Text
                    Special Functions
Special functions, which include the trigonometric functions, have been used for
centuries. Their role in the solution of differential equations was exploited by
Newton and Leibniz, and the subject of special functions has been in continuous
development ever since. In just the past thirty years several new special functions
and applications have been discovered.
This treatise presents an overview of the area of special functions, focusing pri-
primarily on the hypergeometric functions and the associated hypergeometric series.
It includes both important historical results and recent developments and shows
how these arise from several areas of mathematics and mathematical physics.
Particular emphasis is placed on formulas that can be used in computation.
The book begins with a thorough treatment of the gamma and beta functions,
which are essential to understanding hypergeometric functions. Later chapters
discuss Bessel functions, orthogonal polynomials and transformations, the Selberg
integral and its applications, spherical harmonics, g-series, partitions, and Bailey
chains.
This clear, authoritative work will be a lasting reference for students and re-
researchers in number theory, algebra, combinatorics, differential equations, math-
mathematical computing, and mathematical physics.
George E. Andrews is Evan Pugh Professor of Mathematics at The Pennsylvania
State University.
Richard Askey is Professor of Mathematics at the University of Wisconsin-
Madison.
Ranjan Roy is Professor of Mathematics at Beloit College in Wisconsin.


ENCYCLOPEDIA OF MATHEMATICS AND ITS APPLICATIONS EDITED BY G.-C. ROTA Editorial Board B. Doran, M. Ismail, T.-Y. Lam, E. Lutwak Volume 71 Special Functions 27 N. H. Bingham, С. М. Goldie, and J. L. Teugels Regular Variation 28 P. P. Petrushev and V. A. Popov Rational Approximation of Real Functions 29 N. White (ed.) Combinatorial Geometries 30 M. Pohst andH. Zassenhaus Algorithmic Algebraic Number Theory 31 J. Aczel and J. Dhombres Functional Equations in Several Variables 32 M.Kuczma, B.Choczewski,andR. Ger Iterative Functional Equations 33 R. V. Ambartzumian Factorization Calculus and Geometric Probability 34 G. Gripenberg, S.-O. Londen, and O. Staffans Volterra Integral and Functional Equations 35 G. Gasper and M.Rahman BasicHypergeometric Series 36 E. Torgersen Comparison of Statistical Experiments 37 A. Neumaier Interval Methods for Systems of Equations 38 N. Korneichuk Exact Constants in Approximation Theory 39 R. Brualdi and H. Ryser Combinatorial Matrix Theory 40 N. White (ed.) Matroid Applications 41 S.Sakai Operator Algebras in Dynamical Systems 42 W. Hodges Basic Model Theory 43 H.Stahland V. Totik General Orthogonal Polynomials 45 G. Da Prato and J. Zabczyk Stochastic Equations in Infinite Dimensions 46 A. Bjorner et al. OrientedMatroids 47 G. Edgar and L. Sucheston Stopping Times and Directed Processes 48 C.Sims Computation with Finitely Presented Groups 49 T. Palmer Banach Algebras and the General Theory of *-Algebras 50 F. Borceux Handbook of Categorical Algebra I 51 F. Borceux Handbook of Categorical Algebra II 52 F. Borceux Handbook of Categorical Algebra III 54 A. Katok and B. Hasselblatt Introduction to the Modern Theory of Dynamical Systems 55 V. N. Sachkov Combinatorial Methods in Discrete Mathematics 56 V. N. Sachkov Probabilistic Methods in Discrete Mathematics 57 P. M.Cohn Skew Fields 58 R. Gardner Geometric Topography 59 G. A. Baker Jr. and P. Graves-Morris Pade Approximants 60 J. Krajicek Bounded Arithmetic, Prepositional Logic, and Complexity Theory 61 H. Groemer Geometric Applications of Fourier Series and Spherical Harmonics 62 H. O. Fattorini Infinite Dimensional Optimization and Control Theory 63 A.C.Thompson Minkow ski Geometry 64 R. B. Bapat and Т. Е. S. Raghavan Nonnegative Matrices with Applications 65 K.Engel Sperner Theory 66 D. Cverkovic, P. Rowlinson, S. Simic Eigenspaces of Graphs 67 F. Bergeron, G. Labelle, and P. Leroux Combinatorial Species and Tree-Like Structures 68 R. Goodman and N. Wallach Representations and Invariants of the Classical Groups
ENCYCLOPEDIA OF MATHEMATICS AND ITS APPLICATIONS Special Functions GEORGE E.jANDREWS RICHARD ASKEY RANJAN ROY CAMBRIDGE UNIVERSITY PRESS
tier PUBLISHED BY THE PRESS SYNDICATE OF THE UNIVERSITY OF CAMBRIDGE The Pitt Building, Trumpington Street, Cambridge, United Kingdom CAMBRIDGE UNIVERSITY PRESS The Edinburgh Building, Cambridge CB2 2RU, UK 40 West 20th Street, New York, NY 10011-4211, USA 10 Stamford Road, Oakleigh, Melbourne 3166, Australia Ruiz de Alarcon 13, 28014 Madrid, Spain Dock House, The Waterfront, Cape Town 8001, South Africa http://www.cambridge.org © George E. Andrews, Richard Askey, and Ranjan Roy 1999 This book is in copyright. Subject to statutory exception and to the provisions of relevant collective licensing agreements, no reproduction of any part may take place without the written permission of Cambridge University Press. First published 1999 First paperback edition 2000 Printed in the United States of America Typeset in 10/13 Times Roman in iftgX [ТВ] A catalog record for this book is available from the British Library Library of Congress Cataloging in Publication data is available ISBN 0 521 62321 9 hardback ISBN 0 521 78988 5 paperback
To Leonard Carlitz, От Prakash Juneja, and Irwin Kra
Contents Preface page xiii 1 The Gamma and Beta Functions 1 1.1 The Gamma and Beta Integrals and Functions 2 1.2 The Euler Reflection Formula 9 1.3 The Hurwitz and Riemann Zeta Functions 15 1.4 Stirling's Asymptotic Formula 18 1.5 Gauss's Multiplication Formula for Г(mx) 22 1.6 Integral Representations for Log Г(х) and тр-(x) 26 1.7 Kummer's Fourier Expansion of Log Г(х) 29 1.8 Integrals of Dirichlet and Volumes of Ellipsoids 32 1.9 The Bohr-Mollerup Theorem 34 1.10 Gauss and Jacobi Sums 36 1.11 A Probabilistic Evaluation of the Beta Function 43 1.12 The p-adic Gamma Function 44 Exercises 46 2 The Hypergeometric Functions 61 2.1 The Hypergeometric Series 61 2.2 Euler's Integral Representation 65 2.3 The Hypergeometric Equation 73 2.4 The Barnes Integral for the Hypergeometric Function 85 2.5 Contiguous Relations 94 2.6 Dilogarithms 102 2.7 Binomial Sums 107 2.8 DougalPs Bilateral Sum 109 2.9 Fractional Integration by Parts and Hypergeometric Integrals 111 Exercises 114
viii Contents 3 Hypergeometric Transformations and Identities 124 3.1 Quadratic Transformations 125 3.2 The Arithmetic-Geometric Mean and Elliptic Integrals 132 3.3 Transformations of Balanced Series 140 3.4 Whipple's Transformation 143 3.5 Dougall's Formula and Hypergeometric Identities 147 3.6 Integral Analogs of Hypergeometric Sums 150 3.7 Contiguous Relations 154 3.8 The Wilson Polynomials 157 3.9 Quadratic Transformations - Riemann's View 160 3.10 Indefinite Hypergeometric Summation 163 3.11 The W-Z Method 166 3.12 Contiguous Relations and Summation Methods 174 Exercises 176 4 Bessel Functions and Confluent Hypergeometric Functions 187 4.1 The Confluent Hypergeometric Equation 188 4.2 Barnes's Integral for 1F1 192 4.3 Whittaker Functions 195 4.4 Examples of i Fx and Whittaker Functions 196 4.5 Bessel's Equation and Bessel Functions 199 4.6 Recurrence Relations 202 4.7 Integral Representations of Bessel Functions 203 4.8 Asymptotic Expansions 209 4.9 Fourier Transforms and Bessel Functions 210 4.10 Addition Theorems 213 4.11 Integrals of Bessel Functions 216 4.12 The Modified Bessel Functions 222 4.13 Nicholson's Integral 223 4.14 Zeros of Bessel Functions 225 4.15 Monotonicity Properties of Bessel Functions 229 4.16 Zero-Free Regions for i Fi Functions 231 Exercises 234 5 Orthogonal Polynomials 240 5.1 Chebyshev Polynomials 240 5.2 Recurrence 244 5.3 Gauss Quadrature 248 5.4 Zeros of Orthogonal Polynomials 253 5.5 Continued Fractions 256
Contents ix 5.6 Kernel Polynomials 259 5.7 Parseval's Formula 263 5.8 The Moment-Generating Function 266 Exercises 269 6 Special Orthogonal Polynomials 277 6.1 Hermite Polynomials 278 6.2 Laguerre Polynomials 282 6.3 Jacobi Polynomials and Gram Determinants 293 6.4 Generating Functions for Jacobi Polynomials 297 6.5 Completeness of Orthogonal Polynomials 306 6.6 Asymptotic Behavior of Pff -^ (x) for Large n 310 6.7 Integral Representations of Jacobi Polynomials 313 6.8 Linearization of Products of Orthogonal Polynomials 316 6.9 Matching Polynomials 323 6.10 The Hypergeometric Orthogonal Polynomials 330 6.11 An Extension of the Ultraspherical Polynomials 334 Exercises 339 7 Topics in Orthogonal Polynomials 355 7.1 Connection Coefficients 356 7.2 Rational Functions with Positive Power Series Coefficients 363 7.3 Positive Polynomial Sums from Quadrature and Vietoris's Inequality 371 7.4 Positive Polynomial Sums and the Bieberback Conjecture 381 7.5 A Theorem of Turan 384 7.6 Positive Summability of Ultraspherical Polynomials 388 7.7 The Irrationality of ?C) 391 Exercises 395 8 The Selberg Integral and Its Applications 401 8.1 Selberg's and Aomoto's Integrals 402 8.2 Aomoto's Proof of Selberg's Formula 402 8.3 Extensions of Aomoto's Integral Formula 407 8.4 Anderson's Proof of Selberg's Formula 411 8.5 A Problem of Stieltjes and the Discriminant of a Jacobi Polynomial 415 8.6 Siegel's Inequality 419 8.7 The Stieltjes Problem on the Unit Circle 425 8.8 Constant-Term Identities 426 8.9 Nearly Poised 3F2 Identities 428 8.10 The Hasse-Davenport Relation 430
Contents 10 11 8.11 A Finite-Field Analog of Selberg s Integral Exercises Spherical Harmonics 9.1 9.2 9.3 9.4 9.5 9.6 9.7 9.8 9.9 9.10 9.11 9.12 9.13 9.14 9.15 9.16 Harmonic Polynomials The Laplace Equation in Three Dimensions Dimension of the Space of Harmonic Polynomials of Degree к Orthogonality of Harmonic Polynomials Action of an Orthogonal Matrix The Addition Theorem The Funk-Hecke Formula The Addition Theorem for Ultraspherical Polynomials The Poisson Kernel and Dirichlet Problem Fourier Transforms Finite-Dimensional Representations of Compact Groups The Group SUB) Representations of SUB) Jacobi Polynomials as Matrix Entries An Addition Theorem Relation of SUB) to the Rotation Group SOC) Exercises Introduction to ^-Series 10.1 10.2 10.3 10.4 10.5 10.6 10.7 10.8 10.9 10.10 10.11 10.12 The <?-Integral The ^-Binomial Theorem The q -Gamma Function The Triple Product Identity Ramanujan's Summation Formula Representations of Numbers as Sums of Squares Elliptic and Theta Functions g-Beta Integrals Basic Hypergeometric Series Basic Hypergeometric Identities <7-Ultraspherical Polynomials Mellin Transforms Exercises Partitions 11.1 11.2 11.3 Background on Partitions Partition Analysis A Library for the Partition Analysis Algorithm 434 439 445 445 447 449 451 452 454 458 459 463 464 466 469 471 473 474 476 478 481 485 487 493 496 501 506 508 513 520 523 527 532 542 553 553 555 557
Contents xi 11.4 Generating Functions 559 11.5 Some Results on Partitions 563 11.6 Graphical Methods 565 11.7 Congruence Properties of Partitions 569 Exercises 573 12 Bailey Chains 577 12.1 Rogers's Second Proof of the Rogers-Ramanujan Identities 577 12.2 Bailey's Lemma 582 586 589 590 595 595 597 В Summabilitv and Fractional Integration 599 599 602 604 605 607 С Asymptotic Expansions 611 C.I Asymptotic Expansion 611 C.2 Properties of Asymptotic Expansions 612 C.3 Watson's Lemma 614 C.4 The Ratio of Two Gamma Functions 615 Exercises 616 D Euler-Maclaurin Summation Formula 617 D.I Introduction 617 D.2 The Euler-Maclaurin Formula 619 D.3 Applications 621 D.4 The Poisson Summation Formula 623 Exercises 627 E Lagrange Inversion Formula 629 E. 1 Reversion of Series 629 E.2 A Basic Lemma 630 E.3 Lambert's Identity • 631 E.4 Whipple's Transformation 632 Exercises 634 12.3 12.4 Watson's Transformation Formula Other Applications Exercises Infinite Products A.I Infinite Products Exercises Summability and Fractional Integration B.I B.2 B.3 B.4 Abel and Cesaro Means The Cesaro Means (C, a) Fractional Integrals Historical Remarks Exercises
xii Contents F Series Solutions of Differential Equations 637 F.I Ordinary Points 637 F.2 Singular Points 638 F.3 Regular Singular Points 639 Bibliography 641 Index 655 Subject Index 659 Symbol Index 661
Preface Paul Turan once remarked that special functions would be more appropriately la- labeled "useful functions." Because of their remarkable properties, special functions have been used for centuries. For example, since they have numerous applications in astronomy, trigonometric functions have been studied for over a thousand years. Even the series expansions for sine and cosine (and probably the arc tangent) were known to Madhava in the fourteenth century. These series were rediscovered by Newton and Leibniz in the seventeenth century. Since then, the subject of spe- special functions has been continuously developed, with contributions by a host of mathematicians, including Euler, Legendre, Laplace, Gauss, Kummer, Eisenstein, Riemann, and Ramanujan. In the past thirty years, the discoveries of new special functions and of applica- applications of special functions to new areas of mathematics have initiated a resurgence of interest in this field. These discoveries include work in combinatorics initiated by Schiitzenberger and Foata. Moreover, in recent years, particular cases of long familiar special functions have been clearly defined and applied as orthogonal polynomials. As a result of this prolific activity and long history one is pulled different di- directions when writing a book on special functions. First, there are important results from the past that must be included because they are so useful. Second, there are recent developments that should be brought to the attention of those who could use them. One also would wish to help educate the new generation of mathematicians and scientists so that they can further develop and apply this subject. We have tried to do all this, and to include some of the older results that seem to us to have been overlooked. However, we have slighted some of the very important recent developments because a book that did them justice would have to be much longer. Fortunately, specialized books dealing with some of these developments have recently appeared: Petkovsek, Wilf, and Zeilberger [1996], Macdonald [1995],Heckman and Schlicktkrull [1994], and Vilenkin and Klimyk
xiv Preface [1992]. Additionally, I. G. Macdonald is writing a new book on his polynomials in several variables and A. N. Kirillov is writing on R-matrix theory. It is clear that the amount of knowledge about special functions is so great that only a small fraction of it can be included in one book. We have decided to focus primarily on the best understood class of functions, hypergeometric functions, and the associated hypergeometric series. A hypergeometric series is a series Y,an with an+i/an a rational function of n. Unfortunately, knowledge of these functions is not as widespread as is warranted by their importance and usefulness. Most of the power series treated in calculus are hypergeometric, so some facts about them are well known. However, many mathematicians and scientists who encounter such functions in their work are unaware of the general case that could simplify their work. To them a Bessel function and a parabolic cylinder function are types of functions different from the 3 — j or 6 — j symbols that arise in quantum angular momentum theory. In fact these are all hypergeometric functions and many of their elementary properties are best understood when considered as such. Several important facts about hypergeometric series were first found by Euler and an important identity was discovered by Pfaff, one of Gauss's teachers. How- However, it was Gauss himself who fully recognized their significance and gave a systematic account in two important papers, one of which was published posthu- posthumously. One reason for his interest in these functions was that the elementary functions and several other important functions in mathematics are expressible in terms of hypergeometric functions. A half century after Gauss, Riemann de- developed hypergeometric functions from a different point of view, which made available the basic formulas with a minimum of computation. Another approach to hypergeometric functions using contour integrals was presented by the English mathematician E. W. Barnes in the first decade of this century. Each of these different approaches has its advantages. Hypergeometric functions have two very significant properties that add to their usefulness: They satisfy certain identities for special values of the function and they have transformation formulas. We present many applications of these properties. For example, in combinatorial analysis hypergeometric identities classify single sums of products of binomial coefficients. Further, quadratic transformations of hypergeometric functions give insight into the relationship (known to Gauss) of elliptic integrals to the arithmetic-geometric mean. The arithmetic-geometric mean has recently been used to compute л to several million decimal places, and earlier it played a pivotal role in Gauss's theory of elliptic functions. The gamma function and beta integrals dealt with in the first chapter are essential to understanding hypergeometric functions. The gamma function was introduced into mathematics by Euler when he solved the problem of extending the factorial function to all real or complex numbers. He could not have foreseen the extent of its importance in mathematics. There are extensions of gamma and beta functions
Preface xv that are also very important. The text contains a short treatment of Gauss and Jacobi sums, which are finite field analogs of gamma and beta functions. Gauss sums were encountered by Gauss in his work on the constructibility of regular polygons where they arose as "Lagrange resolvents," a concept used by Lagrange to study algebraic equations. Gauss understood the tremendous value of these sums for number theory. We discuss the derivation of Fermat's theorem on primes of the form An + 1 from a formula connecting Gauss and Jacobi sums, which is analogous to Euler's famous formula relating beta integrals with gamma functions. There are also multidimensional gamma and beta integrals. The first of these was introduced by Dirichlet, though it is really an iterated version of the one- dimensional integral. Genuine multidimensional gamma and beta functions were introduced in the 1930s, by both statisticians and number theorists. In the early 1940s, Atle Selberg found a very important multidimensional beta integral in the course of research in entire functions. However, owing to the Second World War and the fact that the first statement and also the proof appeared in journals that were not widely circulated, knowledge of this integral before the 1980s was restricted to a few people around the world. We present two different evaluations of Selberg's integral as well as some of its uses. In addition to the above mentioned extensions, there are q-extensions of the gamma function and beta integrals that are very fundamental because they lead to basic hypergeometric functions and series. These are series Ес„ where cn+\/cn is a rational function of q" for a fixed parameter q. Here the sum may run over all integers, instead of only nonnegative ones. One important example is the theta function Y^oo Q" x" ¦ This and other similar series were used by Gauss and Jacobi to study elliptic and elliptic modular functions. Series of this sort are very useful in many areas of combinatorial analysis, a fact already glimpsed by Euler and Legendre, and they also arise in some branches of physics. For ex- example, the work of the physicist R. J. Baxter on the Yang-Baxter equation led a group in St. Petersburgh to the notion of a quantum group. Independently, M. Jimbo in Japan was led by a study of Baxter's work to a related structure. Many basic hypergeometric series (or g-hypergeometric series), both polyno- polynomials and infinite series, can be studied using Hopf algebras, which make up quantum groups. Unfortunately, we could not include this very important new approach to basic series. It was also not possible to include results on the multidi- multidimensional U(n) generalizations of theorems on basic series, which have been studied extensively in recent years. For some of this work, the reader may refer to Milne [1988] and Milne and Lilly [1995]. We briefly discuss the q- gamma function and some important g-beta integrals; we show that series and products that arise in this theory have applications in number theory, combi- combinatorics, and partition theory. We highlight the method of partition analysis.
xvi Preface P. A. MacMahon, who developed this powerful technique, devoted several chapters to it in his monumental Сombinatory Analysis, but its significance was not realized until recently. The theory of special functions with its numerous beautiful formulas is very well suited to an algorithmic approach to mathematics. In the nineteenth century, it was the ideal of Eisenstein and Kronecker to express and develop mathematical results by means of formulas. Before them, this attitude was common and best exemplified in the works of Euler, Jacobi, and sometimes Gauss. In the twenti- twentieth century, mathematics moved from this approach toward a more abstract and existential method. In fact, agreeing with Hardy that Ramanujan came 100 years too late, Littlewood once wrote that "the great day of formulae seem to be over" (see Littlewood [1986, p. 95]). However, with the advent of computers and the consequent reemergence of computational mathematics, formulas are now once again playing a larger role in mathematics. We present this book against this background, pointing out that beautiful, interesting, and important formulas have been discovered since Ramanujan's time. These formulas are proving fertile and fruitful; we suggest that the day of formulas may be experiencing a new dawn. Finally, we hope that the reader finds as much pleasure studying the formulas in this book as we have found in explaining them. We thank the following people for reading and commenting on various chapters during the writing of the book: Bruce Berndt, David and Gregory Chudnovsky, George Gasper, Warren Johnson, and Mizan Rahman. Special thanks to Mourad Ismail for encouragement and many detailed suggestions for the improvement of this book. We are also grateful to Dee Frana and Diane Reppert for preparing the manuscript with precision, humor, and patience.
The Gamma and Beta Functions Euler discovered the gamma function, Г (x), when he extended the domain of the factorial function. Thus Г(х) is a meromorphic function equal to (x — 1)! when x is a positive integer. The gamma function has several representations, but the two most important, found by Euler, represent it as an infinite integral and as a limit of a finite product. We take the second as the definition. Instead of viewing the beta function as a function, it is more illuminating to think of it as a class of integrals - integrals that can be evaluated in terms of gamma functions. We therefore often refer to beta functions as beta integrals. In this chapter, we develop some elementary properties of the beta and gamma functions. We give more than one proof for some results. Often, one proof gener- generalizes and others do not. We briefly discuss the finite field analogs of the gamma and beta functions. These are called Gauss and Jacobi sums and are important in number theory. We show how they can be used to prove Fermat's theorem that a prime of the form An + 1 is expressible as a sum of two squares. We also treat a simple multidimensional extension of a beta integral, due to Dirichlet, from which the volume of an n-dimensional ellipsoid can be deduced. We present an elementary derivation of Stirling's asymptotic formula for n! but give a complex analytic proof of Euler's beautiful reflection formula. However, two real analytic proofs due to Dedekind and Herglotz are included in the exercises. The reflection formula serves to connect the gamma function with the trigonometric functions. The gamma function has simple poles at zero and at the negative inte- integers, whereas esc nx has poles at all the integers. The partial fraction expansions of the logarithmic derivatives of Г(х) motivate us to consider the Hurwitz and Riemann zeta functions. The latter function is of fundamental importance in the theory of distribution of primes. We have included a short discussion of the func- functional equation satisfied by the Riemann zeta function since it involves the gamma function. In this chapter we also present Kummer's proof of his result on the Fourier expansion of log Г(х). This formula is useful in number theory. The proof given 1
2 1 The Gamma and Beta Functions uses Dirichlet's integral representations of log Г(х) and its derivative. Thus, we have included these results of Dirichlet and the related theorems of Gauss. 1.1 The Gamma and Beta Integrals and Functions The problem of finding a function of a continuous variable x that equals n! when x = n, an integer, was investigated by Euler in the late 1720s. This problem was apparently suggested by Daniel Bernoulli and Goldbach. Its solution is contained in Euler's letter of October 13,1729, to Goldbach. See Fuss [1843, pp. 1-18]. To arrive at Euler's generalization of the factorial, suppose that x > 0 and n > 0 are integers. Write *! = ?±^!, A.1.1) ( + 1)' where (a)n denotes the shifted factorial defined by (a)n=a(a + l)---(a+n-l) for n > 0, (aH = 1, A.1.2) and a is any real or complex number. Rewrite A.1.1) as !)* n]n* (" + *)* Since we conclude that lim = 1, n—>oo nx n\nx x!= lim . A.1.3) П-+СО (x + 1)„ Observe that, as long as x is a complex number not equal to a negative integer, the limit in A.1.3) exists, for and
1.1 The Gamma and Beta Integrals and Functions Therefore, the infinite product ... +3>7 converges and the limit A.1.3) exists. (Readers who are unfamiliar with infinite products should consult Appendix A.) Thus we have a function k\kx П(х) = lim —'¦ A.1.4) к-кх (x + l)k defined for all complex x ф — 1, —2, —3,... and П(л) = n\. Definition 1.1.1 For all complex numbers x ^= 0, — 1, —2, ..., the gamma func- function Г(х) is defined by к\кх~1 Г(х)=Ит——. A.1.5) An immediate consequence of Definition 1.1.1 is Г(х + 1)=хГ(х). A.1.6) Also, Г(л + 1) = л! A-1.7) follows immediately from the above argument or from iteration of A.1.6) and use of ГA) = 1. A.1.8) From A.1.5) it follows that the gamma function has poles at zero and the negative integers, but 1/ Г(х) is an entire function with zeros at these points. Every entire function has a product representation; the product representation of 1/Г(х) is particularly nice. Theorem 1.1.2 1 A.1.9) I \ П I I n=\ where у is Euler 's constant given by у = lim > --logn . A.1.10)
4 1 The Gamma and Beta Functions Proof. 1 = Um x(x+ !)¦•• (*+«-!) X n The infinite product in A.1.9) exists because and the factor e~x/n was introduced to make this possible. The limit in A.1.10) exists because the other limits exist, or its existence can be shown directly. One way to do this is to show that the difference between adjacent expressions under the limit sign decay in a way similar to 1/n2. ¦ One may take A.1.9) as a definition of Г(х) as Weierstrass did, though the formula had been found earlier by Schlomilch and Newman. See Nielsen [1906, p. 10]. Over seventy years before Euler, Wallis [ 1656] attempted to compute the integral /o Vl - x2dx = \ J^(l-xI/2(l+xI/2dx. Since thisintegral gives thearea of a quarter circle, Wallis's aim was to obtain an expression for jr. The only integral he could actually evaluate was /0 xp(l — x)qdx, where p and q are integers or q = 0 and p is rational. He used the value of this integral and some audacious guesswork to suggest that 1 .. Г 2-4-6---2И 1 I2 /3 xldx = - hm • —pz = ГI - A [l - 3 -5-- -B/г-1) VJ \2 A.1.11) Of course, he did not write it as a limit or use the gamma function. Still, this result may have led Euler to consider the relation between the gamma function and integrals of the form Jo xp(\ — x)qdx where p and q are not necessarily integers. Definition 1.1.3 The beta integral is defined for Rex > 0, Re у > 0 by B(x,y)= f tx-l{\-t)y~ldt. A.1.12)
1.1 The Gamma and Beta Integrals and Functions 5 One may also speak of the beta function B(x,y), which is obtained from the integral by analytic continuation. The integral A.1.12) is symmetric in x and у as may be seen by the change of variables и — \ — t. Theorem 1.1.4 Г(х)ГСу) B^y)=nx-^7y AЛЛЗ) Remark 1.1.1 The essential idea of the proof given below goes back to Euler [1730, 1739] and consists of first setting up a functional relation for the beta function and then iterating the relation. An integral representation for Г(х) is obtained as a byproduct. The functional equation technique is useful for evaluating certain integrals and infinite series; we shall see some of its power in subsequent chapters. Proof. The functional relation we need is B(x,y) = X-^B(x,y + \). A.1.14) First note that for Rex > 0 and Re у > 0, /¦i B(x,y + 1) = tx-\\-t){\-t)y-ldt Jo = B(x,y)-B(x + \,y). A.1.15) However, integration by parts gives \-tx(l - ty] + У- f = У-В(х + \,у). A.1.16) X Combine A.1.15) and A.1.16) to get the functional relation A.1.14). Other proofs of A.1.14) are given in problems at the end of this chapter. Now iterate A.1.14) to obtain m ^ (x + y)(x+y + l) (x+y)nDt , . B(x, y) = B(x, у + 2) = • • • = —— B(x, у + n). Я? +1) (y)n Rewrite this relation as B(x, y + l)= \-tx(l - ty] + У- f tx(\ - [ J J (У)п Jo \n ,y-i = ——^jl—— / tx~x[\ — ) dt.
6 1 The Gamma and Beta Functions As n -» oo, the integral tends to /0°° tx~le~'dt. This may be justified by the Lebesgue dominated convergence theorem. Thus B(x, y) = ^ / t^e-'dt. A.1.17) Г(х + у) J Set у = 1 in A.1.12) and A.1.17) to get -= f tx~xdt = B(x, 1) = ГA) Г x Jo Г(х + 1) 7o Then A.1.6) and A.1.8) imply that /0°° tx~le~'dt = Г(х) for Rex > 0. Now use this in A.1.17) to prove the theorem for Rex > 0 and Key > 0. The analytic continuation is immediate from the value of this integral, since the gamma function can be analytically continued. ¦ Remark 1.1.2 Euler's argument in [1739] for A.1.13) used a recurrence relation in x rather than in y. This leads to divergent infinite products and an integral that is zero. He took two such integrals, with у and у = m, divided them, and argued that the resulting "vanishing" integrals were the same. These canceled each other when he took the quotient of the two integrals with у and у = m. The result was an infinite product that converges and gives the correct answer. Euler's extraordinary intuition guided him to correct results, even when his arguments were as bold as this one. Earlier, in 1730, Euler had evaluated A.1.13) by a different method. He expanded A — t)y~l in a series and integrated term by term. When у = n + 1, he stated the value of this sum in product form. An important consequence of the proof is the following corollary: Corollary 1.1.5 For Rex > 0 Г(х) = / tx-'e-'dt. A.1.18) Jo The above integral for Г(х) is sometimes called the Eulerian integral of the second kind. It is often taken as the definition of Г(х) for Rex > 0. The Eulerian integral of the first kind is A.1.12). Legendre introduced this notation. Legendre's Г(х) is preferred over Gauss's function П(х) given by A.1.4), because Theorem 1.1.4 does not have as nice a form in terms of П (x). For another reason, see Section 1.10. The gamma function has poles at zero and at the negative integers. It is easy to use the integral representation A.1.18) to explicitly represent the poles and the
1.1 The Gamma and Beta Integrals and Functions analytic continuation of Г(х): T(x) = / Jo J\ /•OO -ie-dt + l t^e-'dt —11" Г°° —^+ / r'-^-'dr. A.1.19) + x)n\ Ji The second function on the right-hand side is an entire function, and the first shows that the poles are as claimed, with (—1)"/и! being the residue at x = —n, n = 0.1,.... The beta integral has several useful forms that can be obtained by a change of variables. For example, set t — s/{s + 1) in A.1.12) to obtain the beta integral on a half line, Then again, take t = sin2 в to get Г/2 sin2-1 в cos2? Ode = Г(Х)Г°° . A.1.21) Jo 2Г(х+у) Put x = у = 1/2. The result is _ 2ГA) 2' or ГA/2) = V5F. A-1.22) Since this implies [Г(|)]2 = я/4, we have a proof of Wallis's formula A.1.11). We also have the value of the normal integral = ^. A.1.23) -oo JO Jo Finally, the substitution t = (u — a)/(b — a) in A.1.12) gives A.1.24) The special case a — — l,b = Us worth noting as it is often used:
8 1 The Gamma and Beta Functions A useful representation of the analytically continued beta function is Г(х)Г(у) (x+y)^ A + ^) B(x,y)= = TT^—w \ч- A.1.26) T{x+y) xy l=l(l + ^)(l + z) This follows immediately from Theorem 1.1.2. Observe that B(x,y) has poles at x and у equal to zero or negative integers, and it is analytic elsewhere. As mentioned before, the integral formula for Г(х) is often taken as the defini- definition of the gamma function. One reason is that the gamma function very frequently appears in this form. Moreover, the basic properties of the function can be devel- developed easily from the integral. We have the powerful tools of integration by parts and change of variables that can be applied to integrals. As an example, we give another derivation of Theorem 1.1.4. This proof is also important because it can be applied to obtain the finite field analog of Theorem 1.1.4. In that situation one works with a finite sum instead of an integral. Poisson [1823] and independently Jacobi [1834] had the idea of starting with an appropriate double integral and evaluating it in two different ways. Thus, since the integrals involved are absolutely convergent, /CO ЛОО ЛОО ЛОО / tx-]sy-]e-{s+l)dsdt = / t^e-'dt / s^e^ds = Г(х)Г(у). Jo Jo Jo Apply the change of variables s = uv and t = u(l — v) to the double integral, and observe that 0 < и < oo and 0 < v < 1 when 0 < s, t < oo. This change of variables is suggested by first setting s + t = u. Computation of the Jacobian gives dsdt = ududv and the double integral is transformed to A comparison of two evaluations of the double integral gives the necessary result. This is Jacobi's proof. Poisson's proof is similar except that he applies the change of variables t = r and s = ur to the double integral. In this case the beta integral obtained is on the interval @, oo) as in A.1.20). See Exercise 1. To complete this section we show how the limit formula for Г(х) can be derived from an integral representation of Г(х). We first prove that when n is an integer > 0 and Rex > 0, j; tx-\\-t)ndt = —: -. A.1.27) x(x + \){x+n) This is actually a special case of Theorem 1.1.4 but we give a direct proof by induction, in order to avoid circularity in reasoning. Clearly A.1.27) is true for
1.2 The Euler Reflection Formula n = 0,and 1 И tx-\\-t)n+ldt= I tx-\\-t){\-t)ndt о Jo (x)n+1 (x + l)n+i = (и + 1)! (x)n+2 This proves A.1.27) inductively. Now set t = u/n and let n —> oo. By the Lebesgue dominated convergence theorem it follows that tx-le~'dt = lim for Rex > 0. n^oo (x)n Thus, if we begin with the integral definition for Г(х) then the above formula can be used to extend it to other values of x (i.e., those not equal to 0,-1,-2,...). Remark 1.1.3 It is traditional to call the integral A.1.12) the beta function. A better terminology might call this Euler's first beta integral and call A.1.20) the second beta integral. We call the integral in Exercise 13 Cauchy's beta integral. We shall study other beta integrals in later chapters, but the common form of these three is jc[l\{t)]p[l2(t)\4dt, where l\(t) and I2(t) are linear functions of t, and С is an appropriate curve. For Euler's first beta integral, the curve consists of a line segment connecting the two zeros; for the second beta integral, it is a half line joining one zero with infinity such that the other zero is not on this line; and for Cauchy's beta integral, it is a line with zeros on opposite sides. See Whittaker and Watson [1940, §12.43] for some examples of beta integrals that contain curves of integration different from those mentioned above. An important one is given in Exercise 54. 1.2 The Euler Reflection Formula Among the many beautiful formulas involving the gamma function, the Euler reflection formula is particularly significant, as it connects the gamma function with the sine function. In this section, we derive this formula and briefly describe how product and partial fraction expansions for the trigonometric functions can be obtained from it. Euler's formula given in Theorem 1.2.1 shows that, in a sense, the function \/T(x) is half of the sine function. Theorem 1.2.1 Euler's reflection formula: ^ A.2.1)
10 1 The Gamma and Beta Functions Remark The proof given here uses contour integration. Since the gamma function is a real variable function in the sense that many of its important characterizations occur within that theory, three real variable proofs are outlined in the Exercises. See Exercises 15, 16, and 26-27. Since we shall show how some of the theory of trigonometric functions can be derived from A.2.1), we now state that sinx is here defined by the series The cosine function is defined similarly. It is easy to show from this definition that sine and cosine have period 2л and that e1" = — 1. See Rudin[ 1976, pp. 182-184]. Proof. Set у = 1 - x, 0 < x < 1 in A.1.20) to obtain Г(х)ГA-х)= / 1—dt. A.2.2) Jo J +' To compute the integral in A.2.2), consider the integral -dz. I )c 1 -1 where С consists of two circles about the origin of radii R and € respectively, which are joined along the negative real axis from — R to — e. Move along the outer circle in the counterclockwise direction, and along the inner circle in the clockwise direction. By the residue theorem L z dz = -2л i, A.2.3) с 1-z when zx has its principal value. Thus fn iRxeix9 1€е t? dt+ -^d0+ dt. ^J l'e J \ IR l+t }n l-ee'" Л l+t Let R —> oo and e —> 0 so that the first and third integrals tend to zero and the second and fourth combine to give A.2.1) for 0 < x < 1. The full result follows by analytic continuation. One could also argue as follows: Equality of A.2.1) for 0 < x < 1 implies equality in 0 < Rex < 1 by analyticity; for Rex = 0, x ф 0 by continuity; and then for x shifted by integers using Г(х + 1) — хГ(х) and sin(x + л) = — sinx. ¦ The next theorem is an immediate consequence of Theorem 1.2.1. Theorem 1.2.2 ™( x2 Sin7TX = ЛХ \\ 1 г
1.2 The Euler Reflection Formula 11 71 COt7TX =l+Ё (— + -1-) = lim E -4' Sin7TX n=l = lim 7isecnx= lim : + i-x A.2.6) A.2.7) A.2.8) 4У ^ Sin27TX ^^ (Х+ИJ /I——OO Proof. Formula A.2.4) follows from the product formula r<*> »=. proved in the previous section and from Theorem 1.2.1 in the form Г(х)ГA— х) = —хГ(х)Г(—x) = л I sin7rx. Formula A.2.5) is the logarithmic derivative of A.2.4), and A.2.6) follows from A.2.5) since esc x = cot |—cotx. The two formulasA.2.7)andA.2.8)are merely variations of A.2.5) and A.2.6). Formula A.2.9) is the derivative of A.2.5). ¦ It is worth noting that A.2.6) follows directly from A.2.1) without the product formula. We have /00 fX— 1 />1 ^Л—1 /0 ^X — 1 df = / df + / -— l+t Jo l+t Jl 1+t dt \n+\tn+\ dt where k=—n f\tn" JO t"-x+l )dt n+x + 1 ' n —x +2 Thus A.2.6) has been derived from A.2.1). Before going back to the study of the gamma function we note an important consequence of A.2.5). Mathernati:?; V :¦
12 1 The Gamma and Beta Functions Definition 1.2.3 The Bernoulli numbers Bn are defined by the power series ex- expansion n °° 2k 4Ч1 ^4Ч It is easy to check that -?y + | is an even function. The first few Bernoulli numbers are Bx = -1/2, B2 = 1/6, B4 = -1/30, B6 = 1/42. Theorem 1.2.4 For each positive integer к, - 2 f^2k Bk)\ Proof. By A.2.10) eix+e~ix 2ix ^ ... 2lkxlk xcotx = ix- =ix + —. = 1 - > (-\)k+yB2k and A.2.5) gives the expansion x cot x = n=\ n=\ k=\ Лк Now equate the coefficients of x in the two series for x cot л to complete the proof. ¦ Eisenstein [ 1847] showed that a theory of trigonometric functions could be sys- systematically developed from the partial fractions expansion of cot x, taking A.2.5) as a starting point. According to Weil [1976, p. 6] this method provides the sim- simplest proofs of a series of important results on trigonometric functions orginally due to Euler. Eisenstein's actual aim was to provide a theory of elliptic functions along similar lines. A very accessible account of this work and its relation to mod- modern number theory is contained in Weil's book. Weil refers to Пт,,^^ Yl"-n ak as Eisenstein summation. Theorem 1.2.2 shows that series of the form where к is an integer, are related to trigonometric functions. As we shall see next, the "half series" , 0
1.2 The Euler Reflection Formula 13 bears a similar relationship to the gamma function. In fact, one may start the study of the gamma function with these half series. Theorem 1.2.5 Г'A) = -У, A.2.12) A.2.13) Г(х) ^ ^fL A-2.14) froo/. Take the logarithmic derivative of the product for 1 / Г (x). This gives Г(х) ' ' f^\x+k-l k, The case x = 1 gives A.2.12). The other two formulas follow immediately. ¦ Corollary 1.2.6 Log Г(х) is a convex function of x for x > 0. Proof. The right side of A.2.14) is obviously positive. ¦ Remark The functional equation A.1.6) and logarithmic convexity can be used to derive the basic results about the gamma function. See Section 1.9. We denote Г" (x) / Г (x) by xfr (x). This is sometimes called the digamma function. Gauss proved that is(x) can be evaluated by elementary functions when x is a rational number. This result is contained in the next theorem. Theorem 1.2.7 ++ +т x x+1 x+n—1 A.2.15) p\ n np 4^' 2лпр / nn )Y cot log^+ 2> coslog 2 sin ¦ f p\ n np 4^' 2лпр / t[-)=-Y- ^cot— -log^+ 2> cos log 2 sin A.2.16) where 0 < p < q; ^' means that when q is even the term with index n = q/2 is divided by 2. Here lq/2\ denotes the greatest integer in q/2. Proof. The first formula is the logarithmic derivative of Г(х +и) = (x+n- l)(x+n -2)---
14 1 The Gamma and Beta Functions We derive Gauss's formula A.2.16) by an argument of Jensen [1915-1916] using roots of unity. Begin with Simpson's dissection [1759]: k-l n=0 j=0 where w = el7Z'lk is a primitive fcth root of unity. This is a consequence of Y^ wjm = 0, m ф O(modfc). Now by A.2.13) 00 / 1 = У) (—-r - j^ p + nq = lim У" ( ?—\tp+nq =: lim s(t) by Abel's continuity theorem for power series. From the series — log(l — t) = YlT=i tn/n>an^ Simpson's dissection with со = e2nilq, we get q-\ s(t) = -tp~q log(l - tq) + ^оо~пр log(l - wnt) \-tq VX = -tp~glog (tp~g - l)log(l -t) + 22o~nplog(l -oft). Let? ->¦ 1~ to get q-\ if(p/q) = -y - log? + ]Tornp iog(i - a,"). n=\ Replace p by q — p and add the two expressions to obtain The left side is real, so it is equal to the real part of the right side. Thus (p\ (q — p\ %-^ 2irnp ( 2ттп\ - }+ir[ ?—!- =-2y-21og^+> cos -log 2-2cos . q) \ q ; ^ q \ q ) A.2.17) But f(x)-\lr(l -x) = —logr(x)r(l -x) = - dx
1.3 The Hurwitz and Riemann Zeta Functions 15 So i,(p/q) - YK1 - (p/q)) = -л cotnp/q. A.2.18) Add this identity to A.2.17) to get (p\ л лр 1 t-^ 2лnp, / 2лп\ - = -у cot — - logo + - > cos log 2 - 2cos . 4J 2 4 2flt 4 \ 4 ) A.2.19) But cos2jr(^ — n)/q = cos2jrn/^, so the sum can be cut in half, going from 1 to lq/2], where |_*J denotes the greatest integer in x. Thus р\ л лр L^J, 2л tip , ( 2лп\ — = — у cot log о + > cos log 2 — 2 cos ч) 2 ч 7^ ч \ q ) л лр 4~~*' 2л tip ( лп\ = — у cot log о + 2 у cos log 2 sin — . щ 2 ч t=i ч \ ч J 1.3 The Hurwitz and Riemann Zeta Functions The half series 00 for*>0, A.3.1) called the Hurwitz zeta function, is of great interest. We have seen its connection with the gamma function for positive integer values of s in the previous section. Here we view the series essentially as a function of* and give a very brief discussion of how the gamma function comes into the picture. The case x = 1 is called the Riemann zeta function and is denoted by ?(.?). It plays a very important role in the theory of the distribution of primes. The series converges for Re s > 1 and defines an analytic function in that region. It has a continuation to the whole complex plane with a simple pole at 5 — 1. The analytic continuation of ?(s) up to Re s > 0 is not difficult to obtain. Write the series for as a Stieltjes integral involving [jcJ . Thus for Re s > 1 c Ixjdx n=\ = 1 + —-+s I ^—^dx. s - s+l The last integral converges absolutely for Re s > 0 and we have the required
16 1 The Gamma and Beta Functions continuation. The pole ats = l has residue 1 and, moreover, lim f 1 ) f°° UJ im r{s) = 1 + / V- 1- / i , / L*J -x = lim 1 + / z—dx /1 \ = lim V log« = y. A.3.2' The best way to obtain analytic continuation to the rest of the plane is from the functional relation for the zeta function. We state the result here, since the gamma function is also involved. There are several different proofs of this result and we give a nice one due to Hardy [1922], as well as some others, in the exercises. In Chapter 10 we give yet another proof. Theorem 1.3.1 For all complex s, (°)/2) - s). A.3.3) If s < 0, then 1 — s > 1 and the right side provides the value of f (s). This relation was demonstrated by Euler for integer values of s as well as for 5 = 1/2 and i = 3/2. He had proofs for integer values of 5, using Abel means. An interesting historical discussion is contained in Hardy [1949, pp. 23-26]. The importance of %(s) as a function of a complex variable in studying the distribution of primes was first recognized by Riemann [1859]. The last section contained the result The following corollary is then easy to prove. Corollary 1.3.2 <A -2k) = —-B2k, ?@) = — and f(-2/t)=O /or Jk = 1, 2, 3,.... A.3.4) Corollary 1.3.3 С '@) = —1ог2л\ A.3.5) 2 Proo/. From the functional equation and the fact that
1.3 The Hurwitz and Riemann Zeta Functions 17 we have -?A - s) = *-s+l'z ^^ (s - l)f (s). A.3.6) sv 2F(C-5)/2)v Now A.3.2) implies that (s - \)S(s) = 1 + y(s - 1) + A(s - IJ -\ .So take the logarithmic derivative of A.3.6) to get _,,__, y+2A(s-\) = l°g,r--^-,j--, Set 5 = 1 and use Gauss's result in Theorem 1.2.7 with p = 1 and q = 2. This proves the corollary. ¦ There is a generalization of the last corollary to the Hurwitz zeta function S(x,s). A functional equation for this function exists, which would define it for all complex s, but we need only the continuation up to some point to the left of Re s = 0. This can be done by using the function l;(s). Start with the identity S(x, s) - «E) - sxS(s + 1)) = x~s + 2^n~'[(l +x/n)-s - A - sx/n)}. n=\ The sum on the right converges for Re s > — 1, and because Sis) is defined for all s, we have the continuation of Six, s) to Re 5 > — 1. The following theorem is due to Lerch. Theorem 1.3.4 =,„81« „3.7, V2 s=0 V ?Л Proof. The derivative of the equation Six + 1, s) = Six, s) — x's with respect to 5 at s = 0 gives /dS(x+l,s)\ fdSix,s)\ - =logA\ A.3.8) For Re s > 1, «=o so A39)
18 1 The Gamma and Beta Functions Now A.3.8) and A.3.9) together with A.2.14) of Theorem 1.2.5 imply that To determine that the constant С = — \ log 2тт, set x = 1 and use Corollary 1.3.3 This completes the proof of Lerch's theorem. ¦ For a reference to Lerch's paper and also for a slightly different proof of Theoren 1.3.4, see Weil [1976, p. 60]. 1.4 Stirling's Asymptotic Formula De Moivre [1730] found that и! behaves like Cnn+]/2e~n for large n, where С is some constant. Stirling [1730] determined С to be -J2n; de Moivre then used г result of Stirling to give a proof of this claim. See Tweddle [1988, pp. 9-19]. This formula is extremely useful and it is very likely that the reader has seen applications of it. In this section we give an asymptotic formula for Г(х) for Rex large, wher Imx is fixed. First note that log T{x + n + 1) = J21=\ l°g(& + дг) + log Г(* + 1) We then employ the idea that an integral often gives the dominant part of the surr of a series so that if the integral is subtracted from the series the resulting quantity is of a lower order of magnitude than the original series. (We have already used this idea in Equation A.3.2) of the preceding section.) In Appendix D we prove the Euler-Maclaurin summation formula, a very precise form of this idea when the function being integrated is smooth. Two fuller accounts of the Euler-Maclaurir summation formula are given by Hardy [1949, pp. 318-348] and by Olver [1974. pp. 279-289]. Theorem 1.4.1 Г(х) ~ *j2nxx~'42e-x as Re* -> oo. Proof. Denote the right side of the equation n-\ log T(jc + n) = ]T log(* + jc) + log Г(х + 1) *=] by с„, so that cn+i -с„ =log(x+n). By the analogy between the derivative and the finite difference we consider cn to be approximately the integral of log(jc + n) and set cn = (n+ x) log(« + x) - (n + x) + dn.
1.4 Stirling's Asymptotic Formula 19 Substitute this in the previous equation to obtain log(* +n) = (n+\+x)log(n + 1 + jc) - (n + x)log(n + x) + dn+x -dn-\. Thus dn+i - dn = 1 - (и + х + 1) к \ « -г л / 1 1 1 1 1 2(n+jt) 6(n + xJ Proceeding as before, take dn=en-- log(n and substitute in the previous equation to get e», ~ «. = '- log(l {п+х)г)' Now en-e0 = l>+i ~ek) = ^ |- щк + xJ + O[ -^—^ ) \; A.4.1) therefore, limn^.oo(en — eo) = A"] (jc) exists. Set 1 c"-"^12(n+jt)^V(«+*)V' where ^T(jc) = ^Ti(jc) + e0. The term (и + jc) comes from completing the sum in A.4.1) to infinity and approximating the added sum by an integral. So we can write 1 cn = (n + x) log(n +x)-(n + x) - - log(n + *) + logC(*) +o( ), I2(n+x) \(n+xJJ where ^T(jc) = log C(jc). This implies that \ .+O[ ^ )}. Щп+х) \(n+xJJ] A.4.2)
20 1 The Gamma and Beta Functions We claim C(x) is independent of x. By the definition of the gamma function .. Г(л+л:) Г(лО (х)п T(x) T(y) lim ny = lim ny x = • = 1. A.4.3) п-+осГ(п + у) Г (у) п^оо (у)„ Г (У) Г(х) Now, from A.4.2) and A.4.3) we can conclude that , ,. -хГ(п + х) C(x) ( x\n _x C(x) 1 = hm n x^—— = ——- hm 1 + - } e = Г (и) C@)»-»»\ n) C@) Thus C(x) is a constant and T(jc) ~ Cxx~l/2e~x as Re* -> oo. To find C, use Wallis's formula: 22«(»!J 1 = lim — n-*oo Bи)! Vй 2 = lim "^° С This gives С = л/2тт and proves the theorem. Observe that the proof gives the first term of an error estimate. ¦ We next state a more general result and deduce some interesting consequences. A proof is given in Appendix D. For this we need a definition. The Bernoulli polynomials Bn(x) are defined by 7 e' — 1 n=0 The Bernoulli numbers are given by Bn @) = Bn for n > 1. Theorem 1.4.2 For a complex number x not equal to zero or a negative real number, „,, ,,.4.5) 2m 7 (x + 02ffl Г/ге va/ие о/log jc is //ге branch with logx rea/ w/геи jc is rea/ and positive. The expansion of log F(jc) in A.4.5) is an asymptotic series since the integral is easily seen to be O(x~2m+l) for |argx\ < л - 8, S > 0.
1.4 Stirling's Asymptotic Formula 21 From this theorem the following corollary is immediately obtained. Corollary 1.4.3 For S > 0 and |argx\ < ж - S, Г(х) ~ -/2nxx~y2e-x as \x\ -+ oo. Corollary 1.4.4 When x = a + ib,a\ < a < a2 and \b\ —> oo, then \Y(a + ib)\ = V2^\b\a-^2e^w/2[l + O(l/\b\)], where the constant implied by О depends only on a\ and a2. Proof. Take \b\ > \,a > 0. It is easy to check that the Bernoulli polynomial \\B2 - B2(t)\ < \\ B2 - B2(t) =t -t2. Thus \\B2 - B2(t)\ < \\t(\ -01 < | forO < t < l.So A.4.5) with m = 1 is log Г (a + ib) = (a + ib--\ log(a + ib) - (a + ib) + - login + R(x), and dt 1 __, \b\ L , n Now Re ( a + ib - - ) log(a + ib)\ = (a - -) log(a2 + b2I'2 - barctan-. V 2 / V 2 / a Also, log(a2 +b2)l<2 =l-\ogb2 + ilog^l + °^\ =\og\b\ + o(~ Moreover, This gives b a \t, arctan - + arctan - = < 2Л a b [ -f, if b < 0. b , Г , л- а \ -b arctan - = —b\± \- O\ — a [2b \fc2 Putting all this together gives log \Г(а + ib)\ = (a - ^ log \b\ - ||ft| + 11оё2я + О l^~ ).
22 1 The Gamma and Beta Functions The condition a > 0 is removed by a finite number of uses of the functional equation A.1.6) and the corollary follows. Observe that the proof only uses a = o{\b\) rather than a bounded. ¦ Corollary 1.4.5 For |argjc| < л - 8, 8 > 0, Corollary 1.4.4 shows that Г (a + ib) decays exponentially in the imaginary direction. This can be anticipated from the reflection formula, for or or Similarly, and \2 г( [it >) \+ib) 2я 7TD _|_ я— Л"О -VS.— —ibsmnbi e "'* as /><7 as 2я ±O0 V27T\b\'l/2e~nlbl/2 as fe -> ±c». Since Г(д:) increases rapidly on the positive real axis and decreases rapidly in the imaginary direction, there should be curves going to infinity on which a normalized version of P(jc) has a nondegenerate limit. Indeed, there are. See Exercise 18. 1.5 Gauss's Multiplication Formula for Г(отдг) The factorization together with the definition of the gamma function leads immediately to Legendre's duplication formula contained in the next theorem. Theorem 1.5.1 ГBа)Г( l-] = A.5.1)
1.5 Gauss's Multiplication Formula for Г(тх) 23 This proof suggests that one should consider the more general case: the factor- factorization of (a)mn, where m is a positive integer. This gives Guass's formula. Theorem 1.5.2 Г(та)Bл)(т-])/2 = тта/2Г(а)Г (а+-)---г(а+ ^—!Л. A.5.2) V m) V m ) Proof. The same argument almost gives A.5.2). What it gives is A.5.2) but with »-ii /IV (m-\\ Bл)—т-1 replacedby Г - ...Г =: P. A.5.3) \ m I \ m I To show that A.5.3) is true, we show that P2 = 2 Bя)"-] m By the reflection formula sin^' m So it is enough to prove , л 2л (т — Х)л 2 sin — sin — • • • sin = m. mm m Start with the factorization Г x — 1 Let x ->¦ 1 to obtain m-I m = JJA k=\ , л 2л (т — \)л = 2м sin — sin sin —. mm m This proves A.5.3). ¦ Remark 1.5.1 A different proof of A.5.1) or A.5.2) that uses the asymptotic formula for Г(х) and the elementary property Г(х + 1) = хГ(х) is also possible. In fact it is easily verifed that S ГA/2)ГBд:) satisfies the relation g(x + 1) = g(x). Stirling's formula implies that g(x) ~ 1 as jc —>¦ oo so that ЦгПп-юо g(x + n) = 1 when n is an integer. Since g(x+n) = g(x) we can conclude that g(x) = 1. A similar proof may be given for Gauss's formula. This is left to the reader.
24 1 The Gamma and Beta Functions An elegant proof of the multiplication formula using the integral definition of the gamma function is due to Liouville [1855]. We reproduce it here. The product of the gamma functions on the right side of A.5.2) is xm vtf+((m— l)/m)~l//v Xm UXm rOQ rOQ r / e-'^r'Aci / e^xa2+(l/m)-ldx2 ¦¦• Jo Jo Jo CO /-CO /.CO / ¦¦•/ ^ о Jo Jo Introduce a change of variables: _ z _ _ X\ — , X2 — X2, ... i Xm — Xm. X2 ' ' ' xm The Jacobian is easily seen to be mzm'1 х2хъ ¦ ¦ ¦ xm and the integral can be written roc /-oo exp | — ( X2 + хъ + ¦ ¦ ¦ + х„ о Jo a+(l/m)—1 a-\-((m~l)/m)—l z j j j n ил™. Set t = X2 + хт, + ¦ ¦ ¦ + xm + zm/(x2X3 ¦ ¦ ¦ xm), and rewrite the integral as /•CO /"CO /"CO m / ¦•¦/ e-tzma-lx{2l"n)-lxf"n)~X---x«m-l)lm)-'dzdx2---dxm. Jo Jo Jo A.5.4) First compute POO POO m~l I = • ¦ ¦ / e~* TT Xj_li~ dx2dxT, ¦ ¦ -dxm. Jo Jo J=l Clearly, dl m_j f™ f -t T[ (j/m)-\dx2 ¦ ¦ -dxm dz Jo Jo jJi J+i x2---xm Now introduce a change of variables, x2 =zm/(xiX3---xm),x3 =x3,...,xm =xm,
1.5 Gauss's Multiplication Formula for Г(тх) 25 and h = x3 + x4 H h xm + xi + zm/(x3 ¦ ¦ ¦ xmxi). The Jacobian is and j- is given by dl m_, Г00 dz ~ mZ Jo m-\ •п^и)- = -m ¦¦¦ Jo = -ml. Therefore, x2x3 Г "'¦ ( Jo e V \dx\dxj, ¦ ¦ dxm Г/хг /•со m—l Jo )}2]+l ¦Xm-l m \(l/m)-l ) C1X3 ¦¦¦Xm) n)—\ ((m—l)/m)—1 , X1 tZX / = Ce~mz. To find C, set z = 0 in the integral for / as well as in the above equation and equate to get m By A.5.3), С = Bn){m-l)l2m-ll2 and / = {2n)(m-^l2m-xl2e'mz. Substitution in A.5.4) gives РОС Г(а)Г(а + 1/m) ¦¦¦Г(а + (т- l)/m) = т1/2Bя)(т-1)/2 / e-mzzma~ldz, Jo which is Gauss's formula. Remark 1.5.2 We pointed out earlier that 1 / Г (x) is a half of sin n x. In this sense the duplication formula is the analog of the double angle formula ( 1\ sin 2nx = 2 sin nx sin n I x + - I. This is usually written as sin2^x = 2 sin л* cos л* and so is thought of as a special case of the addition for sin(x + y). The gamma function does not have an addition formula.
26 1 The Gamma and Beta Functions 1.6 Integral Representations for Log T(x) and ip(x) In A.2.13), we obtained (^) from the product for 1/ Г(x). We start this section by rederiving it from the beta integral. Note that, for x > 1, -(x - 1) / Jo -t)dt = T{x-l)(l-€)X+k 'Ьо (*+*)(*+*) by term-by-term integration, which is valid because of uniform convergence in [0, 1-е]. Now let € -> 0. By Abel's continuity theorem for power series, r-2 V^ (X ~ О t log(l — t)dt = y^ • We can introduce log(l — t) in the beta integral /0' tx~2(l — t)ydt by taking the derivative with respect to y. In that case, Г(*+у) ' or /•I (x-l) tx~2(\ - t)y log(l - t)dt Jo _ Г(х)Г'(у + \)Г(х + у)~ Г(х)Г(у + 1)Г'(* + у) ~ Г(х + уJ ' The case у — 0 gives the necessary result. The differentiation is justified since the integrands involved are continuous. Some care should also be taken of the fact that the integrals are improper. The details are easy and left to the reader. The next theorem gives the integral representations of f(x) due to Dirichlet and Gauss. Theorem 1.6.1 For Re x > 0, Jo z\ A+. (ii) f(x) = J (:y-y:—^)rfz (Gauss). (i) fix) = -[e~z- )dz (Dirichlet), J \ A + zYJ
1.6 Integral Representations for Log Г(х) and \jr(x) 27 Proof, (i) Evaluate the integral /0°° /* e~ndtdz in two different ways by changing the order of integration to get the formula e-z _ g-sz - - A.6.1) Similarly, the double integral POO pO Jo Jo p —-Ui-t-^./ : dsdz z when first integrated with respect to z yields (by A.6.1)) d f°° _s . logs ds = —— / e s ds = Г (х). о dx Jo If we integrate the double integral with respect to s we get ri/ i \ Г(х) / -(e-t-—-—)dz. Jo z\ (l+z)xJ Equate the last two expressions to get Dirichlet's formula. (ii) Gauss's formula is obtained from Dirichlet's by a change of variables: = lim / — dz- / -dt + \JS z J 1 * —dz+ (—-t^ 7)dt since 1 g dz z -dz = log ^ ^ ^0 as S -> 0. г log(l + S) /log(l+«) Z ' lOg(l + S) This proves (ii). ¦ The integrated form of the last theorem is given in the next result. Theorem 1.6.2 For Re x > 0, (l) --o- V-/ I ! V" -/- jQg^
28 1 The Gamma and Beta Functions and (ii) log Г(*) = / (*-!)«"'- -L 00 < _t 1-е-' Proof. The integrals in Theorem 1.6.1 are uniformly convergent for Re x > S > 0, so we can integrate from 1 to x under the sign of integration. The integrals in Theorem 1.6.2 are the corresponding integrated forms. A change of variables и = e~' in (ii) gives \ du -x+l — • d-6-2) 1 — и ) log и There are two other integrals for log Г (х) due to Binet that are of interest. These are given in the next theorem. A proof of one of them is sketched and the other is left as an exercise. See Exercise 43. ¦ Theorem 1.6.3 ForRex>0, _ / 1 \ 1 f°° /1 1 1 \ e~tx and 1\ , 1 „ , „ f°° arctan(r/x) (ii) \ogT(x)lx Proof. Gauss's formula in Theorem 1.6.1 together with Equation A.6.1) give 1A(x + l) ^logr(x + l) +logx f (+ dx 2x Jo \2 t e' - \ Integrate from 1 to x, changing the order of integration to get Use log Г(х + 1) = log Г(х) + logx to rewrite the above formula as 'tx where Stirling's formula applied above gives / = 1 — A/2) log 2л. The second Binet formula can be used to derive the asymptotic expansion for log Г(х) contained in Corollary 1.4.5.
1.7 Kummer's Fourier Expansion of Log Г(х) 29 Expand \/(e2nt — 1) by the geometric series and integrate term by term to see that (-1) it- (L6-4) The last equality comes from Theorem 1.2.4. Now, gives, after integration, t W3 , ] t5 , (-1)" t2n~l , (-1)" f z2ndz x 5x* 5 xJ In - 1 xLn~l xLn~ Substitute this in Binet's formula (ii) and use A.6.4) to arrive at ii _ x ^_ _ j.— o_ , \ ) log, , + iog&r + g 2 .B. ;)х2;_1 For |argx| < | — €, € > 0, it can be seen that |^~i I < esc ze for all г > 0. This implies that the last term involving the integral is 0(,l+l). So we have the asymptotic series but only for |argx| < | - € instead of |argx| < л — e. Whittaker and Watson [1940, §13.6] show how to extend the range of validity. It is also possible to derive an asymptotic formula for log Г(х) from Binet's first formula. See Wang and Guo [1989, §3.12]. For references to the works of Gauss, Dirichlet, Binet, and others, see Whittaker and Watson [1940, pp. 235-259]. ¦ 1.7 Kummer's Fourier Expansion of Log T(x) Kummer [1847] discovered the following theorem: Theorem 1.7.1 For 0 < x < 1 logloBsin) + ^] logBsin^x) + (y+log2^)(l2x) + ^] \1tz 2 2 л where у is Euler's constant. Proof. Start with the identity „Aitix (mix -log(l - e2™) = e2™ + ^- + — + ..., 0<
30 1 The Gamma and Beta Functions The real and imaginary parts are Ecos 2knx 7 A-7.1) *=i k and л ?^ d.7.2) Since log Г(х) is differentiable in 0 < x < 1, it has a Fourier expansion logГ(х) = C0 + 2^2ckcos2knx + 2^ Dk si <t=l k=\ where Ck= I XogV(x)cos2knxdx and Dk = \ogT(x)sin2knxdx. A.7.3) We use Kummer's method to compute Ck and Ць The Ck are easy to find. Take the logarithm of Euler's reflection formula A.2.1): logr(x)+logr(l -x) = log 2л -IogBsin7rx) = Iog2^ + cos 2лx H— cos4^x + • • • ¦ The Fourier series of log Г(х) gives logr(x)+logT(l -x) = 2C0 + 4C{ cos2лx +4C2cos4^x -\ . Equating the last two relations gives Co = - log 2л and Ck = — for к > 1. 2 4k Now use integral A.6.2) for log Г(х) in A.7.3) so that 1 fl-ux'] \sin2knxdudx A io Jo \ l-« / fl -n /' Уо ' Jo log и But 2кл' and sin 2knxdx = The first two integrals are easy to solve and the third is the imaginary part of ex(logu+2kn,)dx _ 1 M ~ l
1.7 Kummer's Fourier Expansion of Log Г(х) 31 Therefore, f1 f -2kn 1 \ du °k~ Jo \u((loguJ +4k2n2) + 2/br, or, with и = e~2knt, Take к = 1 and we have 1 -2яг Moreover, x = 1 in Dirichlet's formula (Theorem 1.6.1) gives у 1 f°° ~2n ~ 2n Jo where у is Euler's constant. Therefore, 00 e-t - e~27" 1 +2^ By A.6.1), the first integral is log In and a change of variables from t to 1 /t shows that the second integral is 0. Thus To find Dk, observe that „-2л7 _ e-2knt 1 "' In where the integral is once again evaluated by A.6.1). Thus 1 f°° e~2wt — e 1 kDk-Dl = — dt = — log*, In Jo t In = —(y+log2*jr), *= 1,2,3,.... 2А:я The Fourier expansion is then logr(*) = - log In + V C0S f X + - (y + log In) V^ 2/fc ' /'T '^ 2* 1 7 log A: si H> Apply A.7.1) and A.7.2) to get the result. ¦ Kummer's expansion for log (T(x)/V2n) and Theorem 1.3.4 have applications in number theory. Usually they give different ways of deriving the same result. This suggests that the Hurwitz zeta function itself has a Fourier expansion from
32 1 The Gamma and Beta Functions which Kummer's result can be obtained. Such a result exists and is simply the functional equation for the Hurwitz function: 2r(l-s)| 1 *^cos2m7rx 1 ^sin2m^x| Ц^<sin^> tcosTrs) ¦ }. A.7.4) BлI-* 2 ^ m1-* 2 ^-j m1" V. m=l m=l / The functional equation for the Riemann zeta function is a particular case of this when x = 1. See Exercises 24 and 25 for a proof of A.7.4) and another derivation of Kummer's formula. 1.8 Integrals of Dirichlet and Volumes of Ellipsoids Dirichlet found a multidimensional extension of the beta integral which is use- useful in computing volumes. We follow Liouville's exposition of Dirichlet's work. Liouville's [1839] presentation was inspired by the double integral evaluation of the beta function by Jacobi and Poisson. Theorem 1.8.1 If"V is a region definedby xt > 0, / = 1,2,.. .,n,andJ2xi 5 1. then for Re a, > 0, f J xa.-i Proof. The proof is by induction. The formula is clearly true for n = 1. Assume it is true for и = к. Then for &(k+ 1)-dimensional V J-.J x«>-lx? /1 /-1-х, f / •¦¦/ Jo Jo ¦ dxk+l 1 /-1-х, fl-x,~x2 xt / / ll -* 1 ""' X/c — 1 «1-1 at-l Xl ' ' ' Xk 0 ./0 ¦ A — xi — • • • — Xk)at+'dxkdxk-i ¦ ¦ ¦ dx\ Now set Xk = A — xi — ¦ ¦ • — Xk-i)t to get ч'Г- о Jo A — xi — ¦ ¦ ¦ — xk-i)ak+ak*[tak~ A — t)ak+ldtdxk-i ¦ ¦ ¦ dxi Г(ак)Г(ак+] +1) ак+1Г(ак + ak+l II Jo Jo
1.8 Integrals of Dirichlet and Volumes of Ellipsoids 33 Compare this with the integral to which the change of variables was applied and use induction to get ( f Г(«))Г(at + ak+l) ак+1Г(ак + ak+l + 1) ГA + Y^i «/) This reduces to the expression in the theorem. ¦ Corollary 1.8.2 If V is the region enclosed by x; > 0 and J2 (•*; M Y' < 1. 2 dx Proof. Apply the change of variables, y, = (x,¦ /щ)р', i = ],... ,n. Then Эх, 1 x, Эу; Pi yi and the Jacobian is 1 X1X2 ¦ ¦ • Xn P\P2---Pn У\Уг---Уп The integral becomes PlP2'-Pn ¦ dyn, where V is defined by yt > 0 and J2 У' 5 1 ¦ The corollary now follows from the theorem. ¦ Corollary 1.8.3 ThevolumeenclosedbyJ2(xi/ai)Pi < 1, x, >0is Пм^ /и particular the volume of the n-dimensional ellipsoid ^(x,/a,J < 1 is n/2 -an ГО+и/2) Proof. For the first part of the corollary take a, = 1. For the particular case take (|) = \ Pi — 2 and use the fact that Г(|) = \yfn. Corollary 1.8.4 IfV is given by x, > 0 andJ2(^~)Pi < ^ ги Dirichlet's integral, then its value is Liouville also gave the following extension of Dirichlet's result, which can be proven in the same way.
34 1 The Gamma and Beta Functions Theorem 1.8.5 If V consists of xt >0, t\ <J2(xi/ai)Pl < t2 and f is a continuous function on (?ь h), then f ¦¦ f +¦¦¦ + (xnlanY"\dxx ---dx» Па"'Г (aj/pi)/pi A related integral is given next. Theorem 1.8.6 IfV is the set xt > 0, YTi=\ xi = 1. then This is a surface integral rather than a volume integral, but it can be evaluated directly by induction or from Corollary 1.8.2. It is also a special case of Theorem 1.8.5 when f(u) is taken to be the delta function at и = 1. This function is not continuous, but it can be approximated by continuous functions. 1.9 The Bohr-MoIIerup Theorem The problem posed by Euler was to find a continuous function of x > 0 that equaled n! at x = n, an integer. Clearly, the gamma function is not the unique solution to this problem. The condition of convexity (defined below) is not enough, but the fact that the gamma function occurs so frequently gives some indication that it must be unique in some sense. The correct conditions for uniqueness were found by Bohr and Mollerup [1922]. In fact, the notion of logarithmic convexity was extracted from their work by Artin [1964] (the original German edition appeared in 1931) whose treatment we follow here. Definition 1.9.1 A real valued function f on (a, b) is convex if f(Xx + A - k)y) < kf{x) + A - k)f(y) for x, у € (a, b) andO < X < 1. Definition 1.9.2 A positivefunction f on (a, b) is logarithmically convex if log / is convex on (a, b). It is easy to verify that if / is convex in (a, b) and a<x<y<z<b, then /(У) ~ fix) < f(z) - f(x) < f(z) - f(y) у — x ~~ z — x ~ z - у With these definitions we can state the Bohr-Mollerup theorem:
1.9 The Bohr-Mollerup Theorem 35 Theorem 1.9.3 If f is a positive function on x > 0 and (i) /A) = 1, (ii) f(x + 1) = x/(x), and (Hi) f is logarithmically convex, then f(x) = Г(х) for x > 0. Proof. Suppose и is a positive integer and 0 < x < 1. By conditions (i) and (ii) it is sufficient to prove the theorem for such x. Consider the intervals [n, n + 1], [n + 1, n + 1 + x], and [n + 1, n + 2]. Apply A.9.1) to see that the difference quotient of log /(x) on these intervals is increasing. Thus lo /(" + !) , l lQ /(n + l+x) /(и+ 2) Simplify this by conditions (i) and (ii) to get \(x+n)(x+n-l)---xf(x)] xlogn <log <xlog(n + 1). L «! J Rearrange the inequalities as follows: Therefore, n\nx fix) = lim = Г(х), "->oo x(x + 1) • ¦ • (X +П) and the theorem is proved. ¦ This theorem can be made the basis for the development of the theory of the gamma and beta functions. As examples, we show how to derive the formulas /•OO Г(х) = / e~'tx~ldt, x > 0, = / e-'t Jo and tx~\\ - ty~ldt = Г(Х)Г(:У), x > 0 and у > 0. A.9.2) /о Г(х + у) We require Holder's inequality, a proof of which is sketched in Exercise 6. We state the inequality here for the reader's convenience. If / and g are measurable nonnegative functions on (a, b), so that the integrals on the right in A.9.3) are finite, and p and q positive real numbers such that \/p + \/q = 1, then fgdx<^J fPdxj i^J g<dx) . A.9.3) It is clear that we need to check only condition (iii) for log Г(х). This condition can be written as Г(ах + Ру)<Г(х)аГ(у)/}, a>0,p>0 and a + p = I. A.9.4)
36 1 The Gamma and Beta Functions Now observe that and apply Holder's inequality with a = \/p and fi = \/q to get A.9.4). To prove A.9.2) consider the function Г(х + у)В(х,у) fix) = =-— • г Су) Once again we require the functional relation A.1.14) for B(x, y). This is needed to prove that / (x +1) = xf (x). It is evident that / A) = 1 and we need only check the convexity of log/(x). The proof again uses Holder's inequality in exactly the same way as for the gamma function. We state another uniqueness theorem, the proof of which is left to the reader. Theorem 1.9.4 If f{x) is defined for x > 0 and satisfies (i) /A) = 1, (ii) f{x + 1) = xf(x), and (шДт^со f(x + n)/[nx/(«)] = 1, then f(x) = Fix). For other uniqueness theorems the reader may consult Artin [1964] or Anastassiadis [1964]. See Exercises 26-30 at the end of the chapter. Finally, we note that Ahern and Rudin [1996] have shown that log |Г(дг + iy)\ is a convex function of x in Rex > 1/2. See Exercise 55. 1.10 Gauss and Jacobi Sums The integral representation of the gamma function is Г(х) Here dt/t should be regarded as the invariant measure on the multiplicative group @, oo), since d(ct) dt ct t To find the finite field analog one should, therefore, look at the integrand e~cttx. The functions e~ct and tx can be viewed as solutions of certain functional relations. This point of view suggests the following analogs. Theorem 1.10.1 Suppose f is a homomorphism from the additive group of real numbers R to the multiplicative group of nonzero complex numbers C*, that is,
1.10 Gauss and Jacobi Sums 37 and f(x + y) = fix)fiy). A.10.1) /// is differentiable with /'@) = с ф 0, then fix) = ecx. Remark 1.10.1 We have assumed that /(x) фО for any x but, in fact, the relation g(x + y) — gix)g(y), where g : R —>• C, implies that if g is zero at one point it vanishes everywhere. Proof. First observe that /@ + 0) = /@J by A.10.1). So /@) = 1, since /@) cannot be 0. Now, by the definition of the derivative, ,,, , .. fix + 0 - fjx) fix)fjt) - fjx) / (x) = lim = lim ,, ,,. /@-/@) = fix) lim = сf{x). So fix) = e". Ш Remark 1.10.2 In the above theorem it is enough to assume that / is continuous or just integrable. To see this, choose а у e R such that /oy fit)dt / 0. Then fix) H fit)dt = H fix + t)dt = Jxx+y fit)dt. So This equation implies that if / is integrable, then it must be continuous and hence differentiable. Corollary 1.10.2 Suppose g is a homomorphism from the multiplicative group of positive reals R+ to C*, that is, gixy) = gix)giy). A.10.2) Then gix) — xc for some с Proof. Consider the map / = g о exp : R ->• C*, where exp(x) = ex. Then / satisfies A.10.1) and giex) = ecx. This implies the result. ¦ A finite field has p" elements, where p is prime and и is a positive integer. For simplicity we take n = 1, so the field is isomorphic to Ъ(р), the integers modulo p. The analog of / in A.10.1) is a homomorphism TJf.Zip) -+ C*. Since "Lip) is a cyclic group of order p generated by 1 we need only specify i^ (!)• Also, VK1K = VKO) = 1 and we can choose any of the p\h roots of unity as the
38 1 The Gamma and Beta Functions value of фA). We therefore have p different homomorphisms ij,j(x) = ei*iixlp, j = O,l,...,p-l. A.10.3) These are called the additive characters of the field. In a similar way the multi- multiplicative characters are the p — 1 characters defined by the homomorphisms from Щр)* to C*. Here Щр)* = Z(p) - {0}. Since Щр)* is a cyclic group of order p — 1, we have an isomorphism Щр)* = Щр — 1). The p — \ characters on Щр)* can be defined by means of this isomorphism and A.10.3). We denote a multiplicative character by either x or r\, unless otherwise stated. It is now clear how to define the "gamma" function for a finite field. Definition 1.10.3 For an additive character т/л,- and multiplicative character Xt we define the Gauss sums gj (xd, j = 0, 1,..., p — 1 by the formula x=o where we extend the domain of Xi by setting x,- @) = 0. It is sufficient to consider g(x) := g\ (x)> for when j ф 0, 8j(x) = Y, X(x)fj(x) = X(j)g(x)- A.10.5) This formula corresponds to /0°° e~jxxs~ldx = F(s)/js, where j is a nonzero complex number with positive real part. When j = 0 in A.10.4) the sum is Ylx X(x), which can be shown to be zero when х{х)ф 1 for at least one value of x. Theorem 1.10.4 For a character x, Remark 1.10.3 The identity character is the one that takes the value 1 at each point in Щр)*. Proof. The result is obvious for x = id. If x ф id, there is а у е Щр)* such thatx(j) ф l.Then
1.10 Gauss and Jacobi Sums 39 which implies the theorem. There is a dual to A.10.6) given by the following theorem: ¦ Theorem 1.10.5 For the sum over all characters we have Proof. It is sufficient to observe that if x ф 1, then there is a character x such that x (x) Ф 1 • The theorem may now be proved as before. ¦ We now define the analog of the beta function. Definition 1.10.6 For two multiplicative characters x and r\ the Jacobi sum is defined by J(X,V)= Yl Х(хШУ)- A-10.8) x+y=\ The following theorem gives some elementary properties of the Jacobi sum. We denote the trivial or identity character by e. The reader should notice that the last result is the analog of the formula B(x, у) = Г(х)Г(у)/[Г(х + у)]. Theorem 1.10.7 For nontrivial characters x and i), the following properties hold: J(e,X)=0. A.10.9) J(e,e) = p-2. A.10.10) ,X~1) = -X(-D- A.10.11) then J(x,r1) = 8ix)8(f. A.10.12) Remark 1.10.4 From the definition of characters it is clear that the product of two characters is itself a character and so the set of characters forms a group. The additive characters form a cyclic group of order p and the multiplicative characters a cyclic group of order p — 1. Also, x(x) = x(x~') = l/x(*) and since |x(*)l = 1 it follows that x~lix) = Proof. The first part of the theorem is a restatement of Theorem 1.10.3 and the second part is obvious. To prove A.10.11), begin with the definition ¦/(X, X) = Now note that as x runs through 2,..., p—l, then x(l - x) runs through 1,..., p — 2. The value у — p— 1 = — l(mod;>) is not assumed because x = уA+у)~1 ¦ Therefore, J(x,x~l)= уф-\
40 1 The Gamma and Beta Functions by Theorem 1.10.4. This proves the third part. The proof of the fourth part is very similar to Poisson's or Jacobi's proofs of the analogous formula for the beta function. Here one multiplies two Gauss sums and by a change of variables arrives at a product of a Jacobi sum and a Gauss sum. Thus, for x Ц ф е, 8(X)g(ri) Х(*ЖH+ Y, X(x)r](t-x)e -x)e2*it/p x+y=0 The first sum is ^ X(x)i(-x) — rj(—l) YjX Xi(x) = 0 since xi Ф id. The second sum with x — st is -s)e2nitlp = = g(X4)J(X,4). This proves the fourth part of the theorem. ¦ We were able to evaluate Г (s) in a nice form for positive integer values and half- integer values of s. Evaluations of special cases of Gauss sums are also possible and important, but in any case the magnitude of the Gauss sum can always be found. Theorem 1.10.8 For nontrivial multiplicative and additive characters x and \j/, Proof. By A.10.5) it is enough to prove that |gi(x)l2 — l#(x)l2 = P- \8(X)\2 = Set x = ty. Then \8ix)\2 = r.v^O 1 tjtO or 1
1.10 Gauss and Jacobi Sums 41 The first sum is p — 1 and the inner sum in the second term is — 1. Thus \g(x)\2 = p-i-J2x{t) = P-l + l = P> and the result is proved. ¦ Corollary 1.10.9 If x, Ц and xi] are nontrivial characters, then \J(X,ri)\=VP- A.10.13) Proof. This follows from Theorems 1.10.7 and 1.10.8. ¦ As an interesting consequence we have: Corollary 1.10.10 If p = An + 1 is a prime, then there exist integers a and b such that p = a2 + b2. Proof. The group 2>{p)* is of order p — 1 = An, which is also isomorphic to the group of multiplicative characters on Z(p)*. Since the latter group is cyclic there exists a character x of order 4 that takes the value ±1, ±i. It follows that J(X, x) = a + bi for integers a and b. Since x2 Ф id, apply Corollary 1.10.9 to obtain the desired result. ¦ Corollary 1.10.10 is a theorem of Fermat, though Euler was the first to publish a proof. See Weil [1983, pp. 66-69]. Later we shall prove a more refined result that gives the number of representations of a positive integer as a sum of two squares. This will come from a formula that involves yet another analog of the beta integral. We have seen that characters can be defined for cyclic groups. Since any abelian group is a direct product of cyclic groups, it is not difficult to find all the characters of an abelian group and their structure. The following observation may be sufficient here: If xi is a character of a abelian group G\, and xi of G2, then we can define a character / : Gx x G2 -> C* by x(x, y) = Xi(x)Xi(y)- We thus obtain n additive characters of Z(n) and ф(п) multiplicative characters of Z(«)*. The Gauss and Jacobi sums for these more general characters can be defined in the same way as before. Gauss [1808] found one derivation of the law of quadratic reciprocity by evaluating the Gauss sum arising from the quadratic character. (A character x ф id is a quadratic character when x2 = id.) Details of this connection are in Exercise 37 at the end of the chapter. One problem that arises here, and which Gauss dealt with, is evaluating the sum G = Ylx=o e27lix2/N. As in Theorem 1.10.8 one can show that G2 = ±N depending on whether N = 1 D) or 3 D). The problem is to determine the appropriate square root for obtaining G. According to Gauss, it took him four years to settle this question. Dirichlet's evaluation of Ylx=o е2т"х ^N by means of Fourier series is given in Exercise 32.
42 1 The Gamma and Beta Functions Jacobi and Eisenstein also considered the more general Jacobi sum J(Xi,X2,...,)U)= J2 Xi(ti)X2(t2)---Xdti). A.10.14) This is the analog of the general beta integral in Theorem 1.8.6. Eisenstein's result, corresponding to the formula in Theorem 1.8.6, follows. Theorem 1.10.11 If xi, X2, ¦ ¦ ¦ > Xi are nontrivial characters and X1X2 ¦ • ¦ Xi ™ nontrivial, then j, ч g(Xi)g(X2) ¦ ¦ ¦ g(Xe) /11Л1СЧ J(Xi,X2,---,Xt) = : : • A.10.15) g(XiX2---Xe) The proof of this is similar to that of Theorem 1.10.7, and the reader should fill in the details. In Section 1.8 the volume of «-dimensional objects of the form ayx\[ + aix5^ + h a^xf < b was determined by means of the gamma function. In the same way, for finite fields, the number of points satisfying а\х\* +a2x2 H 1-a**** = b can be found in terms of Gauss sums. Gauss himself first found the number of points on such (but simpler) hypersurfaces and used this to evaluate some specific Gauss sums. Weil [1949] observed that it is easier to reverse the process and obtain the number of points in terms of Gauss sums. For an account of this the reader should see Weil [1974]. It may be mentioned that Weil's famous conjectures concerning the zeta function of algebraic varieties over finite fields are contained in his 1949 paper. It also contains the references to Gauss's works. One may also consult Ireland and Rosen [1991] for more on Jacobi and Gauss sums and for references to the papers of Jacobi and Eisenstein. The form of the Gauss sums also suggests that they are connected with Fourier transforms. Let T denote the vector space of all complex valued functions on Z(N), the integers modulo N. Let F be the Fourier transform on T defined by 1 N-\ (Ff)(n) = -= V f{x)^l"'N. A.10.16) It can be shown that the trace of this Fourier transform with respect to the basis {So, Si,... ,S/v}, where '. x Ф У, , x=y, is the quadratic Gauss sum Ylx=o e2l"x ^' ¦ Schur [1921] gave another evaluation of this sum from this fact. The details are given in Exercise 47. One first proves that the fourth power of F is the identity so that the eigenvalues are ± 1, ±i and the essential problem is to find the multiplicity of these eigenvalues.
1.11 A Probabilistic Evaluation of the Beta Function 43 Discrete or finite Fourier analysis was not applied extensively before 1965 be- because of the difficulty of numerical computation. This changed when Cooley and Tukey [1965] introduced an algorithm they called the Fast Fourier Transform (FFT) to reduce the computation by several orders of magnitude. The reader may wish to consult the paper of Auslander and Tolimieri [1979] for an introduction to FFT, which emphasizes the connection with group theory. Some of the earlier instances of an FFT algorithm are mentioned here. Computational aspects are also interesting. See de Boor [1980] and Van Loan [1992, §1.3]. 1.11 A Probabilistic Evaluation of the Beta Function When a and fi are positive integers, I It seems that it should be possible to arrive at this result by a combinatorial argu- argument. But working with only a finite number of objects could not give an integral. Here is a combinatorial-cum-probabilistic argument that evaluates the integral. Choose points at random from the unit interval [0, 1]. Assume that the probability that a point lies in a subinterval (a, b) is b — a. Fix an integer n and let P(xk < t) denote the probability that, of n points chosen at random, exactly к of them have values less than t. The probability density function for P(xk < t) is ,.\ v P{Xk < t + Af) ~ P{Xk < r) p(t) = hm hm . д/->о At Now P(xk <t + At)- P(xk < t) — the probability that one point lies in (t, t + At), к — 1 points less than t and n — к points greater than t + At, + the probability that two points lie in (t, t + At), к — 2 points less than t and n — к points greater than t + At, Since there are n points, the number of ways that one point is in (t, t + At), к — 1 points are less than t, and n — k, are greater than t + At is n-l\fn-k\ (n-\ -i L-J="U-i
44 1 The Gamma and Beta Functions The probability of each such event is Attk~l A —t — At)n~k, and since the events are mutally exclusive, we get P(xk <t + At)- P(xk < t) n~1 )At tk~l(\ -t- At)n~k к — I I + ¦•• = "(?"! V~'d - t - At)n~kAt + O((AtJ). \ к - 1 / Therefore, Since s: we obtain (n-k+l) /' Jo n\ Г(« + 1) We made use of probability theory here to indicate its relationship with the beta function. Though we do not use it elsewhere, probability theory can be used to derive formulas involving some extensions of the beta function. 1.12 Thep-adic Gamma Function In number theory there are completions of the rationals other than the reals that are of great importance. These are the p-adic completions of the rationals. There is an analog of the gamma function defined on the p-adic numbers that is useful. The following is a very brief account of the p-adic gamma function. The interested reader should consult the references given later. Suppose a is an integer and p a prime. Define ord^a to be the highest power of p that divides a. Let Q be the set of rational numbers. For x = alb e Q, where a and b are integers, define ord^x = ord^a — oxdpb. The p-adic norm | 1^ on Q is defined by l/pordp\ x ф 0, 0, x = 0.
1.12 The /?-adic Gamma Function 45 Thus in the p-adic norm, p" gets small as n gets large. In contrast, for negative values ofn,p" becomes big. So it is reasonable to write numbers in powers of p. An integer would have an expansion of the form a0 + a\p + агР2 Л +anp", where a,¦ e {0, 1, 2,..., p — 1}. For rational numbers negative powers of p will also be involved. The p-adic norm is non-Archimedean, that is, and so the triangle inequality holds. This gives a metric on Q. We can then obtain a completion of Q with this metric in the same way as we get the real numbers by taking the ordinary metric on Q. This involves taking the Cauchy sequence of rationals. The p-adic completion is denoted by Qp. The p-adic numbers can be represented by the series b-m Ь-m+l b_i 2 , 1 r -\ 1 h b0 + hx p + Ъгр Л • pm pm-1 р The subset of Qp which contains all numbers with nonnegative powers of p forms a ring denoted by Zp. This is the ring of p-adic integers. The positive integers Z+ form a dense subset of Zp. This makes sense because a member of Zp can be represented as an infinite series ao + a\P + агр1 + ¦ ¦ ¦, a, e {0, 1,... p - 1}, and the partial sums are integers that converge to the p-adic number. So, if there is a function / defined on the positive integers and the values of / at two integers that are p-adically close are close to each other, then / has a unique continuous extension to Zp. Define a function / on the positive integers n by the formula рЧк It is not difficult to show that /(« + pmt) = f{n)modpm, where n,m, and I are positive integers. Now n + p"? and n are p-adically close to each other and the values of / at these points are also p-adically close. Consequently, / has an extension to Zp. This extension gives the p-adic gamma function due to Morita [1975]. The p-adic gamma function is defined by ГДх) = —f(x — 1). This function also has a functional relation and other useful properties. There is a formula of Gross and Koblitz [1979] that gives the Gauss sum as a product of values of the p-adic gamma function.
46 1 The Gamma and Beta Functions A good treatment of the p-adic numbers and functions is given in Koblitz [1977]. An account of the p-adic gamma function and the Gross-Koblitz formula is available in Lang [ 1980], including a reference to a paper by "Boyarsky" [ 1980]. In fact, p-adic extensions of the beta function, and more generally, the Mellin transform, are also available. Exercises 1. Use the change of variables s = ut to show that /•00 pOG Jo Jo is Г(х + y)B(x, y). (Poisson) 2. Let / = /0°° e-*2dx. Observe that /2 = /0°° /0°° e-^'+>'2)dxdy. Evaluate this double integral by converting to polar coordinates and show that / = ф:/2. 3. A proof of Wallis's formula is sketched below: (a) Show that /•00 Jo dx ж /o x2+a (b) Take the derivative of both sides n times with respect to the parameter a to conclude that roc dx 1 • 3 • 5 • • • In - 1 л 1 f Jo o (x2+a)"+! 2-4-6---2n 2 (c) Set x = y/s/n, a = 1, let n —>- oo, and use Exercise 2 to obtain Wallis's formula. 4. Evaluate j\{i — t2)x~ldt in two different ways to prove the duplication formula given in Theorem 151To get another proof evaluate formula given in Theorem 1.5.1. To get another proof evaluate /-Л-/2 / sin2* 2<Ы6> Jo in two ways. 5. Suppose that / is twice differentiable. Show that /" > 0 is equivalent to /(ax + fiy) < a/(x) + fif{y) for a and /9 nonnegative and a + /9 = 1. 6. Convexity can be used to prove some important inequalities, for example, Holder's inequality: b 11//p Г гь \ 0 / fga Ja where / and g are integrable functions and | + - = 1. We sketch a proof
Exercises 47 (a) Note that ex is a convex function; use this and the result of Exercise 5 to show that if и and v are nonegative real numbers then uv < 1 . p q Equality holds if and only if up = vq. (b) Deduce Holder's inequality from (a). It might be appropriate to call this the Rogers-Holder inequality since Rogers [1888] had the result before Holder [1889]. Other important results of L. J. Rogers are discussed later in the book. 7. Here is another proof of the functional relation ) Write Jo B(x,y+l)= I tx+y~M — 1 dt and perform an integration by parts to show that B(x,y + l) = -?—B{x, x + y 8. Show that B{x,y) f00 tx~ldt ¦y Jo (c + t)x+y Take the derivative with respect to с and derive the functional equation Give a similar argument using t*-\c-tydt. Jo 9. Write Gauss's formula as . = Bл-)(п-1)/2„A/2)-х • Show that the right side satisfies all the conditions of the Bohr-Mollerup theorem. This proves the formula. 10. Give a proof of Gauss's formula by using the definition of Г(х). 11. Prove Gauss's formula by the method given in the remark after Theorem 1.5.2. 12. It is clear from Г(х + 1) = хГ(х) that //+1 logT{t)dt = xlogx -x + C. Show that С = j log27T. Stirling's formula will work, but there is a more elegant argument using Gauss's multiplication formula first.
48 1 The Gamma and Beta Functions 13. There is another beta integral due to Cauchy defined by dt п21-х-УТ{х+у-\) (a) To prove this, show: (i) Integration by parts gives C(x,y + 1) — *,C(x + l,y). (ii) Write . . (-l-it /-00 = 2C(x,y+l)-C(x-l, This together with (i) gives C(x, y) = —-C(x, v + 1). x + у - 1 (iii) Iteration gives 22" (*)„(>% - lJn = / — 00 ¦00 -oo A+?2)"A+/0л"A -ity' Set f -> 11yfn in the second integral and let n —> со. (b) The substitution t = tan в leads to an important integral. Find it. 14. Use the method for obtaining Stirling's formula to show that 11 1 ,- 1 / 1 ~r + ~F + • ¦ ¦ + —F = 1^n + C ° where Sum n n * cn = у (Ck — Ck-i) with cn = > —= — and use some algebra to change cn — cn_i to an expression that goes to zero like и/2 to show that See Ramanujan [1927, papers 9 and 13] for further results of this type.
Exercises 49 15. Here is an outline of a real variable proof of A.2.1). Let N . Ж v-^ 1 g(x) = lim > . tan7TX N^oc*-^1 n + X — N oo (a) g'(x) = -7T2/sm27TX + Y,l/(n+xJ- —00 (b) g'(x) is continuous forO < x < 1 if g'@) = g'(l) = 0. (c) g'(x/2) + g'((x + l)/2)=4g'(x). (d) Let M = maxo<x<i \g'(x)\. Then M < M/2 so M = 0. 00 (e) g(x/2) - g((x + l)/2) = 27r/(sinjrx) - 2^ (-!)"/(« + x). —00 (f) g(x + l) = g(x). (g) g(x) = constant. (h) / tx-1/(i+t)dt = Y/(-iy/(n+x)+Y/(-iT/(n + i-x) J° n=0 n=0 so A.2.1) holds. This proof, due to Herglotz, was published by Caratheodory [1954, pp. 269-270]. Bochner's [1979] review of the collected works of Herglotz also includes this proof. 16. The following is Dedekind's [1853] proof of Г(х)ГA -x)=n/ sin7rx. Set (a) Show that /•OO fX — l Jo st + 1 ~ and (b) Deduce that 00 fX — 1 Л = (/>(x)^. -1 (.~*"\ Z100 fx s - 1 Jo (*? (c) Use the second formula in (a) to get ,00 j / ,00 fX-| X = J0 VTTkL 7T~sdt)ds'y
50 1 The Gamma and Beta Functions then change the order of integration to obtain [ф(х)]2 = / f-dt. Jo t - 1 (d) Deduce y />oo *v—1 / /y />oo *v—1 f — v №(x)fdx= / —dt. -y Jo I ~ 1 (e) Integrate (b) with respect to s over @, oo) and use (d) to derive /x f00 tx~l log? m)?dt = 2 / f dt = 2ф'(х). J l + (f) Show that ф(х) = ф(\-х) implies ф'ф = 0 and = 2 / l-л: Jl/2 (g) Deduce that ф(х) f Jl/2 (h) Show that ф satisfies the differential equation фф" — (ф1J = ф4. (i) Solve the differential equation with initial condition ф(^) = ж and ф'(\) = 0 to get ф(х) = ж свсях. 17. Show that Г *-xQ-*y-1* = W) ,Rex>0,Re>,>0,fl>0, Jo [at + b(l -t)]x+y ахЪУТ(х+у) 18. Show that hm sin ax 1 h ,cscGrb/2) —— dx = -жаь~1— b 2 19. Prove that for a > 0, X *b "" 2"" Г(Ь) and f°° cos ax 1 , _, 8ес(яЬ/2) / ;—dx = -жа , С Jo xb 2 Гф) 20. For к > 0, x > 0 and —я/2 < a < я/2, prove that f tx-le-ktcosaCQS(kt ^na)dt = X-x Jo
Exercises 51 and ~v~ isinax. f Jo /o 21. Prove that n-sl2Y(s/2)l;(s) = л--A-5)/2Г(A - s)/2)f(l - s) as follows: (a) Observe that , ¦ = (— l)mW4 for mn < x < (m + lOr, m = 0, 1, ^ 2n + l (b) Multiply the equation by xs~l@ < s < 1) and integrate over @, oo). Show that the left side is V(s) sin(sjr/2)(l - 2-s-x)t;(s + 1) and that the right represents an analytic function for Res < 1 and is equal to 2A - 2i+1)f A - s) for Res < 0. (c) Deduce the functional equation for the zeta function. (Hardy) 22. Let С be a contour that starts at infinity on the negative real axis, encircles the origin once in the positive direction, and returns to negative infinity. Prove that -L = -L / л-*. i(s) 2ni Jc This formula holds for all complex s. (a) Note that the integral represents an analytic function of s. (b) С may be taken to be a line from -oo to —8, then a circle of radius 8 in the positive direction, and finally a line from —8 to —oo. Show that / e'rsdt = 2i sinns / e~"u~sdu + I, Jc Js where / is the integral on the circle \t\ =8. This representation of the gamma function is due to Hankel; see Whittaker and Watson [1940, p. 244]. 23. Prove that e"ijrsr(l -s) Г ts-ye~xl Г ts-ye / Jc 1 - Tdt, t(x,s) Ц 2ni where С starts at infinity on the positive real axis, encircles the origin once in the positive direction, excluding the points ±2nni, n > 1 an integer, and returns to positive infinity. Hint: First prove that and then apply the ideas of the previous exercise. Note also that t(x,s) is now
52 1 The Gamma and Beta Functions defined as a meromorphic function by the contour integral with a simple pole at s — 1. 24. Prove the functional equation 2Y{\—s) Г I. m=\ m=\ ) Hint. Let С denote the line along the positive real axis from oo to B« + 1)ж, then a square with corners Bn + 1)jt(±1 ± /), and then the line from (In + 1) to oo. Show that f ts-xe~xl Г f-xe'" / "i Tdt = / i Tdt Jc 1 - e-< JCn 1 - e~< — the sum of the residues at ± 2mni, m = 1,..., n, where С is the curve in the previous exercise. Note that the sum of the residues at ±2mni is У'" sinBm7rx + ns/2). Now let n -*¦ oo and show that /c ->•(). 25. Show that the functional equation for f (x, s) easily implies (a) the functional equation for t, (s), (b) Kummer's Fourier expansion for log Т{х)/^/Ът. The next five problems are taken from Artin [1964]. 26. For 0 < x < oo, let ф(х) be positive and continuously twice differentiable satisfying (а) ф(х + 1) = ф(х), (b) ффф(^) = dф{x), where d is a constant. Prove that ф is a constant. Hint:Letg{x) = ^ log ф(х). Observe that g(x + l) = g(x) and i(g(f ) + g(^r))=g(x). 27. Show that ф(х) = r(x)r(l-x)sin7rx satisfies the conditions of the previous problem. Deduce Euler's reflection formula. 28. Prove that a twice continuously differentiable function / that is positive in 0 < x < oo and satisfies (a) f(x +1) = xf(x) and (b) 22x~1f(x)f(x + \) = л/л fBx) is identical to Г(х). 29. It is enough to assume that / is continuously differentiable in the previous problem. This is implied by the following: If g is continuously differentiable, g(x + 1) = g{x), and g(f) + g(*±!) = g{x), then g = 0. Hint: Observe that k=0 -I The left side tends to /J g'{x)dx = g{\) - g@) = 0 as n -> oo.
Exercises 53 30. Prove that the example g{x) = X^li ^ sinBn7rx) shows that just continuity is insufficient in the previous problem. 31. Suppose / and g are differentiable functions such that f(x +y) = f(x)f(y) — g (x) g(y) and g(x+y) = f(x)g(y)+g(x)f{y). Prove that/(x) = eaxcosbx and g(x) = eax sin их, unless f(x) — g{x) = 0. 32. Prove that Yfx^ elni%1lN = Ц^-VN, where i = v^l. (a) Set f(t) = J2^e2nKx+lJ/N,0 < t < 1. Note that /@) = /(l) and extend /(?) as a periodic function to the whole real line. (b) Note that fit) = ?-oc ane2*inl, where an = ft f{t)e-2*imdt. Conclude that /@) = Et"o *W/" = E-oo ««• (c) Show that an = e-™N*l* J%f} e (d) Show that -nN/2 rN-nN/2 -E/ (e) Use Exercise 19 to evaulate the integral. Another way is to take N = 1 in (d). (Dirichlet) 33. If p is an odd prime, then there is exactly one character /2 that maps Z(p)* onto {±1}. Recall that Z(p)* is the integers modulo p without 0. Prove that X2(a) = 1 if and only if x2 —a mod p is solvable, that is, a is a square in Z(p)*. Usually one writes x2 (a) = (-), which is called the Legendre symbol. 34. Prove that if a is a positive integer prime to p, then ap~x/2 = (-) (mod p). Here p is an odd prime. (Use the fact that Ъ{р)* is a cyclic group.) 35. For pan odd prime, use the previous problem to prove that (—) = (—1)^p~1^2 and i2-) = (-l)^2-"/8. (Use 2p/2 = {еп11А + e~ni/4)P = (epni/4 + e-pni/4) (modp). Consider the two cases p = ±1 (mod 8) and p = ±3 (mod 8) separately.) 36. Prove the law of quadratic reciprocity: For odd primes p and q, (?)(?) = (a) For S = Y?x=\ (f )e2nix/p, show that S2 = {-~)p- (The proof is similar to that of Theorem 1.10.8.) (b) Use (a) and Exercise 34 to prove that Sq~l = (-l)^1?1 (?) (mod^). (c) Show that Sq = T,xZl(pe2nillx/p = (pS (modq). (d) Deduce the reciprocity theorem from (b) and (c). (Gauss) 37. For integers a and N with N > 0, define G(a, N) = ^^o e2niax2/N. (a) For p prime, show that G(l, p) = ??=i(f У2*1'*7*"- (b) For p prime show that G(a, p) = (-)G(l, p).
54 1 The Gamma and Beta Functions (c) Prove that G(q, p)G(p, q) = G(l, pq) when p and q are odd primes. (d) Now use the result of Exercise 32 to deduce the reciprocity law. (Gauss) For a discussion of Exercises 32-37 and for references, see Scharlau and Opolka [1985, Chapters 6 and 8]. 38. Prove Theorems 1.8.5 and 1.8.6. 39. Prove Theorem 1.9.4. 40. Weierstrass's approximation theorem: Suppose / is a continuous function on a closed and bounded interval, which we can choose to be [0, 1] without any loss of generality. The following exercise shows that / can be uniformly approximated by polynomials on [0, 1]. (a) Show that it is enough to prove the result for /@) = /A) = 0. Now extend / continuously to the whole real line by taking / = 0 on x < 0 andx > 1. (b) Observe that is a polynomial such that Qn(t)dt=l. Show that Pn(x) = J_{ f{x + t)Qn{t)dt is a polynomial in x for x e [0, 1]. (c) Use Stirling's formula to show that for S > 0and<5 < \t\ < 1, Qn{t) -*¦ 0 uniformly as n —>¦ oo. (d) Note that forO < x < 1, Pn(x) - f(x) = J^lfix + t) - f(x)]Qn(t)dt. To show that Р„(х) ->¦ f(x) uniformly on [0, 1], break up the integral into three parts, J~{ + J_s + Js, and use (c). 41. Prove Plana's formula (see Whittaker and Watson [1940, p. 145] for references to Plana): For positive integers m and n k=m ф(т)+ф(п) / фШх - i / Jm JO ф(п + iy) - ф(т + ty) - ф(п - ty) + ф(т - ly) X z : flV, e2ny _l •y' where ф(х + iy) is a bounded analytic function in m < x < n. Hint: (a) Consider the integral /c ф{г)/(е~2пп — \)dz where С is a suitable in- indented rectangle with vertices ifc,ifc+l,ifc+l + L/, and к + Li. Then let L —>¦ oo. (b) Now replace i with —i in the contour С and repeat the process in (a). (c) Add the results in (a) and (b) and sum over k.
Exercises 55 42. (i) In Plana's formula let m = 0, n -> oo, and suppose that ф(п) -> О, ф(п ± iy) -> 0, to get v>(*) = W- Г\,._... Г*<оО-*(-оО ?? 2 Jo Л (ii) Deduce Hermite's formula (for reference, see Whittaker and Watson, [1940, p. 269]) x~s xl~s f°° (x2 + t2)~s/2 sin(s arctanf/x) (iii) Conclude that ?(x, 2) = —5Ч h , / , —^ , 2x2 x Jo (x2 + t2J(e2711 - I) 43. (a) For f{x) = T'ix)/ Г(х), note that fix) = ?(x, 2). (b) Deduce that ltdt fix) =lnx- (Use part (iii) of the previous exercise.) (c) Deduce Binet's second formula ( x\ ! 1пГ(х)^^ I x — — 1 in x — x -\— \ 2) 2 where x is complex and Re x > 0. (d) Use Hermite's formula in the previous problem to obtain Lerch's formula A.3.7) for ij-Jix,s))s=0. 44. Prove the following properties of Bernoulli polynomials: (a) Bqix + \) - Bq{x) = qxq~l. (b) 1 n=M (c) Bn(jt) = 2j ,_ )BkXn- k=0 4=0 45. Prove that (a) B2q-i(x -[x]) = 2(
56 1 The Gamma and Beta Functions and (b) Deduce CBq) = (—X)q~ , q > 1, B#)! 2 and 46. Prove that B2n=G2n- p (p-\)\2n F where dn is some integer and p is a prime such that p — 1 divides 2и. (Clausen-von Staudt) Hint: Define ]>X0 %xn = ?X0 If-*" (mod k)if к divides an - bn for all и > 0. Show that ^ Z- + ¦ ¦ ¦ ) (mod4). (b) For prime p, (c) For composite m > 4 (ez-l)m-1=O (modm). «0 4,^ + ^ + . ez — 1 2 3 4 Deduce the result on Bernoulli numbers. (See Polya and Szego [1972, Vol. II, p. 339]. 47. Let C(Z(n)), where Z(n) is the integers modulo n, be the set of all complex functions on Z(n), where n is an odd positive integer. Define F: C(Z(n)) —> C(Z(/i)) by 1 ""' (F/)(*) = -= V /(t)^"" for x e Z(n). (a) Show that Trace F = -^ YXX e2nik2/n. Hint: Use the functions Sx,x e Z(n), where 5^(y) = 0, x ф у, and ^ (x) = 1, as a basis for Z(n).
Exercises 57 (b) Prove that (F2f)(x) = /(-*). Conclude that F4 = id and hence that ±1, ±i are the eigenvalues of F. Let m\, rm, m^, m4 be the multiplicities of 1, i, — l.and —i respectively. Thus mi + гп2 + тз +П14 = 1. (c) Show that Trace F2 — 1 and conclude that m\ — ni2 + ni-$ — тц = 1. (d) Show that 1;^ ЕЙ e27Iik2/n\2 = 1. Use (a) to get (m, - miJ + (w3 - (e) Prove that f l " «з)гA-")/2, И = 1D), F - ;(«l-«4)+(»i-«3)-(»+D/2 _ "U-m^10'2, n = 3D), = det f-Le2nixy/n and also 0<дс,у<я-1 where Л" is a positive number. (f) Show that nti = a + 1, and»?2 = m^ = m.4 — a when n — Aa + 1 and 7щ = ni2 = mi = a and m.4 = a — 1 when и = Aa — 1. (g) Obtain the value of -^ ЕЙ e2nik^n for и odd. (Schur) Let mbea positive integer and let / be a character on the group Z(m)*. The function / can then be defined on all the integers by setting / (k) =0 when gcd(?, m) > 1. Clearly / has period m. We call / primitive if it does not have a smaller period. Also, / is even if / (— 1) = 1 and odd if Also, define m-l n=0 48. For a e Z(p), let N(xn = a) denote the number of solutions of the equation xn = a. If n I p — 1, then prove that N(xn = a) = 1 -f where the sum is over all nontrivial characters of order dividing n. Let a be a nonzero integer. Consider the elliptic curve E defined by xqxI — x\ — cixq = 0, which in affine coordinates is y2 = x3 + a. Suppose p ф 2 or 3 is a prime that does not divide a. Then у2 = хъ + a is an elliptic curve over Z(p) with a point at infinity. If Np denotes the number of Z(p) points on the curve, then Np = 1 + N(y2 — x3 + a). (a) Show that if p = 2 (mod 3), then Np = p + 1. (b) Let p = 1 (mod3) and let /3 and /2 denote the cubic and quadratic characters ofZ(p)*. Note that N(y2 = x3 + a) = EB+V=a N(y2 = u)
58 1 The Gamma and Beta Functions N(x3 = -v). Deduce that Np=p+l+ X2Xi(a)J(Xi, Хз) + Х2Хз(а) ДХг, Хз)- (с) Show that if Np = p + 1 - ap then \ap\ < 2^/p. 49. By the method used in the previous problem, show that \N(x3+y3 = \)-р + 2\<2^/р. For Exercises 48 and 49, see Ireland and Rosen [1991, Chapters 8 and 18]. 50. Prove that if x is primitive, then ( ) = fx(k)g(x) when gcd(k,m) = l, 8kKX) \0 when gcd(?, m) > 1. Define the Dirichlet L-function by oo X(n) n=\ The series converges for Re s > 0, when % is a nontrivial character, that is, X(n) ф 1 for at least one n e Z(m)*. 51. (a) Prove that when / is nontrivial . m— 1 L(X, 1) = — m (b) Show that if / is primitive ix) V > X(k) log sin — H . k?Z(m)* Ч 7 (c) Prove that when / is even, 53 X (k)k = 0. and also, when x is odd, Ex(*)l°g (d) Prove that f ^r E k€Z(mf X №) log sin ^, when x is even, 1^?2) h x is odd. 52. Prove that (a) l-| + 5-^ + --- = f (Madhava-Leibniz) (b) l + 5-5-^ + 5 + TT = i7! (Newton) (c) 1-1 + 1-1 + 1-1 + ... = ^= (Euler)
Exercises 59 (d) 1 + 1-1 + 1-1-1 + 1 + ... = ^ (Euler) V4/ * 2 3 "г 4 ' 6 7 8 ' 9 '" 11 ¦¦'-"V5l0S 2 The series for n/4, usually called Leibniz's formula, was known to Madhava in the fourteenth century. See Roy [1990]. Newton [1960, p. 156] produced his series in response to Leibniz's formula by evaluating the integral ' 1+x2 , / Jo in two different ways. Series (c) and (d) are attributed to Euler by Scharlau and Opolka [1985, pp. 30 and 83]. Define the generalized Bernoulli numbers by the formula a=l n=0 53. (a) Prove the following functional equation for L(x,s), x primitive: 2is \r where 8 = 0 or 1 according as / is even or odd. Hint: Consider the integral pat —dt, L Ic emt - 1 where С is as in problems 23 and 24. Follow the procedure given in those problems, (b) For any integer n > 1, show that n (c) For n > 1 and n = 8 (mod 2) (8 as defined in (a)), prove that 54. Let P be any point between 0 and 1. Show that A+,o+,i_.o-) ^_^ ^ ^_^ ^ Jp ГA— The notation implies that the integration is over a contour that starts at P, encircles the point 1 in the positive (counterclockwise) direction, returns to P, then encircles the origin in the positive direction, and returns to P. The 1 —, 0— indicates that now the path of integration is in the clockwise direction, first around 1 and then 0. See Whittaker and Watson [1940, pp. 256-257].
60 1 The Gamma and Beta Functions 55. Let G(z) = log Г (г). Show that (a) If x > 1/2, then Re G"(x + iy) > 0 for all real y. (b) If x < 1/2, then Re G"{x + iy) < 0 for all sufficiently large y. (c) If 1/2 < a <b, then is an increasing function of у on (—oo, oo). (d) The conclusion in (c) also holds if 0 < a < 1/2 and b > I — a. (Ahern and Rudin) 56. Show that This problem was given without the value by Amend [1996]. FOXTROT © 1996 Bill Amend. Reprinted with permission of Universal Press Syndicate. All rights reserved.
The Hypergeometric Functions Almost all of the elementary functions of mathematics are either hypergeometric or ratios of hypergeometric functions. A series Ес„ is hypergeometric if the ratio cn+i/cn is a rational function of n. Many of the nonelementary functions that arise in mathematics and physics also have representations as hypergeometric series. In this chapter, we introduce three important approaches to hypergeometric func- functions. First, Euler's fractional integral representation leads easily to the derivation of essential identities and transformations of hypergeometric functions. A second- order linear differential equation satisfied by a hypergeometric function provides a second method. This equation was also found by Euler and then studied by Gauss. Still later, Riemann observed that a characterization of second-order equations with three regular singularities gives a powerful technique, involving minimal calculation, for obtaining formulas for hypergeometric functions. Third, Barnes expressed a hypergeometric function as a contour integral, which can be seen as a Mellin inversion formula. Some integrals that arise here are really extensions of beta integrals. They also appear in the orthogonality relations for some special orthogonal polynomials. Perceiving their significance, Gauss gave a complete list of contiguous relations for 2Fi functions. These have numerous applications. We show how they imply some continued fraction expansions for hypergeometric functions and also contain three-term recurrence relations for hypergeometric orthogonal polynomials. We discuss one case of the latter in this chapter, namely, Jacobi polynomials. 2.1 The Hypergeometric Series A hypergeometric series is a series Y^, cn such that cn+\/cn is a rational function of n. On factorizing the polynomials in и, we obtain cn+i _ (n + a\)(n + a2) • ¦ • (n + ap)x 61
62 2 The Hypergeometric Functions The x occurs because the polynomial may not be monic. The factor (n + 1) may result from the factorization, or it may not. If not, add it along with the compensating factor (n + 1) in the numerator. At present, a reason for inserting this factor is to introduce n! in the hypergeometric series J2 cn • This is a convenient factor to have in a hypergeometric series, since it often occurs naturally for many cases that are significant enough to have been given names. Later in this chapter we shall give a more intrinsic reason. From B.1.1) we have > с» = Cn ) =: en п га I \ х ). B.1.2) to 0t0(b0n---(bq)nnl Ч*ь-Л ) Here the ft, are not negative integers or zero, as that would make the denominator zero. For typographical reasons, we shall sometimes denote the sum on the right side of B.1.2) by pFq{a\,..., ap; ftb ..., bq; x) or by pFq. It is natural to apply the ratio test to determine the convergence of the series B.1.2). Thus, cn+\ < \x\nP-i-\\ + \a\\/n) ¦ ¦ ¦ (\ + \ap\/n) An immediate consequence of this is the following: Theorem 2.1.1 The series pFq(a\, ...,ap;b\, ...,bq;x) converges absolutely for all x if p < q and for \x\ < 1 if p = q + 1, and it diverges for all x^O if p > q + 1 and the series does not terminate. Proof. It is clear that \cn+\/cn\ —>¦ 0 as n —>¦ oo if p < q. For p = q + 1, linWoo \cn+\/cn\ — \x\, and for p > q + 1, \cn+\/cn\ -> oo as n -> oo. This proves the theorem. ¦ The case \x \ = 1 when p = q + 1 is of great interest. The next result gives the conditions for convergence in this case. Theorem 2.1.2 The series q+\Fq(a\,..., aq+l; b\, ..., bq; x) with \x\ = 1 con- converges absolutely /fRe(^fc, — Y2ai) > 0- ^e series converges condition- conditionally if x = е'в ф 1 and 0 > Re(^fc, — ^a,) > -1 and the series diverges if Proof. The coefficient of nth term in q+l Fq is (fli)n • ¦
2.1 The Hypergeometric Series 63 and the definition of the gamma function implies that this term is ПГ(а,-) as n —>¦ oo. Usually one invokes Stirling's formula to obtain this, but that is not necessary. See Formula A.4.3). The statements about absolute convergence and divergence follow immediately. The part of the theorem concerning conditional convergence can be proved by summation by parts. ¦ This chapter will focus on a study of the special case 2F\(а, b; с; x), though more general series will be considered in a few places. The 2Ft series was studied extensively by Euler, Pfaff, Gauss, Kummer, and Riemann and most of the present chapter and the next one is a discussion of their fundamental ideas. We saw that 2^1 (a, b; c; x)diverges in general forx = landRe(c — a—b) < 0. The next theorem due to Gauss describes the behavior of the series as x —>¦ 1~. A proof is given later in the text, where it arises naturally. Theorem 2.1.3 //Re(c - a - b) < 0, then ,. 2Fi(a,b;c;x) Г(с)Г(а+Ь-с) hm = ' *-*i- (l-x)c-a-b Г(а)Г(Ь) and for с = a +b, 2Fl(a,b;a+b;x) Г(а+Ь) *->i- log(l/(l-*)) Г(а)Г(Ь) The next result about partial sums of 2Fi(a,b; c; 1) is due to Hill [1908]. It can be stated more generally for p+\Fp. The proof is left as an exercise. Theorem 2.1.4 Let sn denote the nth partial sum of 2F1(a,b;c; 1). For Re(c - a - b) < 0, T{c)na+b-c S" ~ Г(а)Г(Ь)(а+Ь-сУ and for с = a + b, ^ Г (с) log n Sn~ Г(а)Гф)' The theorem is easily believable when we note that the nth term is Г(с) па+ь-с-\ The necessary result would now follow if we replace the sum with an integral.
64 2 The Hypergeometric Functions Many of the elementary functions have representations as hypergeometric series. Here are some examples: tan x = x 2FX ( ^ 2 l; -xl ) ; B.1.4) U/2-"- B.1.5) B.1.6) This last relation is merely the binomial theorem. We also have sin x=x0Fl( ~ ¦ -x2/4 J ; B.1.7) cosx=0F^i/2;—J; B.1.8) () B.1.9) The next set of examples uses limits: (hbl) B.1.10) coshx = lim г/7]! ; —r 1 ; B.1.11) ,*(%)= lim 2*(в'*;^; B.1.12) Vc ) b-oc \ с bj 0Fi(~;jc) = lim 2Fj (а'Й; 4) . B.1.13) \ с J a,b-><x> \ с ab) The example of log(l — x) = —x 2/^i A, 1; 2; x) shows that though the series con- converges for \x\ < 1, it has a continuation as a single-valued function in the complex plane from which a line joining 1 to oo is deleted. This describes the general sit- situation; a 2Fj function has a continuation to the complex plane with branch points at 1 and oo. Definition 2.1.5 The hypergeometric function 2F\ (a, b; c; x) is defined by the series ^ (a)n(b)n „ / \ fl=0 for \x\ < I, and by continuation elsewhere.
2.2 Euler's Integral Representation 65 When the words "hypergeometric function" are used, they usually refer to the function 2F1 (a, b\ c; x). We will usually follow this tradition, but when referring to a hypergeometric series it will not necessarily mean just iF\. Hypergeometric series will be the series defined in B.1.2). 2.2 Euler's Integral Representation Contained in the following theorem is an important integral representation of the 2F\ function due to Euler [1769, Vol. 12, pp. 221-230]. This integral also has an interpretation as a fractional integral as discussed in Section 2.9. Theorem 2.2.1 If Re с >Reb>0, then Г(Ь)Г(с-Ь) Jo in the x plane cut along the real axis from 1 to 00. Here it is understood that arg? = arg(l — t) = 0 and A — xt)~a has its principal value. Proof. Suppose at first that \x\ < 1. Expand A — xt)~a by the binomial theorem given in B.1.6) so that the right side of the formula becomes x n\ This is a beta integral, which in terms of the gamma function is Ь)Г(с-Ь) T(n + Substitute this in the last expression to get „ „ (a,b xn = 2FA ;x ^ л!Г(л + с) \ с n~\) This proves the result for \x\ < 1. Since the integral is analytic in the cut plane, the theorem holds for x in this region as well. ¦ The integral in Theorem 2.2.1 may be viewed as the analytic continuation of the jF\ series, but only when Re с > Reb > 0. The function A — xt)~a in the integrand is in general multivalued and one may study the multivalued nature of 2Fi(a,b;c;x) using this integral. To discuss analytic continuation more deeply would require some ideas from the theory of Riemann surfaces, which goes beyond the scope of this book. See Klein [1894]. It is also important to note that we view 2^1 (a. b; c; x) as a function of four complex variables a,b,c, and x instead of just x. It is easy to see that ^ 2 F\ (a, b; c; x) is an entire function of a, b, с if x is fixed and \x\ < 1, for in this case the
66 2 The Hypergeometric Functions series converges uniformly in every compact domain of the a, b, с space. Analytic continuation may be applied to the parameters a,b,c. The results may at first be obtained under some restrictions, and then extended. For example: Theorem 2.2.2 (Gauss [1812]) For Re(c - a - b) > 0, we have ^ (g)n(b)n _ р /a,b \ = Г(с)Г(с-а-Ь) ^ n\{c)n \ с ' ) T{c-a)T{c-b) Proof. Let x —>¦ 1 ~ in Euler's integral for 2 F\. The result is, by Abel's continuity theorem, b; Л = Г(с) r(b)r(c-b)J0 _ Г(с)Г(с-а-Ь) ~ Г(с-а)Г(с-Ь)' when Re с > Re ? > 0 and Re(c — a - b) > 0. The condition Re с > Re b > 0 may be removed by continuation. It is, however, instructive to give a proof that does not appeal to the principle of analytic continuation. Our first goal is to prove the relationship c(c — a — b) If , (a)n(b)n (a)nib)n An = and Bn = , л!(с)„ л!(с+1)„' then <2,л) 1 c-a-b- и!(с+1)„_1 and с(иА„ — (и + l)An+i) = — я и!(с+1)„_! L c- So, since the right sides in the last two expresions are equal, c(c —a — b)An = (c - a)(c — b)Bn + cnAn — c(n and N N c(c — a — b)y An = (c — a)(c — b)y Bn — c(N о о
2.2 Euler's Integral Representation 67 Now let N -* oo and observe that (N + l)AN+i ~ \/Nc~a'b -> 0, because Re(c — a — b) > 0. This proves B.2.1). Iterate this relation n times to get Г(с-а)Г(с-Ь) -.(а,Ь \ Г(с+ n - а)Г(с+ n - b) ( a,b Г(с)Г(с-а-Ь) \ с J Г(с + п)Г(с + п-а-Ь) \c + n It is an easy verification that the right side —>• 1 as n —*¦ oo. This proves the theorem for Re(c — a — b) > 0. The theorem is called Gauss's summation formula. ¦ The case where one of the upper parameters is a negative integer, thereby making the 2F1 a finite sum, is worthy of note. This result was essentially known to the thirteenth century Chinese mathematician Chu and rediscovered later. See Askey [1975, Chapter 7]. Corollary 2.2.3 (Chu-Vandermonde) -n,a \ _ (c-a)n с ) (с)„ Euler's integral for 2F\ can be generalized to pFq. Rewrite it as Thus, integrating a i Fo with respect to the beta distribution tb~l A — t)c~b~l gives a 2F1, that is, a parameter i> is added in the numerator and с in the denominator of the original \Fo(a; t). More generally, we have /fli a,,yi \_ 1 9+1\bu...,bq,bq+l'X) f и---,aP 4\bu...,bq J when Rei>9+i > Reap+i > 0. This condition is needed for the convergence of the integral. By a change of variables the expression on the right of B.2.2) also equals B.2.3) Note also that B.2.2) can be used to change the value of a denominator or numerator parameter in pFq(a\, ...,ap;bi,...,bq;x). For example, take ap+\ — bq in
68 2 The Hypergeometric Functions B.2.2) to get , U1, „ , " «v*i.---.Vi.Vi'"y r(bq)r(bq+i-bq) ¦ [ tb<-l(l-t)b'+1-b<-1pFq(a,u"-'a,p;xt)dt. B.2.4) Jo \bu---,bq J It should be remarked that when x is a complex variable in B.2.2) to B.2.4), then the pFq is in general a multivalued function. Thus, the variable x has to be restricted to a domain where the pFq in the integrand is single valued. One must take care to state the conditions for single-valuedness. We note a special case of B.2.4). Theorem 2.2.4 For Rec > Red > 0, x ф 1, and |arg(l — x)\ < я, r{d)T(c-d)J0 \ d ' One pecularity of Euler's integral for 2Fi is that the 2F\ is obviously symmetric in the upper parameters a and b, whereas it is not evident that the integral remains the same when a and b are interchanged. Erdelyi [1937] has presented a double integral from which the two representations can be obtained: [Г( ^ /' /Vv-'o-o'-*-1 -a)F(c-b)Jo Jo Г(а)Г(Ь)Г(с • A - s)c-"-1 A - tsx)~cdtds. B.2.5) The next theorem gives an important application of Euler's integral to the deriva- derivation of two transformation formulas of hypergeometric functions. Theorem 2.2.5 b) ^C;b ^) (Pfaff), B.2.6) в;^)=A-*)~-*2*(С~в;С~*:*) (Euler). B.2.7) Proof. Replace t with 1 — s in Euler's integral (Theorem 2.2.1) to obtain r Г(Ь)Г(с-Ь) Jo
2.2 Euler's Integral Representation 69 This proves Pfaff's [1797] transformation for Re с > Reb > 0. The complete result follows by continuation of с and b. The hypergeometric function is symmetric in the parameters a and й, so we apply Pfaff's transformation to itself: This is Euler's [1794] formula and the theorem is proved. ¦ The right-hand series in Pfaff's transformation converges for \x/(x — 1)| < 1. This condition is implied by Rex < 1/2; so we have a continuation of the series 2F1 (a, b\ c; x) to this region by Pfaff's formula. The following two examples give an indication of the power of the transforma- transformation formulas. By Pfaff's transformation, 1 + x2 In Chapter 1 we showed how the gamma function could be used to develop some aspects of trigonometric functions, starting with the series definitions of the sine and cosine functions. The above relation can now be the basis for the connection between the trigonometric functions and a right triangle. For the second example, write Euler's transformation as Equate the coefficient of x" on both sides to get a)j(b)j(c-a-b)n-j (c-a)n(c-b)n j^ j\(c)j(n-j)l n\(c)n Rewrite this as: Theorem 2.2.6 (Pfaff-Saalschutz) 3F2( n'a'b Л = (са)АсЬ)п г\с,1+а + Ь-с-п' ) (c)n(c-a-b)n Gauss's 2F1 sum (Theorem 2.2.2) follows from this by letting n -> 00. The limiting procedure may be justified by Tannery's theorem, which is a discrete form of Lebesgue's dominated convergence theorem. Theorem 2.2.6 was first discoverd by Pfaff [1797a] and rediscovered by Saalschlitz [1890]. It is often called Saalschiitz's theorem but this nomenclature does not give due credit to
70 2 The Hypergeometric Functions Pfaff. Surprisingly, a special case seems to have been found by Chu. See Takacs [1973]. Remark 2.2.1 The Chu-Vandermonde identity (Corollary 2.2.3) gives the sum of a terminating 2^1- The Pfaff-Saalschiitz identity involves a special type of terminating 3F2. The sum of denominator parameters is one more than the sum of the numerator parameters. Such a series is called balanced. This identity was obtained by a factorization of a jF\ and it is worth noting that the Chu- Vandermonde identity can be derived from a factorization of a 1 Fq. Thus, one may equate the coefficients of x" in A - xTa{\ - x)'b = A - x)-{a+b) to get an equivalent identity: {а)кф)п-к (a+b)n The right side of Pfaff's transformation formula when expanded as a series equals (аЖе - b)k k (a+k)jxj X , Note that (a)k(a + k)j = (a)j+k; then write j + к = п to see that the sum is —x — > x , where the inner sum was evaluated by the Chu-Vandermonde identity. This gives another proof of Pfaff's transformation. The following definition is suggested by the Pfaff-Saalschiitz formula. Definition 2.2.7 A series /au...,ap+i p+\tp I ,X \ 0\, .. . ,0p is called balanced ifx — 1, one of the numerator parameters is a negative integer, and a\-\ h ap+\ + 1 = b\ H \-bp. Remark 2.2.2 The Pfaff-Saalschiitz identity can be written as -n, -a, -b (c)n(c + a + b)n3F2 [ ; \c, l—a—b — n—c
2.2 Euler's Integral Representation 71 This is a polynomial identity in a, b, с Dougall [1907] took the view that both sides of this equation are polynomials of degree и in a. Therefore, the identity is true if both sides are equal for n + 1 distinct values of a. Clearly the result is true when n = 0. Assume the result true for n = 0, 1,..., к — 1. Now set n — k. By symmetry in a and n, it follows that the identity is true for a = 0, 1,..., к — 1. These are к values, so if we can find one more value of a for which the identity holds, then it is proved. Note that (l-a-b-n-c)j v ' v So, if a = —b — с then both sides of the identity are equal to (—a)n(—b)n. This proves the identity. Dougall showed that a more general identity could be proved by this method. The identity is a, 1 + ^a, —b, —c, —d, —e, —n lFe у \a, 1 + a + b, 1 + a + c, 1 + a + d, 1 + d = A + a)n(l +a+b + c)»(l + a + b + d)n(l +a + c + d)n ~ {l+a+b)n{l+a+c)n{l+a+d)n{l+a+b + c + d)n' where l+2a+b + c + d + e + n — 0 and и is a positive integer. This condi- condition means that the series terminates and the sum of the denominator parameters is 2 more than the sum of the numerator parameter. Such a series is called 2- balanced. (A 1-balanced series is balanced as in Pfaff-Saalschiitz.) Note that the sum of the parameters in a column in this tF6 add up to the same quantity. Thus, \ + a = 1 + \a + \a = 1 + a + b — b = I + a + с — с and so on. This type of series is called well poised. Dougall's identity thus gives the sum of a class of well-poised 2-balanced 7F6. These series are called very well poised because the series contains the factor (f + 1),. a + 2k This identity was also discovered by Ramanujan at around the same time as Dougall's discovery. See Hardy [1940, p. 102]. A proof of the sum B.2.9) is given later. To obtain another important identity from Dougall, let n —>¦ oo to get / a,a/2+ \,-b,-c,-d 5 4 \a/2,a+b + l,a + c+ l,a+d + l' T(a+b + 1)Г(д +с+ 1)Г(а + d + 1)Г(а +b + c + d+l) + + d + l)
72 2 The Hypergeometric Functions when Re(a + b + c + d + l)>0. Then take d = -a/2 to get f ( a' ~b' ~c ¦ 3 2 \a+b+l,a+c+l' b + с + I) Г (а + 1)Г(§ + b + 1)Г(| + с + 1)Г(а + Ъ This gives the sum of a general well-poised 3F2 series. It is due to Dixon [1903]. The limiting procedure above may be justified by Tannery's theorem. A more general result than B.2.10) had been found by Rogers [1895]. This will be given later. Remark 2.2.3 We have seen that the Chu-Vandermonde identity can be obtained from the Euler's integral in Theorem 2.2.1, which is an immediate consequence of the value of the beta function. A type of converse holds. The Chu-Vandermonde identity is a discrete form of the beta integral formula, .e-i,, .,*-, ,. Па)Т{Ъ) ! t"-l(l- Jo By Remark 2.2.1, we have n! ^ ia)k{b)n-k _ (a + Ь)„ f^ We briefly sketch the argument showing that a limiting form of this identity is the beta integral formula. Rewrite the identity as (a+b)n n + l f-? k\{n-k)\ or A [(* + l)l-a(a)k/k\][(n + 1 - k)'-\b)n-k/(n - *)!] Recall that by definition k^o jfei = r^)"
2.3 The Hypergeometric Equation 73 If we break up the sum as logn n—logn n ?+? + ?' k=0 logn n—logn the first and third sums go to zero and the second tends to This expression equals 1 and we have the result. The reader should try to find the beta integral that corresponds to Gauss's formula for 2^1 at x = 1. 2.3 The Hypergeometric Equation The hypergeometric function satisfies a second-order differential equation with three regular singular points. This equation was found by Euler [1769] and was extensively studied by Gauss [1812] and Kummer [1836]. Riemann [1857] in- introduced a more abstract approach, which is very important. Our treatment will basically follow Riemann, in a more explicit form given by Papperitz [1889]. The reader who has never seen series solutions of differential equations with regular singular points might find it helpful to read Appendix F first. Let p(x) and q(x) be meromorphic functions. Suppose that the equation ii+P(x)^+q(x)y=0 B.3.1) has regular singularities at the finite points a, ft, у and that the indicial equations at these points have solutions a\, a2; bu bj; and с\,сг respectively. Assume that a\ — аг,Ъ\ —Ъг, and c\ — сг are not integers. Set x = \/t so that the differential equation is transformed to Since 00 is an ordinary point, 2x —x2p(x) and x4q (x) are analytic at 00. Moreover, since a, fi, у are regular singular points, ABC p(x) = + + + Ui(x) x — a x — p x — у and (x - a)(x - p){x - y)q(x) = + + + U2(x), x — a x — p x - у where u\{x) and ui{x) are analytic functions.
74 2 The Hypergeometric Functions The last two relations together with the analyticity of 2x —x2p(x) and xAq{x) at infinity imply that A + B + C = 2 and u\ (x) = u2(x) = 0. Suppose a solution has the form YlT=o а"(х ~ a)n+X> where the exponent X satisfies the indicial equation X(X - 1) + XA + -? = 0. (a - P)(a - y) Since a\ and a2 are roots of this equation, fll + U2 = 1 — A and D axa2 = {a - /J)(a - y) Therefore, A = 1 — a\ - аг and D = (a — f})(a — y)a\a2. Similarly, B = l-bi-b2 and E = (P - a)(P - у)Ьф2, and C=\—C\-C2 and F = (y - a)(y - P)c\c2. Since Л + В + С = 2, it follows that the exponents of the differential equation are related by the equation fCl+c2 = l. B.3.3) We summarize the results in the following theorem due to Papperitz [1889]. Theorem 2.3.1 A differential equation with three singular points a, p, у and exponents a\, a2; b\, b2; and Ci, c2 respectively has the form d2y \\-a\-a2 \-b\-b2 1 - C\ - c2 \ dy d2y (I — a\— a2 1 —b\—b2 ~dx^+ \ x-a + x-p {x - a){x - p){x - y) x — у ) dx (a - P)(a - y)axa2 | (p - a)(p - у)Ьф2 | (y - a)(y - P)ac2\ x-p x-y / and the exponents satisfy B.3.3).
2.3 The Hypergeometric Equation 75 It is customary to take the regular singularities at 0, 1, and oo. So let a = 0, /3 = 1, and у -> oo in the above differential equation to obtain x2(x - \)ld-\ + {A - fl, - аг)х{х - IJ + A - bx - b2)x\x - dxl + {axa2{l-x)+b\b2x+clc2x{x- \)}y = 0. B.3.4) The hypergeometric equation is obtained from this one by another simplification. Write this equation in the form B.3.1). If у satisfies B.3.1) and у = xxf, then / satisfies d2f ( 2X\ df ( Xp(x) X(X-\)\ -4 + p{x) + — )-L+( q{x) + J^-L + / = 0. dx2 \ x ) dx \ x x1 ) This equation also has 0, 1, oo as singular points, but the exponents are different. Equation B.3.4) has exponents ax and a2 at 0; the new equation has exponents a\ — X and a2 — X at zero. The exponents at oo, however, are a + X and a + X. By this procedure we can arrange that one exponent at 0 and one exponent at 1 be equal to 0. (At x = 1, we set у = A — x)x f{x).) Thus the new equation has exponents 0, a2 — a\; 0, b2 — b\\ c\ + a\ + b\, c2 + a\ + b\. This brings considerable simplification in B.3.4) since the terms a\a2 and b\b2 vanish. It is traditional to write a — c\ + a\ + b\, b = c2 + a\ + b\, and с = 1 + a\ — a2. After simplification, the equation becomes x{\ - xy-?- + [c - (a + b + l)x]~ - aby = 0. B.3.5) dx2 dx This is Euler's hypergeometric differential equation. It has regular singularities at 0, 1, and oo with exponents 0, 1 — c; 0, с — a — b; and a, b respectively. Unless specifically stated, we assume that c, a — b, and с — a — b are not integers. Riemann [1857] denoted the set of all solutions of the equation in Theorem 2.3.1 by B.3.6) In particular, the set of solutions of B.3.5) is denoted by
76 2 The Hypergeometric Functions Our earlier discussion implies that xx(\-xYP lax ex bx x B.3.7) Every conformal mapping of the Riemann sphere С U {oo} is of the form Xx + [i t = Sx + v ' where X v — /л8 = 1. Such a mapping takes any set of three distinct points {a, f), у} to another set of three distinct points {a\, fix> Y\ }¦ In this case [a P Y ) («i А И ) P lax bx cx x \ = P I ax bx cx t\. B.3.8) [a2 b2 c2 J [ a2 b2 c2 J This is easily checked. Moreover, there are six linear fractional transformations that will map a set of three points to a permutation of the three points. For example, the set {0, 1, oo} will be mapped to itself by the mappings 11 1 x-1 1 x x-+x,l-x,-, , 1 - - = , —- = -. B.3.9) X I — X X X 1 — l/X X — 1 We note a few particular cases of B.3.7)-B.3.9): 0 oo 1 = P< 0 a 0 1-х с — a — b b I — с B.3.10) (A) (B) л — i I c-b b-a ) (Note: A ^y)" = A — x)~a.)
2.3 The Hypergeometric Equation 77 = Р{а О О -) (С) (D) (Е) О оо 1 = {\-х)с-а-ьР \ 0 с-а О х) (F) 1-е с — Ъ а+Ь — с О оо 1 = х1'ср{ 01+а-с 0 х). (G) кс—1 l+b — c c — a — b Now from the full set of solutions P { } of B.3.5) we choose two solutions about x = 0 which form a basis. For a solution of the form xx Y^ anxn, X is either 0 or 1-е. When X = 0, the coefficients an satisfy So one solution is 2Fi(a,b;c;x). If с is not an integer, then the hypergeometric equation has only one independent solution analytic at x = 0. In particular, 2F\{a,b;c;x) is the only solution analytic at x = 0 and with value 1 at x = 0. The other solution is of the form W = xl~cg, where g is analytic at x = 0. It follows from B.3.10G) that g = k2Fx(a + 1 - c, b + 1 - c; 2 - c; x), where к is a constant. Therefore, the independent solutions are 2F\(a,b;c;x) and x1~c2F\(a + 1 — c, b + 1 — c; 2 — c; x). In a similar way, B.3.10) immediately implies that two independent solutions at x — 1 are 2FX (a, b\a +b+ 1 - c; 1 - x) and A - x)c~a~b2Fi{c - а, с - b; с + \-a-b\l-x) and at oo are (-xya2Fi(a, a + l-c;a+l-b; l/x) and b + l-c\b + l-a; l/x).
78 2 The Hypergeometric Functions The powers of — 1 have been introduced for convenience in expressing some later formulas. It is important to note that some of the hypergeometric transformation formu- formulas can also be obtained from B.3.10). Pfaff's transformation 2F\{a, b; c; x) = A — x)~a2F\(a,c — b; c; x/(x — 1)) is an immediate consequence of B.3.10B), combined with the fact that there is only one solution analytic and equal to 1 at x = 0. Euler's transformation 2F\_(a, b; c; x) = A — x)c~a~b2F\{c — а, с — b;c; x) follows from B.3.10F). A number of relations involving hypergeometric functions arise from the fact that the hypergeometric equation has two independent solutions, so that any three solutions must be linearly related. Theorem 2.3.2 v( a'b iF\ ; 1 — x \a+b+l -c (a,b \ , /1 +a - c, 1 +b - с \ = A2Fll Ус -xj+Bx'-^FA 2c ;x\, B.3.11) where T = (а + 1-с)Гф+1-с) . T{a+b+l-c)T{l-c) , n Г(с-1)Г(а A = and В — Г( 1)Гф1) /a,b \ (a,a-c+\ 1 2F1[ ;x)= C{-x)-a2Fx ^ ; - ; / \ a — b + I x b-a+l where С = Г(с)Г(Ь - a) and D = ~ T{)T{b) пП ~ and D Proof. When x = 0 and Re с < 1, Gauss's summation formula gives Г(а+1-с)Г(Ь+1-с)' Observe that Re с < 1 was used twice. First it was used to make the second term on the right vanish; it is also the condition for the series on the left to converge at x = 0. Then x = 1 gives (for Re(c - a - b) > 0) l = лГ(с)Г(с-а-Ь) вГB-с)Г(с-а-Ь) ~ Г(с-а)Г(с-Ь) ГA-
2.3 The Hypergeometric Equation 79 After some tedious trigonometric calculation, which comes in after applying Euler's reflection formula to the second term on the right, we arrive at the value of В required by the theorem. This proves B.3.11). Suppose Reb > Rea. The right side of B.3.12) as x —> oo is ~C(—x)~". To see the behavior of the left side, apply Pfaff's transformation. Then 5 1 x - I Г(с-а)Г(Ь)' The assumption that Reb > Re a was used in the last step to evaluate the 2FX by Gauss's formula. It follows that Г(с)Г(Ь-а) Г(с-а)Гф)' The value of D follows from the symmetry in a and b. ¦ Corollary 2.3.3 'a,b \ T(c)T(c-a-b) Г(а)Г(й) ( Х) 2^{l+c-a-b'1 *) ' B.3.13) 7 ;) (Pfaff) B.3.14) +l-n-c ) Proof. In B.3.11), replace x by 1 - x and с by a + b + 1 - с Then B.3.14) l Г(-л) follows from B.3.13). Just take a = —и and recall that r ' , = 0 when и is a nonnegative integer. ¦ The first part of Theorem 2.1.3 also follows from B.3.13) above. It should also be noted that, since Pfaff's formula 2F\{a,b;c\x) = A - x)~a2F\{a, с - b; c; x/{x - 1)) gives a continuation of 2F\ from |jc| < 1 to Rex < \, then B.3.13) gives the continuation to Rex > 1/2 cut along the real axis from x = 1 to x — oo. The cut comes from the branch points of A — x)c~a~b, and once this function is defined on a Riemann surface, 2F\ {a, b; c; x) is also defined there.
80 2 The Hypergeometric Functions Now consider the function f0° dt S(x) 4° Jo We show how Theorem 2.3.2 can be employed to find the asymptotic expansion of the function given above. Wong [1989, p. 18] used this function to demonstrate that a certain amount of care should be taken when finding the asymptotic expansion of a function. In this instance, when the method of integration by parts is applied, one gets the expansion z.r.^^-i)^" as x' <) which is obviously incorrect, since the integral is positive and every term of the expansion is negative. However, for t > 1 we have «=o If this series is substituted in the integral, then term-by-term integration produces the divergent integrals /•oo ,-«-1/3 / JO x+t -dt. If these are interpreted in a "distributional" sense the value of the above integral can be set equal to 2л (-1)" With this interpretation, S(x) has the expansion (after termwise integration) S(x) ~%Y Ш«х-«-1/з as x -+ oo. B.3.16) The correct result is, however, the sum of the two expansions in B.3.15) and B.3.16). We obtain this from Theorem 2.3.2. First note that forRe(a + 1-е) > 0 and Re b > 0, /•00 \C-b-\. / tb~l{\+t)c~b~l{l+xt)~adt Jo Г(а + 1-с)Г(Ь) ( a,b \ = — — -2FA ;l-x). B.3.17) F(a + b+l— c) \a+b+l—c ) This follows from Euler's integral representation of a iF\ given in Theorem 2.2.1. To reduce this integral to Euler's form, set r = и/A — и). From B.3.17) it follows
2.3 The Hypergeometric Equation 81 that 1 f° = - / X Jo ц /i,i i 3 /1,1 1 Apply B.3.11) to get 3 [ГD/3)Г(-2/3) /1,1 1 ГB/3)ГD/3) 23 /1/3,1/3 1 * 2f4 1/3 ;x FJ + X 4 1/3 ' 3 f/^1'1 l\ л.27Г : F /1/3,1/3 1 2x \5/3 x/ V3* / V 1/3 ¦« This is equivalent to the sum of the series in B.3.15) and B.3.16). Remark 2.3.1 We have seen that 2F1 {a, b; c; x) is one solution of the hyperge- hypergeometric equation. This fact was used to show that another independent solution is xl~c2Fx(a + 1 - c, b+ 1 - c; 2 - c; x). Here we show how the other solutions can be obtained formally from the series for 2F1. We should write the hypergeometric series as a bilateral series Since ГA + x) has poles at x = —1, —2,... , the series has no negative powers of x. It is clear that a change of variables n -> n + m, where m is an integer, does not change the bilateral series. Consider the transformation n -> n + a, where a is a noninteger. Now the series takes the form (a)n+a(b)n+axn+a _ (a)a(b)a a ^ (a+a)n(b ^ (a)n+a(b)n+axn+a _ (a)a(b)a a ^ (a+a)n(b + u) n fl (c)n+a(\)n+a (c)e(l)a ,f-L(c+ «)„(!+а)„ «= — 00
82 2 The Hypergeometric Functions In the latter expression, the terms with negative values of n vanish if we set c+a = lorl+a = l. The last condition gives back the original iF\ series. The first case, where a = 1 — c, gives (a + 1 - c)n{b + 1 - c)n j_ (a + l-c,b+l-c =X 2^1 ^ >x ^ A)вB-с)„ 2 Ч 2-е which is the second independent solution. The solutions at 00 are obtained in a similar manner by changing и to —и. In this case (A) becomes kxa X where к is a constant. Since (a + «)_„ = (—1)"/A — a — a)n, write the last series as kxa V (l-c-a)nX~n(-<x)n -n Once again, to eliminate that portion of the sum involving negative values of и, take either a = —aora = — b. In the first case we get ex ~a 2^1 (я + 1-е, a; a + l—b; l/x) and in the second case cx~b2F\(b + 1 — c, b; b + \-a\ \/x). Remark2.3.2 In Section 2.1 we proved that the series 2F\(a, b; c; x) has radius of convergence 1. We now reverse the point of view taken in Remark 2.3.1 and see how to obtain convergence from the theory of differential equations. Since the singularities of the equation are at 0, 1, and 00, the radius of convergence is at least 1. If it is more than 1, then the series is an entire function. Moreover, by B.3.12) it is a linear combination of x~a f\(x) and x~bf2(x), which are solutions at 00. This is possible only if either a or b is an integer. Otherwise, both solutions at 00 are multivalued. (Both a and b cannot be integers since a — b is not an integer.) Liouville's theorem shows us that the integer must be negative and that the 2F1 is a polynomial. Hence, if the 2^1 is an infinite series then the radius of convergence must be 1. We now consider the case where с is an integer. Suppose с is a positive integer; then Г(а)Г(Ь) (a,b к!Г(с + к) and Г(а+\-с)Гф + \-с) x_ (a + c- l.fc+1 -c F2:= ^—; x 2F>( 2_c ;* Г (a + 1 - с + к)Г(Ь + l - с + fc) к\ГB-с + к)
2.3 The Hypergeometric Equation 83 are equal. To find the second solution in this case, suppose a and b are not negative integers. Consider the limit hm = — (F, - F2) \c=n с-m С — П ОС Г(а+к)Г(Ь + к)Г'(п+к) к к\Г(п+к)Г(п + к) logx k\T{2-n+k) f^Q к\ГB-п+к) '(a + l-n+Jfc) Г'(Ь + 1 - n + к) — hmx Г'B-с + к) к The second series is Г»Г(?), ^ (a,b Г (и) ;x and the first n — 1 terms in the third series are zero because TAln+K) = 0 for к = 0,1,... ,n — 2. The same is not true in the fourth series, because Г'B— с + к)I ГB — с + к) has poles at these points. By Euler's reflection for- formula, + cos лхТ{х). [ГA-дс)р Г(х)ГA-х) Set x — n — к — 1 to get The fourth series can now be written as n — \ ,, Г (a + 1 - n + k)V{b + 1 - n + к) к
84 2 The Hypergeometric Functions So, when a and b are not negative integers, the second solution is ; x log* n ) E°° (a)k(b)k k {yf(a + k) + yf(b + k) — i/f(l + k) — Ф(п + k)\x (*-1)!Г(«-*)Г(Ь-*) (лЛ1)! V 7 where f{x) = Г'(х)/Г(х). If a is a negative integer, say —m, then -ф-(а + к) is undefined for some values of к. Consequently, the solution given above does not work. To resolve this difficulty observe that lim {ir(a + k)-if(a)} = ir(l+m-к)-if A+m) fotk<m. B.3.19) a> m a—>— m Now, if \p-(a) iF\(a, b; c; x) is subtracted from the second term in B.3.18), then the resulting series is again a solution of the hypergeometric equation except that now we may let a tend to the negative integer —m. The reader should verify B.3.19) and also that in this case the second solution is 2F1I ;x 1 logx ( \ (h\ 7\ №Q +mk) + ir(b + k) ф(п + к) фA + к)}хк к=о Г(Ь) (^ (п - к - 1I(т + 1)к The case where both a and b are negative integers may be treated in the same way. When с — 0, — 1, —2,..., then the indicial equation shows that the first solution is a -c + l,b -c + 1 The second solution in this case can be obtained from B.3.18) by replacing a and b with a — с + 1 and b — с + 1 respectively. Theorem 2.3.2 and its corollary must be modified when c, a — b, or с - a — b is an integer. The reader should work out the necessary changes. An interesting history of the hypergeometric equation is contained in Gray [1986].
2.4 The Barnes Integral for the Hypergeometric Function 85 2.4 The Barnes Integral for the Hypergeometric Function In a sequence of papers published in the period 1904-1910, Barnes developed an alternative method of treating the hypergeometric function 2 F\. A cornerstone of this structure is a contour integral representation of jF\(a, b; c; x). An under- understanding of this representation can be obtained through the concept of a Mellin transform. We begin with a simple and familiar example: T(s) = /0°° xs~xe~xdx. It turns out that it is possible to recover the integrated function e~x in terms of a complex integral involving Г (s). This inversion formula is given by -* = -L Г 2ni Jc_i ¦C+iOO —s x~sr(s)ds, о 0. B.4.1) This can be proved by Cauchy's residue theorem. Take a rectangular contour L with vertices с ± iR, с — (N + |) ± iR, where N is a positive integer. The poles of F(s) inside this contour are at 0, —1,..., —N and the residues are {—\УЦ\ at j = 0, 1,..., N. Cauchy's theorem gives ^ JL x~sT(s)ds = ^=0 (-l)jxj/jl. Now let R and N tend to infinity and use Theorem 1.4.1 and Corollary 1.4.4 to show that the integral on L minus the line joining с — iR to с + iR tends to zero. This proves B.4.1). The Mellin transform of a function /(jc) is defined by the integral F(s) = /0°° xs~l f(x)dx. We have studied other examples of Mellin transforms in Chap- Chapter 1. The integral /J jcs-1A - x)'~ldx is the transform of I — jc)'-1, 0<jc<1, and poo rs~X -dx A+JC)'" is the transform of f(x) = 1/A + x)'. Once again, one can prove that A - jc)' = —- / x~s -!— ds, Re t > 0 and c> 0, B.4.2) )+ 2ni Jc_ioo r(s + r) and 1 I pc+ioo (l+хУ 2niT{t) /C+l x~sr(s)T(t - s)ds, 0 < с < Re t. B.4.3) —ioo The phenomenon exhibited by B.4.1) through B.4.3) continues to hold for a fairly large class of functions /(jc). Thus, if F(x) = f™xs~lf(x)dx then /(jc) = 2S7 Sc-ioo X~S F(s)ds is true for a class of functions. We do not develop this theory as a whole but prove a few interesting cases. The Mellin transform is further discussed in Chapter 10 with a different motivation.
86 2 The Hypergeometric Functions The above discussion shows that if we want a complex integral representation for the hypergeometric function we should find its Mellin transform. Now, '00 xs~l2F\ ( ; — x \dx с T{b)(c-b)Jo Jo (\+xt)a T(s)T(a-s)Tic) f\b-s-4l_tr-b-,dt Г(а)Г(Ь)Г(с-Ь)Л ( ' r(s)r(a - s)F(c) Г(Ь - s)V(c - b) T(a)T(b)T(c - b) T(c-s) T(c) T(s)T(a-s)T(b-s) B.4.4) Г(а)Г(Ь) T(c-s) These formal steps can be justified by assuming min(Re a, Re b) > Res > 0. The auxiliary condition Re с > Re b can be removed by analytic continuation or via contiguous relations, which are treated in Section 2.5. Note that we integrated 2F\ at — x because, in general, 2F\(a, b; c; x) has branch points at x = 1 and x = oo and in B.4.4) the integral is over the positive real axis. There is another proof of B.4.4) in Exercise 35. We expect, by inversion, that T(a)T(b) (a,b \ 1 [k+i°° Г(х)Г(а - s)F(b - s) K\ = JL fk+ic ) 2xi Jk-ioo T{c) "'\ с  2л-/ Л-,-оо Пс-s) v ' ' B.4.5) where min(Re a, Reb) > к > 0 and с ф 0, — 1, -2, This is Barnes's formula and it is the basis for an alternative development of the theory of hypergeometric functions. It should be clear that we can represent a pFq by a similar integral. The precise form of Barnes's [1908] theorem is given next. Theorem 2.4.1 T(a)T(b) „ (a,b \ 1 fi0° Г (a + s)T(b + s)V(-s) ¦x)=t- „, , ч (-x)sds, Г(с) 'V с J 2niJ_i0 |arg(—jc)| < n. The path of integration is curved, if necessary, to separate the poles s = —a — n,s = —b — n, from the poles s = n, where n is an integer > 0. (Such a contour can always be drawn if a and b are not negative integers.) Proof. Let L be the closed contour formed by a part of the curve used in the theorem from —(N+ i)i to (N + |)i together with the semicircle of radius N +^
2.4 The Barnes Integral for the Hypergeometric Function 87 drawn to the right with 0 as center. We first show that the above integral defines an analytic function in |arg(— jc)| < n — <5, 8 > 0. By Euler's reflection formula, write the integrand as By Corollary 1.4.3, this expression is asymptotic to a л SHlSTT Set* = it to get „i»(log|jt|+iarg(-jt)) -{it)a+b-c~l2ni = p—7X1 pTlt for |arg(— x) | < n — 8. This estimate shows that the integral represents an analytic function in |arg(—jc)| < л — 8 for every 8 > 0, and hence it is analytic in |arg(—jc)| < n. We now show that the integral represents the series xn for \x < 1. This will prove the theorem by continuation. (Note that, if we start with the series for iF\(a, b; c; x), then Barnes's integral gives the continuation to the cut region |arg(-x)| < л.) On the semicircular part of the contour L the integrand is sin sn for large N. For s = (N + \)ew and \x\ < 1, sinra Since —л + 8 < arg(—x) < n — S, the last expression is In 0 < \в\ < |,cos6» > ^ and in I < \в\ < \, |sin6»| > ^. So, since log|jcI < 0, the integrand is O(Na+b-c-leT2iN+^bglxl) for 0 < |6»| < f and O(Na+b-c-\e-j-24N+{)^ for n_ < щ < | This implies that the integral on the semicircle -^ 0 as N ->¦ oo. Since the pole s = n of the integrand has residue n\T(c the theorem is proved. ¦ Г(а+п)Г(Ь + п) п
88 2 The Hypergeometric Functions We can recover the asymptotic expansion contained in B.3.12) from Barnes's integral quite easily. Suppose a — bis not an integer. Move the line of integration to the left by m units and collect the residues at s = —a — n and s = —b — n. The residue at s — —a — n is „•-а-п)Г(а + п)(-хГа-" п\Т(с-а-п) Г(с-а) ' n\(l+a-b)n ' after a little simplification. Thus, Г(а)Г(Ь) (a,b \ 1 Гт+ ; x = / Чс У 2ni J_m_io Г (с) 1\ с J 2л i J_m_ioo Г (с + (~хГа Г (с - а) а - b) ^ (b)n(l + b - с)п + ( X) T(c-b) ^ n\(l+b-a)n X ' B.4.6) where w(a) is the largest integer n suchthata+л < m. We define m(b) similarly. The integral is equal to Х—х-т [' —^—Ш ' "/- v~ ' ''" {-x)'ds. 2ni J-ioo Г (с — /и + *)ГA — w + s) sin л* For |arg(— jc)I < n — S, S > 0, the last integral is a bounded function of m and x. This implies that the expression is O(l/:cm), so that we have an asymptotic expansion for 2F1 (a, b; c; x) in B.4.6). If a — b is an integer, then some of the poles of Г(а + s)F(b + s) are double poles and logarithmic terms are involved. The reader should work out this case as an exercise. To gain insight into the next result of Barnes [1910], suppose F(s) andG(s)are the Mellin transforms of /(jc) and g(jc) respectively. The problem is to determine how the Mellin transform of f(x)g{x) is related to F(s) and G(s). Formally, it is easily seen that /00 1 /-oo /-c+zoo *s-7(jc)g(;c)rfjc - — / x'-lg(x) / F(t)x-'dtdx Ini Jo Jc-ioo 1 pc~\~ioo poo = ;r-7 / FW / xs-'~lg(x)dxdt 2ni Jc-ioo Jo 1 pc+ioo / F(t)G(s - t)dt. B.4.7) 2ТГI J^^
2.4 The Barnes Integral for the Hypergeometric Function 89 The case of interest to us is where s = 1. Then /00 1 /-C+/00 f(x)g(x)dx = — / F{t)G{\ - t)dt. B.4.8) ШI Jc-ioo Apply this to the Mellin pairs tt л х" cv л r(b + s)r(a-b-s) f(x) = ?ГТ7?' F(s) = and xd nr^ r(d + s)T(c-d-s) to obtain 1 fk+i0° Г(Ь + s)T(a -b- s)T(d + 1 - s)V(c - d - 1 + s) iTii Л-,-» Г(а)Г(с) -L 00 xb+d T{b+d+ \)T{a +c-b-d-l) -dx = Г(а+с) for a suitable k. By renaming the parameters, this can be written as 1 rlOQ I 1 id 2Л7 J-ioo B.4.9) This formula is due to Barnes and the above proof is due to Titchmarsh [1937]. It is correct when Re(a, b, c, d) > 0. We give another proof, because we have not developed the general theory of Mellin transforms in a rigorous way here. But first note that, if we take f(x) = jc?A - jc)^ and g(x) = xc-\\ ~ x)d+c~x in B.4.8), we get 2лi Л-ioo r(b + s)T(d-s) T(b-a)T(d-c)T(b + d-Y) max(-a, -b) < к < min(c, d). B.4.10) Theorem 2.4.2 If the path of integration is curved to separate the poles of Г {a + s)F(b + s)from the poles ofV(c - s)F(d - s), then 1 f'°° I := : Г(а+ s)T(b + s)T(c - s)T(d - s)ds Г (a + с)Г(а + d)T(b + с)Г(Ь + d) Note that a+c, a + d,b + c, b + d cannot be 0 or a negative integer.
90 2 The Hypergeometric Functions Proof. As in the proof of the previous theorem, use Euler's reflection formula to write the integrand as ГA — с + s){\ — d + s) sinn(c -: Also, let L be a closed contour formed by a part of the curve in the theorem together with a semicircle of radius R to the right of the imaginary axis. By Stirling's theorem (Corollary 1.4.3), the integrand is o(sa+b+c+d~2e'2j']lms]) as \s\ —>¦ oo on L. So the integral in Theorem 2.4.2 converges, but for JL we see that Im s can be arbitrarily small when \s\ is large. Thus, we have to assume that Re(a + b + с + d — 1) < Oto ensure that JL on the semicircle tends to 0 as R ->¦ oo. By Cauchy's residue theorem, _ ^ Г (а + с + п)Г(Ь + с + n)V(d - с - и)(-1)и ^ Г(а + d + п)Г(Ь + d + п)Г(с - d - п)(-\)" = Г(а + с)Г(Ь + c)T(d - cJFi i ~ •--¦-. х 1 +с -d Г(а + d)T(b + */)Г(с - dJFx (, ^ V 1 +J- The 2^iS can be summed by Gauss's formula. After some simplification using Euler's reflection formula and trigonometry, the right side of the theorem is ob- obtained. This is under the condition Re(a + b + с + d — 1) < 0. The complete result follows by analytic continuation of the parameters a,b,c,d. ¦ Theorem 2.4.2 is the integral analog of Gauss's summation of the jF\ at x = 1. Moreover, if we let b — e — it, d = f — it, and s = itx in the theorem and let t —> oo, we get, after some reduction employing Stirling's formula, i: о i ya-t с + е + j ) Thus, Barnes's integral formula is an extension of the beta integral on @, 1) and so will be called Barnes's beta integral. It is also called Barnes's first lemma. Theorem 2.4.2 can also be used to prove that (a,b \ T(c)T(c-a-b) ( a,b Г(с)Г(а + Ь-с) c^a_b ( c-a,c-b + Г(а)Г(Ь) ( ~X) lF
2.4 The Barnes Integral for the Hypergeometric Function 91 с — a — b ф integer. The proof is an exercise. This result was derived from the hypergeometric differential equation in the previous section. The next theorem gives an integral analog of the Pfaff-Saalschiitz identity. Theorem 2.4.3 For a suitably curved line of integration, so that the decreasing sequences of poles lie to the left and the increasing sequence of poles lies to the right of the contour: 2л- i J_ 100 Г(а + s)T(b + s)r(c + s)T(l -d- s)r(-s) ds —ioo ^ \^ ' S) Г(д)Г(Ь)Г(с)ГA -d + а)ГA -d + Ь)ГA - d + с) Г(е-а)Г(е-Ь)Г(е-с) where d-\-e = a-\-b-\-c-\-\. Proof. Start with the following special case of Theorem 2.4.2: Г(с - а)Г(с - b)T(a + п)Г(Ь + n) Г(с + п) 1 riOQ = -— / Г(а + s)T(b + э)Г(п - s)T(c -a-b- s)ds. 2ni J-ioo Multiply both sides by (d)n/[n\(e)n] and sum with respect to n. The result is Г(с-а)Г(с-Ь)Г(а)Г(Ь) (a,b,d rw 3 1 Г°° 2tu J_@O 1 Г°° Г 2ni J_ioo T{e-d) -s d ; e Г(е) r(a + s)r(b + s)r(ed + s)r(cabs)r(s) B.4.11) Take с = d, so that the 3F2 on the left becomes tF\. Thus we have Г(а)Гф)Г(е)Г(е -а- Ь)Г(с - а)Г(с - b) Г(с)Г(е-а)Г(е-Ь) Г(е) i J-ioo -ds. Г(е - с) 2ni This is the required result after renaming the parameters. Note that some of the operations carried out in the proof can be done only after appropriate restrictions on
92 2 The Hypergeometric Functions the parameters. These restrictions can be removed later by analytic continuation. The reader can check the details. ¦ A corollary of the proof of the previous theorem is the following interesting formula: Theorem 2.4.4 a,b,e-c F (a'b'C-l) _ r(d)T(dab) / a,b,ec 3 2\ d,e ' ) r(d-a)T(d-bK 2\e,l+a + b-d' r(d)T(e)r(d + e-a-b- с)Г(а +b-d) + T(a)T(b)T(d + e-a- b)T{e - c) /d — a,d — b,d + e — a—b — с \ '3 2V d + e-a-b,d+\-a-b ' )' Proof. As a consequence of Cauchy's theorem, B.4.11) is equal to Г (a + n)T(b + n)T(e -d + n)T(c -a-b- n)(-l)n п\Г(е Г(е) Т(е -d) Ln=Q .... ч- , .., n=Q T(c-b + n)T(c-a+n)T(c + e-a-b-d + n)T(a + b-c- n)(-l)" n\T(c + e-a -b + n) Set this equal to the 3F2 on the left of B.4.11) and the theorem is obtained after reduction. ¦ Corollary 2.4.5 Ifd + e = a+b+c+\, then 'a,b,c \ T(d)T(e)T(d-a-b)T(e-a-b) d,e ) T{d-a)T{d-b)T{e-a)T(e-b) 1 T(d)T(e) a + b-d T(a)T(b)T(d + e - a - b) d-a,d-b,\ \ + e —a —b,d+l—a—b' ) Proof. When d + e=a+b + c + l, the first 3 F-i on the right in Theorem 2.4.4 becomes iF\. Evaluate the 2^1 by Gauss's formula and get the result. ¦ Note that when a or b is a negative integer the second expression on the right vanishes because of the factor l/[F(a)F(b)] and we recover the Pfaff-Saalschiitz formula. Thus this result is the nonterminating form of the Pfaff-Saalschiitz identity.
2.4 The Barnes Integral for the Hypergeometric Function 93 The second term on the right in Theorem 2.4.4 vanishes if we take с = e + n — l, where n > 1 is an integer. The formula obtained is / a,b,l-n / a,b,ln \ T(d-a)T(d-bY \a + b-d+l,e' )' B.4.12) This leads to an interesting result about the partial sums of iF\(a, b; e; 1). Set d = a + b + n+€ and let e -> 0 to get a,b ; 1 [to n terms] = ) п)Г(Ь + п) ; 1 . ) xFo \ e ) Г(а + Ь + п)Г(п) \ e,a + b + n B.4.13) A particular case, where a = b = | and e = 1, was given by Ramanujan in the following striking form: 1 1.3 2Л 1 Bailey [1931, 1932] also proved the next, more general, theorem. Theorem 2.4.6 Г(х + m)T(y + m) fx,y,u+m — l 3^2 ; 1 Г(т)Г(х + y + m) \ u,x + y + m Г(п)Г(х -3F2 x,y,u+n-l to n terms to m terms. There is a simple proof from Theorem 3.3.3. Remark The Mellin transform can be seen as the Fourier transform carried over to the multiplicative group @, 00) by means of the exponential function. In F(s) = f™xs-lf(x)dx, write s = a + it and x = е2ли to get F{a + it) = 2л- X!^(/(e2jr") ¦ elnau)elnitudu =: In /^ g(u)e2nitudu. A new feature in the theory of Mellin transforms is that F(s) is analytic in a vertical strip. Just as the gamma function, a special case of a Mellin transform, has a finite field analog, so does the more general case. Let Fq denote a finite field with q elements and F* its multiplicative part. Let / be a complex-valued function on F*. Its Mellin transform is denned on the group of characters of F* as F(x) = YlaeF* X(a)f(a)- The reader may verify that this has an inversion given by f(s) ~ ~~[ S/ X(a)F(x)- There is also an analog ofBarnes's formula (Theorem 2.4.2) due to Helversen-Pasotto [1978]. A proof based on Mellin transforms is given in Helversen-Pasotto and Sole [1993].
94 2 The Hypergeometric Functions 2.5 Contiguous Relations Gauss defined two hypergeometric functions to be contiguous if they have the same power-series variable, if two of the parameters are pairwise equal, and if the third pair differ by 1. We use F(a±) to denote 2F\(a ± 1, b; c; x) respectively. F(b±) and F(c±) are defined similarly. Gauss [1812] showed that a hyperge- hypergeometric function and any two others contiguous to it are linearly related. Since there are six functions contiguous to a given 2F1, we get (!j) = 15 relations. In fact, there are only nine different relations, if the symmetry in a and b is taken into account. These relations can be iterated, so any three hypergeometric functions whose parameters differ by integers are linearly related. These relations are called contiguous relations. In this section we show how Gauss's fifteen relations are de- derived. Then we briefly point out connections with continued fractions and orthog- orthogonal polynomials. Contiguous relations can be iterated and we use the word contiguous in the more general sense when the parameters differ by integers. It is easily verified that d r(a,b \ ab /a + l,b+l \ -riFii ;x )=— 2FA ;x\. B.5.1) ax \ с J с \ с + \ J Since this 2F\ satisfies the equation x(l - x)y" + [c - (a + b + \)x]y' - aby = 0, we get the contiguous relation l) fa + 2, b + 2 —2Fi[ ;x c(c By means of transformation formulas, this can be changed into other contiguous relations. Apply Pfaff's transformation г, b \ _a fa,c — b x с '*) ~ X 21V с 'x-1, to each term in the above equation. After a little simplification where we set и — x/(x - 1) and replace с — b by b, the result is v(a,b \ c+(a-b+l)u fa + l,b 2ri[ ; и = 2^11 , i :' \ с J с \ с +1 (a + l)(c — b + 1)m /a+2,b . 2fA . ' ;u . B.5.3) c(c+l) \ c + 2
2.5 Contiguous Relations 95 This is a contiguous relation due to Euler, who derived it in a different way. If we apply Euler's transformation с — а,с — b с to B.5.2), then we get another contiguous relation: 'a,b \ c-{2c-a-b + Y)x ( a,b ' -x . B.5.4) c(c + 1) This is one of the relations Gauss obtained. Euler's method for obtaining B.5.3) was to use his integral representation of 2^1- By direct integration he found a formula of which the following is a particular case: 1 ta~\\ -t)c~a~\\ -tx)~bdt 0 = (c + (a + 1 - b)x) I ta(\- 0е"" A - txybdt Jo ta+\\ - t)c'a-\\ - txybdt. B.5.3') 0 This is identical with B.5.3). See Exercise 23 for a simple way of proving this identity. As another example of how the integral can be used, observe that A — xt)~~a = A — xt)~a~x{\ — xt) = A — xt)~"~l[l — x + A - t)x]. Substitute the right side in ; x = ) ; x ггмг с ) Г(Ь)Г(с- to obtain / Cl- Clio ~ldt \x )=(l-xJFi[ ¦x)+ 2^1 ,, \x). с ) \ с ) с \ с + 1 ) The above examples show how contiguous relations arise. Now we give a derivation of Gauss's basic contiguous relations. It is enough to obtain a set of six relations from which Gauss's fifteen are obtained by equating the Сл pairs of
96 2 The Hypergeometric Functions them. The first three in this set are where dF x—=a(F(a+)-F), B.5.5) dx dF x =b(F(b+)-F), B.5.6) dx dF x— = (c-l)(F(c-)-F), B.5.7) dx a' ;xj and F(a+) := 2FX (° + ' ;x and so on. The nth term of F(a+) - F is \(a+\)n(b)n (a)n(b)n] „ (a + l)n-i(b)n x = (a + n — a)x I n\(c)n n\(c)n \ n\(c)n а я!(с)„ and „ X = > ———X". dx ^ n\(c)n This proves B.5.5). Formula B.5.6) follows by symmetry in a and b, and B.5.7) is proved similarly. To obtain the other three equations, set 8 := x ? and verify that the hypergeometric equation can be written as [8(8 + c-l)-x(8+ a)(8 + b)]y = 0. So, [8(8 + с - 1) - x(8 + a - 1)(<5 + b)]F(a-) = 0. Now 8(8 + с - 1) = (8 + a - 1H$ +c-a)-(a- l)(c - a) so that [(8+c-a)- x(8 + b)](8 + a - l)F(a-) = (c - a)(a - l)F(a-). Apply B.5.5) in the form (8 + a - l)F(a-) — (a - l)F to the above equation to get [(8+c-a)- x(8 + b)]F = (c - a)F(a-), or dF x(l - x)— = (c - a)F(a-) + (a - с + bx)F, B.5.8) dx
2.5 Contiguous Relations 97 The remaining two relations are dF x(l-x) — = (c - b)F(b-) + (b - с + ax)F, B.5.9) ax dF c{\ -x) — = (c - a)(c - b)F(c+) + c(a + b - c)F. B.5.10) dx Formula B.5.9) is obtained from B.5.8) by symmetry in a and b, and B.5.10) is proved in a manner similar to B.5.8). Gauss's fifteen contiguous relations are obtained by equating two values of x^ in B.5.5) to B.5.7) and in B.5.8) to B.5.10). For example, [c-2a-(b- a)x]F +a{\ - x)F(a+) - (c - a)F{a-) = 0 follows from B.5.5) and B.5.8), whereas B.5.5) and B.5.10) give c[a - (c - b)x]F - ac(l - x)F(a+) + (c - a)(c - b)xF(c+) = 0. We have seen a special case of the last contiguous relation. Set x = 1 to obtain c(c-a-b)F[ ; 1 = (с - a)(c- We used this relation in Section 2.2 to derive Gauss's formula for 2^1 (a, b; c; 1). The reader should derive a few more contiguous relations as an exercise. It must also be clear that once a contiguous relation is given, it is very easy to verify by considering the coefficient of xn in each term. For example, (a,b \ fa,b+l \ a{c-b) fa + l,b+l 2Fi{ ; x = 2^11 ; x \ Х2Г1 ; x \ с J 2 \ c+\ ) c(c+\) \ с+ 2 B.5.11) is true because (a)n(b + !)„ a(c - b) {a + l)n_i(b + !)„-! = (a)n(b)n n\(c + l)n c(c+l) (n — l)!(c + 2)n_! n\(c)n Gauss used B.5.11) to obtain an interesting continued fraction for the ratio of two associated hypergeometric series. Rewrite B.5.11) as 2Fi(a, b; c: x) a(c — b) 1 - j y . — I — _ Vxx U2X
98 2 The Hypergeometric Functions where (a+n-l)(c-b + n-l) (b+n)(c-a+n) u and vn = (c + 2n~2)(c + 2n- 1) (c + 2n-l)(c + 2n) Surprisingly, Euler had earlier found a different continued fraction for 2F\ (a, b; c; x)/2Fi(a, b + 1; с + 1; x). This comes from B.5.3). Interchange a and b in B.5.3) and rewrite it as 2F\{a, b; c\ x) (b + l)(c — a + \)x We now have the continued fraction {b + \)(c - a + \)x (b + 2)(c-a+2)x с + A + fe я)х (с + j + B + fo _ fl)x)_ (с + 2 + C + b _ ( It is clear that numerous examples of continued fractions for appropriate ratios of hypergeometric functions can be obtained in this way. Note that B.5.2), which is just the differential equation for the hypergeometric equation, also gives a contin- continued fraction for 2F\(a + I, b + 1; с + 1; x)/2F\(a,b; c; x). It involves quadratic terms in x and is given in Exercise 26. We have not given conditions for conver- convergence of these infinite continued fractions. The reader should see Lorentzen and Waadeland [1992] for a discussion of convergence of continued fractions. Also see Berndt [1985, pp. 136-137] for the reference to Euler's continued fraction and for the work of Ramanujan on this topic. One way of arriving at a connection between hypergeometric functions and orthogonal polynomials (defined below) is through a formula of Jacobi, which we now derive. Multiply the hypergeometric equation by xc~l(l — x)a+b~c and write it as — [x(l - x)xc-'(l - x)a+h-cy'] = abxc-\\ - x)a+b-cy, dx where у = 2F\(a, b\ c; x). From B.5.1) it follows that the derivative of a hypergeometric function is again hypergeometric, so that it satisfies the differential equation with a, b, с changed toa + l,fe+l,c+l.By induction, this implies that — \xk(l -х)кМуЩ =(a+k- l)(b + k- 1)**-Ч1 -x)k-lMy«-l\ dx J where M = xc~x A — x)a+b~c. Consequently, we have the recurrence relation — [х\1-х?МУ™}={а+к-\)ф + к-\I-ГГ1 = (a)k{b)kMy.
2.5 Contiguous Relations 99 Substitute (k) _ (a)k(b)k /a + k,b + k У (c)k l\ c+k in the above equation to get dk \ к,л ,кг, г fa + k,b + k \] fa,b —- x A - x) M2F\ I ; x I = {c)kM2F\ I ;x If b is a negative integer — n, then for к — n we have Jacobi's formula 2Fi(~n'a;x) =-—A~X) ^—[xc+n~\\ -х)а~с1 B.5.12) \ с J (c)n dx" For a more symmetrical expression in B.5.12) set x = A - y)/2, с = a + 1, and a = n + a + fi + 1 to get n,n+a+p + 1 1 -y a + 1 ' 2 - v)-a(l + v)-^, _„ cf Definition 2.5.1 The Jacobi polynomial of degree n is defined by a + 1 ' 2 One of its fundamental properties is that r+i P(a,p)/ \p(a,p)/r\/i уЛаП -I- y\PHy -1 + р + 1) 1)Г(л + a + P + 1)л! ""*' This is easy to prove. Use B.5.13) and integration by parts. Remark 2.5.1 Formula E.13), which can be written as A -xf(l +хI>Р<а-»(х) = t^-?.[(l -xf+a{\ +хГ^], B.5.13') is often called the Rodngues formula for Jacobi polynomials. The particular case for Legendre polynomials, where a = fi = 0, was published by O. Rodngues in 1816 in an Ecole Poly technique journal. Unfortunately, Rodrigues's paper did not receive much attention. The formula was rediscovered independently by J. Ivory in 1822 and Jacobi in 1827. It is amusing to note that Jacobi later suggested to Ivory that they write a joint paper on this important formula and publish it in France as it was not known there. Their paper appeared in Liouville's journal in 1837.
100 2 The Hypergeometric Functions Interestingly, Rodrigues's teacher, Laplace, in the course of his work in probability A810-11), found a similar formula [see F.1.3)] for Hermite polynomials. For references, see Roy [1993]. A set of polynomials {pn(x)} is called orthogonal if there is a positive measure d/x(x) with finite moments of all orders so that oo Pn(x)pm(x)dix(x) = 0, тфп. —oo Thus, the Jacobi polynomials are orthogonal with respect to the measure d (x)= (A-х)аA+х)Ых, -1<х<1, \ 0, otherwise. We shall see in Chapter 5 that any set of orthogonal polynomials satisfies a three-term recurrence relation: хр„(х) = Anpn+i(x) + Bnpn{x) + Cnpn-i(x), pO(x) = 1,/>_!(*) = 0, An, Bn, Cn+i real, AnCn+i > 0, и = 0, 1,.... Conversely, any set of polynomials that satisfies this recurrence relation is orthog- orthogonal with respect to a positive measure, which may not be unique. The three-term recurrence relation for Jacobi polynomials comes from the con- contiguous relation 2b(c-a)(b-a- c - [A - 2x)(b -a-lK + (b- a)(b + a- l)Bc -b-a- l)]2F{( °' * V с -2a{b-c)(b-a+\JFAa+ '*" ; x ) = 0, B.5.15) V c ) after proper identification. In particular we require that a = —n, where и is a positive integer. But B.5.15) continues to hold when the series does not terminate. Another contiguous relation that gives a set of orthogonal polynomials is a(l-xJFi(a+^b;x\ + [c - 2a - (b - a)x]2Fx( "' ; x B.5.16) We shall study orthogonal polynomials in detail in later chapters. That will provide the natural setting for some of the contiguous relations in the sense implied by the above remarks.
2.5 Contiguous Relations 101 Kummer [1836] considered the problem of extending the contiguous relations to pFq, but he stopped with the remark that for 3F2 (a, b, c; d, e; x) the formulas are more complicated. In particular, the linear contiguous relations require four functions, a 3 F2 and three contiguous to it. Kummer also noted that only when x = 1 did the formulas simplify. That is the key to three-term relations for some higher p and q. These will be discussed in the next chapter. Remark 2.5.2 It should be noted that the continued fraction from Gauss's re- relation B.5.11) contains a continued fraction for arctanx as a special case. The Taylor series for arctan x converges very slowly when x = 1, but the convergence of the continued fraction is extremely rapid and was once useful in computing approximations of n. See Exercise 25. Remark 2.5.3 The orthogonality relation B.5.14) is a generalization of the well- known fact from trigonometry: cos тв cos n9d9 = 0 Ытфп. B.5.17) о This becomes clear by setting x = cos# and cosnd = Tn(x). Then Tn{x) is a polynomial of degree n and B.5.17) becomes / Tm{x)Tn{x){\ -x)-1/2(l+x)-1/2dx=0 Ытфп. It is not difficult to show that Г„(х) = CPn(-1/2'/2)(x),whereC = Bn)!/[22n(n!J]. Another set of polynomials is n < m sin(n + 1N/ Un(cos9) = Tn(x) and Un(x) are called Chebyshev polynomials of the first and second kind respectively. The three-term recurrence relation for Tn (x) is xTn(x) = -Tn+1(x) + -rn_i(x). This is the trigonometric identity 2cos#cosn# = cos(n + \)в + cos(n - 1)#. Several properties of the Chebyshev polynomials translate to elementary trigono- trigonometric identities and they form the starting point for generalizations to Jacobi and other sets of orthogonal polynomials. For this reason the reader should keep them in mind when studying the "classical" orthogonal polynomials. A number of exercises at the end of this chapter deal with Chebyshev polynomials. We also
102 2 The Hypergeometric Functions refer to sin{Bn + 1)9/2} cos{Bn + 1N/2} Vn(cos0) = r—- and Wn(cos9) = — sin #/2 cos #/2 as Chebyshev polynomials of the third and fourth kinds respectively. These poly- polynomials are of lesser importance. 2.6 Dilogarithms All the examples given in Section 2.1 of special functions expressible as hyper- hypergeometric functions were either 2Fi or of lower level. The dilogarithm function is an example of a 3F2. This function was first discussed by Euler and later by many other mathematicians including Abel and Kummer. But it is only in the past two decades that it has begun to appear in several different mathematical contexts. Its growing importance is reflected in the two books devoted to it and its generalization, the polylogarithm. See Lewin [1981, 1981a]. Here we give a few elementary properties of dilogarithms. The reader may also wish to see Kirillov [1994] and Zagier [1989]. The latter paper gives a number of interesting applications in number theory and geometry. The dilogarithm is defined by the series :=2^—, for |x| < 1. B.6.1) 71 = 1 П From the Taylor expansion of logA — t) it follows that Li2(x) = - / — -dt. B.6.2) Jo t The integral is defined as a single-valued function in the cut plane С — [1, oo); so we have an analytic continuation of Li2(x) to this region. The multivaluedness of Li2(x) can also be studied easily. There are branch points at 1 and oo. If Li2(x) is continued along a loop that winds around x — 1 once, then the value of Li2 changes to Li2(x) — 2ni logx. This is easily seen from the integral definition. We now obtain the hypergeometric representation from the integral Г (hi \ fl (hi \ / Li:(x)= / 2Fi( 2 ¦,t\dt = xl 2FA ^ \xu\du = xiF1[ 1,1,1 by formula B.2.2). Though it is possible to develop the properties of the diloga- dilogarithm without any reference to the theory of hypergeometric series, we note one example where Pfaff 's transformation is applicable.
2.6 Dilogarithms 103 Theorem 2.6.1 Li2(x) + Li2(x/(x - 1)) = -j[log(l - x)]2 (Landen's trans- transformation). Proof. By Pfaff's transformation (Theorem 2.2.5) Set и = t/(t - 1) in the last integral to get x/{x-\) /, ! \ du 2FA ;u о V 2 У 1-й The first integral is Li2(x/(x — 1)) and the second integral is >dll = j x) й 2 /о i-м 2 This proves the result. ¦ We give another proof since it involves a different expression for the dilogarithm as a hypergeometric function. This expression is given by Li2(x) = lim 1 Lfi ( *' ^; x J - l|. B.6.3) Let x be in the region {x||x| < S < 1} П {x\\x/(x - 1)| < S < 1} = S5, where 5 > 0. Apply Pfaff's transformation to B.6.3) to get Li2(x) — lim — 2 1 f / 6 = lim -{( 1 -t log(l -x) + -[log(l -x)]2 + Now, 00 t / чи 00,/ \ n 00/-. -. -1 X- 1
104 Thus, Li2(x) = 1 2 ( The Hypergeometric Functions : log(l — x) H [log(l — x)] 2 = ~[log(l-x)]z-Li2f X The limit operation may be justified by the fact that for x e S$ and |e| < 1/2, the relevant series represent analytic functions of x and e. This proves the theorem again. There is another proof in Exercise 38. Theorem 2.6.2 Ifw" = \, then -Li2(x") = V Li2(a/x). B.6.4) n z—' -Li2(x2) = Li2(x) + Li2(-x). B.6.5) 2 Li2(x) + Li2(l -x) = — -logxlog(l -x). B.6.6) 6 Proof. To prove B.6.4) start with the factorization A -1") = (I - t)(l - wt) ¦ ¦ ¦ A — con~lt). Take the logarithm and integrate to get fx log(l -t") Г log(l -1) - / dt = - dt Jo * Jo t fx log(l - wt) fx log(l - wn~lt) - / dt / dt. Jo t Jo t A change of variables shows that the integral on the left is ^Li2 (x"). This proves B.6.4), and B.6.5) follows by taking n = 2. To derive B.6.6) integrate by parts: ?. , ч Г logd - 0 J , , /л Гх log? Li2(x) = -/ -5 dt = -\ogxlog(\ -x)- / —^rff. Jo t Jo 1 — t The last integral, after a change of variables и = 1 — t, is i-'-Mogd - и) -<зм = Li2(l — x) —
2.6 Dilogarithms 105 But 2 by Theorem 1.2.4. The proof of the theorem is complete. ¦ Apparently, the only values x at which Li2(x) can be computed in terms of more elementary functions are the eight values x = 0, ±1, |, ''^ , ^y^, ^^. Theorem 2.6.3 Li2(O) = О, Li2(D = ^-, 6 12 n Г2' 2 Li2 Ll2 V5 15 4L""°V 2 'л/5-Р12 1 7Г2 1 л/5-l V5+ B.6.7) B.6.8) B.6.9) B.6.10) B.6.11) B.6.12) B.6.13) B.6.14) Proof. Relation B.6.7) is obvious and B.6.8) was done in the proof of Theorem 2.6.2. For B.6.9), observe that 7Г2 111 7T2 6 22 ' 6 \2 Set x = | in B.6.6) to get B.6.10).
106 2 The Hypergeometric Functions The identities B.6.11) and B.6.12) can be derived as follows: Landen's trans- transformation and B.6.5) combine to give Li2 (j^-j-) + ^Li2(x2) - Li2(-x) = -^[log(l - x)]2. B.6.15) Set the variables in the first two dilogarithmic functions equal to each other. Then x/(x - 1) = x2 and x2 - x - 1 = 0. A solution of this is x = A - л/5)/2. Substitute this x in B.6.15) to obtain To find another equation involving Li2(C — л/5)/2) and Li2((l — л/5)/2) take x = C - л/5)/2 in B.6.6) to arrive at 3-V5 Now solve these equations to obtain the necessary result. The proofs of the for- formulas B.6.13) and B.6.14) are left as exercises. ¦ There are also two variable equations for the dilogarithm. The following is usually attributed to Abel though it was published earlier by Spence. See Lewin [1981a] for references. The formula is - log(l-x)log(l-y). B.6.16) This is easily verified by partial differentiation with respect to x or у and is left to the reader. More generally we can define the polylogarithm by the series oo „ Limx:=V— for|x| < 1,/и =2, 3, .... B.6.17) Z—/ „m The relation , n" --Lim(x) = -Lim_ ax x is easy to show and one can use this to define the analytic continuation of Lim(x). The polylogarithm can be expressed as a hypergeometric function as well. The formula is Lim(x)=xm+1Fm( ' ""' \ z,..., z
2.7 Binomial Sums 107 We do not go into the properties of this function any further but instead refer the reader to Zagier's [1989] article and the books mentioned earlier. 2.7 Binomial Sums One area where hypergeometric identities are very useful is in the evaluation of single sums of products of binomial coefficients. The essential character of such sums is revealed by writing them as hypergeometric series. Sums of binomial coefficients that appear to be very different from one another turn out to be examples of the same hypergeometric series. One reason for this is that binomial coefficients can be taken apart and then rearranged to take many different forms. A few examples given below will explain these points. Consider the sum n /k\fk-l-j\ n +7 j=0 J ^ j=0 To write this as a hypergeometric series, look at the ratio cj+\/cj as we did when we defined these series. A simple calculation shows that cj (J-k+ 1H" + 2) So Л k-l Now, as explained in Section 2.1, we could introduce j\ — A), in the numerator and denominator to get We have learned to sum two 3F2 series: the balanced and the well-poised series (see Section 2.2). This is neither though it is actually "nearly" poised. However, we have not yet considered this type of series. But there is another way out. Note that the denominator has B)j, which can be written as (l)y+i. Then S = k-l\ (-k) * k-l\ (-k) y (n + l)(k + I) j^ QM-k)t (-к) f !)(* + !) L V -к f „l,*l _
108 2 The Hypergeometric Functions The 2-Fi can be evaluated by the Chu-Vandermonde formula and after simplifica- simplification we get S = 1 Jfe+1 [\n As another example take the sum (-1)" k>0 n+k \Bk\(-l)k k>0 Here after simplification. Thus (k + n + \)(k+\)(k-m + n) (jt + f + l)(t+=±i)(Jt + 2) 5 = m — 1 — n + m, n, —I ; Г1 2 ' 2 The 3F2 is balanced and we can apply the Pfaff-Saalschiitz identity. We get s = m (— -n) (-) \ 2 /n+l-m V 2 /n+l-m _ i (m=l) (- — n) .V 2 /n+l-m V 2 "Jn + l-m which simplifies to {nm_x)¦ The reader may verify that E k>0 n-k I \m +n also reduces to an example of a Pfaff-Saalschiitz series. As the final example take the series In = S, where we are assuming that I = min(€, m,n). This reduces to the series (/n - ?)!(/n + t)\(n - t)\(n +t)\ This is a well-poised series (see Section 2.2) that can be summed by Dixon's for- formula. However, this result cannot be applied directly because we get a term
2.8 Dougall's Bilateral Sum 109 ГA — ?)/ ГA — 21) that is undefined. A way around this is to use the following case of Dixon's formula: -2г-2е,-т-г-е,-п-г-е m-l-e + l,n-l-€ + l ' ГA - ? - е)Г{1 + m - ? - е)ГA + m +n + ? + e) ~ ГA -21- 2е)ГA + /п)ГA + и)ГA + т + п) Apply Euler's reflection formula to the right side to get +т -1-е)ГA +п-?-€)ГA+т+п In the limit as e —> 0, this expression is . B? - 1)! (m - l)\{n - ?)\{m (?-\)\ m\n\(m+ri)\ Thus (? + m + n)\B?)\Bm)\Bn)\ S = (? + m)\(? + n)\{m + n)\?\m\n\' Another example that gives a well-poised 3F2 is J2l=i ^H^I )(?+„)• Examples can be multiplied but the discussion above is sufficient to explain how hyperge- hypergeometric identities apply to the evaluation of binomial sums. See Exercise 29 in Chapter 3. 2.8 Dougall's Bilateral Sum The bilateral series Г(а+п)Г(Ь + п) is the subject of this section. In fact, the hypergeometric series 2^2 (a,b\ c; 1) should be regarded as a special case of the bilateral series where d = 1, since 1/ Г(и) — 0 for nonpositive integers n. This explains why we introduced 1/n! in the nth term of the hypergeometric series. The above remark gives us a way of evaluating the above bilateral series. Ford = 1,2,..., the sum reduces to a series that can be evaluated by Gauss's summation of iF\(a, b; c; 1). The following theorem of Carlson allows one to evaluate the bilateral series from its values at d = 1,2,....
110 2 The Hypergeometric Functions Theorem 2.8.1 Iff(z) is analytic and bounded for Re z > Oandif f(z) = Ofor z = 0,1, 2,..., then f(z) is identically zero. Remark The boundedness condition can be relaxed. We need only assume that f(z) = O(e*|z|), where к < it. The simple proof given below of the particular case is due to Selberg [1944]. Proof. As a consequence of Cauchy's residue theorem, f(z) = (a-\)(a-2)---(a-n) f° (z-aKz-l)---(z-n) for n > a > 0. Then, for a > 1, Z 2л- 2nn\ Let n -»¦ oo to see that /(a) =0 for all real a > 1. This implies the theorem. Theorem 2.8.2 (Dougall) For 1 + Re(a + b) < Re(c + d), Г(а+п)Г(Ь+п) n2 F(c + d-a-b-l) п=оо Г(с + n)V(d + n) sin л-a sin nb Г(с - а)Г(^ - а)Г(с - 6)Г(^ - &) Proof. For Re<i > Re (a + b — c) + 1, the functions on both sides are bounded analytic functions of d. Let m be an integer in this half plane. For d = m the series on the left is , Г(с + п)Г(т+п) Г(а-т+ \)Г(Ь -т + \)^(а-т + \)еф - T(c-m This series can be summed by Gauss's iF\ formula. Thus Dougall's result can be verified for d equal to an integer in the half plane. Carlson's theorem now implies Theorem 2.8.2. ¦ Gauss's iF\ sum is itself a consequence of Theorem 2.8.1. We have to prove that iFA t"";l) = с J Г(с-а)Г(с
2.9 Fractional Integration by Parts and Hypergeometric Integrals 111 This relation is true for b = 1, 2,... by the Chu-Vandermonde identity, and then by the above argument the general case follows. For a different example where Carlson's theorem applies, consider the formula i: xa~l{l-. It is easy to prove this by induction when a is a positive integer. We saw this in Chapter 1. The integral is bounded and analytic in Re a > S > 0 and so is the right side of the formula. This proves the result. 2.9 Fractional Integration by Parts and Hypergeometric Integrals Theorem 2.2.1 gives Euler's integral representation for a hypergeometric function. One drawback of this representation is that the symmetry in the parameters a and b of the function is not obvious in the integral. We observed that Erdelyi's double integral B.2.5) gives the two different representations by changing the order of integration. In this section, we show how fractional integration by parts can be used to transform the integral for 2F\ (a, b; c; x) to another integral in which a and b have been interchanged. In fact, fractional integration by parts is a powerful tool and we also use it to prove a formula of Erdelyi. This contains some of the integral formulas considered in this chapter as special cases. It also has implications in the theory of orthogonal polynomials, which will be discussed in Chapter 6. -iet (//)(*) := / f(t)dt Ja and (hf)(x) ¦= [ f f{h)dndt = f (x- t)f(t)dt. J a J a J a Inductively, it follows that, for a positive integer n, (/„/)(*):= f [¦¦¦[" ' f(ta-i)dtn-i-..dtidt J a J a J a {x-tf-lf{t)dt. (П ~ 1)! Ja A fractional integral /„, for Re a > 0, is then defined by '' Г (a) Ja (/„/)(*) := ¦=$- f (x - t)a-lf{t)dt. B.9.1)
112 2 The Нуpergeometric Functions The restriction Re a > 0 can be removed by using contour integrals. An in- interpretation of Euler's integral for 2^1 (a, b; c; x) as a fractional integral is now evident. The fractional derivatives can also be defined by the formal relation '^—w»-\ B.9.2) dwv when the right side is meaningful. To state the formula for fractional integration by parts, suppose и and v are functions defined by 00 00 Ar(x — ay , v = у r=0 s=0 Then fb dvv fb dvu / и dx = / v dx, B.9.3) Ja d(b-x)v Ja d(x-aY provided that the integrals exist. This can be verified directly by substituting the series for и and v and their derivatives, which are obtained by applying B.9.2) term by term. The two are seen to be identical after integration term by term. It is noteworthy that if Re v < 0 in B.9.3), then B.9.3) is equivalent to the identity fb \ fb 1 fb Г fx 1 / u(x)\ I (y-x)"~1v(y)dy\dx= / v(x)\ / (x - y)"~1u(y)dy\dx, Ja L J x J Ja L J a J where v = —a. This formula holds because both sides equal the double integral и u(x)v(y)(y - xf^dxdy. We now show how to transform one integral representation of 2F1 (a, b; c; x) to the other. It is clear from B.9.2) that — f -b)JQ T(b)T{c ) f -a)J0 »-\\-ty-b-\\-xt)-adt Г{Ь)Г(с By the integration by parts formula B.9.3), we get Г(с) r(b)r(c-a)J0 r1 / a-ty-a- Jo
2.9 Fractional Integration by Parts and Hypergeometric Integrals 113 The binomial theorem and B.9.2) give db-atb-\{y_xtya ^ db-a ~ r(fl + r)f6+r-l xr dtb-a ~ dtb-a f^ Г(а)г! 2. Г(в)г! Substitute this in B.9.5) to obtain r, *}? . ff-\\-ty—\\-xt)-bdt. Г(а)Г(с-а) Уо This is the expression in B.9.4) with a and ? interchanged; our claim is proved. As another example of the use of fractional integration by parts, we re-prove the following formula contained in Theorem 2.2.4: : d when Rec > Red > 0, x ф 1, |arg(l - x)| < n. Use B.9.2) to see that B.9.4) is equal to = Г(с) Г(Ь Also, dtb~d dtb~d f^ Г(а)г! r=0 Substitute this in the last integral to complete the proof of B.9.6). We now state and prove the formula of Erdelyi [1939] mentioned earlier. Theorem 2.9.1 For Rec > Re д > 0, x ф 1, |arg(l — x)| < n, we have ) f t»-\\ - ty-»-\l - xtf—b -ц) Jo \xtJFi Г(ц)Г(с Х-а,Х-Ь
114 2 The Hypergeometric Functions Proof. Apply Euler's transformation in Theorem 2.2.5 to the 2F\ inside the inte- integral in B.9.6). The result is '<*=*)- By B.9.2) and the series representation of 2F\, we see that k-a,k~b \ d^'k { t^~l /k-a,k-b ¦,xt)=-—r\-—2Fl( ;xt . Substitute this in the last integral and use the fractional-integration-by-parts for- formula B.9.3) to get a,b \ Г(с) fx ^_, ^ (k-a,k-b с "' ¦ -ТГ— Г 1 —^C1 - xt)k-"-b \ , d(\ -ty~k \ Г(с -к) J Write the expression in curly braces as T(c-k) V 1- 4 ' ^ r\T(c-k) Take the (/л — A.)th derivative of this expression to obtain 'a + b-k,c-k (l-t)x c-fi x-1 By Pfaff's transformation (Theorem 2.2.5), this is equal to 1 a + b-k,k-n (l-t)x ; с -11 \-xt Substitute this in the last integral. The result is proved. ¦ Exercises 1. Complete the proof of the second part of Theorem 2.1.2 concerning conditional convergence.
Exercises 115 2. Suppose that YTk=\ak I anc^ S/Ui \ak I tendto infinity in the same way, that is, < к Е- k=\ for all n and К is independent of n. Prove that = lim **, provided that the right-hand limit exists. 3. Use the result in Exercise 2 to prove Theorem 2.1.4. 4. (a) Show that I((l + x)" + A - x)") = 2F\(-n/2, -{n + l)/2; 1/2; x2). Find a similar expression for |(A + x)" — A — x)"). (b) Show that A +x)" = 1 +nx2F\{\ -n, 1; 2; -x). 5. Derive the Chu-Vandermonde identity by equating the coefficient of x" on each side of A -x)"a(l - x)~b = A -x)-(a+b). 6. Suppose that log(l + x) is defined by the series B.1.3). Use Pfaff's transfor- transformation (Theorem 2.2.5) to show that log(l + x) = -log(l + x). 7. Show that Pfaff's transformation is equivalent to -n,c-b \ {b)n ; 1 = -—-, n = 0, 1,.... с ) (с)„ This is one way of removing the restriction Re с > Re b > 0 used in the proof of Theorem 2.2.5. 8. Prove the identities B.1.10) to B.1.13). 9. Show that xF\(a\ c; x) = ex,Fx(c - a; c\ -x). 10. Suppose x is a complex number not equal to zero or a negative integer. Show that ; x+1 r*) Jx Note that the i F\ series exhibits the poles and residues of Г(х). 11. Show that /•OO / JO Г(а) (ах,...,ар,а х b\,..., bq ' when p < q, Res > 0, Re a > 0, and term-by-term integration is permitted.
116 2 The Hypergeometric Functions 12. Show that cos mx = 2F\[ 2', 2 ; sin2 x I, V / \+m \-m \ sinmx = m sini2fi I 2 3 2 ;sin2xj. V 2 / 13. Prove that the functions and / 2a, 2fc, a + b У2 = 3^2 r, , ~, , l , 1 ' X \2a + 2b, a + b + \ both satisfy the differential equation x2(x - \)y'" - 3 + {[2(a2 + b2 + Aab) + 3(a + b) + l)]x - (a + b)Ba + 2b+ 1)}/ + 4ab(a + b)y = 0. Thus prove Clausen's identity (see Clausen [1828]), a,b \J „( 2a,2b,a+b Also prove that 2^1 ' a-b+\,b-a 2' 2. v 14. Show that the Pfaff-Saalschiitz identity B.2.8) can be written in a completely symmetric form as (a,b,c \ 7r2r(rf)r(e)[cos7rJcos7re+cos7racos7rfecos7rc] 3 2V d,e ' J ~ r(J - a)Y{d - fe)r(J - с)Г(е - a)T(e - b)T(e - c) when the series terminates naturally and d+e = a+b+c+l. This observation was made by R. William Gosper.
Exercises 117 IS. Prove that k=0 where Tn(x) is the Chebyshev polynomial of the first kind. Find a similar expression for Un(x), the Chebyshev polynomial of the second kind. 16. Prove the following analog of Fermat's little theorem: If x is apositive integer and p an odd prime then Tp(x) = T\ (x) (mod p). 17. Pell's equation is x1 — Dy2 = 1, where D is a square free positive integer. Let S be the set of all positive solutions (x, y) of Pell's equation. Let (x\, yi) be the solution with least x in S. Show that (Tn(x\), yiUn-\(x\)), n > 1, is a solution and thus that Pell's equation has infinitely many solutions, if it has one. 18. Let Sn(x) = Un-i(x) with U_i(x) = 0. Prove that gcd(Sn(x), Sm(x)) = Sgcd(m,n)(x), where x, m, and n are positive integers. 19. (a) Show that pn(a,-l/2)Bx2 _ 1)/J(x)A _ X2ydx = Oi where p(x) is any polynomial of degree <2n — 1. Deduce that ( (b) Show that (e,e) _ ГBп + а + 2)п\ п+i U) - Г(л + a (c) What do (a) and (b) mean for the Chebyshev polynomials of the first and second kind respectively? 20. Prove that (« + !)„/1+лсу /-n,-n-ptx-l n! V 2 J - \ a+l x + —n, —n — a n\
118 2 The Hypergeometric Functions 21. Letx = cos 6». Prove that dn~x sin2" в (-1)" Bn)! sinn#. (Jacobi) := dx"-x n 2nn\ 22. Prove that the Jacobi polynomials Р„(а/3)О) satisfy the three-term recurrence relation 2(n + l)(n + a + j9 + l)Bn + a f = Bn + a + P+ l){Bn + «+/? + 2)Bn + a + p)x + a2 - /?2} x Р^а-Р\х) - 2(n + a)(n + P) xBn + a + p + 2)Pn("f'(x) =0, n = 1, 2, 3,.... (Compare this with B.5.15).) 23. Prove Euler's contiguous relation expressed as the integral formula B.5.3') by observing that 0= / — (f"(l -t)c-"(l-tx)l~b)dt, Rec>Rea>0. Jo at 24. Prove the following contiguous relations: (a) (c-2a-(b- a)x)F + a(\ - x)F(a+) - (c - a)F(a-) = 0. (b) (c - a - l)F + aF(a+) - (c - l)F(c-) = 0. (c) (b - a)(l - x)F - (c - a)F(a-) + (c - b)F(b-) = 0. (d) c(b - (c - a)x)F - bc{\ - x)F(b+) - (c - b)F(b-) = 0. 25. Prove x l2x l2x 22x 22x (b) log(l+x) = — eV T 1+ 2+ 3+ 4+ 5+ x \2x2 22x2 32x2 (c) arctanx = — 1+3+ 5+ 7+ 4 1+ 3+ 5+ 7+ " The 9th approximant gives n/4 correctly up to seven decimal places. x+1 2 - - - (e) lOg - 2 3 4 X i x .x тЛ tX arcsinx x l-2x2 1 • 2x2 3 ¦ 4x2 3-4x2
Exercises 119 26. Show that fa,b \ / fa + l,b+l с V c+l (a+2)(fc+2) _ с - (a +b + l)x xu x; c(c+1) xu x-> (c+Q(C+2) ~~ r ' c+\-(a+b+3)x + c+2-(a+b+5)x +"''- c+l c+2 27. Use Barnes's integral representation of 2F\ and Barnes's beta integral (Theo- (Theorem 2.4.2) to prove formula B.3.13). Also consider the case where c — a — b is an integer. 28. Multiply the equation in B.3.13)byxd(l-x)e"'/ and integrate over @, 1) to obtain another proof of Theorem 2.4.4. 29. Prove the following formulas of Ramanujan [1927, paper 11] Kx/O>+1)J l + (x/(b + 2)J j Г Jo 1 + (x/aJ 1 + (x/(a + I)J Г0 X (b) ' dx (I + (jc/aJ)(l + (x/(a + I)J) • • • (I + (x/bJ)(l + (x/(b + I)J) • • ^Г(а)Г(а + {)Г(Ь)Г(Ь+{)Г(а + Ь) r , a > U, b > U. 2 r(a+b+\) 30. Prove that for Re t > 0 oo , «=—oo v n——oo ЯшГ: Denote the left side by /(a) and note that / has period one. Expand / as a Fourier series oo and observe that ^ A .„ = Г e-'^e-^^dy. J — oo 31. (a) Let x be a nontrivial even primitive character (mod ./V). Prove that oo n=— oo where Re t > 0 and a=l
120 2 The Hypergeometric Functions Hint: First observe that n=—00 a=l n=—00 Then apply the result of the previous exercise, (b) Let x be a nontrivial odd primitive character (mod N). Prove that 32. (a) Show that л "¦Т(я)?Bя) is the Mellin transform of oo ? e~" nt for Re s > 1. n=l (b) Use Exercise 30 to show that s \ — s (c) Observe that the expression on the right in (b) does not change under s -+ I — s. Deduce the analytic continuation and functional equations of the zeta function. 33. Obtain the functional equation of L(x, s), x primitive (mod N), using Exercise 31 and the idea of Exercise 32. (By Exercise 1.53 the functional equation is ЩЛ-s) Ш -_ 8(X) Bл У VA' ' 2is \N J Г(s)cosn(s-S)/2' where S = 0 or 1 depending on whether x is even or odd.) 34. Assuming the functional equation of the zeta function, apply Mellin inversion to prove that 35. Evaluate the Mellin transform of 2F\ (a, b\ c\ —x) as follows: f Jo Jo a,c-b_ x Set и = x/(x + 1) and integrate the 2F\ series term by term.
Exercises 121 36. Let Ft (s) be the Mellin transform of /, (x). Show that -J-r / Fl(s)F2(s)F3(s)ds 2ЛI Jk-ioo uv J uv Deduce Barnes's continuous extension (Theorem 2.4.3) of the Pfaff-Saalschiitz identity. 37. (a) Prove that J2 , Ima > O,Res > 1, n=l n=—oo by using Carlson's theorem. (The formula is due to Lipschitz.) (b) Expand 2^°oo(a + n)"f, Im a > 0, as a Fourier series Y,Ame27Tima. Ex- Express Am as an integral over (—oo, oo). Use the result in (a) to deduce Hankel's formula for 1/ F(s). See Exercise 1.22. 38. (a) Verify Theorem 2.6.1 by differentiating both sides of the equation. (b) Verify the Abel-Spence identity B.6.16). (c) Prove the identities B.6.13) and B.6.14). 39. (a) Prove that /y\ fl-xl ГуA-х Li2(x) - Li2()O + Li2 [y- )+ Li2 - Li2 л2 [1-х = -г -logxlog 6 г logxlog 6 \_\-y (b) Prove that Prove that Li2 , x(\-y) — jc)J = Li2 \-x---^ +Li2 — 1- i + Li2 J- + -log 2y. (Kummer) 1-х 2 See Lewin [1981a]. (c) Prove that Li2(x) + Li2(y) - Li2(xy) T. (x(\-y)\ t T. (y{\-x) = Ll2 + Ll; V 1 - xy J (Rogers) (d) Show that if 0 i;x < 1 and f(x) e C2(@, 1)) and satisfies B.6.6) and the functional relation in part (c), then /(x) = Li2(x). See Rogers [1907].
122 2 The Hypergeometric Functions 40. Suppose X is a primitive Dirichlet character mod N. Show that N n N n=l n=\ where g(x) is the Gauss sum defined in Exercise 31. 41. Suppose и is a positive integer. Define [logo, n = pk, a power of a prime, Л(и) = < [0, otherwise. (a) Show ?(я) = ПрО - P~"TX for Res > 1. (b) Show that for Re s > 1. X (c) Let i[r(x) = Yln<x Л(п). Show that rjr{x) = 0(x) and x~s~xilr{x)dx = , Res s (d) Prove the inversion formula yjr{x) = / ds, c>l and x not an integer. 2лi Jc_ioo * ?(•*) (e) Let f{t)dt. x Show that rco 1 C'(s) n , Res 0 5E -' (f) Prove the inversion of (e), 1 rc+ioo xs = — -—г / 2ЛI Jc-ioo S(S+ ds, c>\ and x not an integer. (g) Show that the Mellin transform of J2T=iA(n)e~"x is -T(s)$'(s)/Z(s). (h) Prove the Mellin inversion formula of (g), that is, i pc+ioo t'(i) ^L / rT(s)—ds=VA(n)rH, о 1, x>0. 2ni Jc_loo y't;{s) V
Exercises 123 42. Let a, b, c, d be complex numbers. (a) Suppose с ф an integer and Re {a + b) > — 1. Prove that K=—oc Л (a + c+ \)T{b-c+ (b) Suppose Re {a + b + с + d) > -1. Prove that oo - /-" T{a-k+ \)T{b -k+ 1)Г(с + к+ l)T(d + k+l) T(a + b + c + d+\) ~ Г (a + с + 1)Г(Ь + с+ 1)Г(а + d+ 1)Г(Ь + d + 1)' (с) Which beta integral is extended to the sum in part (b)? 43. This problem gives a quick way of obtaining the differential equation for у = 2Fi(a, b\ c; x). From the series for у and Euler's transformation, we have (a) y' = — 2Fi(a + l,b+l;c+l;x) с = —A - x)c-a-b-\Fx{c -a,c-b;c+l; x), and (b) ^-[xc-x2Fx{a,b;c;x)} = (с - l)xc-22Fx(a, b; с - 1; x). dx J Use (a) and (b) to see that — [x'(l - x)a+b+l-cy'] = abxc-l2Fx(c -a,c-b;c; x) dx = abxc-\l-x)a+b-cy. The differential equation is obtained after computing the derivative on the left side. 44. Evaluate /¦7Г/2 J-Tit 1-71/2 by use of the binomial theorem and Theorem 2.2.1, and so show that лТ(а + ? + 1) / (cos 0)a+fl cos(a - Jo 1)'
Hypergeometric Transformations and Identities Gauss's work on the hypergeometric equation contains a discussion of the mon- odromy question for the solutions of this equation. Gauss found and analyzed a quadratic transformation of hypergeometric functions; this apparently led him to the problem of monodromy. Unlike the linear (fractional) transformations of these functions, of which Pfaff's formula in Theorem 2.2.5 is an example, quadratic transformations exist only under certain conditions on the parameters. Neverthe- Nevertheless, they are important and useful. We have given some applications of these transformations after deriving a few basic formulas. An interesting application deals with the problem of proving Gauss's arithmetic-geometric mean to be ex- expressible as an elliptic integral. This chapter also contains a discussion of some methods for the summation of certain types of hypergeometric series. We use a quadratic transformation to obtain Dixon's identity for a well-poised 3F2 at x = 1. We then apply a method of Bailey to derive identities for special types of p+\ Fp with 2 < p < 6, including Dougall's identity, which was mentioned in Remark 2.2.2 in the previous chapter. An important transformation formula due to Whipple is obtained by the same method. Just as Barnes's integral on the product of gamma functions was an analog of Gauss's 2 F\ sum, these identities also have integral analogs and we discuss them. The hypergeometric identities provide a systematic approach to the evaluation of single sums of binomial coefficients. Contiguous relations for hypergeometric series contain an enormous amount of hidden information. There are three-term relations for balanced 4F3 functions as noted by Wilson [ 1977] and independently by Raynal [ 1979] and for 3 F2 functions at x — 1, a fact pointed out much earlier by Kummer. We describe Wilson's simple technique for deriving the contiguous relations that contain the three-term recur- recurrence for Wilson polynomials. These 4F3 polynomials contain a whole range of classical orthogonal polynomials as special or limiting cases. We devote a section of this chapter to the definition and orthogonality of Wilson polynomials. They are 124
3.1 Quadratic Transformations 125 orthogonal with respect to a weight function that occurs as the integrand in an integral analog of a 5 F4 identity. Gosper, Zeilberger, and Wilf have done significant work toward devising com- computer algorithms for finding and proving hypergeometric identities. We discuss the Wilf-Zeilberger method and compare it with that of Pfaff; both are applications of contiguous relations. 3.1 Quadratic Transformations Exercise 2.19 asks the reader to prove the following relation satisfied by Jacobi polynomials: lJn p(g,-l/2),2 2 _ n ГЗ 1 П This important formula can be stated in hypergeometric form as /-2и,2и + 2а +1 \ (-n,n + a + \ Д 2FA ^ ;(l-x)/2)=2F,( + . 2\l-x2 . C.1.2) Note that the 2F\ on the left is linear in x and the one on the right is quadratic in x. This is an example of a quadratic transformation. In this section we give the fundamental results on quadratic transformations with two free parameters. It is natural to suspect that C.1.2) continues to hold when the two series do not terminate. This can be shown directly by rewriting C.1.2) as 2a,2b \ ( a,b \ , ;x =2^1 , ;4x(l -x) C.1.3) :+fo+2 / \a + b+2 J and expanding the right-hand side as a power series in x. The coefficient of x" is a balanced 3F2, which can be summed by the Pfaff-Saalschiitz identity (Theorem 2.2.6). However, two cases have to be considered: n even and n odd. An equivalent identity C.1.4) does not have this problem. Apply Pfaff's transformation (Theorem 2.2.5) to the left-hand side and re-label the parameters and variable. The result we want to prove is Theorem 3.1.1 For all x where the two series converge, a,b \ „ ч_„ „/a/2, (l+a)/2-b -4x C.1.4) u-oti (i — X)" j Proof. Write the series on the right as y^ Kul^)kK~" T V" T L)l*-)k , ,fc.j ч-я+2* ^ Ща-Ь+\)к ^л (a/2)k(—b + (a + l)/2)t j.^(a + 2fe); • = > (-Ax) > -x1. L~l k\(a-b+\)k z—' /'! к=0 y=0
126 3 Hypergeometric Transformations and Identities It is easy to see that the coefficient of x" in the last expression is (-b + (a + l)/2)t(-4)*(fl + Ы)п-и f^ (a - b + l)kk\(n - k)\ Now observe that (a)n+k {a)n(a+n)k ( ' ' ia + 2k)-k= (aht -2Ща/2Ы(а So C.1.5) is the same as {a)n v^ {—b + (a + \)/2)k(a + n)k(-n)k n\ f^0 (a-b+l)k«a+l)/2)kk\ Application of the Pfaff-Saalschiitz identity shows that this balanced 3 F2 is equal to n\(a-b+ l)n' Clearly, this is also the coefficient of x" on the left side of C.1.4). This proves the theorem. ¦ Corollary 3.1.2 (Kummer [1836]) Proof. Let x —> —1 in C.1.4) and conclude by Abel's continuity theorem that a-b+ V ) \ a-b+ Now sum the 2 Fi on the right by Gauss's summation formula (Theorem 2.2.2). The corollary follows after an application of Legendre's duplication formula (Theorem 1.5.1). ¦ The quadratic transformation in Theorem 3.1.1 holds when the two series converge. This is no longer true for C.1.3). Both sides of C.1.3) converge for I < x < 1. Letting x —> 1 on both sides gives 2a,2b \ , 1 ; 1 = 1. + fc+ ) = 1. C.1.6) This implies, by Gauss's summation, that T(a +b + \)Г(\ - a -b) _ cosn(a-b) Г(Ь — a + 5Jr(a - b + \) cosn(a+b) This identity is not true in general, although it is true when a or b is an integer. When a or b is a negative integer, C.1.3) holds for all x. Otherwise it holds in the connected component of x = 0 when both \x\ < 1 and |4x(l — x)\ < 1. Gauss was the first to remark on this, although his paper remained unpublished until after analytic continuation was discovered. Gauss understood that a hypergeometric
3.1 Quadratic Transformations 127 function is many valued, citing sin x as a similar case; thus, he saw that C.1.3) does not hold for all x where the two sides converge. See Exercise 6 for the correct identity when i < x < 1. The set j < x < 1 in C.1.3) maps to x < — 1 in C.1.4). The function on the right- hand side is evaluated at a point inside the unit circle, but on the left an analytic continuation is needed to make sense of the function. Our first evaluation of 2F1(a, b; c; x) at x = 1 depended on Euler's integral representation ^ f t-\l ~ 0—41 - xt^dt. с-а) Jo It is possible to derive Kummer's identity in Corollary 3.1.2 by taking x = — 1 in the integral. In this case, с = a — b + 1, which makes the powers of 1 — t and 1 — xt equal. The other fundamental quadratic transformation comes from taking the powers of t and 1 — t to be the same, that is, с = 2a. Theorem 3.1.3 For all x where the series converge Proof. The left side equals ГBо) Г(а)Г(а) Substitute s = 1 — 2t or t = A — s)/2 and simplify to get ГBо)A - {x/2))~b П J-A x- ГBо)A - When n is odd, the last integral is zero. Otherwise it is Jo where n = 2m. So (a,b \ ГBа)(\-(х/2)Гь J^ (ЬJтГ(т+ к)Г(а) ( х N 2m 242e:xj = - _ rBa)r(I)(l-(x/2))-fe /fo/2,(fo+l)/2. / x \2\ Г(а)Г(а + |J2«-1 21\^ a + i ' \2 - x) )' C.1.8) An application of Legendre's duplication formula proves the theorem. ¦
128 3 Hypergeometric Transformations and Identities Remark It is worth observing that Legendre's duplication formula follows from C.1.8) by taking *=0. Theorems 3.1.1 and 3.1.3 contain the basic quadratic transformations. Others can be derived from these two by using the fractional linear transformations or the three- term relations connecting different solutions of the hypergeometric differential equation. Apply Pfaff 's transformation to the right side of C.1.4) to obtain ) 4 a-b+l ;( Replace 4jc/A + xJ with x to derive the equivalent formula \ a,b C.1.10) A combination of C.1.10) and C.1.7) is another useful transformation, namely, a b Ax \ , (a,a+\ -b : ; (i+^Ja2^i Д ; 2F1 To prove this, replace x with 4x/( 1 + xJ in C.1.7) and interchange a and b to get By C.1.10) it follows that the right side of the last formula is equal to the right side of C.1.11). This proves C.1.11). To see how three-term relations can be used to derive more quadratic transfor- transformations, recall formula B.3.13), Г(с)Г(с-а-Ь) ( a,b Г(с)Г(а + fo -?) A _ x)C-a-b2F.f c-a,c-b Apply this to C.1.3). The result is 2a,2b x+l\ Г(а + Ь+\)Г(\) ^(a,b 2 ( )() (a+\,b+\ 2 2*\ X Г(а)Гф) 2*\ 3/2 > C.1.12)
3.1 Quadratic Transformations 129 Some general remarks about quadratic transformations are now in order. There are two sides to the standard quadratic transformations: the linear side and the quadratic side. On the linear side, the variable is linear or the image of linear under the linear fractional transformation x{x — I). The parameters on the linear side are restricted by one condition, which comes from writing the 2F\ as an integral and equating two of the exponents of the functions in the integrand. This gives с = 2а, с = a — b + I, and a + b = 1. To obtain the complete list, symmetry in the parameters a and b and the Pfaff transformation are used. The complete list of conditions is с = 2a,c = 2b, c = a-b+l,c = b-a + l,a + b = \,c = . C.1.13) On the quadratic side, the variable appears in a quadratic form; the parameters are restricted by requiring that those in the numerator differ by j, or the denominator parameters differ by \ (that is, с = \ or с = |), or the Pfaff transformation is used. The conditions are 1 2' 1 a + -,c -a 1 + +-,c-a 1 + ~ 2'C 1 ~ 2'C 3 ~ 2" C.1.14) Note that we use a, b, с to denote generic parameters and that they may differ in a specific formula. The two sides of the quadratic transformation are split into two groups; one has с = la or с = 2b on the linear side and any of the other conditions C.1.13) in the other group. On the quadratic side, one set has с = | or с = |, and the other set has the remaining conditions in C.1.14). Notice that C.1.7) connects с = 2a with b = a + \ and that C.1.4) connects с = a -b+ 1 with с = (a + b + l)/2. The transformations that go to the group с = \ and с = | come from three-term relations. An example of this is C.1.12). Notice a simple way to distinguish those quadratic transformations that come from C.1.4) from those that come from C.1.7) via linear transformations. In C.1.4) the denominator parameters are equal; in C.1.7) they differ except for one value. Observe that C.1.11) is not one of these quadratic transformations. Both sides have conditions in C.1.13), and they were shown to be equal by show- showing that each is equal to a function with one of the conditions in C.1.14) satis- satisfied. Let us return to the proof of Theorem 3.1.1 and note that the balanced 3F2 that occurs in its proof has one specialization not needed to sum it. There is likely to be a more general quadratic transformation with one more degree of freedom. It would be nice if this were a transformation of a general 2^1, but our discussion of the
130 3 Hypergeometric Transformations and Identities connection between the quadratic transformation and Euler's integral makes this unlikely. The extra freedom comes at the 3F2 level. Whipple found the following quadratic transformation: F ( a,b,c 31 \a-b+l,a-c+l'X l_Li'a °, 1 ; 71 ^)- С3-1-15) a—b+l,a—c + \ A— xJ ) The proof of this formula is very similar to that of Theorem 3.1.1. The coefficient of x" in the expression on the right of C.1.15) is again a balanced 3 F2. The details are left to the reader. There exist examples of cubic transformations, though these are not as well understood as quadratic transformations. Two examples are given in Exercise 38. Kummer [1836] gave some quadratic transformations of 2F\ s with one free pa- parameter. The parameter was chosen so that more than one transformation could be applied to the 2 F\ function. Consider the following two fundamental transforma- transformations: and The 2^is on the left become identical when we make the denominator parameters equal. This means 2b = a — b + 1 orb = (a + l)/3. In this case = f аЛа+D/3 2Г\Bа+2)/3'Х The last equation was obtained by an application of the Pfaff transformation. Let
3.1 Quadratic Transformations 131 x —> —1 in the first and last expressions of C.1.16) to get F"/  \" ' //. I I I г "/ "' \" ¦ // . I Ba + 5)/6 '9j \3J 2 \ 2(a 2)/3) 4)/6)Г((а + 1)/2)" Now use Gauss's quadratic transformation C.1.3) and then Pfaff's transformation B.2.6) to obtain Recall that Gauss's formula holds in the connected component of the region — x)\<\ that contains the origin. Thus Vl — 4x + Ax2 = \ —2x. (A sim- similar argument applies in the derivation of the last equation in C.1.16).) Combine C.1.16) and C.1.17) to get Let x ->¦ | in the first equation to arrive at (a /fl/2,(a + l)/6 2(fl + l)/3 '9y~V3/ V 2(fl + l)/3 ' The result in B.3.13) connects the following three fl/2,(fl + l)/2 \ /fl/2,(fl + l)/2. 2(a + l)/3 У' 2 Л Bа + 5)/6 ' and G-2fl)/6
132 3 Hypergeometric Transformations and Identities Thus the third 2F1 can also be computed. In Exercise 2 we give a few more results of Kummer on quadratic transformations with one free parameter. 3.2 The Arithmetic-Geometric Mean and Elliptic Integrals Definition 3.2.1 LetO < к < 1. Following Legendre, the integral dt for xe [-1,1] C.2.1) о УA - t2){\ - k2t2) is called an elliptic integral of the first kind. Ifx = 1, this definite integral is called a complete elliptic integral and denoted by K. Thus К := K(k) := Г f * = Г f M . C.2.2) Jo Ja-t2)(i-k2t2) Jo J\ -k2sm2e To fully understand the integral C.2.1), one has to study its inverse, which is a Jacobi elliptic function. This is similar to looking at the integral /q* A — t2)~l/2dt as the inverse of the sine function. We shall take a very brief look at the Jacobi elliptic functions in Chapter 10. Here we consider how some theory of hypergeometric functions can be used to obtain interesting results about integral C.2.2). First note that the binomial expansion of the integrand A — k2 sin2 в)~1/2, integrated term by term, gives K1/21/24 (з-2-з) Now replace x with A — Vl — x2)/(l + Vl — x2) in the quadratic transformation C.1.11) to obtain 2b'" J V 2 У *"y b+l/2 Apply this to the hypergeometric function in C.2.3). The result is C-2'4) where k'2 = 1 — k2. To iterate this result, we introduce the following notation: ko:=k,k'm:=Jl-k2m, and *m+1 := iz^-, M = 0, 1,2,.... C.2.5) 1 + Km It follows from C.2.4) and C.2.5) that K(k) = fl Y^TK(km+i)- m=0 m
3.2 The Arithmetic-Geometric Mean and Elliptic Integrals 133 Observe that So Moreover, ^n < i - (i -4) <4- km < к and km —> 0 as m -> oo. l-K. m—0 m=0 converges, since J^mLo^ ~~ л/^ ~~ ^m) < SmU)^2™ converges. Thus on _ oo 7Г ш=0 ш ш=0 The fc^s can be obtained successively from C.2.6) The right side is the ratio of the geometric mean and the arithmetic mean of 1 and k'm. Now suppose there are two sequences {an} and {bn} with a0 = 1 and bo = к' such that b^ = k'n and *±L = C.2.7) Then and an an_i n-l -П oo , ,, Moreover, fe^ -> 1 implies that m=0 lim fon = lim an. n->oo n->oo From C.2.6) and C.2.7) it follows that an +bn and bn+i = \janbn. C.2.8)
134 3 Hypergeometric Transformations and Identities Conversely, it is easy to see that if two sequences {an} and {bn} satisfy C.2.8) with uq = 1 and bo = k', then C.2.7) also holds. The common limit of the two sequences is called the arithmethic-geometric mean of the sequences and it is equal to 2 Г1 л Jo dt This result is due to Lagrange and Gauss independently. See Cox [1984]. We state it as a theorem after formally defining the arithmetic-geometric mean. Definition 3.2.2 Suppose {an} and {bn} are two sequences such that a =: ao and b =: bo are real with a > b > 0 and an+\ = (an + bn)/2, bn+\ = y/anbn. Then the two sequences converge to a common limit M(a, b), called the arithmetic- geometric mean. Theorem 3.2.3 dd 1 _ 2 Г1 a, b) ж Jo M(a Proof. It is clear from the definition that M(Xa, Xb) — kM(a, b). So M{a, b) = aM(l, bid). The theorem follows from the result in the previous paragraph by taking k' = b/a. ¦ The fact that the two sequences in Definition 3.2.2 have a common limit can be obtained directly. It is easy to see that a = a0 > fli > a2 > • • • > an > ¦ ¦ ¦ > bn > bn_\ > • • • > b0 = b. Also, an + bn an - bn an+\ — t>n+i S an+\ — on = — bn = —-—. Thus a-b an-bn< 2" and the two sequences converge to the same limit. From the equation (an - bnJ 4(an +i it follows that the convergence is quadratic. Gauss saw this rapid convergence by computing the numerical example with a — -s/2 and b = 1. The first few values
3.2 The Arithmetic-Geometric Mean and Elliptic Integrals 135 of an and bn are n 0 1 2 3 4 an 1.41421356237 1.20710678118 1.19815694809 1.19814023479 1.19814023473 К 1.00000000000 1.18920711500 1.19812352149 1.19814023467 1.19814023473 Gauss calculated up to twenty-one decimal places but the above table is sufficent to illustrate our point. Somewhat later he calculated the ratio л/со where со = 2 f\l-x4r1/2dx. Jo On May 30, 1799, he noted in his diary that Af (a/2, 1) and ж/сЪ agreed to eleven decimal places and he conjectured that they were equal. Later he proved the more general result contained in Theorem 3.2.3. Once one has made this conjecture, however, it is not too difficult to prove the result. If I (a, b) denotes the integral in the theorem, then it is enough to prove I(a,b) = I(a\,b\), C.2.9) for then I(a,b) = I{a\,b\) = •¦¦ — I{am,bm) = ¦•¦ — lim I(am,bm). m->oo To prove C.2.9), Gauss defined a new variable в\ by 2a sin 0, a + b + (a - b) sinz 0] and asserted that (a2cos20 + fc2sin20)~1/2d0 = (a2cos20! +fc2sin201)~1/2rf01. This requires some computation, and C.2.9) follows. We started with an elliptic integral and then introduced the arithmetic-geometric mean. Clearly Gauss was approaching the problem from the opposite direction, as presented above. Let us now consider another way of arriving at the elliptic integral from the arithmetic-geometric mean. Gauss also studied the function /w- ' M(l+x, 1 -x) Now 2/ MA+,M-,2) = ('+'V(>'). <3.2.,0)
136 3 Hypergeometric Transformations and Identities Assume that / is analytic about t = 0. We want the analytic function that satisfies /@) = 1 and the functional relation in C.2.10). Since / is clearly even, f(x) = g(x2) for some function g and Replace t2 with x to get Ax - )=(l+x)g(xz). Write g(x) = J2T=o anx"> and use this functional equation to get I2 ¦ 32 I2 • 32 ¦ 52 d\ = #o/4, по = ~T^ 7^"Я(Ь 6E3 = — — —-dc\ 2 * 4 2 • 4 • о This suggests that + X) ) = A + X) lF\ Formula C.1.11), Ffa'b- lF\ 2b ' suggests the same identity. It is possible to show directly that f{x) = g(x2) is analytic. See the first few pages of Borwein and Borwein [1987]. However, it is easier to do the argument in the other direction, as done above. Definition 3.2.4 The complete elliptic integral of the second kind is defined as rn/2 E:=E(k):= A - k2 sm2e)l/2d6. C.2.11) Jo A theorem of Legendre connects the complete elliptic integral of the first kind with that of the second kind. Before proving it we state a lemma about the Wronskian of an hypergeometric equation. (For the reference to Legendre's book, where the result appears, see Whittaker and Watson [1940, p. 520].) Definition 3.2.5 If y\ and y2 are two solutions of a second-order differential equation, then their Wronskian is W(y\, y2) := у\у'2 — Угу\- Lemma 3.2.6 Ify\ and y2 are two independent solutions of the hypergeometric equation y" + (c — (a + b + l)x)y' — aby = 0, then A xc(l-x)a+b-c+1' where A is a constant.
3.2 The Arithmetic-Geometric Mean and Elliptic Integrals 137 Proof. Multiply the equation jcA - x)y'2' + (c-(a + b+ \)x)y'2 - aby2 = 0 by y\ and subtract from it the equation obtained by interchanging yt and уг- The result is x(l - x)(yiy'i - y2y'l) +(c-(a + b+ \)x)(yiy'2 - y2y[) = 0 or jcA - x)W'(yi, y2) + (c-(a + b+ l)x)W = 0. Now solve this equation to verify the result in the lemma. ¦ We shall need particular cases of the following two independent solutions of the general hypergeometric equation: fa, b \ У1 =2Fii c ;xj, C.2.12) C.2.13) Observe that from C.2.11) IT /I -I Л C.2.14) Theorem 3.2.7 EK' + E'K - KK' = f, w/геге /iT' := ?(?'), ?' := ?(*'), and ка=1- к2. Proof. Set x — k2 so that 1 — x = /c'2. The contiguous relation B.5.9) gives us Similarly, х(\-х)^=1-Е-\-(\-х)К. C.2.15) ax 2 2 -X{l-X)d?=l-E'-X-K>. C.2.16) Multiply C.2.15) by K' and C.2.16) by К and add to get EK' + E'K - KK' = 2jcA - x)W(K', K). With a = b = | and с = 1, Lemma 3.2.6 gives ЩАГ', K) = A/x{\ - x). So EK' + E'K — KK' is a constant. An examination of the asymptotic behavior of К as л; ->¦ 1 shows that the constant must be (?r/2). This is left to the reader. ¦
138 3 Hypergeometric Transformations and Identities It is possible to prove the following more general result of Elliot [1904] in exactly the same way. The proof is left as an exercise. The formula in Theorem 3.2.7 is called Legendre's relation. Theorem 3.2.8 „/\+a,-\-c \ „/'\-a,c+\ iFA l , , \x\2FA 2, ,l; 1 — x \ a+b+l )l \ b + c+l a+b+l ) \ b+c+l a + \ c \ г e+V+i >XJF Salamin [1976] and Brent [1976] independently combined Legendre's relation with the arithmetic-geometric mean to find an algorithm for approximating ж. We conclude this section with a brief sketch of this application. Some of the details are left for the reader to work out. Lemma 3.2.9 If{an] and {bn} are sequences in Definition 3.2.2, then 2Jn+i — Jn— anbnln, where Jn := f (a2 cos2 в + b2n sin2 в) 1/2d6, Jo and In has — | as the power in the integrand. Lemma 3.2.10 E(k)= M- V и=0 where c2n = a2n - b\. Proof. From C.2.9) we know that /„ = I (a, b) =: I. By Lemma 3.2.9, 2(jn+1 - a2n+1l) - (Jn - a2nl) = (а„Ьп-2а2п+1+а2пI
3.2 The Arithmetic-Geometric Mean and Elliptic Integrals 139 Rewrite this equation as Z [Jn+l an+\1) Z V" anl)—L Cn1 and sum it from n = 0 to n = m to get r-iAl, C.2.17) и=0 / where J := Jq. Now, *¦ \Jm+\ -am+1lm+i) — I cm+1 Since c^,+1 tends to zero quadratically, the last term tends to zero. Let m -*¦ oo in C.2.17) and then take a = 1 and b — k'. The lemma is proved. ¦ Theorem 3.2.11 7Г = where с2 = a2 — b2 with ao = 1 and bo = 4=. n n n " " v 2 Prao/ Take к = J=. Then /c' = J= and Legendre's relation becomes - = BE-K)K, where ? = ?D=) and K = K[~ 2 VV2/ VV2 This implies и=0 Since /i: = 7T/BMA, 4^)), the result follows. An algorithm based on this theorem has been used to compute millions of digits of it. Define лт '¦= _ Vm 2"r2 2^n=0Z Cn Then 7rm increases monotonically to n. Note that cn = y/a2 — b2 = c2_x/Aan. The an and bn are computed by the arithmetic-geometric mean algorithm. For more information on the computation of n, see Berggren, Borwein, and Borwein [1997].
140 3 Hypergeometric Transformations and Identities 3.3 Transformations of Balanced Series In the previous chapter we saw how a general 2F1 transforms under a fractional linear transformation and how to evaluate the sum of the series when x = 1. In the case of quadratic transformations, there were restrictions on the parameters. For higher p+iFp, transformations and summation formulas do not exist in general. There are, however, two classes of hypergeometric series for which some results can be obtained. Definition 3.3.1 A hypergeometric series is called k-balanced where к is a positive integer, if x = 1, if one of the a,s is a negative integer, and if p p /=0 /=1 The condition that the series terminates may seem artificial, but without it many results do not hold. The case к = 1 is very important, and then the series is called balanced or Saalschiitzian. Definition 3.3.2 If the parameters in the hypergeometric series satisfy the rela- relations a0 + 1 = a\ + bi = ¦ ¦ ¦ = ap +bp the series is called well poised. It is nearly poised if all but one of the pairs of parameters have the same sum. In Section 3.4 we shall give a connection between the two kinds of series consid- considered in Definitions 3.3.1 and 3.3.2. We begin with a study of balanced series. The main theorem of this section is the following result of Whipple, which transforms a balanced 4F3 to another balanced 4/% Theorem 3.3.3 (e - a)n(f - a)n d,e,f ' ) («)„(/)„ —n,a, d — b,d — с **F\d,a+\-n-e,a where
3.3 Transformations of Balanced Series 141 Proof. Start with Euler's transformation: Rewrite this with different parameters: Suppose c — a — b = f — d — e and multiply the two identities to get f-d.f-e \ (c-a,c-b The coefficient of x" on the left side is (a)k(bMf ~ d)n-k(f - e)n-t This expression can be rewritten as (f-d)n(f-e)n ( a,b,l- f-n,-n Equating this to the coefficient of x" on the right side, we obtain / -n,a,b, -f-n+ 1 4 \ d — f — +1 — f — +1 -n, с — а, с — b,\ — f — n U-d)n{f-e)n \ c,l-d-n,l-e-n ' )' This is equivalent to the statement of the theorem. This result is due to Whipple [1926]. For a different proof see Remark 3.4.2. ¦ The next result was given by Sheppard [1912]. Corollary 3.3.4 -n,a,b \ (d-a)n(e-a)n ( -n,a,a + b - n - d - e + 1 3^2 j . , .. . . ,; 11- d,e 'lJ- (d)n(e)n 3i\a-n-d+l,a-n-e+l Proof. Let / -*¦ oo but keep / — с fixed in Theorem 3.3.3 so that f-n,a,b,c \ f-n,a,b \ 4^3 , , ; 1 ->¦ 3^2 , ; 1 • V d,e,f ) \ d,e J
142 3 Hypergeometric Transformations and Identities A similar change takes place on the right side and the end result is —n,a,b \ (e — a)n d,e J (e)n Sheppard's transformation is obtained by applying this transformation to itself. The corollary is proved. ¦ The formula in Corollary 3.3.4 has some interesting special cases. For example, suppose the left side is ^-balanced, that is, d + e = k — n + a + b. Then the right side is a sum of к terms. In particular, к = 1 gives back the Pfaff- Saalschiitz identity. Corollary 3.3.5 a,b,c \ _ r(e)T(d + e— a—b — c) ( a,d-b,d — c d,e ' J ~ T(e -a)T(d + e - b -cK \d, d + e - b - с ' when the two series converge. Proof. Let n -> oo and keep f + n fixed. Since the number of terms of the series tends to infinity, Tannery's theorem may be used to justify the calculation. The left side in Theorem 3.3.3 becomes , ; \ d,e To find the right side, write (e-a)n(f-a)n T(e - a + n)T(f - a + n) Г(в)Г(/) («)„(/)„ Г(е-а)Г(/-а) Г(в + и)Г(/+и) Г(е) Г(д-/+1)Г(-и-/ + 1) Г(е-д+и) Г(е-а)Г(а-/-и + 1)Г(-/+1) Г(е + и) ' where Euler's reflection formula was used to derive the second equality. Recall that l-f-n — d + e-a-b-c. So (e - a)n(f - d)n j:{e)Y(d + e - a - b - с) / Г(д - / + 1)Г(е -а + п) As и -*¦ oo andn + /is fixed, we have— / -» oo and the expression in parentheses equals 1 in the limit. The corollary follows. ¦ Corollary 3.3.5 was given by Kummer [1836]. If we apply Kummer's transfor- transformation to itself we get a theorem of Thomae [1879]:
3.4 Whipple's Transformation 143 Corollary 3.3.6 \ d,e J Г (a) where s = d + e — a — b — с 3.4 Whipple's Transformation The main result of this section is an important formula of Whipple [1926] that connects a terminating well-poised jF6 with a balanced 4/% We prove it by a method of Bailey, which requires that we know the value of a general well-poised 3F2 at x = 1. In Chapter 2, we showed that the latter result, known as Dixon's theorem, is a consequence of Dougall's theorem. See Remark 2.2.2 in Chapter 2. Because Dougall's theorem is itself a corollary of Whipple's transformation, it would be nice if we had a directproof of Dixon's formula, that is, one that does not use Dougall's formula. Several such proofs are known. We give one that follows from a quadratic transformation given in Section 3.1. Theorem 3.4.1 f a,-b,-c 3 2l и = Г((а/2) + 1)Г(а + 6+1)Г(а- ~ Г (a + 1)Г((а/2) + * + 1)Г((а/2) + с + 1)Г(а + b + c+l)' Prvof. If a — —n, a negative integer, in the quadratic transformation C.1.15), then both sides of the equation are polynomials in x. Take x — 1. If a is an even negative integer, then we get -2n,b,c Л Bn)\(b + с + n)n \-2n-b,l-2n-c' ) (b + n)n(c + n)nn\ If a is an odd negative integer, then / -2n-l,b,c \ >F\-2n-b,-2n-c'l)=0- Thus Theorem 3.4.1 is verified when a is a negative integer. Now suppose that с is a positive integer and a is arbitrary. In this case both sides are rational functions of a and the identity is true for an infinite number of values of a. Thus, we have shown that the identity holds if с is an integer and a and b are arbitrary. In the general case, for Re с > Re(—a/2 — b— 1), both sides of the identity are bounded analytic functions of с and equal for с = 1, 2, 3, By Carlson's theorem the result is proved. ¦
144 3 Hypergeometric Transformations and Identities Kummer's identity, which gives the value of a well-poised 2^1 at x — —1, is a corollary of Dixon's theorem. To see this, let с -> oo. Remark 3.4.1 We noted earlier that the balanced identities in their simplest form come from the factorization A - x)-a(l - x)-b = A - x)-"-b. In a similar sense, the well-poised series comes from Equate the coefficient of x2n from both sides to get k\Bn -k)\ or -2n-b' ) n\(bJn This is Kummer's identity for a,b;-\ >F\a-b. when a is a negative even integer. As in the proof of Theorem 3.4.1, we can now obtain the general result of Kummer. The 2^1 result was so special that Kummer failed to realize that series of a similar nature could be studied at the 3 F2 and higher levels. This is not surprising for well-poised series are not on the surface. We also need the following lemma to prove Whipple's theorem. It is proved by Bailey's [1935] method mentioned at the beginning of this section. Lemma 3.4.2 Ff a,b,c,d, —m \a — b+ l,a — с + l,a — d + l,a + m + I' (a/2), a -b-c+l,d,-m ((a/2)+ \)m{a-d + \)m \a - b + I, a - с + \,d - m - a/2' Proof. By the Pfaff-Saalschiitz identity for a balanced 3 F2 we have (-n)r(a-b-c + l)r(a + n)r (b)n(c)n r\(a -b + \\(a -c+\)r ~~ (a-b + 1)„(а - с + 1)„ '
3.4 Whipple's Transformation 145 So / a,b,c,d,-m 5 \a-b+l,a-c + l,a-d + l,a + m + l' ^-^ (а)„(а)„(—от)и ^-^ (—n)r(a — b — с + l)r(a + n)r n\(a — d + \)n(a + m + 1)и ^ r!(a — & + l)r(a — с + l)r (— l)r (a)n+r(d)n(—m)n{a — b — с + l)r г — r)\r\{a — b + l)r(a — с + \)r(a — d + \)n(a + m + l)n (sett =n -r), (a)t+2r(d)t+r(-m)t+r(a -b-c+ l)r(-l)r r=0 (=0 l !r!(a - *+ l)r(a - с + l)r(a - J + l),+r(a + m \(a-b+ \)r(a -c+ \)r(a -d+ \)r(a + m + l)r (a+2r),(fif + r),(-m+r)( The inner sum can be computed by Dixon's identity (Theorem 3.4.1). An easy calculation then gives the required relation. ¦ Lemma 3.4.2 transforms a terminating well-poised 5F4 to a balanced 4F3. Corollary 3.4.3 a,(a/2) + l,c,d,-m \ (a + l)m(a - с - d + l)m ; 1 а/2,а-с+1,а-а" + 1,а+от-|-Г ) {a-c+\)m(a-d+\)m Proof. Take fe = (a/2) + 1 in Lemma 3.4.2. The 4F3 reduces to a balanced 3F2. ¦ Theorem 3.4.4 a,(a/2) + l,b,c,<i,e,-от 7 \a/2,a - (a + \)m(a -d -e+ \)m r ( a-b - с+\,d,e,-m (a~d+ \)m{a -e + l)m4'
146 3 Hypergeometric Transformations and Identities Proof. The proof of this theorem is exactly the same as that of Lemma 3.4.2 except that one uses Corollary 3.4.3 instead of Dixon's theorem. Thus a, (a/2) + 1, b, c, d, e, — m a/2, a -b+l,a-c+\,a-d+l,a-e+l,a+m + l' ^ (a)«((a/2) + l)n(d)n(e)n(-m)n f^Q и!((а/2))„(а - d + 1)„(а - e + 1)„(а + m + l)n E(-n)r(a - b - с + 1)Да + n)r r\(a - b + l)r(a - с + l)r After a calculation similar to the one in Lemma 3.4.2, this sum equals ^U (аJг((а/2) + l)r(d)r(e)r(-m)r yC (a + 2r),((a/2) + r + !),(«/ + r),(e + r),(-m + r), ¦^ t\((a/2) + r),(a — d + r + l),(a — e + r + I),(a + m +r + 1),' The inner sum can be evaluated by Corollary 3.4.3 and the result follows after a straightforward calculation. ¦ Since the two elementary identities in Remark 3.4.1 are not related, we see the very surprising nature of Whipple's identity. In a later chapter we give a more natural proof of Whipple's theorem as a consequence of some properties of Jacobi polynomials. We refer to the above 7 Ff, as a very well poised 7 F(,. The word "very" refers to the factor ((a/2) + l)t = а + 2k (a/2)t ~ а The 5F4 in Corollary 3.4.3 is also very well poised. Remark 3.4.2 Theorem 3.3.2 is a particular case of Theorem 3.4.4 because of the symmetry in the parameters b, c, d, e in the 7^. Whipple also stated a more general form of Theorem 3.4.4. Theorem 3.4.5 ( a,(a/2) + l,b,c,d,e,f 1 \a/2,a-b+l,a-c + l,a-d + l,a-e+l,a- /+Г Г (a - d + 1)Г(а -е+ 1)Г(а - / + 1)Г(а -d-e-f + l) Г (a + 1)Г(д -e-f+ 1)Г(а - а" - e + 1)Г(а - а" - / + 1)  1 \л ts ^i-,»*,^,^ .11
3.5 Dougall's Formula and Hypergeometric Identities 147 provided the series on the right side terminates and the one on the left con- converges. Proof. This is a consequence of Carlson's theorem and Theorem 3.4.4. The reader should work out the details or see Bailey [1935, p. 40]. ¦ Theorem 3.4.6 F( a,(a/2) + l,b,c,d,e 6 \a/2,a-b+l,a-c + l,a-d+l,a-e + l' _ Г(а-дЧ-1)Г(а-е+1) / a - b-с+ l,d,e d-e + lf \a - b+ I,a - с + Г Proof Let m -> oo in Theorem 3.4.4 to prove the result. ¦ Observe that Theorem 3.4.6 connects a general 3F2 at x = 1 with a very well poised 6^5 at x = —1. 3.5 Dougall's Formula and Hypergeometric Identities Set 2a + 1 = b + c + d + e — min Theorem 3.4.4. The 4F3 reduces to a balanced 3F2, which can be summed. The result is Dougall's formula. Theorem 3.5.1 F( a,{a/2) + \,b,c,d,e,-m 1 \a/2,a-b+\,a-c+ l,a-d+l,a-e+l,a + m + l' = (a + l)m(a -b-c+ l)w(a -b~d+ l)w(a - с - d + l)w (a - ft + l)m(a - с + l)m(a - J + l)m(a - ft - с - d + l)m ' when la + 1 =b + c + d + e — m. This formula sums a 2-balanced very well poised 7 Tv In the following identities, convergence conditions need to be imposed. They are not explicitly stated, since they are easy to work out in each case. Corollary 3.5.2 ( a,(a/2)+l,c,d,e 5 \a/2,a-c+l,a-d + l,a-e + l' _Г(а-с+ 1)Г(а - d + 1)Г(а -е + 1)Г(а -c-d-e + 1) ~ Г (а + 1)Г(а -d-e+ 1)Г(а - с - e + 1)Г(а - с - d + 1)' Prao/ Substitute b = 2a — c — d — e + m + lin Theorem 3.5.1 and let m-»oo. This procedure may be justified by Tannery's theorem. The corollary follows. ¦
148 3 Hypergeometric Transformations and Identities One may also derive this corollary from Corollary 3.4.3 of the previous section by an application of Carlson's theorem. Dixon's formula follows from Corol- Corollary 3.5.2 by taking e = a/2. The next corollary gives the value of a very well poised 4F3 at x = — 1. Corollary 3.5.3 ( a, (a/2) , .,о,„ . _ a/2,a-c + l,a-d+ Г / Г (a + 1)Г(а - с - d + 1) Proof. Let e -> —00 in Corollary 3.5.2 and the result follows; or else take Z? + c = a + lin Theorem 3.4.6. ¦ Here are a few more summation formulas. Theorem 3.5.4 <¦¦* M/2U Г(с/2)Г((с+1)/2) + в)/2)Г((с-в- Proof Let x -> — 1 in Pfaff's transformation (Theorem 2.2.5), to get There are two ways in which the series on the left becomes well poised so that it can be summed by Kummer's identity (Corollary 3.1.2). These cases are (i) 2c — b — a + 1 and (ii) a + с = с — b + \ or a + b = 1. The two parts of the theorem follow immediately. ¦ Theorem 3.5.5 ГA/2)Г(с + A/2))Г((а + b + 1)/2)Г(с -(a+b- Г((а + 1)/2)Г((* + 1)/2)Г(с - (a - 1)/2)Г(с - (b -
3.5 Dougall's Formula and Hypergeometric Identities 149 ,'a,b,c (и) з^2 , ; 1 лТ(е)Г(/) 22сГ((а + е)/2)Г((а + /)/2)Г((Ь + е)/2)Г((Ь - when а + Ъ - 1 and е + / = 2с + 1. Proof. These results follow from Thomae's formula (Corollary 3.3.6) by choos- choosing parameters appropriately. To obtain (i), choose the parameters so that the right side becomes well poised. Thus d — (a + b + l)/2 and e = 2c and Thomae's formula gives a,b,c Л Г((а+Ь+1)/2)ГBс)Г(с-(а+Ь-1)/2) (а+Ь+1)/2,2сУ ) Г(а)Г(с-(а-Ь-1)/2)ГBс-(а+Ь-1)/2) /2c-a,(b-a + l)/2,c-(a+b-l)/2 \ Now apply Dixon's identity to get (i). Note that we require с > (a + b - l)/2. The 3F2 also exists without this condition when с = — n, a negative integer, if the series is taken to terminate with n + 1 terms. The value of this series can be found, but it is not what one gets by letting с tend to —и. (ii) To prove this identity choose the parameters so that the right side is the 3F2 given by (i). Thus take a + b = 1 and e + f = 2c + 1 to get (a,b,c \ _ Г(е)Г(/)Г(с) /e-a,f-a,c 3 V,/' ) Г(а)Г(Ь + с)ГBс)Ъ \ b + c,2c ' An application of (i) at this point gives (ii). ¦ Note that Theorem 3.5.4(i) and (ii) are limiting cases of Theorem 3.5.5(i) and (ii) respectively. Let с -> 00 to see this. Part (i) of the last theorem is due to Watson [1925], who proved it for the case where a = —n, a negative integer. Watson's theorem can also be obtained by equating the coefficient of xn on each side of the quadratic transformation C.1.11). The general case then follows by Carlson's theorem. Another way is to multiply Equation C.1.3) by (x — x2)c~l and integrate over @, 1). This only works in the terminating case. Remark We end this section with the following comments on well-poised series. Let l ~n,aua2,. aq \ -n -a\,..., 1 -n —aq )
150 3 Hypergeometric Transformations and Identities Then the polynomial f(x) satisfies the relation fix) = (-iynx"fil/x). A polynomial g (x) that satisfies g(x) = xng(l/x)is called a reciprocal polynomial of degree n. These polynomials have the form gix) = ao + a\x + a2x2 + ¦ ¦ ¦ + a2x"~2 + a\xn~x + uqx" . Note that fix) is reciprocal if either q or n is even. It is easy to check that if g is the reciprocal polynomial given above then gix) = oo(l + x)" + aixil + x)"-2 + ¦¦¦ + avxvil + *T~lv C.5.1) for some an, й\,..., av which can be defined in terms of an, a\, Here v = [и/2]. It can be shown that aJ= 1^ 0<2r<n and 0^, 7"' Jn-j-i Note that C.5.1) can also be written as ) Uk 2k- C-5-4) Observe the connection of C.5.4) with the quadratic transformations C.1.4) and C.1.15). 3.6 Integral Analogs of Hypergeometric Sums In Chapter 2 we saw two integrals of Barnes that were continuous analogs of Gauss's 2 F\ identity and the Pfaff-Saalschiitz 3 F2 identity. There are other Barnes- type integrals that are analogs of the higher pFq sums we have considered in this chapter. The path of integration in the integrals will be parallel to the imaginary axis but suitably deformed so that the increasing sequence of poles of the integrand is separated by the contour from the decreasing sequence of poles. The following theorem of Bailey is an analog of Corollary 3.5.2, which sums a very well poised 5F4 at x = 1. See Bailey [1935, p. 47].
3.6 Integral Analogs of Hypergeometric Sums 151 Theorem 3.6.1 2) + 1 + s)T(b + s)T(c + sW(d + s)r(b -a- s)r(-s)ds J_ Г Г(а Ъп J Г((а/2) + «)Г(а - с + 1 + «)Г(а - d + 1 + s) Гф)Г(с)ГУ)Гф + с~ а)Гф + d-a) Proof. The proof is similar to that of Theorem 2.4.2. The reader should fill in the details. The residues at the poles of Г(Ь — a — s)V(—s) to the right of contour give the integral as the sum of two very well poised 5 F4. These can be summed by Corollary 3.5.2. The result follows. ¦ A different and useful form of Theorem 3.6.1 was given by Wilson [1978]. We note it here. Theorem 3.6.2 ! + s)T(a - х)Гф + х)Гф - s)T(c + s)T(c - s)V(d + s)F(d - s) 1 f Г (а bd J T{2s)T{-2s) 2Г(а + b)T(a + с)Г(а + d)Yф + с)Гф + d)V(c + d) ¦ds Here the contour is along the imaginary axis but suitably deformed. As always, we are assuming that a, b, c, d are such that this can be done. Proof. In Theorem 3.6.1 replace a with 2a; b, c, d with b + a,c +a, and d + a respectively; and s with s — a. We get Г (а + s)T{s + l)F(b + s)T(c + s)V(d + s)V(b - s)V(a - s) — f \ni J - d + s) Г (а + Ь)Г(а + с) Г (а + d)V(b + с)Г(Ь + d) -ds 2ГA-c-d)r(a + b + Use Euler's reflection formula to rewrite this as Г (а + s)V(b + s)V(c + s)V(d + s)V(a - s)V(b - s)V(c - s)V(d - s) 2ni J rBs)r(-2s) sin(c — s)n sin(d — s)?t ¦ ds —smsn cos sn sin(c + d)n _ 2Г(а + Ь)Г(а + с)Г(а + d)T{b + c)T{b + d)T{c + d) ~ Г (a + b + с + d) The trigonometric expression in the integrand is sin en sin dn cos2 sn + cos en cos dn sin2 sn 1 - sin(c + d)n sin sn cos sn
152 3 Hypergeometric Transformations and Identities Now observe that the second term involving the trigonometric functions changes sign when s is changed to —s. Hence that part of the integral vanishes and the theorem is proved. ¦ If a, b, c, d are positive or a = b and/or с = d and the real parts are positive, then we can write Wilson's formula as -I Г (a + ix)F(b + ix)F(c + ix)F(d + ix) z dx FBix) _ Г(д + b)F(a + с)Г(д + d)F{b + c)F(b + d)F(c + d) F(a+b + c + d) The result in C.6.1) was given also by de Branges [1972, 1972a]. Formula C.6.1) continues to hold when one of the parameters is zero. The following corollary is the result of letting d —> oo: 2n Jo Г (a + ix)F(b + ix)F(c + ix) dx = F(a+b)r(a+c)r(b FBix) C.6.1) There is also an analog of the Dougall 7 F6 formula. This is the next theorem, also given by Bailey. Theorem 3.6.3 Г (a + s)F((a/2) + 1 + s)F(b + s)F(c + s)V(d + s) In i J Г ((a/2) + s)V(a - с + 1 + s)F(a - d + 1 + s) F(e + s)F(f + s)F(b -а- ^г(~^)^ Г (a - e + 1 + s)F(a - f + 1 + s) F(b)F(c)F(d)F(e)F(f)F(b + c-a) 2F(a -d-e+ 1)Г(а -с-е+ 1)Г(а - с - d+ 1) F(b+d- a)F(b + e - a)F(b + / - a) " Г(в - с - / + 1)Г(а -d-f+ 1)Г(а - e - f + 1)' when 2a+ 1 = b + c + d + e + f. C.6.2) Proof. It is not possible to evaluate this integral as in Theorem 3.6.1. The two 7 F6 series cannot be summed by Dougall's formula unless they terminate. However, if they terminate, a contour separating the increasing and decreasing sequences of poles cannot be constructed. It is possible, nonetheless, to give a proof that is the integral analog of the proof of Theorem 3.4.4. Start with the formula in
3.6 Integral Analogs of Hypergeometric Sums 153 Theorem 2.4.3 in the following form: Г (а - d + 1+ s)V(a -e+l+ s)V(a - f+l+s) _ 1 ~ Г (a - d - e + 1)Г(а - J - / + 1)Г(а - e - f + 1) Г(</ + г)Г(е + г)Г(/ + ?)Г(а -rf-e-/ + l- г)Г(* - t) 2ni J dt. C.6.3) The left side of C.6.4) is a part of the integrand in C.6.3). Substitute this in C.6.3) to get 1 f V(d + t)T(e + t)V(f + t)V(a -d-e-f+l-t) 2ni J V(a-d-e+ 1)Г(а - d - f + 1)Г(а - e - f 1 f Г (а + *)Г((а/2) + 1 + s)T(b + s)V(c + s) ч! r((a/2)+s)F(a-c+l+s) r(b-a-s)r(s~t)r(-s)J ¦ dsdt. Evaluate the inner integral by Theorem 3.6.1. The resulting integral then reduces to 1, which can be computed by C.6.4) when 2a + l=b + c + d + e + f. The theorem follows. ¦ An analog of Dixon's well-poised 3F2 can also be derived. Take d = a/2 in Theorem 3.6.1 to get the required result: 1 ГГ(а+ s)V(b + s)V(c + s)V(b -a- s)V(-s) J ds 2ni J Г(Ь)Г(с)Г(а/2)Г(Ь + с - а)Г(Ь - u, -„ = C.6.4) 2Г((а/2)~с+1)Г(Ь + с-(а/2)) To obtain a more symmetrical form, replace a with 2a and b, с with b + a,c + a respectively and s with s — a to get Г (а + s)V(b + s)V(c + s)T(b - s)V(a - s) ds b)r(a + __z C65) 2ГA~с)Г(а+Ь + с) Now apply the reflection formula and do the simplification (as employed in the
154 3 Hypergeometric Transformations and Identities reduction from Theorem 3.6.1 to Wilson's integral) to get 1 f / Г (a + s)T(b + s)T(c + s)F (a - s)T(b - s)T(c - s) cos snds 2л i J Г(а)Г(Ь)Г(с)Г(а + b)T{a + c)T(b + c) = 2Г(а+Ь + с) • If a, b, and с are positive, we can write the formula as - / \Г(а + ix)T(b + ix)T(c + ix)\2 coshлxdx л- Jo Г(а)Г(Ь)Г(с)Г(а + Ь)Г(а + с)Г(Ь + с) Г(а 3.7 Contiguous Relations C.6.7) In the previous chapter we gave the three-term contiguous relations of Gauss for the 2F{ functions. More generally, there are (q +2)-term relations for pFqs with p < q + 2. Under certain conditions, these relations become three-term relations. Kummer observed that it was possible to obtain such relations for the 3F2s when x = 1. Bailey [1954] gave a procedure using the differential equation satisfied by the 3f2 to produce these relations. A simpler method was given by Wilson [1978]. This applies to more general pFqs. In this section we use Wilson's method to derive his results on the three-term contiguous relations for balanced 4F3S. These contain the three-term recurrence relations for a set of orthogonal polynomials due to Wilson, which contain the "classical" orthogonal polynomials as special cases. Before describing Wilson's method, we note that it is possible to obtain three- term relations for the 3 F2S from the contiguous relations for the 2 F\ s by integration. For example, multiply the equation (b - d)F + aF(a+) - bF{b+) = 0 by xd~l(l - x)e~d~l and integrate over @, 1) to get ,, . „(a,b,d \ (a + \,b,d Л , „(a,b+\,d Л „ (b-aKF2{ ;1 )+a3F2[ ¦,\)-b3F2[ ; 1 = 0. V c,e ) \ c,e J \ c,e J As another example, apply this procedure to а,Ь \ (c-b)ax (a+\,b+\ \x\ 2 е/ c(c + l) to arrive at a,b + l,d \ rfa,b,d Д a(c - b)d fa + \, ;1Ч ;1Ч c+l,e ;1ЛЧ с,е ;1Г^ПO3Ч с + 2,е
3.7 Contiguous Relations 155 Let us now turn to Wilson's procedure for systematically deriving all the con- contiguous relations of a balanced 4F3. Note that if only one of the parameters in a balanced 4F3 is altered, the new 4F3 is not balanced. Definition 3.7.1 Given a balanced 4F3, a contiguous 4F3 is obtained by altering two parameters by±\ in such a way that the new series is also balanced. As before, a relation among contiguous functions is called a contiguous relation. Denote the balanced 4F3(a, b, c, d; e, f, g; 1) by F. By definition, one of the numerator parameters is — n and the sum of the denominator parameters is one more than the sum of the numerator parameters. There are2 x G2) = 42 4F3s contiguous to F. Consider the difference F(a—, b+) — F. Since (a - \)k(b + l)k(c)k(d)k k\(e)k(f)k(g)k k\(e)k(f)k(g)k (a)k-i(b + l)t k\(eMf)k(g)k (a-b-l), (k - \)\(e)k(f)k(g)k we have Т7, и,л it (a-b- \)cd F(a-,b+)- F = F+(a-), C.7.1) efg where F+ is obtained from F by increasing every parameter by 1. Similarly, F(a-,e-)-F=f^-F+{fl-), C.7.2) (e - \)efg „, . ., j, (e-a)bcd F(a+, e+) - F = — —— F+(e+), C.7.3) e(e+l)fg (e - f + \)abcd F(e+, f-)-F= ^—^-777 F+(e+)- C-7-4) e(e + \)fg By symmetry in the parameters, we now have expressions for all the differences between F and a function contiguous to it. Some of the contiguous relations are immediate corollaries of these relations. By C.7.1), Therefore, equating the two expressions for F+(a~), we get b(a-c- l)(F(a-, b+) - F) = c(a - b - l)(F(a-, c+) - F). The other contiguous relations would follow once we find an equation connecting F with F+(a—) and F+(e+). Of course, this equation would imply the other
156 3 Hypergeometric Transformations and Identities necessary relations by symmetry. To derive the required relation, take a = —n and apply the transformation in Theorem 3.3.1 to obtain (f)n(g)n „ c( a,b,e-c,e-d (f-b)n(g-b)n \e,e + f-c-d,e + g-c-d' a, b + 1, e — с, е — d -) =4^3 (/-*)„(*-*)„ and (/+l)»-l(g +!)„-! (f - b)n-i(g - b)n-i a + l,b+l,e-c + l,e-d+l Now the connection between F, F(b+, e+), and F+(e+) is given by C.7.3). This implies the relation among F, F+(a—), and F+(e+), which is given by fgF - (/ - a)(g - a)F+{a-) + п(в ~ **"?*" ^ F+(e+) = 0. C.7.5) In this derivation we also used the fact that a+b + c + d+\ — e + f + g. We note that for C.7.5), it does not matter which numerator parameter is a negative integer. This is true because F is a rational function of the five free parameters. Thus we also have t г lf ил, им? ,и л , He - aXe-cXfi - d) fgF - (/ - bXg - b)F+{b-) -\ F+(e+) = 0. e(e + 1) Eliminate F+(e+) from the last two equations to get b(e - aXf ~ a){g - a)F+(a-) - a(e - bXf ~ b)(g - b)F+(b-) + {a- b)efgF = 0. C.7.6) The final relation we need is similarly obtained from C.7.5). It is (e - ci)(e - bXe - cXe - d) (/ - aXf ~ bXf ~ cXf ~ d) p /fi^ , f,f + V) F+(f+) + g(e- f)F = 0. C.7.7) All the contiguous relations can now be obtained from C.7.1) to C.7.7). As one more example, substitute the values of F+(a—) and F+(b—) from C.7.1) into
3.8 The Wilson Polynomials 157 C.7.6) to get Ke-a)(f-a)(g-a)i a-b-l a(e - b)(J - b)(g - b) (F(a+,b-)~ F)+cd(a-b)F = 0. C.7.8) b-a- 1 The contiguous relations for the 3 F2 s are found by letting n -» 00 in the relations for the 4F3S. One may also write down the fundamental relations corresponding to C.7.1) to C.7.7) and derive the others from these. We give a few examples. In C.7.1), let a — —n and n -» 00 to get, after renaming the parameters, F(a+) -F = ^F+, C.7.9) de where F stands for a general 3F2 at x = 1. In C.7.2), let d = —n and e = —n + a + b + с — f — g — 1 and n -> 00. After renaming parameters, F(«-) - F = -~F+{a-). C.7.10) de It is possible to get C.7.10) from C.7.1) too. Similarly, we have -) -F= abC F+ C.7.11) (d — \)de and F(d+)-F = - f\ F+(d+). C.7.12) a(a + l)e By taking limits in different ways in C.7.5) we get two more relations: deF -a(d + e-a-b-c- \)F+ - (d - a)(e - a)F+(a-) = 0 C.7.13) and eF~(e- a)F+(a~) - ^ ~^~ C) F+{d+) = 0. C.7.14) The rest of the relations can be found in a similar manner. In fact, all the 3 F2 contiguous relations follow from C.7.9) to C.7.14) if symmetry in the parameters is also used. 3.8 The Wilson Polynomials Consider the polynomial pn of degree n > 0 defined by the relation ;;l) C.8.1) a + b,a + c,a + d I
158 3 Hypergeometric Transformations and Identities where a, b, c, d are real and positive. From the contiguous relation C.7.8) we find that pn satisfies the three-term recurrence relation An(pn+i(x) - pn(x)) + С„(р„_,(х) - pn(x)) + (a2+x)pn(x) = 0, C.8.2) where (n + a+b + c + d- \){n + a + b)(n + a + c)(n + a + d) " Bn + a+b + c + d-l)Bn + a and n(n + c + d- \){n + b + d- \){n cn = c+d- 2)Bn + a+b + c+d-\)' As we have remarked before, since AnCn+x > 0 for n > 0, the polynomials pn are orthogonal with respect to some positive weight function. In fact, it will be shown that 2л- о Г (a + ix)T{b + ix)r(c + ix)T{d + ix) 2 2 2 pn(xz)pm(xz)dx rBi*) C.8.3) = ^m,nn\{n + a +b + с + d — 1)„ Г(а + b + п)Г(а + с + п) ¦ ¦ ¦ Г (с + d + п) х , Г(а +b + c + d + 2n) where р„(х2) - (а + Ь)п(а + с)„(а + d)npn(x2). The relations given above continue to hold when a — b and/or с = d and the real parts of these parameters are positive. Definition 3.8.1 The Wilson polynomials pn (x) are defined by Pn(x2) = (a + b)n{a + c)n{a + d)n ( —n, n + a + b + c + d — 1, a — ix,a + ix \ a + b,a + c,a + d where a,b,c,d are complex parameters. It is evident from the definition that pn(x) is symmetric in b, c, and d. An application of Theorem 3.3.3 shows that symmetry in all four parameters a, b, c, and d exists. The Wilson polynomials are orthogonal with respect to the integrand of Theorem 3.6.2 as a weight function. We denote the integrand by f(s).
3.8 The Wilson Polynomials 159 Theorem 3.8.2 With the contour and the parameters a, b, c, and d as in Theo- Theorem 3.6.2, t— / f(s)pn(-s2)pm(-s2)ds = 8m,n2n](n 2m J Г (a + b + п)Г(а +c + n)T(a + d + п)Г(Ь + с + n)T(b + d + п)Г(с + d + n) Г(а + b + c + d + 2n) Proof. First observe that we can write m pm(-s2) = ^Ak(b- s)k(b + s)k, *=0 where Ak are suitable constants. We compute ^*-r f f(s)pn(-s2)(b - s)k(b + s)kds 2ni J = (a + b)n(a + c)n(b + с)пУ (a+b)j(a+c)j(b- ¦ ^— I f(s)(a - s)j(a + s)j(b - s)k(b + s)kds. C.8.4) The integral in the sum can be rewritten as 1 f Г(о + j + s)T(a + j - s)T(b + к + s)T(b + к - э)Г(с + s)T(c - s)r(d + s)T(d - s) , J с This integral can be evaluated by Theorem 3.6.2. After a little simplification we see that C.8.4) is equal to 2Г(й + b+ к)Г(а +с + п)Г(а + d + п)Г(Ь + с + к)Г(Ь + d + к)Г(с + d)(a + b)n „ ( -n, n + a + b + c + d - l,a + b + k ¦3F2I ; 1 \ a+b, a + b + c + d +k This 3F2 is balanced and so can be summed by the Pfaff-Saalschiitz identity. We then get Ini J f(s)pn(-sz){b - s)k(b + s)kds = 2(-k)n T(a + b + k)T(a + с + n)T(a + d + n)T(b + с + k)T(b + d + к)Г(с +d + n) T(a + b + c + d+n+k) The factor (—к)„ is zero for к < п. By symmetry in a and b we know that Pn(s2) = {-=^-{n +a+b + c + d- \)n(b - s)n{b + s)n n\ k=0 This completes the proof of the theorem.
160 3 Hypergeometric Transformations and Identities The result in C.8.3) follows from Theorem 3.8.2 when a, b, c, and d are positive or when a — b, с = d, and the parameters have positive real parts. The Wilson polynomials contain many sets of orthogonal polynomials as limiting or special cases. Here we show how the Jacobi polynomials (introduced in the previous chapter) are derived. Set a = b = (a + l)/2, с = d = ф + l)/2 + ico, and x = «V(l - 0/2 in the 4F3 in Definition 3.8.1. Let со ->• oo. We get, except for a constant factor, the Jacobi polynomial a + 1 3.9 Quadratic Transformations - Riemann's View Riemann exploited to the fullest degree the idea that a function is determined to a large extent by its singularities. An example of this was given in the previous chap- chapter in a discussion of the hypergeometric differential equation. Here we show how the ideas developed there give Riemann's basic result on quadratic transformations. The hypergeometric equation has regular singularities at 0,1, and 00 but no other singularities. Suppose the exponents at 0 are 0 and 1/2. By a change of variables x = t2, the solutions at 0 become analytic. But now singularities are introduced at t = ±1 and we get another hypergeometric equation. To see the details, note that B.3.4) of Chapter 2 shows that C.9.1) C-9-2) is the set of solutions of the d2y /1 \-bi- dx2 \2x x-\ The change of variables x = dy 1 dy dx 2t dt ( 0 oo p\ 0 a I 1/2 c2 equation b2\dy | ( b )dx + {x = t2 implies and ^У 1 h b2 -i l 2t A j \ ее) У -Idy 1 d2y 2t2 dt 2t dt2 Substitute this in C.9.2) to obtain an equation that can be written as d*y dt2 t -1 2bxb2 l-bi-b2\dy ~dt C.9.3)
3.9 Quadratic Transformations - Riemann's View 161 Apply Theorem 2.3.1 to conclude that the set of solutions of C.9.3) is It follows that Theorem 3.9.1 = P{ bi 2ci bi t) C.9.4) -1 b\ b2 0  b2 oo 2c2 oo 2ci 2c2 1 1 ?l f > *2 J 1 h ^ *>2 This is Riemann's theorem on quadratic transformations. Let us derive the two basic quadratic transformations for the 2F\ contained in Theorems 3.1.1 and 3.1.3. Write C.9.4) as = P { с а 0 1-х2 x- 1 Set с = 0, and replace 2b with 1 — 2b + a, 2a with a, and J with b —ato get .6-0^-6 1/2 or
162 3 Hypergeometric Transformations and Identities Since there is only one solution analytic at 0 with value 1 at that point, we get b 4x il-xJF\a-b+l'X)=2F\ a-b+l This is the result of Theorem 3.1.1. Similarly, we have 0 oo 1 j Л ( 0 oo 1 P I a О с — \ = P I 2a с с or b 1/2 d ~ ) {2b d d 1+X Oool P I la с с х > = P I a 0 2b d d ) {b 1/2 d y2 X With appropriate changes in the parameters we can write this as or 2-х This proves Theorem 3.1.3 once again. The differential equation with regular singularities at — 1, 1, and oo but no other regular singularities when written as A - x2)y" - 2xy' + [v(v + 1) - -^-J у = 0 is called Legendre's differential equation. It is clear that the set of solutions is ( -1 oo 1 | P I pull v + 1 fi/2 x \ . C.9.5) [-H/2 -у -д/2 J A comparison of C.9.5) with C.9.4) shows that quadratic transformations apply to the solutions of the Legendre equation.
3.10 Indefinite Hypergeometric Summation 163 3.10 Indefinite Hypergeometric Summation We have derived a number of hypergeometric identities in this chapter. In this section we consider the problem of evaluating partial sums of hypergeometric series. Gosper [1978] has given an algorithm that yields the value of such a sum provided it is a hypergeometric term. To make this more precise, suppose that YH=i ck is a partial sum of a hypergeometric series. The problem is to find a function Sn such that ck=Sk-Sk-U C.10.1) when it is assumed that SkjSk-\ is a rational function of k. We refer to such an Sk as a hypergeometric term. From C.10.1) it is clear that *=1 Write Ck Pk 4k C.10.2) Ck-l Рк-l Гк where pk,qk, and rk are polynomials in к satisfying = 1, C.10.3) for all integers j > 0. The necessity of this condition will become evident later, but qk and rk can be chosen so that C.10.3) is true. Condition C.10.3) implies that if к + a and к + fi are factors of qk and rk respectively, then a - ft cannot be a nonnegative integer. Suppose that, initially in the decomposition C.10.2), gcd(qk, rk+j) = gk- Then replace qk, rk, and pk with , qk , rk qk = —. rk = —. and Pk = qk . k gk gk-j It is easy to check that It is now clear that after a finite number of repetitions of this process, condition C.10.3) will hold for all j > 0. The next step is to write Sk = —fkck. C.10.4) Pk Substitute in C.10.1) and use C.10.2) to find a relation for fk, which is the only unknown in C.10.4). Then pk = qk+ifk ~ rkfk-i- C.10.5)
164 3 Hypergeometric Transformations and Identities By the condition on Sk, we see that fk is a rational function of k. We show that it is a polynomial. Let fk = tk/mk, where lk and mk are polynomials in к with no common factors. Suppose mk is not independent of k. Let j be the largest nonnegative integer such that к + X and к + X + j are both factors ofmk. Substitute the expression for fk in C.10.5) to get pkmkmk-i = qk+xlkmk-X - rklk-\mk. C.10.6) Since к + X — 1 | mk-i, the last equation implies that к + X — 1 | rktk-\mk. But ^_i, tk-i) = 1 and к + X — 1 does not divide mk by the maximality of j. So * + A.-l|rt. C.10.7) Similarly, the fact that к + X + j | qk+\lkmk-\ implies that k + X + j\qk+loi k + X + j -l\qk. C.10.8) By C.10.7) and C.10.8), gcd(^b rk+j) ф 1. This contradicts C.10.3) and we can conclude that / is a polynomial of degree d, say, given by fk=aokd + aikd-l+--- + ad. C.10.9) Substitute this expression for fk in C.10.5) to obtain a system of linear equations satisfied by ao, a\,..., a^. If this is a consistent system the values ao, ai,..., a<j are obtained by solving the equations. From fk we get Sk by C.10.4). To obtain the possible degrees of the polynomial fk, write C.10.5) as z Afk + fk-l) , . , .(fk — fk-\) /Т1ПШЧ Pk = (Чк+i ~rk) + (qk+i+rk) . C.10.10) There are two cases. First suppose that deg(qk+i + rk) < deg(?i+i - rk) = d'. Since deg(fk - fk-\)/2 < d, it follows that d = deg pk - d!. Now suppose that (qk+l + rk)/2 = bkd' + ¦¦¦, ЪфО and (qk+l - rk)/2 = ckd'~x +¦¦¦.
3.10 Indefinite Hypergeometric Summation 165 Use these expressions in C.10.10) to obtain If aoc + a0bd/2 ф 0, then d = deg pk-d' +I. Otherwise, d = —2c/b and d > deg pk — d! + 1. The last value of d is used only if it is an integer greater than deg pk—d' + l. This completes Gosper's algorithm. It decides whether a partial sum of a hypergeometric series can be expressed as a hypergeometric term and gives its value if it does. Zeilberger [1982] extended the scope of this algorithm by taking ck as a func- function of two variables n and к rather than just k. We discuss the Wilf-Zeilberger method here and in the next section. This method is very powerful in proving hypergeometric identities. Suppose the identity to be proved can be written as к where A{n) ф 0 and n > 0. Divide both sides by A(n) and write the identity as ^F(n,fc)=l. C.10.11) к This implies that Earlier we were trying to express F(n,k), or rather T(n,k), as the difference «Sjt+i — <Si, but this is often not possible. As an example, consider the sum Л (-\f(-n)k ?-> k\ k=0 when j < n. By going through the steps in Gosper's algorithm, it can be seen that this sum is not expressible as a hypergeometric term. In Zeilberger's method one tries to write the difference F(n + 1 ,k) — F(n,k) as <Sjt+i — <Sjfc. This improves the situation. Suppose there is a function G(n, k) such that F(n + 1 ,k) ~ F(n, k) = G{n, к + 1) - G(n, k). C.10.12)
166 3 Hypergeometric Transformations and Identities This function G can be determined by Gosper's algorithm. Then к ^ (F(n + 1 ,k) ~ F(n, k)) = G(n, K + \)- G(n, -L). C.10.13) k=-L If we assume that G satisfies the property lim G(n,k) = 0, C.10.14) k—r±oo then it follows that У~] F(n,k) — constant. k It is then sufficient to verify the identity for one value of n, say n = 0. Thus to prove C.10.11), find a G that satisfies C.10.12) and C.10.14). If this G exists, then C.10.11) is known to be true after it is verified for n = 0. This method works for a very large class of identities. The reader may consult Petkovsek, Wilf, and Zeilberger [1996] and Nemes, Petkovsek, Wilf, and Zeilberger [1997] for examples and further results. As an example, consider the identity 2—i ьпп k\2n к Here F(n'k)= к\2п and one can show that G{n,k)=^— satisfies C.10.12) and C.10.14). The identity is therefore verified since it is true for n = 0. The next section contains further illustrations of the Wilf-Zeilberger method and a comparison with a related method. 3.11 The W-Z Method In a series of papers, Zeilberger (sometimes jointly with Wilf) developed a tech- technique he called creative telescoping. The method is often referred to as the W-Z method in that an important component of this work was presented in the Wilf- Zeilberger paper. The method is surprisingly easy to describe in full, but for sim- simplicity we shall apply it to some elementary identities and then compare this method with that of Pfaff.
3.11 The W-Z Method 167 Suppose that there is a linear, homogeneous recurrence relation that we wish to prove for a particular sum. In other words, suppose /t=-oo where for each n, F(n, k) is zero for all but finitely many k. Suppose we expect that a(n)S(n) = P(n)S(n - 1) (so that in fact S(n) = 5@) П"=1 (?(./)/«(./))• Then the w~z method constructs a function G{n,k) (which again for each n is 0 for all but finitely many k) so that a(n)F(n, k) - P(n)F(n - 1 ,k) = G(n, k) - G(n, к - 1). Then the desired recurrence follows immediately. Therefore, a(n)S(n) - p(n)S(n - 1) = J](«(n)F(n, k) - p(n)F(n - 1, k)) к к = 0 because the final sum telescopes. This example illustrates the appropriateness of the label "creative telescoping." The best way to appreciate this is through some examples. Consider the Chu- Vandermonde summation. We wish to prove Sv(n) = ~ , where Sv(n) = J] Fv(n, k) (b)n k=o and Fv(n,k)= —. k\(b)k In other words, we wish to find G(n,k) so that (b + n- l)Fv(n, k)-{b-a + n- l)Fv(n - 1 ,k) = G(n, k) - G(n, к - 1). C.11.1) Zeilberger has fully implemented on the computer an algorithm for finding G(n,k). For problems involving hypergeometric series such as Sv(n), G(n, k) has been shown by Zeilberger to be of the following form: G(n,k) = R(n,k)F(n- l,k).
168 3 Hypergeometric Transformations and Identities However, we can easily work out the value of G(n, k) by inspection. In C.11.1) set к = 0. Then, assuming G(n, — 1) = 0, we see G(n, 0) = (b + n- l)Fv(n, 0)-(b-a + n- \)Fv(n - l, 0) = a. Set* = 1 in C.11.1). Then G(n, 1) = G(n, 0) + (b + n- l)Fv(n, l)-(b-a + n- l)Fv(n - 1, In this manner we can work out (either by hand or with the use of a computer algebra system) the conjecture that in fact G(n,k) = (a+k)Fv(n-l,k). C.11.2) Once conjectured, the proof of C.11.2) is pure algebra. We have (b + n- l)Fv(n, k)-(b-a + n- l)Fv(n - 1, *) (b + n- l)(-w)t(q)fc (b-a + n- 1)(-и + 1)к(а)к k\(b)k k\(b)k (~" * uik~l(a)k № + n- l)(-n) -(b-a + n- \)(-n + k)] k\(b)k (n\, (-an-k(b-a+n-\)). C.11.3) к\(Ъ)к However, G(n,k)-G(n,k- 1) (a + k)(l - n)k(a)k (a + k- k\(b)k (*-l) ! l)ii(q)t {-an -k(b-a + n- k\(b)k = (b + n- l)Fv(n, k)-(b-a + n- l)Fv(n - 1, k), (by C.11.3)). Hence creative telescoping shows us that (b + n- l)Sv(n) = (b-a+n- l)Sv(n - 1), and so by iteration Sv(n) = (b — a)n/(b)n, as desired.
3.11 The W-Z Method 169 Now turn to the Pfaff-Saalschutz summation. This summation was initially treated in Chapter 2. It may be stated as follows: (c-a)n(c-b)n Ьр(П)- (c)n(c-a-b)n' П where Sp(n) = ^ Fp(n, k) with C.11.4) t=o {-n)k{a)k(b)k FJn.k) = . л k\(c)k(\-n + a + b-c)k Note that C.11.4) is clearly equivalent to (c + n- l)(c -a-b + n- l)Sp(n) = {c-a+n- l)(c - 2> + n - l)Sp(n - 1). We proceed as before. We let Gp(n,k) = Rp(n,k)Fp(n-l,k) and we wish to construct the rational function Rp(n, k) so that (c + n - l)(c - a - b + n - l)Fp(n, k) -(c-a + n- l)(c -b + n- \)Fp{n -\,k) = Gp(n, k) - Gp(n, к - 1). C.11.5) Consequently, with к = 0 in C.11.5) we get Gp(n, 0) = (c + n - l)(c -a-& + n-l)-(c-a+n- l)(c -b + n-I) = (c + n- l)(-a) + a(c - fc + n - 1) = -afc. Now by C.11.5) with A; = 1, G,(n, 1) = Gp(n, 0) + (c + n - l)(c -a-b + n- 1) - (c - a + n - l)(c - b + n - 1) cB — n + a+fo — c) B - cC - n + a + b - c) Thus as in C.11.2) we may conjecture Gp(n, k) = -(a + k)(b + k)Fp(n - 1 ,k). C.11.6) The proof of this conjecture is again merely an algebraic exercise. In each case it
170 3 Hypergeometric Transformations and Identities turns out that (c + n- l)(c-a-b + n- l)-(c-e+n- l)(c-b+n- \)Fp(n = Gp(ri,k)-Gp(n,k-l) k)(l-n)k(a)k(b)k k\(c)kB-n + a + b-c)k (a + k- \)(b + к - 1)A - w)t-i(a)t-i(b)t-i which simplifies to our desired result. So C.11.4) has been proved by the W-Z method. We now turn to Bailey's 4F3 summation, which is somewhat more difficult. Our object here is to prove (b - a)n ft/2, (Ы-l)/2, e + 1 •1J= Note that this is a balanced 4 F3. In notation suitable for the W-Z method, we wish to prove (b - a)n where SB(n) = ELo fb("' *) with + n)k(-n)k In this instance, our first expectation is that we can prove (b + n- l)SB(n)-(b-a+n- However, the method utilized in the previous two cases fails initially. It is at this moment that we realize how useful computer algebra is in such matters. It turns out that we can use the W-Z method to obtain a second-order recurrence, namely (n + 1)(—n — b + a)FB(n, k) + (-a2 + ba - a + 2nb + 3b + 2 + An + 2n2)FB(n + l,k) -(b + n + l)(a +n + 2)FB(n + 2, k) = GB(n,k)-GB(n,k-l), where GB(n,k) = FB(n,k). (b + n)(n -k + \)
3.11 The W-Z Method 171 This implies that (n + l)(-n -b + a)SB(n) + (-a2 + ba - a + 2nb + 3b + 2 2n2)SB(n + l)-(b + n+ \)(a + n + 2)SB(n + 2) = 0. C.11.8) Also, since SB@) — 1, SB(l) = (b — a)/b, and since (b - a)n/(b)n satisfies the above recurrence, we see that C.11.7) is proved. A couple of observations should be made here. First, an identity like C.11.7) often arises in practice as a conjecture. In other words, if C.11.7) is true, then something useful follows (c.f. Andrews and Burge [1993]). Consequently, one usually knows the form of the desired summation identity before looking for a proof. Second, suppose that we are dealing with a more complicated identity where we perhaps had not determined exactly what the summation should be like. Here Marco Petkovsek has produced an auxiliary algorithm for the W-Z method. It finds the minimal recurrence satisfied by the sum in question. Thus in this instance Petkovsek's algorithm applied to C.11.7) would reveal C.11.8). There is a somewhat different summation method due to Pfaff. This method is less algorithmic than the W-Z method. However, it spreads out the algebraic complications to systems of recurrences. Consequently, it may provide new sum- summations in addition to the one we wish to prove and it may allow the required algebra to be considerably simpler than that required by the W-Z method. Pfaff's method rather resembles the W-Z method; however, it allows the various additional parameters in the summation to play an important role. Pfaff's method begins very simply. We merely subtract term by term the sum at n — 1 from the sum at n. Consider the Chu-Vandermonde summation once again. We phrase the problem slightly differently. Let Now we note ySv(n-l,a + l,b+l). C.11.9) b
172 3 Hypergeometnc Transformations and Identities Note that C.11.9) together with Sv@, a,b) = 1 uniquely defines Sv(n, a, b). But if av(n,a,b) = then (b - a)n (b - a)n-i av{n, a, b) — crv(n — 1, a, b) = (b)n (b-a)n-i (-a) a (b — я)л-1 я = — Т. ~~п = — т~ ^и (и — 1, я Hence Sv(n, a, b) = av(n, a, b) — (b - a)n/(b)n as desired. Pfaff-Saalschiitz summation by Pfaff's method is as follows: Let Now we note 5p(n, a, fe, c) — Sp(n — 1, a, b, c) j^ j\(c)j(\ -n + a + b-c)j jl(c)jB -n + a + b-c)j = "^2 . J J~l —((-и)A -n + a + b-c + j) j=0 ^ 'J^ 'J~X n(a + b+l-cJ_^ c(l- У=о n + ( me n(a Я + + 1 -(a + b b C + 1 c)B -и -с)яй 1)у(Ь+1)у -с)я^ Sp(n - 1, a + \,b + 1, с + 1). — и + a + b — c)B — n + a + b — c) C.11.10)
3.11 The W-Z Method 173 The rest follows by showing that (c — a)n(c — b)n/[(c)n(c -a — b)n] satisfies the same recurrence and is equal to 1 when n equals 0. This is precisely the proof given by Pfaff for this formula in 1797. We finally look at Bailey's 4F3 summation by Pfaff's method. Just as the W-Z method did not work as expected, so too does Pfaff's method require a new twist. Here we wish to prove that SB(n,a,b)= (*~g)", C.11.11) where Subtracting term by term we find SB(n,a,b)-SB(n-l,a,b) a(l-b- In) b(b + 1) where TB(n-l,a + 2,b + 2), C.11.12) We have thus introduced a new sum. Calculation of this sum for и = 1,2, and 3 suggests that TB(n, a, b) = " . C.11.14) (b + 2n- l)(fc) We now try term-by-term comparison of TB(n,a,b) with SB(n,a,b) and SB(n — 1, a, b). The second comparison yields (b + n- l)(a + n) TB(n,a, b) - SB(n - 1,a, *) = + -TB(n - C.11.15) The two recurrences C.11.12) and C.11.15) together with the initial values SB@, a,b) = TB@,a,b) = l completely define SB(n,a,b) and TB(n, a, b). It is now again an easy algebraic exercise to see that (b — a)n/{b)n and (b — a)n/((b + 2n — \)(b)n-\) satisfy the same recurrences and initial conditions. Consequently, we have not only proved C.11.11) but we have also proved C.11.14).
174 3 Hypergeometric Transformations and Identities 3.12 Contiguous Relations and Summation Methods From B.5.6) and B.5.8) we get one of Gauss's contiguous relations, namely {c - a - b)F = {c - a)F(a-) - b{\ - x)F(b+). Set a = -n + 1 to find (c + n- lJFi(-n, b;c;x)-(c-b + n- lJFi(-n + 1, b; c; x) Now put x = 1 to see that precisely the creative telescoping of Zeilberger re- reduces the right-hand side to 0, and we have produced the W-Z proof of the Chu- Vandermonde sum. Similarly, (c + n — l)(c — a — b + n — \)T,F2(—n, a, b; с, 1 — n + a + b — c; x) - (c - a +n - l)(c-b + n - lKF2(-n + l,a,b;c, 2-n + a+b-c;x) j>0 v U - l)!(c);-iB -n+a + b- c);_i j\(c)jB- n + a+b -c)j = -ab{\ - xKF2(l -n,a + l,b+l;c,2-n + a + b-c;x), and if we set x = 1 we get the W-Z proof of the Pfaff-Saalschiitz summation. Thus we see how the W-Z method is an effective algorithm for discovering useful instances of contiguous relations. In the case of Bailey's 4F3, the W-Z method fails to find a first-order recurrence because there is no three-term contiguous relation relating b/2Ab+l)/2,a to a third 4 Ft, multiplied by a factor including A-х). However, when one moves to four terms, such a relation holds and the W-Z proof of Bailey's summation follows by setting x = 1.
3.12 Contiguous Relations and Summation Methods 175 Pfaff's is even more obviously a method of contiguous relations. In his method x is set equal to 1 before we begin. The full Pfaff proof of Chu-Vandermonde is given by directly establishing the contiguous relation -и.о. Л r/-n + U Л a р/-« + М The Pfaff proof of Pfaff-Saalschiitz is just -n,a,b \ / \-n,a,b iFl[ - ' ¦ - ¦ -- -"> *"\с,2-п+а+Ь-с' n(a+b+l-c)ab ^, > ,., „ ,.,„,. . c(l -n + a+b-c)B-n+a + b-c) \c +1,3 - n + a + b - с The proof of Bailey's formula relies entirely on two contiguous relations: /a/2, (a + l)/2, b + n, -n \ _ fa/2, (a + l)/2, b + n - 1, 1 - n . 4 3V b/2, (b+l)/2,a+l ' ' 4 3| a{\-b-2n) Па/2)+\,{а + Ъ)/2,Ъ + п,\п 4 3V Ъ/2+\ф + 3)/2а + 2 b(b+\) 4 3V Ъ/2+\,ф + 3)/2,а and fa/2,(a + l)/2,b + nl,n 4 \ */2 (*+l)/2,a fa/2,(a + l)/2,b + nl,ln 4 3V b/2,(b+\)/2,a + l (b + n-\){a+n) Па/2) + I, (a + 3)/2, b + n, I -n + 2 ; The discoveries of Wilf and Zeilberger truly revolutionized the study of sum- summations of terminating hypergeometric series. An important offshoot of this work is the MAPLE implementations of these algorithms also prepared by Zeilberger. Peripheral to this accomplishment has been a philosophical debate by one of us (Andrews [ 1994]) with Zeilberger [ 1994] about the implication of these discoveries for artificial intelligence. Currently the internal constraints in MAPLE have prevented a W-Z proof of j-, ( — 2n, x + 2n + 1, x — z + \, x + n — 1, z + n + 1 \_ (x/2) + 1, (x + l)/2, 2z + In + 1, 2x — 2z (\/2)n{2z-x)nBz-x + n + 2)n _ 2] (x + 1)„_зО + 2n- 2KO - z)n(z + n
176 3 Hypergeometric Transformations and Identities The Pfaff-method proof involves a gigantic simultaneous treatment of twenty iden- identities established in the manner discussed earlier. Undoubtedly, the improvements of software and hardware will eventually yield a W-Z proof of C.12.1). It is to be hoped that this contest between methods will serve to make clear that progress occurs when human thought aided by machines applies itself to any given problem. What should not get lost in the shuffle, however, is the observation that the Pfaff and W-Z methods are valuable applications of the classical theory of contiguous relations to summation problems. It would be hard to believe that these are the only such methods buried in contiguous relations, and further investigations are clearly merited. Exercises 1. Verify the following quadratic transformation formulas: (ь, (с) a,b 2b a,b a-b+Y \х \ = (\ - a,2b-a ь + \ ; a,a-b + ± 2a - 2b + 1 ' A + (e) = 2*1 + 2F1 /2a, -2b+1 1 \ a-b+l ' 2V l+x 2) 2. Verify the following one-parameter transformations of Kummer [1836]: 4х B(e
Exercises 177 a,Da- x\-"a/2,(a- 2 2 Ч Bа+5)/6 '\2-x a/2, (a + l)/2 Ax a + C/4) ; (l+x] 3. Show that ^/2,B-fl)/6. Л ^а/2 „ fa/2, (a Г(Bа+2)/3)ГA/2) Г((а (b) ^BТ+\1^г Г(Bа + 3)/4)Г(Bа- 4. Deduce Kummer's identity in Corollary 3.1.2 from Euler's integral for 2F1. 5. Note that C.1.2) is equivalent to Here и is a positive integer. Multiply by xc~l A - x)d~c~l and integrate over @,1) to get -2n,2b, с \ ( -n,b,c,d-c Deduce that 2a,2b,-k \ ( a,b,-k,d l)F where к is a positive integer, (d) Let d = — - + e and & ->• 00 to see that (a) holds without restriction on n.
178 3 Hypergeometric Transformations and Identities 6. Show that for 0 < x < \, 7. Prove formulas C.2.3) and C.2.14) for the elliptic integrals E and K. 8. Prove Elliot's result contained in Theorem 3.2.8. 9. Prove Lemma 3.2.9. 10. ^О \ l?l 1. (a) Prove that (92(q) + 042(<?))/2 = 032(?2) and /Цч)^) 4(^) (b) Deduce that the arithmetic-geometric mean ofe%(q) and ^(g) is 1 (i.e., M@22(<?) A2(<?)) = 1). (c) Prove that 9%(q) - 9%(q2) = 9%(q2). (d) Deduce from (c) and (a) that 9%(q2) - 9$(q2) = 0%(q) and 9%(q) = (e) For 0 < q < 1, let к := k(q) := 9$(q)/9i(q). Prove that 0 < it < 1 and Af A, it') = 03(?), where jfc'2 = 1 - к2. Also prove that K(k) = ~92(q). 11. (a) Use Exercise 2.30 to show that VJc 9ъ(е~пх) = 9ъ(е~ж1х)апйфс 9г(е~пх) = 94(е-л/х). (b) With k(q) as in Exercise 10, show that к(е~лх) = к'(е-ж'х). (d) Show that the unique solution of 9$(q)/92(q) = к for 0 < к < 1 is q = e-nK',K 12. Prove formula C.2.9), that /(a, b) = I ((a + b)/2, Jab). 13. Let a < b < с Prove that '* dx ж / J a f Jb — b)(x — c) M(Vc - a, ~Jc — b)' dx in -a)(x —b)(x -c) M{Jc -a, -Jb -a)
Exercises 179 14. Multiply Euler's transformation vfa'b \ ,л ^с-а-ь „fc-a,c-b lF\ с <Х) = A-ХУ 2^^ c ; by xd~l{\ - x)e~d~l and integrate over @, 1) to obtain Corollary 3.3.5. 15. Prove that -n,-a,-b . Л = Ф + с - \)n(a + c)n 3 2| cl-n-a-Ь-с1 ) (a + b + с - 1)„(с)и аи 1 + (b+c- Ща+с + п- 1)_ 16. Prove Watson's identity in Theorem 3.5.5(i) by multiplying Equation C.1.2), by xc 1(l — x)c 1 and then integrating over @, 1). (Note that и is a nonnegative integer in the above formula. Otherwise the formula does not hold over @, 1).) 17. (a) Show that pf a,b,c,d,-n j 5 \l+a-b, 1+a- c, 1+a-d, 1 + a + я ' when За + 2 = 2(b + с + d - n). (b) Prove that a,b,c,-n +a-fe, l+a-c, l+a + n' Г (a - 2fc + 2n + 1)Г(и + ^)Г(д - fr + 1)Г(д +n + 1)Г(^ - b) T{a -2b+ 1)Г(§)Г(а - b + n + 1)Г(в + In + 1)Г(| - b + n) when 1 + 2a = 2b + 2c — 2n. (c) Deduce that -k,a,b,c \ Ba)kBb)k(a +b)k ; i = 4 3| 1 - а - it, 1 - b - k, 1 - с - к' V (a)t(*)tBe + 2b)k' when 1 - 2c = 2a + 2b + 2k.
180 3 Hypergeometric Transformations and Identities (d) Deduce Clausen's identity: a,b 2a,2b,a+b (Hint: Equate the coefficient of x" on both sides.) Note that a different proof of this identity was given in Exercise 2.13. 18. Prove that (a-d-e+\)n (a) i-d + \)n(a -e + \)n ( a — b — с + \,d,e,—n \tз| ; l \a — b + \,a — с + \,d + e — a — n (a-b-c+\)n (a-b+\)n(a-c+\)n ( a — d — e+\,b,c,—n \ 4 \a-d+\,a-e+l,b + c-a-n' )' a, (a/2) + I, b,с Л Г (a - b + 1)Г(а - с + 1) ;1 = ^a/2,a-b+l,a-c+l' ) Г(а + 1)Г(а - b - с + I)' 19. (a) Use the method of Lemma 3.4.2 to prove that a,b,c,-n \ (d-a)n (b) (d)n a-d + l,a/2,(a + l)/2,a-b-c+l,-n . + l,a—c+l,a—d — n + l)/2, (a — d — n)/2 + 1' This transforms a nearly poised 4F3 into a balanced 5 Fa,. Deduce that d — a — n — \)(d — a)n-i a,fe,n \ (a-2*)B 3F2 _ ,,,„, , ', 1 = (а-Ь+\)п((а/2)-Ь)п(-2Ь)п ~ (a - b + l)n(-2b)n' (a - 2* - l)B( (a-b+ l)B
Exercises 181 -n,b,c,e \ (d-e)n \-n-b,\-n-c,d ) (d)n ( e,l-n-b-c,-n/2,(l-n)/2,l-n-d \ '5 \l-n-b, l-n-c, (l+e-d- n)/2, (e-d- n)/2 + 1' )' (Whipple) See Bailey [1935, §§ 4.5 and 4.7] for the reference to Whipple. 20. Prove that {Hint: Apply Exercise 19(f) to the 4F3 that appears after squaring the 2F1. Then apply Theorem 3.3.3.) 21. Obtain the transformation in Exercise 19(a) by multiplying Whipple's qua- quadratic transformation for 3F2 in C.1.15) by xd+n~l and equating the coeffi- coefficients of x". 22. Derive the formula / a,a/2+l,b,c,-n \ = (d - a -n - l)(d - a)n-i 5 \a/2ab + lac+ld' )~ a/2,a-b + l,a-c+l,d' ) (d)n ( a/2+l,(a + l)/2,abc + l,ad + l,n '5 \(a-d-n + 3)/2, (a-d-n+ 2)/2, a-b + l,a-c + V by using the formula in Exercise 19(b) instead of Dixon's theorem in the proof of Lemma 3.4.2. The above formula transforms a nearly poised 5 Fa, into a balanced 5F4. See Bailey [1935, §4.5]. 23. (a) By letting a ->• 0 in Corollary 3.5.2, evaluate the sum (b)n(c)n(d)n \-b)n(\-c)n{\-d)n (b) More generally, show that (a + b)n(a + c)n(a + d)n (a-b + \)n(a -c+l)n(a-d + 1)„ Г(а - b + 1)Г(а - с + 1)Г(а -d+ 1)ГA - b - с - d)r(b)r(c)F(d) Г (а + Ь)Г(а + с)Г(а + d)F(l -Ъ- с)ГA - Ъ - d)F(l -c-d)
182 3 Hypergeometric Transformations and Identities 24. (a) Observe that Dougall's formula can be written as / k,k/2+l,k + b-a,k+c-a,k + d-a,a+n,-n 7 6[k/2,a-b + l,a-c+l,a-d+l,k-a-n + l,k+n+l' (k + \Ub)n(c)n(d)n (a - k)n(a -b + \)n(a - с + \)n(a - d + 1)„ ' when k = 2a—b — c — d+l. (b) In the proof of Lemma 3.4.2 use (a) instead of the Pfaff-Saalschutz's identity to get / a,a/2+l,b,c,d,e,f,g,-n \ 9 ^8 I " 1 I \a/2, a-b+ \,a-c+\,a-d+\,a-e + \,a-f+\,a-g+\,a + n+l ) (а +\)„(к-е+ 1)„(к - f + 1)„(к - g + 1)„ {k+\)n{a-e + 1)„(а - / + 1)„(а - g + / k,k/2 + l,k + b-a,k + c-a,k + d-a,e,f,g,-n Fs\k/2,a-b + l,a-c+\,a-d+ l,k-e+ \,k - f+ l,k - g+ l,k + n + l' when к = 2a-b-c -d + 1 and b + c + d + e + f+g-n = (c) Deduce Theorem 3.4.5 from (b). See Bailey [1935, §4.3]. 25. (a) Show that / a,a/2+l,b,c 4 \a/2,a-b+l,a-c+l' Г (а - b + 1)Г(а - с + 1)Г((а + 1)/2)Г((а + 1)/2 - ?> - с) ~ Г (а + 1)Г(а - & - с + 1)Г((а + 1)/2 - &)Г((а + 1)/2 - с)' (b) Add this identity to the one in 18(b) to obtain a formula for / a a + \ a b b+l с d+l 2' 2 '4+ '2' 2 '2' 2 а а — i> a — b+l 1 а — с , а — с + I ' 26. Show that \ 4' ^ + ' 2 ' Г 1\3 /1-3\3 /1 - 3 • 5N 3 23/2 (c) 1-5 - + 1\5 „/1•3\5 _/1 - 3 • 5Ч 5 2) Ч2-4У 'Л2-4-6У {ГC/4)}4-
Exercises 183 27. Show that (a) s + (s +: \ Г((* + 1)/2)Г(A-3«)/2) /s\ fs(s + l)l sinm (b) s-(s + 2)(-\ +(s+4)l \ 2 > =-^~¦ (Dougall) For Exercises 26 and 27, see Hardy [1940, pp. 105-6] or Bailey [1935, p. 96]. 28. Prove that 2 ~ 22"+1(n!J — = 1 + V ——. (Takebe Kenko) 4 ^ Bn+2)! See Roy [1990] for reference. 29. Evaluate the sums 2n + l \ fp + к V 2n Л
184 3 Hypergeometric Transformations and Identities 'n\2 fm + 2n-ks (i) ? ^ 2n (n + k\Bk\{- 30. Prove that / a, I + a/2, d/2, (d + l)/2, a-d, I + 2a-d + m,-m 7 6W2, 1 + a - ^/2, a + A - ^)/2, l+d, d-a-m, l+a + m' (l+a)m(l+2a-2d)m l+a-d)m(l+2a-d) See Bailey [1935, p. 98]. 31. Prove that (a) 2f i 2a,2b,a+b = sF21 (b) 2Fi( \a-r v — j / \ы^о— 2 / / 9/7 9J, — 1 л д. A, _ 1 \ (Orr) — 2, a See Bailey [1935, p. 86]. 32. Prove Theorem 3.6.1. 33. Prove that Г(х+т)Г(у + т) (x,y,v + m-\ \ 3F2 ; 1 to n terms Г(т)Г(х + у + т) у v,x + y + m J Г(х+п)Г(у + п) (x,y,v + n~\ \ = ---—- -3^2 ;1 torn terms. Т{п)Г{х+у + п) у v,x + y + n J 34. Prove formula C.8.2). 35. Prove that Wilson's polynomial in Definition 8.1 is symmetric in a, b, c, and d.
Exercises 185 36. Show that a, (a + 2)/2, b, c, d, e, -n (a + \)n(a - b - c)n(a - b - d)n(a - с - d)n (a + 1 - b)n{a + 1 - c)n(a + 1 - d)n{a - b - с - d)n n(n + 2a — b — с — d)(a — b — с — d)] {a-b- c)(a -b- d){a - с - d) J' when e — 2a +n - b — с — d. The 7F6 is 4-balanced and very well poised. 37. Prove formulas C.5.2), C.5.3), and C.5.4) connected with reciprocal polyno- polynomials. 38. Prove Bailey's cubic transformations: (a) a,2b-a-l,a+2-2b x\ / a q+± a+2 ,-a ^ 3' з ' з -27* (b) (a,b-\,a + \-b 3 2\ 2b,2a + 2-2b ' For comments on these cubic transformations and for the reference to Bailey, see Askey [1994]. 39. Show that la a,a+l/2. 2a + ( ' and 40. Define a, a +1/2 2a 2a-1 n-\ ф(х;п) = j=o ф(х;0) =
186 3 Hypergeometric Transformations and Identities If show that See Gould and Hsu [1973]. 41. By appropriate choices of g(&) and </>(&; и), show that -n,n+a,l+a-b-c .A = Ф)п(с)„ \+a-b,\+a-c ' )- (\+а-Ъ)п{\+а-с)„ gives the sum of the terminating very well poised 5F4.
Bessel Functions and Confluent Hypergeometric Functions In this chapter, we discuss the confluent hypergeometric equation and the related Bessel and Whittaker equations. The Bessel equation is important in mathemati- mathematical physics because it arises from the Laplace equation when there is cylindrical symmetry. The confluent hypergeometric equation is obtained when we start with a second-order differential equation whose only singularities are regular singular- singularities at 0, b, and oo; we let b —> oo. The resulting equation has oo as an irregular singular point obtained from a confluence of two regular singularities. Thus, the confluent equation can be derived from the hypergeometric equation by chang- changing the independent variable x to x/b and letting b ->• oo. The solutions are \Fy functions, and some properties of these functions are limits of properties of 2^1 functions. However, it is often easier to derive the results directly than to justify the limiting procedures. Whittaker transformed the confluent equation to one in which the coefficient of the first derivative is zero. Solutions of this equation are called Whittaker functions. We find their series and integral representations and their asymptotic behavior and then give some important examples such as the error function and the parabolic cylinder function. The Bessel equation can be derived from a particular Whittaker equation and can be solved to obtain the Bessel functions to which we devote a good por- portion of this chapter. These functions are also important for their role in Fourier transforms in several variables. We present some integral representations of Bessel functions due to Poisson, Gegenbauer, and others. Later, we discuss some interesting finite and infinite integrals involving Bessel functions as inte- integrands. Some of these are really limits of generating functions for Jacobi poly- polynomials. The sine and cosine functions are particular cases of Bessel functions. Thus, it is useful to look for generalizations of formulas for these trigonometric functions to Bessel functions. Nicholson found a remarkable extension of sin2 x + cos2 x = 1 to Bessel functions; he expressed it as an integral formula. We present Nicholson's 187
188 4 Bessel Functions and Confluent Hypergeometric Functions formula and later show how Lorch and Szego used it to derive results about zeros of Bessel functions. We end this chapter with a discussion of some work of Saff and Varga on zero- free regions for the sequence of polynomials that are partial sums of the exponential function and, more generally, of the i F\ functions. 4.1 The Confluent Hypergeometric Equation It is easily seen that the hypergeometric series ) DЛЛ) is a formal solution of the differential equation [8(8 + bl-l)---(8 + bq-l)-x(8 + al)---(8 + ap)}y = 0, D.1.2) where When p > 2 or q > 1, this equation is of order max(p, q + 1) > 2, and the resulting equation is not as useful as the hypergeometric equation. When q = 1 and p = 0 or 1, the equation is still of second order with a regular singular point at x = 0, but the other singular point is at x = oo and is an irregular singular point. Although irregular singular points cause serious problems, it is still possible to say something about the solutions near them. We consider the case where p = q = 1. Then the equation is or xy" + (c - x)y' -ay = 0. D.1.3) This equation can be obtained from the hypergeometric equation x(l - x)y" + {c-(a + b+ \)x}y' -aby = 0 by the following process. Replace x with x/b, so that the new equation has singular points at 0, b, and oo. Now let b —>¦ oo so that infinity is a confluence of two singularities. The resulting Equation D.1.3) is called the confluent hypergeometric equation.
4.1 The Confluent Hypergeometric Equation 189 When с is not an integer, two independent solutions of the hypergeometric equation around x — 0 are / a + l-c,b+l-c and x C2F\ I ; x V 2-е Replace x with x/b in these expressions and let b —»¦ oo to get a;x] and j'-^J c У V 2-е These are two independent solutions of D.1.3) around x = 0. They are valid over the whole complex plane, since i F\ is an entire function. One has to be somewhat more careful to find the solutions around infinity. A solution of the hypergeometric equation about infinity is given by a+l-b ' x When x is changed to x/b and b ->¦ oo, this expression tends termwise to This series diverges, so it does not directly give a solution of Equation D.1.3). However, it is possible to find an integral representation of a solution of D.1.3) that has this series as an asymptotic expansion. To find this integral representation, start with Euler's hypergeometric integral a + 1-е,a. a + l-b > -/ (l + ~') ta-l(l + -) dl. D.1.5) Jo \ xj V b/ Г(а)ГA-Ь) It is possible to let b ->¦ -oo in the integral, though D.1.5) no longer makes sense in the limit. The right side of D.1.5) tends to r-a />oo / f\c-a-l , /-oo Г (a) Jo \ xj Г (a) Jo D.1.6) This integral converges for Re a > 0 and Re x > 0. It is easy to verify that D.1.6) is a solution of the confluent equation D.1.3). Remark 4.1.1 Here is another way of arriving at D.1.6). Let /•OO y(x) = / e~xtf(t)dt. Jo
190 4 Bessel Functions and Confluent Hypergeometric Functions Then xy" + (c — x)y' — ay = / (xt2 - (c - x)t - a)e~x'f(t)dt Jo = / e-xl{[t2f(t)]' + [tf(t)]' ~(a + ct)f(t)}dt = 0. Jo The last equation holds when f'(t) _ a - 1 + (c - 2)/ _ a - 1 с - а - 1 ДО or and we get integral D.1.6) once again. Suppose x > 1 in D.1.6). By Taylor's theorem Vм where 0 < 6 1 f00 where П V The integral x) 1 < l.So ) — converges _ + and i \ k=l (-1)" 1-е) )xn (a + 1 и! ¦it — г Jo [a + k\ - с ^ 1-е )« t" JC" Я+ 1 a+n-1 ¦)* (¦ - к I (] tk xk + X ) c)k(a)k JC J Thus we see that, except for a constant factor, , ._a „ /a, a+ 1 -c V — gives an asymptotic expansion of a solution of the confluent hypergeometric equa- equation when jc > 1 is large. In fact, we need not restrict ourselves to positive x if
4.1 The Confluent Hypergeometric Equation 191 instead of D.1.7) we use D.1.8) This holds as long as 1 + Ms not a negative real number. To remove the restriction Re* > 0, which is necessary for convergence in D.1.6), consider the integral Г@+) / {\ c-a-l or x~a / e-'f"! 1 + - ) dt. D.1.10) ioo V */ These integrals are also solutions of the confluent equation, but without the restric- restrictions on x and a needed in D.1.6). From D.1.10) and D.1.8), we can once again obtain the 2^0 asymptotic expansion for large |x|, when |arg x\ < л — 8 < л. Relations among solutions of the hypergeometric equation suggest correspond- corresponding relations among solutions of the confluent equation. These can then be proved rigorously. Similarly, transformations of hypergeometric functions imply transfor- transformations of the 1 F{ function. The following are a few examples. In Pfaff's transformation, change x to x/b and let b ->• 00 to get Kummer's first transformation, D-i.li) A similar procedure applied to the quadratic transformation, ab \x \ , (a,a + \ -b A U:)(i+j[^4 } CU1> leads to Kummer's second transformation, a . \ ъ Finally, the three-term relation 'a,a + I - с 1 (-xTa2Fu a+1-b x -с)Г(я + 16) (<*>Ь 1\ с ' Г (с - 1)Г(а + \-Ь) ,_с (а + \-с,Ъ+\-с Г(а)Г(с-Ь)
192 4 Bessel Functions and Confluent Hypergeometric Functions suggests that 2-е ' D.1.13) Formulas D.1.11) and D.1.12) can be proved directly. Thus, the coefficient of jc" on the right side of D.1.11) is ()(l)* = 1 (-n,c-a n\2 \ с ' (c)kk\{n-k)\ и! V c (a), n\{c)n' which is the coefficient of jc" on the left side. There is a similar proof of D.1.12). We give a proof of D.1.13) in the next section where we approach this topic from a different point of view. 4.2 Barnes's Integral for jfi We can find the contour integral representation for \F\{a; c; jc) by computing its Mellin transform. This is similar to finding such a representation for the hyperge- ometric function. Let /= / x-1lFdu;-x)dx. = By Kummer's first transformation D.1.11), /= Гxs-le-xiFi(C~a;x )dx Jo Vе o e^ (От = f(sJFi ; 1 = с ) Г (а) Г(с-5) By Mellin inversion, we should have or — (-x)sds. D.2.2)
4.2 Barnes's Integral for , F, 193 Of course, once we have seen Barnes's integral for a 2^1, this can be written by analogy. In D.2.2) we have — x > 0, but this can be extended. The next theorem gives the extension and is due to Barnes. Theorem 4.2.1 For |arg(—je)| < л/2 and a not a negative integer or zero, where the path of integration is curved, if necessary, to separate the negative poles from the positive ones. The proof follows the same lines as that of Theorem 2.4.1. The reader should work out the details. Again as in Chapter 2, this representation of 1F1 can be used to obtain an asymptotic expansion by moving the line of integration to the left. The residues come from the poles of Г (a + s) at л = —a — n. The result is contained in the next theorem. Theorem 4.2.2 For Rex < 0, Corollary 4.2.3 For Re x > 0, Г(с)е* (c-a,l-a Proof This follows from Theorem 4.2.2 after an application of D.1.11). Now note that the 2^0 in Theorem 4.2.2 suggests the integral J = / Г(-я)ГA -c-s)T(a+s)xsds. D.2.3) 2л-/ J_ioo Again the line of integration is suitably curved. By moving the line of integration to the left and picking up the residues at s = —k - a, where к > 0 is an integer, we get 1 r— a— n+ioo —: / r(-s)T(l-c-s)T(a + s)xsds. D.2.4) ?711 J —a—n—ioo To ensure the validity of this formula, we need an estimate of the integrand on 5 = a + iT, where T is large and — a — n < a < 0. By Stirling's formula (see
194 4 Bessel Functions and Confluent Hypergeometric Functions Corollary 1.4.4), Y — M The expression on the right-hand side dies out exponentially when |argx| < 37Г/2 — S < 37Г/2. We assume this condition and D.2.4) is then true. The last integral in D.2.4) is equal to Г (a + n - s)T(\ +a-c+n- s)F(s - n)xsds = 0(x~"'n) 2тп J_, when \x | is large. Thus Г(а)ГA +a- the asymptotic expansion being valid for |argx| < Зтт/2. However, when the line of integration is moved to the right, it can be seen that / = Г(а)ГA - с) iFi ; x )+ Г (я + 1 - с)Г(с - l)xl~ciFi a + 1 — с 2-е ;' D.2.6) when |argx| < 37Г/2. This proves the next theorem. Theorem 4.2.4 For |arg x \ < Зл"/2, D.2.5) and D.2.6) hold and Г(а)ГA - c)ifi Г°; x\ + Г (a + 1 - с)Г(с - Djc'^iJ Observe that this is the same as relation D.1.13). This theorem gives the linear combination of the two independent i F\ that produce the recessive solution of the confluent equation. This is of special interest in numerical work. To clarify this point, consider the simpler equation y" - у = О, which has independent solutions sinhx and coshx as well as ex and e~x. This equation has an essential singularity at oo and, in the neighborhood of this point for Re x > 0, e~x is the recessive solution. Any other solution independent of e~x is a dominant solution. Thus the combination Aex + Be~x can be computed very accurately from values of 0е and e~x. However, A cosh x — В sinh x creates problems, especially when А яа В and x has a large positive real part.
4.3 Whittaker Functions 195 4.3 Whittaker Functions Whittaker [1904] gave another important form of the confluent equation. This is obtained from Kummer's equation D.1.3) by a transformation that eliminates the first derivative from the equation. Set у = ex/1x~c/2co(x) in D.1.3). The equation satisfied by со is Two independent solutions of this equation can have a more symmetric form if we set с с = 1 + 2m, - — a = к or c- 1 1 m = —j—, a = -+m-k. The result is Whittaker's equation: ' = 0. D.3.1) From the solutions D.1.4) of D.1.3), it is clear that when 2m is not an integer, two independent solutions of D.3.1) are Mk,m(x) = e^2x^lFl ( 5 ++w2 * ; ^ D.3.2) and Mk,.m(x) = e-'^x^F! ( 5 -^ k;xy D.3.3) The solutions Mk,±m{x) are called Whittaker functions. Because of the factors x 5±m, the functions are not single valued in the complex plane. Usually one restricts x to |argx| < n. Formulas for \F\ obviously carry over to the Whittaker functions. Kummer's first formula, for example, takes the form х~1"тМКт{х) = {-хГ^тМ_Кт{-х). D.3.4) A drawback of the functions M^±m (x) is that one of them is not defined when 2m is an integer. Moreover, the asymptotic behavior of the solution of Whittaker's equation is not easily obtained from these functions. So we use the integral in
196 4 Bessel Functions and Confluent Hypergeometric Functions D.1.10) to derive another Whittaker function, Wktin(x). This is defined by Wk,m(x) := - —-I r(o+) 7 J CO (-0-*~i+m(l + ->) 2 е"'Л, D.3.5) where argx takes its principal value and the contour does not contain the point / = —x. Moreover, |arg(—1)\ < n, and when t approaches 0 along the contour, arg(l + t/x) ->¦ 0. This makes the integrand single valued. It is easily verified that Wk<m(x) is also a solution of D.3.1). Note that Whittaker's equation D.3.1) is unchanged when x and к change sign. Thus W-k,m(—x) is also a solution and is independent of Wk,m № ¦ This is clear when one considers the asymptotic expansion for Wk,m(x). The reader should verify that the remarks after D.1.10) imply that ) -oo, D-3.6) when |argx| < ж — S < тт. Consequently, This shows that Wk,m(x) and W-kiin(—x) are linearly independent. 4.4 Examples of iFi and Whittaker Functions This section contains some important examples of i F\ and Whittaker functions that occur frequently enough in mathematics, statistics, and physics to be given names. (a) The simplest example is given by ex = 1Fi(a;a;x). D.4.1) (b) The error function is defined by erfx = —= / e~'2dt = 1 - erfcx (x real), D.4.2) л/ТГ Jo where 2 /" erfcx = —— I e~' dt. V* Jx It is easy to see that erf x = ^1^A/2; 3/2; x).
4.4 Examples of i F] and Whittaker Functions 197 To express the error function in terms of Wk<m (x) we need to write D.3.5) as an integral over @, oo). Assume that Re (k — | — m) < 0; then D.3.5) can be written as n , ( 1 \ sin7T (k + A — m) Wk,m{x) = e'x'2xkT (k+--m\ ^p >- /¦OO ¦ / e-'rk-Li+m(l+t/x)k-L2+mdt Jo = —г —— e-'rk-^m(\+t/x)k^+mdt. D.4.3) Г(г - к + m) Jo The integral converges for Re (k — \ - m) < 0 and Re x > 0. Note the relation of D.4.3) with the integral in D.1.6). Now set t = u2 — s2, where и is the new variable. Then wk,m Set к Thus (X) = e-x/2xk2es< T(\-k + m) -1/4, m = 1/4, erf x = '/У andx 1-- = s2 to get mfx + u2-s2 V x /•00 1 e~u2du. Js /2W_1/4,l/4(*2). D.4.4) An asymptotic expansion for erf x can be derived from this formula and D.3.6). (c) The incomplete gamma function is defined by r-x /-oo (a,x)= e-'t"-ldt = Г (a) - / e^t^dt = Г (а) - Г(а,х). Jo Jx D.4.5) After expanding e~' as a series in / and term-by-term integration, it is clear that y(a,x) = —iFi(a;a+l;x). D.4.6) a The reader may also verify that T(a, x) = e-xl2x°-^Wa^a_(x). (АЛЛ) (d) The logarithmic integral li(x) is defined by dt о bg/
198 4 Bessel Functions and Confluent Hypergeometric Functions Check that If x is complex, take |arg(—logx)| < ж. Additional examples of Whittaker functions such as the integral sine and cosine and Fresnel integrals are given in Exercise 4. (e) The parabolic cylinder functions are also particular cases of the Whittaker functions. To see how these functions arise, consider the Laplace equation д2и д2и д2и ^+^+^=°- DA8) The coordinates of the parabolic cylinder |, r], z are defined by x = ^2-r12), у = $4, z = z. D.4.9) Apply the change of variables D.4.9) to the Laplace equation. The result after some calculation is д2и д2и\ д2и This equation has particular solutions of the form U(^)V(r])W{z), which can be obtained by separation of variables. The equation satisfied by U, for example, has the form where a and X are constants. After a slight change in variables, this equation can be written as = 0. D.4.10) Equation D.4.10) is called Weber's equation. It can be checked that ?>n(x) = 2t+?x-2W§+i_i(x2/2) (|argx| < Зтг/4) D.4.11) is a solution of D.4.10). The constant factor is chosen to make the coefficient of the first term in the asymptotic expansion of Dn (x) equal to one. Dn (x) is called the parabolic cylinder function. When л is a positive integer, Dn(x) is e~'tx2 times a polynomial, which, except for a constant factor, is Hn(x/^/l), where Hn(x) is the Hermite polynomial of degree n. These polynomials will be studied in Chapter 6.
4.5 Bessel's Equation and Bessel Functions 199 (f) In the study of scattering of charged particles by spherically symmetric poten- potentials, we can take (see Schiff [1947, Chapter V]) the solution of the Schrodinger equation h2 2 V и + Vu = Eu 2(i to be of the form ОС - ч Ей(г) P|(COS#), t=0 where Pi is the Legendre polynomial of degree t and yi satisfies the equation By a change of variables we can take к — I. The Coulomb potential is given by U(r) = 2r]/r, so the equation for у is d2y г 2г, Щ + + 1 r Comparison of this equation with Whittaker's equation D.3.1) shows that уi = rt+le"iriFi(i + 1 - irj; 21 + 2; 2ir). The function <J>t(r),r) := e~"\F\(l + 1 - irj; 21 + 2; 2ir) is called the Coulomb wave function. 4.5 Bessel's Equation and Bessel Functions Bessel functions are important in mathematical physics because they are solutions of the Bessel equation, which is obtained from Laplace's equation when there is cylindrical symmetry. The rest of this chapter gives an account of some elementary properties of Bessel functions. When к = 0 and m = a in Whittaker's equation D.3.1), we get d2W Г 1 1/4-a2 dp [ 4 If we set y(x) = ^/xWBix), then у satisfies the equation dx2 xdx
200 4 Bessel Functions and Confluent Hypergeometric Functions This equation is called Bessel's equation of order a. It is easily verified that Г(а +1) ya+ 1 is a solution of D.5.1). Ja(x) is the Bessel function of the first kind of order a. From D.1.12), we have another representation of Ja{x). This is Equation D.5.1) is unchanged when a is replaced by —a. This means that /_a (x) is also a solution of D.5.1). One can check directly that when a is not an integer, Ja(x) and J-a(x) are linearly independent solutions. When a is an integer, say a = n, then /_„(*) = (-1)"/„(*). D.5.4) Therefore /_„ (x) is linearly dependent on /„ (x). A second linearly independent so- solution can be found as follows. Since (—1)" = cos пж, we see that Ja (x) cos na — J-a(x) is a solution of D.5.1), which vanishes when a is an integer. Define /a(x)cos7ra - J-a(x) Ya(x) := : . D.5.5) Sin7TO! When a = n is an integer, Ya (x) is defined as a limit. By L'Hopital's rule, D.5.6) а-и ж [ да да Note that оо This implies that Ja (x) is an entire function in a. Thus the functions j? and ^jj2- in D.5.6) are meaningful. Moreover, as functions of x, Ja (x) are analytic functions of x in a cut plane. Thus we can verify that Yn (x) is a solution of Bessel's equation D.5.1) when a = n is an integer. We can conclude that D.5.5) is a solution of D.5.1) in all cases. Ya(x) is called a Bessel function of the second kind. Substitution of the series for Ja(x) in D.5.6) gives, after simplification, 2k -n 1 °° Г-П* - - X) i— №& + к + 1) + ф(к+ l)](x/2Jk+n. D.5.7) Here л is a nonnegative integer, |argx| < ж, and ifr(x) = Г'(х)/Г(х).
4.5 Bessel's Equation and Bessel Functions 201 Note that Bessel's equation can be written Suppose a is not an integer. It is easy to deduce from D.5.8) that /-„(*)— x— - Ja(x)— x— = 0 dx \ dx ) dx \ dx ) or x[j-a{x)J'a(x) - Ja(x)J'_a(x)] = С = constant. To find C, let x -> 0 and use the series D.5.2) and Euler's reflection formula. The result is С = 2 sin ал /тт. Thus the Wronskian W(Ja(x), J-a(x)) = Ja{x)J'_a{x) - J_a(x)J^(x) is given by W(Ja(x), /_ц(х)) — —2sina7r/7rx, for а ф integer, and W(Ja(x), Ya(x)) = 2/nx not only when а ф integer, but also for a = n by continuity. Many differential equations can be reduced to the Bessel equation D.5.1). For example, u=x"Ja(bxc) satisfies и = 0. D.5.9) X [ Xz J When x = 1/2, b = 2/3, с = 3/2, and a2 = I/a, this equation reduces to u" + xu=0. D.5.10) This is the Airy equation and it has a turning point at x =0, so solutions oscillate for x > 0 and are eventually monotonic when x < 0. As such, solutions of the Airy equation can be used to approximate solutions to many other more compli- complicated differential equations that have a turning point. For example, the differential equation after F.1.12) has x = \/2n + 1 as a turning point. Airy functions can be used to uniformly approximate Hermite polynomials in a two-sided neighborhood of the turning point. See Erdelyi [1960].
202 4 Bessel Functions and Confluent Hypergeometric Functions 4.6 Recurrence Relations There are two important differentiation formulas for Bessel functions: d r«Kr\ у-(-1)"B* dx a(> ^ Г(и + а n=l) nx and ' 2 and, similarly, —x~aJa(x) = -x~aJa+i(x). D.6.2) From the series for cosine and sine, D.6.3) cosx. D.6.4) Rewrite D.6.1) and D.6.2) as aJa(x) +xJ'a(x) = xJa-i(x) and -a/a(x)+x/^(x) = -xJa+l(x). Elimination of the derivative J'a gives 2a Ja-i(x) + Ja+i(x) = —Ja(x). D.6.5) Elimination of Ja(x) gives Ja-i(x) - Ja+l(x) = 2J^(x). D.6.6) It follows from D.6.1) and D.6.2) that and /1 И \n I I (x~a Ja(x)) = (— l)nx~a~nJa+n(x). D.6.8) \x dx J When these are applied to D.6.3) and D.6.4), we obtain 9 /1 И \ п / \ X J D.6.9)
4.7 Integral Representations of Bessel Functions 203 and / 2 ,, /1 d \" /cosx\ J-n-ipix) = J — xn+l (-—) ). D.6.10) V 7ГХ \x dx J V x J The following two formulas can now be proved by induction (the details are left to the reader): Г'"" i-l)kin + 2k)\ ГТI = J — < sin(x - ИЛ-/2) itx ' ^i2k)Mn-2k)M2x) D.6.11) J-n-\n(x) = \l^{ COSQt + Л7Г/2) V^ Ц— 1 VttxI ' f^o i2k)\in - 2k)li2xJk ¦ , „ч "v^ (— 1)*(и + 2k + 1)! I — SinQt + Л7Г/2) >^ 2А+Г f " D.6.12) 4.7 Integral Representations of Bessel Functions Set у — xau in Bessel's equation „ 1 , 9 9 / + -/ + A - cr/x2)y = 0. x Then м satisfies the equation хм" + Bа+ 1)м' + хм=0. D.7.1) Since equations with linear coefficients have Laplace integrals as solutions, let u = A / ext fit)dt, Jc where A is a constant. Substitute this in D.7.1). Then 0 = / fit)ixt2 + Ba + 1)/ + x)ex'dt Jc f \ 2 9 ] xt ~ Jc l V dt " T = [extit2 + l)fit)]c + / ext\ --[it2 + l)/@] + Ba + l)tf(t) \dt Jc V & J
204 4 Bessel Functions and Confluent Hypergeometric Functions after integration by parts. This equation is satisfied when [ex'(t2 + \)f(t)]c = 0 D.7.2) and ^1 D.7.3) at Equation D.7.3) holds when f(t) = (t2 + 1)"~1/2. Replace t with *J^At so that D.7.2) holds when С is the line joining —1 and 1 and Re a > —1/2. The above calculations imply that /•i y = Axa I elx'(l - t2)a-llldt (A.IA) is a solution of Bessel's equation. We assume that arg(l — t2) = 0. To see that D.7.4) gives an integral representation of Ja(x) for Rea > —1/2, expand the exponential function in the integrand as a series and integrate. The result, after an easy calculation involving beta integrals, is An application of Legendre's duplication formula (Theorem 1.5.1), 22кГ(к + 1)Г(к + 1/2) = т/пBкI, gives y(x) = А^Г(а + \/2JaJa(x). Therefore, )a f eixt(\ - t2)"- D.7.5) when Rea > —1/2. Put t = cos# to get the Poisson integral representation Ja(x) = — ]2{х/2)а I eiXCOS9s[n2aede = —= (x/2)a [ cos(jccos6»)sin2a6»rf6l, D.7.6) у/лГ(а + 1/2) Jo for Re a > — 1 /2. An important consequence of D.7.5) is Gegenbauer's formula, which gives a Bessel function as an integral of an ultraspherical polynomial. The formula is D.7.7) for Re v > —1/2. When v ->• 0, we get Bessel's integral D.9.11) for Jn(x).
4.7 Integral Representations of Bessel Functions 205 To prove this, take a = v + n in D.7.5) and integrate by parts n times to get т , ч апх/2у "' 7-i dt" By Rodrigues's formula B.5.13), d"(\-t2y+"-l/2 _ (-2)"п!Г(у + п + 1/2)ГBу) _ 2 IT" ~ r(v + l/2)rBv + n) Use of this in the previous integral gives Gegenbauer's formula D.7.7). Condition D.7.2) is also valid when С is a closed contour on which e'x'(t2 — l)<*+i/2 returns to its initial value after t moves around the curve once. We take С as shown in Figure 4.1, and write an integral on С as / Ja d+,-1-) f(t)dt. Here 1+ means that 1 is circled in the positive direction, and —1— means —1 is circled in the negative direction. We are interested in the integral У(х) Ja (l+.-i-) eixt{t2-l)"-1/2dt, which is denned for all a, since С does not pass through any singularity of the integrand. When Re a > — 1 /2, we can deform С into a pair of lines from — 1 to 1 and back. We choose arg(r2 - 1) = 0 at A; then y(x) = xa U eixt[(l - fte-'T-^dt + J eix'[(l - , = хаИйп(]--а\п f eixt(l-t2y-1/2dt V2 / 7-i Mx). Figure 4.1
206 4 Bessel Functions and Confluent Hypergeometric Functions This gives Hankel's formula: 1.7.8) when а ф ^p, n = 0, 1, 2,..., and arg(r2 - 1) = 0 at A. We now prove another formula of Hankel, r / n Г(\/2-а)е™{х/2)а 2я1 Ji00 eix'(t2 - If , D.7.9) when Rejc > 0, —Зтг < arg(r2 — 1) < я, and a + \ ф 1, 2, The contour in D.7.9) is shown in Figure 4.2. It is assumed that the contour lies outside the unit circle. To prove D.7.9) we need the following formula of Hankel (see Exercise 1.22): 1 1 / Г (г) 2ТГI J. @+) l /•(<>+) = — \ e'rzdt, 2ТГI J-ooe.fi D.7.10) for|0|<;r/2. The expansion of (r2 — I)"/2 in powers of 1/r converges uniformly on the contour. We have (_„ k[ -^ < argr < |. So *¦ • Jioo D.7.11) Figure 4.2
4.7 Integral Representations of Bessel Functions 207 Figure 4.3 Figure 4.4 Since Re* > 0, we have |argjt| = \в\ < \n. Set и = ei7l/2xt. Then K-1+.1+) /-@+) / J—o = (-l)ke-°"Tix2k-2a- e и 271 i ГA -2а+2к)' Substitute this in D.7.11) and apply Legendre's duplication formula. This proves D.7.9). Now modify the paths in D.7.7) and D.7.8) to those shown in Figures 4.3 and 4.4 respectively. Take Rejc > 0 in D.7.7) and D.7.8). When the horizontal parts of these curves are made to go to infinity we get Ja(x) = Г(\-а)(х/2)а 2л i /•()+) /Ч-1-) / + / eixt(t2 - \)a~l'2dt Jl+ioo J— 1+ioo D.7.12)
208 4 Bessel Functions and Confluent Hypergeometric Functions and f + + [ eixt(t2 - lf-1/2dt Jl+ioo J—l+ioo D.7.13) In D.7.12) arg(?2 -1) is 0 at A and it at B, whereas in D.7.13) arg(r2 -1) is 0 at A and -it at B. To make aig(t2 - 1) = it at В in D.7.13), we multiply (t2 - I)"/2 in the second integral by the factor e-2(«-!/2)^ pormuia D.7.13) may now be written (after reversing the direction of the contour in the second integral) as + e-"ai f eixt(t2-lf-Ut]. D.7.14) J— l+ioo J The form of D.7.12) and D.7.14) suggests the following two functions, which are called Bessel functions of the third kind or Hankel functions: ^-al« Ja(x) - J-a{x)] D.7.15) sinajr and sinajr \eaniJa(x) - J-a(x)l D.7.16) These are more simply written in terms of Ja {x) and of Ya (jc), the Bessel function of the second kind. Thus, H$\x) = Ja{x) + iYa(x), D.7.17) HB\x) = Ja{x) - iYa(x), D.7.18) H$\x) = V2_ /(x/2H1 / eix'(t2 - l)a-l/2dt, D.7.19) •Jititi Jl+ico and eix'(t2 - l)a-1/2dt. D.7.20) J- — l+ioo The integral formulas for H^\x) and H^2)(x) hold when Rejc > 0, a + \ ф 1,2, Moreover,arg(r2 - 1) = — it at l + j'ooandarg(?2 — 1) = it at— 1+гоо. Note also that H%(x) = J—(cosx + i sinjc) = J—elx = HJ%(x) D.7.21) ' V itx V itx '
4.8 Asymptotic Expansions 209 and D.7.22) References to the work of Bessel, Poisson, Gegenbauer, and Hankel can be found in Watson [1944, Chapters 2, 3, and 6]. 4.8 Asymptotic Expansions Set t = 1 + iu/x in D.7.19). The integral becomes /•@+) / , Joe e-u{_u)a-l/2 Ш a-1/2 du. D.8.1) Compare this with D.3.5) and D.3.6) to obtain the asymptotic expansion Hankel introduced the notation it! Da2 - I2)Da2 - 32) • ¦ • Da2 - Bk - IJ) Then D.8.2) can be written as + A similar argument gives Since .7=0 and yeW = ^ 2j we have from D.8.3) and D.8.4) Ja(x) an 7Г cos x — — sin jc — атг тг D.8.2) D.8.3) D.8.4) D.8.5)
210 4 Bessel Functions and Confluent Hypergeometric Functions and Ya(x)~J — ait ait „ , % ,. ..,v~,-, ¦ ^ D86) when |argjc| < it. Note that D.6.11) and D.6.12) are special cases of D.8.5). 4.9 Fourier Transforms and Bessel Functions Many special functions arise in the study of Fourier transforms. In Chapter 6, we shall see a connection between Hermite polynomials, which were mentioned in Section 4.4, and Fourier transforms in one variable. Here we consider a connection with Bessel functions and as a byproduct obtain a generating function for Jn(x). Start with the Fourier transform in two dimensions. We have 1 ГСС /«00 F(u, u) = — / / f(x,y)ei{xu+yv)dxdy. D.9.1) 2?r J J /-OO rill / Introduce polar coordinates in both (jc, y) and (и, v) by jt=rcos0, y=rsin6; m = Then 1 /-OO ri F(u,v)=— / / 2it Jo Jo Expand / as a Fourier series in в, 00 f {r cosd, r sin6) = J2 f"ir)eine, to get /-2л- oo /-oo г i /-2л- F(u,v)=Y / Mr)r\— е1пве"К D.9.2) The relation with Bessel functions comes from the inner integral. Since the inte- integrand is periodic (of period lit) it is sufficient to consider the integral Fn(x) = — [ eixcoseeinede. D.9.3) 2л- Jo
4.9 Fourier Transforms and Bessel Functions 211 Expand the exponential and integrate term by term to get ^ OO .? ? „27Г Fn(x) - — Y^ '—?- / cos* BeinBdd. D.9.4) ^ k=o ¦ Jo Now 2k cos* в = (ew + e-w)k = em - So, writing к — n + 2m we have ~ in+2mxn+2m {n + 2m F»(x) = У -, m=0 = /ИЛ(^) D.9.5) This relation is interesting as it gives the Fourier expansion of e'xcose: i"Jn(x)ein9 OO = J0(x)+2^2 i"Jn(x) cos пв. D.9.6) n=l The last equation follows from У_„ (x) = (— 1)" У„ (jc). Equating the real and imag- imaginary parts gives 00 cosQtcostf) = J0(x) + 2'^2(-l)nJ2n(x)cos2ne D.9.7) n=l and oo sin(jccos6») = 2E(-1)"-/2n+i(^)cosBn + 1H. D.9.8) For an interesting special case, take в = л/2 in D.9.7) to get D.9.9) n=l It is worth mentioning Miller's algorithm at this point. The series D.9.9) shows that for any given x = xo and sufficiently large n, J2n (xo) is small. So take J2n (x0) to be 0 and 7г(п-1) C*o) = c, which is to be determined. Use the recurrence relation D.6.5) to compute Л(п-2)(^о) and so on down to Уг(^о) and Jo(*o) as multiples of c. Since D.9.9) can be approximated by n k=\
212 4 Bessel Functions and Confluent Hypergeometric Functions we obtain an approximate value of с and hence also of the values of the Bessel functions J2k(xo). This is an example of Miller's algorithm. See Gautschi [1967, p. 46]. There is another way of looking at D.9.6). Put t = ie'9. Then oo exp(jc(r - 1/0/2) = JZ •/»(JC)f"- D.9.10) n=—oo Thus,exp(jc(? — 1/0/2) is the generating function for Bessel functions of integer order. Replace 0 with | — 0 in D.9.3); then use D.9.5) and the periodicity of the integrand to get Bessel's formula 1 Г Jn{x) — — / exp(—inB +ix smB)dB 2тг J_n 1 Г \ Г — — / exp(—inB + ix sin 0)^0 H / exp(/n0 — ix 2л Jo 2jr Jo = -/ cos(«0 -x sin6)dB. D.9.11) In Section 4.7, we obtained this from Poisson's integral formula. We can go in the opposite direction and derive (x/2)" Г *A/2)B Jo ¦71 :_2n, Jn(x) = ; ' I cos(jccos6»)sin/n вd6 D.9.12) from D.9.11) by using Jacobi's formula given in Exercise 2.21, d"'1 sin2" в (-1)" dy"~ -2"(l/2)nsinn0, y= In D.9.12), и is a nonnegative integer. But this restriction can be removed. First multiply both sides of D.9.12) by B/х)"Г(п + 1). Then both sides are bounded analytic functions of n for Re n > —1/2. By Carlson's theorem, we can conclude that D.9.12) is true for these values of и. We end this section with a proof of the inequalities: Forx real, |/o(*)l < 1, and \Jm(x)\ < 1/V2 form = 1, 2, 3,.... D.9.13) The first inequality follows from D.9.12), but we have another proof which verifies them all at once. Change t to —t in the generating function D.9.10) to get oo exp(-*(f - 1/0/2) = Y, (-1)"•/»(*)*". D.9.14)
4.10 Addition Theorems 213 Multiply D.9.10) by D.9.14) to get n~—oo m=—oo Equate the coefficients of the powers of t and use У_„ (je) = (— 1)" Jn (x) to obtain 00 00 Jn(x) = 1 D-9Л5) n=—oo n=l and oo (~l)mJm(x)Jn-m(x) =0 forn ^ 0. D.9.16) The inequalities in D.9.13) follow from D.9.15). Remark 4.9.1 Observe that, in Bessel's formula D.9.11), n has to be an integer, for when n = a, a noninteger, the integral on the right side of D.9.11) is no longer a solution of Bessel's equation D.5.1). However, Poisson's integral formula D.9.12) holds for all n as long as Re и > —1/2. We also remark that Jacobi obtained the direct transformation of D.9.12) to D.9.11) by the argument given here in reverse. For references and details of Jacobi's proof, see Watson [1944, §§2.3-2.32]. Note also that Jacobi's formula mentioned after D.9.12) is really a consequence of Rodrigues's formula B.5.13') when applied to Chebyshev polynomials of the second kind. 4.10 Addition Theorems In this section, we prove a useful addition theorem of Gegenbauer. First we show that OO Jn(X+y)= J2 Jm{x)Jn-m(y)- D.10.1) m=—oo This follows immediately from the fact that exp((* + y)(t - 1/0/2) = exp(x(t - l/r)/2)exp(y(r - 1/0/2), for this implies
214 4 Bessel Functions and Confluent Hypergeometric Functions The result in D.10.1) is obtained by equating the coefficients of t". Observe that D.9.16) follows from this addition formula. To state the second addition theorem, suppose a,b, and с are lengths of sides of a triangle and c2 = a2 + b2 — lab cos в. Then oo Mc) = J2 Jm{a)Jm(b)eime. D.10.2) m=—oo Set d = aew - b. Then c2 — dd, so с and d have the same absolute value. Thus there is a real \jr such that с = {aeie - Ь)е'ф. A short calculation shows that the last relation implies с sin ф = a sin@ + \jr + ф) - b sin{\jr + ф). By D.9.11), 2^ Jo {•27T 2П Jo Since \jf is independent of ф and the integrand is periodic, by D.9.10), У„(С) = _L f * еЦапп(в+ф)-Ь*шф]аф 2л" Jo = У Jm(a)eime— / e"'*™*^» m=—со ° = V Ут(аУтв— / е'^тфе-п 2л- Jo oo = У Jm(a)./mB>yme. m=—oo The last equation comes from D.9.11). Since J-m(x) = (—l)m7m(x),wecan write the addition formula in the following form: oo Me) = Ma)Mb) + 2 J2 Jm(a)Jm(b) cosmB. D.10.3) m=l
4.10 Addition Theorems 215 Observe that Id Id с dc ab sin в d6' then apply this operator to D.10.3), and use D.6.2) to get J\ (с) ^л sin тв ^ = 2 V«7(aO00 D.10.4) / *v v fit \ / fit V / " /1 m=l Rewrite this as Me) _ C m=0 Apply D.10.4) to the last formula; use D.6.2) again to get m=0 In general, we have the following result for derivatives of ultraspherical polyno- polynomials: — C*(cos6>) = -2X. sin вC^{(cos6). dO Now apply induction to see that —— = 2aT(a) / (m +a)~—— m C"(cos6), D.10.5) m=0 when a — 0, 1, 2, By F.4.11), C?(cos#) is a polynomial in a; hence by the remarks we made after formula D.9.12) the two sides of D.10.5) are bounded analytic functions in a right half plane. Carlson's theorem now implies the truth of D.10.5) for values of a in this half plane. By analytic continuation, D.10.5) is then true for all a except a = 0, — 1, —2,.... Equation D.10.5) is called Gegenbauer's addition formula. We state without proof the following result of Graf: /a -be~w\a/2 °° J^ci nf) = Y, Ja+m(a)Jm(b)e""e D.10.6) V a - be10 I z-^ 4 ' m=-oo when b < a. When a, b, and в are complex, we require that \be±w/a\ < 1 and с —>¦ a as b —>¦ 0. Graf's formula contains D.10.1) and D.10.2) as special cases. See Watson [1944, §11.3]. Exercise 29 gives a proof of D.10.6) when a is an integer.
216 4 Bessel Functions and Confluent Hypergeometric Functions 4.11 Integrals of Bessel Functions Expand the function F{u,v) in D.9.2) as a Fourier series: F(u, v) = J2 Fn{R)e n=—oo OO 1 fin /OO 1 fin /•oo i />2jt oo = / _L / «,'¦*'»•« V- fn{r)j«9+*>derdr Jo 27t Jo „fr^ /•oo i r2n °° °o h 2л- Л mt^o „fr^ Hence ( [by D.9.6)] oo г „oo -| = V j" / Jn(Rr)fn(r)rdr\einf. D.11.1) /•CO -i)nFn(R)= fn(r)Jn{Rr)rdr. D.11.2) The inverse Fourier transform of D.9.1) is 1 ГОО ГОО f(x,y) = — / Z7t J-oo J-oo If a calculation similar to D.11.1) is performed, we obtain /•OO fn(r) = (-i)n Fn(R)Jn(Rr)RdR. D.11.3) Jo The integral in D.11.2) is called the Hankel transform of order n of the function fn(r). Then D.11.3) is called the inverse Hankel transform. For a function f(x) that is smooth enough and vanishes sufficiently fast at infinity, we have more generally the Hankel pair of order a: /•00 F(y)= f(x)Ja(yx)xdx D.11.4) Jo and /•00 /(*)= / F(y)Ja(xy)ydy. D.11.5) Jo
4.11 Integrals of Bessel Functions 217 To obtain an interesting integral, multiply Gegenbauer's formula D.10.5) by C" (cos в) and use the orthogonality relation which follows from B.5.14), namely The result is 21~апТ(п + 2а) Ja+n(a)Ja+n(b) ГJj^-Can Jo c01 aaba D.11.6) where a, b, and с are sides of a triangle, that is, c2 = a2 + b2 — 2abcosO. Rescale a, b, and с and take n = 0 to arrive at „2ariJn лТBо!) Ja(ax)Ja(bx) Уо " s J\a sin 2а~1Г(а) ; Rewrite this as fa+b [(a + bJ - c2]a-'2[c2 - (a - bJ]"~l2 Ja(cx)cdc \a-b\ C" a + l/2)Ja(ax)Ja(bx)(ab)a xa Then, by the Hankel inversion formula, for Re a > —1/2, f°° ,_ [c2 - (a - ^21«- / Ja(ax)Ja(bx)Ja{cx)x adx = — Jo 2ia 23а-1^/пГ(а+1/2)(аЬс)а D.11.8) for \a — b\ < с < a + b. The value of the integral is 0 otherwise. If the formula for the area of a triangle (denoted by Д) in terms of its sides is used, then the right side of D.11.8) can be written 2«-1д2а-1 There are important generalizations of integral D.7.6) due to Sonine. These are contained in the next theorem. Theorem 4.11.1 For Re \i > -1 and Re v > -1, J 7M+v+1(x) = 2v^ + J J^x sin0) sin"+1 в cos2v+16Ш D.11.9) and D.11.10)
218 4 Bessel Functions and Confluent Hypergeometric Functions The integrals D.11.9) and D.11.10) are referred to as Sonine's first and second integrals. Proof, (i) The proof is simple. Expand J^ (x sin в) as a power series and integrate term by term. Thus I 7M(xsin6')sinM Л 0cos2v+10de. ¦ ifi m=0 ~ 4r~ ' The last integral is a beta integral equal to Substitution of this in the above series gives the result. (ii) In this case expand both J^xsind) and Jv(y cos 6) in power series and integrate term by term. The details are left to the reader. Observe that D.11.9) is a special case of D.11.10). Divide both sides of D.11.10) by yv and let у -> 0. The result is D.11.9). ¦ Corollary 4.11.2 For Re a > -1 /2, (x/2)a Г , J (x) = — / cos (x cos 0) sin 6dd. D.7.6) Г (a + l/2)V5r Уо Proof. Take /n = -l/2,v+l/2 = aand recall that J-1/2KX) = \ V 7ГХ Sonine's first integral D.11.9) can also be written as rv+l roo / гмA - t'y.Jpixfydt. D.11.11) ^'l^t l) Jo By Hankel inversion, 2T(v + 1) y J^X) J,Axt)xdx = ^A - t2y+ D.11.12) for Re pi > -1, Re v > — 1. We now turn to the computation of the Laplace transform of Bessel functions. Hankel evaluated the transform of t>J'~1Jct(yt) in terms of a 2^1 function. For special values of a and д, the 2^1 reduces to more elementary functions. We
4.11 Integrals of Bessel Functions 219 consider this class of integrals next. The simplest integral of this kind was found by Lipschitz. His result is the following: For Re (x ± iy) > 0, OO 1 —xt - - - ¦ e-xtJ0(yt)dt = D.11.13) о V(* + У ) From the asymptotic expansion for Bessel functions D.8.5), it is clear that Re(x ± iy) > 0 is sufficient for the convergence of the integral. Use D.9.11) to see that /OO 1 /•OO /*7Г e-xtJ0(yt)dt = - / e~xt / eiytcosededt 71 Jo Jo _ l Г c dQ_ 0 л — iy cos в 1 The more general result, which gives the Laplace transform of t^ 1 Ja{yt)-, is due toHankel[1875]. Theorem 4.11.3 For Re (a + рь) > 0 and Re (x ± iy) > 0, f°° _ , _, (^/2х)аГ(а + д) / e Ja(yt)t^ dt = Jo х^Г(а + 1) a + 1 ' ) Proof. First assume that \y/x | < 1. Substitute the series for Jv (yt) in the integral to get m(yqr+2m Г i-\)m(yq 2m Г l) Jo (-l)m(y/2)a+2m Г(« + [i + 2m) m\T(a + m + 1) xa+n+2m Since \y/x\ < 1, the final series is absolutely convergent. This justifies the term- by-term integration. So we have D.11.14) under the restriction \y/x\ < 1. The complete result follows upon analytic continuation, since both sides of D.11.14) are analytic functions of у when Re (x ± iy) > 0. This proves the theorem. ¦ When fj, = a +1 or a +2, the 2^1 in D.11.14) reduces to a i^o, which can be sum- summed by the binomial theorem for \y/x\ < 1. The results are in the next corollary.
220 4 Bessel Functions and Confluent Hypergeometric Functions Corollary 4.11.4 For Re (x ± iy) > 0, D.11.15) and D.11.16) Corollary 4.11.5 For Re (x ± iy) > 0, Г°о r/ 2 , у2ч1/2 _ x-ia Г e-xtJa(yt)rldt = LV" ' " 7 ^-, whenRea > 0, D.11.17) Л аУа and /•со г/„2 i 2\ 1 /2. v"|<* / e~x'Ja{yt)dt = ^ 2 1/2' w^nRea>-l. D.11.18) Proof. Apply Exercise 3.39. ¦ The formulas in the two corollaries are limits of some formulas for Jacobi polynomials introduced in Chapter 2. Recall that these are defined by Theorem 4.11.6 For real a and /J, lim п-аР<ай( cos-) = lim п~аР^ A Д ^ у л->оо ^ 2пг Proof. This follows easily from Tannery's theorem (or the Lebesgue dominated convergence theorem). Suppose a is not a negative integer. Tennwise convergence is easily checked. Moreover, domination by a convergent series is seen from the fact that, for large n, n~a(n + a + p+ l)t(a + 1)» ^ Bn + a + 0)k(a + 1)яи-° ^ С Л!(и - ifc)!(a + 1L2*и2<: ~ ifc!(a + \)k{2n)kn\ ~ Ща + \)k' where С is a constant that holds for all k, 0 < к < и. When a = — ? is a negative integer, use the fact that to obtain the desired result.
4.11 Integrals of Bessel Functions 221 The integral formulas D.11.15) to D.11.18) are limits of generating-f unction formulas for Jacobi polynomials, three of which will be proved in Chapter 6. The corresponding formulas are ) 2 D.11.19) Bn + a + p + 1)Г(я + a + 0 + и n=0 « =2a(l-r+ /?)"", D.11.21) where Л = A - 2xr + r2I/2, and (l -r + R)-a(l + r + RyP. D.11.22) л=0 A proof of a result more general than D.11.21) is sketched in Exercise 7.31. The other generating-function formulas are proved in Chapter 6 (Section 6.4). The next theorem gives another infinite integral of a Bessel function due to Hankel. Theorem 4.11.7 For Re (д + v) > 0, /•oo / Mat) t^e'^dt -- Jo Proof. The condition Re (\i + v) > 0 is necessary for convergence at zero. The asymptotic behavior of Jv(x) given by D.8.5) shows that the integral converges absolutely. Thus the integral can be evaluated using term by term integration. This gives Jv(at)
222 4 Bessel Functions and Confluent Hypergeometric Functions Since the integral on the right-hand side equals 2 the result follows. ¦ Corollary 4.11.8 For Re (д + v) > 0, Г Щ) Л Л 2pT(v+l) \ v+l '4p2J D.11.24) Proof. Apply Kummer's first i Fi transformation D.1.11) to Hankel's formula in Theorem 4.11.7. ¦ An important particular case we use later is / Jv(at)tv+le-pVdt = ——-—e~a2/4p\ Rev > -1. D.11.25) Jo Bргу+1 See Watson [1944, Chapters 12 and 13] for references. 4.12 The Modified Bessel Functions The differential equation where x is real, arises frequently in mathematical physics. It is easily seen that Ja(ix) is a solution of this equation. Moreover, for x, real e~an'/2Ja(xe7"/2) is a real function. We then define the modified Bessel function of the first kind as Ia(x) = e~a7lil2Ja(xenil2) (-it < aigx < тг/2) ^n < argx < tz = (jt/2)« у ^l . D.12.2) When a is not an integer, Ia{x) and /_a(x) are two independent solutions of D.12.1). When a = n is an integer, then In(x) = /_„(*).
4.13 Nicholson's Integral 223 To deal with this situation, define the modified Bessel function of the second kind: Ka(x):=—-—[I-a{x) - Ia(x)l D.12.3) /sinew It is immediately verified that hn(x)=\ —sinhx and I-m(x) = \ —coshx. D.12.4) У И V 71X Thus Jx- D-12.5) We see that Ja(x) corresponds to the sine and cosine functions whereas Ia(x) corresponds to the exponential function. Perhaps this is why the nineteenth century British mathematician, George Stokes, took Ia(x), rather than the Bessel function, as the fundamental function. The asymptotic expansions for Ia(x) and Ka(x) can be obtained in the same way as those for Ja{x) and Ya{x). Thus ? (-Л-/2 < argx < Зтг/2), D.12.7) and (-Зтг/2 < argx < Л-/2). D.12.8) Here (a, n) - (-l)»(o + 1/2)„(-а + 1/2)„/л!. 4.13 Nicholson's Integral Integral representations for modified Bessel functions can be obtained from those for Bessel functions. Similarly, there are formulas for integrals of modified Bessel functions. As one example, take у = i and Rex > 1 in D.11.18) to get Jo
224 4 Bessel Functions and Confluent Hypergeometric Functions Set x = cosh /8; then D.13.1) can be written as /•00 Jo в sinh/8 Now, since D.13.2) Ka{t)~\hre-' asf^oo, D.13.3) we see from D.13.2), on replacing a with —a, that /*00 / л—' cosh fi jf Jo when Re (cosh fi) > — 1. sin got sinh f5 D.13.4) Let a -> 0 to get -^—. D.13.5) o sinh j8 When fS — in 12, we have n D.13.6) Nicholson's formula, #(*) = — / ?0B* sinhf) coshBaf) dt = J^(x) + Y%(x), Rex > 0, т Jo D.13.7) generalizes the trigonometric identity sin2 x + cos2 x — 1 as can be seen by taking a = 1/2 and applying D.13.6). We present Wilkins's [1948] verification of D.13.7). This is done by showing that both sides of D.13.7) satisfy the same differential equation and then analyzing their asymptotic behavior. We show first that N{x) ~ — as x -+ oo, D.13.8) nx where N(x) denotes the left side of D.13.7). It is sufficient to prove that 8 f°° 2 lim xN{x) = lim —- / xK0Bx sinht) coshtdt = —. D.13.9) Jt-i-oo jc->oo я2 Jo П The second equation follows from D.13.6). For the first equation, we show that F(x, t) = xK0Bx sinh t) (cosh 2at - coshf)
4.14 Zeros of Bessel Functions 225 converges boundedly to 0. The dominated convergence theorem then implies D.13.9). From D.13.3) we have \F(x, f)| < A0(x csch fI/2e-2jcsinh'|cosh2af - coshf| < Ao(x/t)l/2e-2xsivb'B\a\ + I)f(sinh2|a|f + sinhf). The second inequality follows upon an application of the mean value theorem to cosh 2at — cosh t, recalling the fact that csch t < l/t. Let x > 1. Since sinh t > t and (xt)l/2e~xt is bounded, we have \F(x, t)\ < A(sinh2|a|f + smht)e~xsiaht < A(sinh2|a|f+ sinhf)e~sinh'. This proves D.13.9). For the next step, check that the product of any two solutions of y"+py'+qy = 0 satisfies the equation у'" + Ъру" + Bp2 + p'+ 4q)y' + Dpq + 2q')y = 0. Apply this to Bessel's equation to see that {tfvA)(.x)}2, {H^2\x)f and J2(x) + Y2(x) are independent solutions of Using differentiation under the integral sign, the reader should verify that satisfies this differential equation. The differential equation satisfied by K0(x) is also required in the calculation. Thus we have N(x) = A{J2(x) + Y2(x)} + B{H^\x)f + C{HB\x)}\ Let x ->¦ oo and use D.8.3) to D.8.6) to obtain 1 = A + e2'^-!"*-Wb + е-ъ(х-±ал-±я)с + oA)_ Hence В = С = 0, A = 1, and Nicholson's formula is proved. 4.14 Zeros of Bessel Functions It is easily seen that all nontrivial solutions of the Bessel equation D.5.1) have simple zeros except possibly at zero. The first derivatives of such solutions also have simple zeros except possibly at zero and ±a. From D.8.5) we can conclude that for real a, Ja (x) changes sign infinitely often as x —> oo. This implies that Ja (x) and J'a (x) have infinitely many positive zeros. The conclusion for J'a (x) follows from the mean value theorem.
226 4 Bessel Functions and Confluent Hypergeometric Functions Suppose that jati, ja<2,... are the positive zeros of Ja(x) in ascending order. Then, fora > — 1, 0 < ja,l < ja+l,l < ja,2 < jc+1,2 < ja,3 < •¦ D.14.1) From D.6.1) and the mean value theorem it follows that between two zeros of xa+xJa+\(x) there is a zero of xa+1Ja(x). Similarly, D.6.2) implies that between two zeros of x~aJa(x) there is a zero of x~a Ja+\{x). This proves D.14.1). When a < — 1, the zeros of Ja (x) and Ja+i (x) are still interlaced by the above argument, but the smallest zero of Ja+i (x) is closer to zero than that of Ja (x). It can also be proved that for —2s < a < — Bs + l),s a positive integer, Ja(x) has As complex zeros all with nonzero real parts. In contrast, when —Bs + 1) < a < —2(s + l),sa nonnegative integer, Ja (x) has As + 2 complex zeros, two of which are purely imaginary. See Watson [1944, p. 483] for a proof. Theorem 4.11.6 and Theorem 5.4.1 imply the following theorem about Bessel functions. Theorem 4.14.1 Letx\n > x2n > • • • be the zeros of P^^(x) in[-\, \\andlet Xkn = cos#/tn, 0 < вк„ < п. Then for a fixed к, lim пвкп = ja,k- n—>oo In particular, Ja (x) has an infinite number of positive zeros. In the next chapter, we prove that all zeros of P^a-^(x) lie in (—1, 1) when a, ft > — 1. This, combined with Hurwitz's theorem, shows that x~a Ja(x) has only real zeros for a > — 1. For Hurwitz's theorem one may consult Hille [1962, p. 180]. Another method of obtaining the reality of the zeros of /„ (x) for a > — 1 is to establish the formula (b2-a2)[ tJa(at)Ja(bt)dt=x[ja(bx)J'a(ax)-Ja(ax)J'a(bx)}. D.14.2) To prove this, note that Ja(ax) satisfies the differential equation Multiply this equation by Ja(bx) and multiply the corresponding equation for Ja(bx) by Ja(ax); subtract to get d ( dJa(ax)\ d ( dJa(bx)\ , , Ja(bx)—l x "; - Max)—I x  = (b2 - a2)xJa(ax)Mbx) dx\ dx J dx \ dx J
4.14 Zeros of Bessel Functions 227 or ^-[xJa{bx)J'a(ax) - xJa{ax)J'a{bx)} = (b2 - a2)xJa(ax)Ja(bx). Formula D.14.2) is simply an integrated form of this. Now if a is a complex zero of Ja(x), then so is a. Take x = 1, b = a in D.14.2) and note that the integrand tJa(at)Ja(at) > 0. Hence the left side of D.14.2) is nonzero but the right side is zero. This contradiction implies that Ja (x) has no complex zeros. An argument using differential equations can also be given to show that Ja(x) has an infinity of positive real solutions for real a. This technique goes back to Sturm. The version of Sturm's comparison theorem given below is due to Watson [1944, p. 518]. Theorem 4.14.2 Let u\{x) and u2(x) be the solutions of the equations ^ ф{H ^ф() 0 such that when x = a Mi(fl) = u2(a), Mj(fl) = M2(a). Lef 0i(jc) an<i 02(*) fee continuous in the interval a < x < b, and u\{x) and u'2(x) be continuous in the same interval. Then, ifф\{x) > ф2(х) throughout the interval, \u2(x)\ exceeds J и i (jc ) | as long as x lies between a and the first zero of ui(x) in the interval. Thus the first zero ofu\{x) in the interval is on the left of the first zero of u2(x). Proof. Without loss of generality, we assume that mi(*) and u2(x) are both positive immediately to the right of x = a. Subtract u2 times the first equation from mi times the second to get Mi—— -и2—-г- = (ф\(х) -ф2(х))ихи2 > 0. dx1 dx1 Integration gives du2 du\ Mj— u2—— dx dx Since the expression in the brackets vanishes at a, we have du2 dux mi— И2-7— > 0. dx dx Hence d(u2/ui) dx >0
228 4 Bessel Functions and Confluent Hypergeometric Functions or which implies that u2{x) > u\ (x). This proves the theorem. ¦ Suppose \a\ > \,a real, and take ф\ (x) = 1 - (a2 — 1/4) /x2 and ф2 (x) = 1 — (a2- l/4)/c2. Thenfor* > c, we have фх (х) > 02(*)-NotethatMi = xl/2Ja(x) is a solution of u'( + ф\{х)и\ = 0. Denote the general solution by x 1^2Ca(x). It is clear that u2 = A coscox + В sinwx, where w2 = 1 — (a2 — l/4)/c2. It follows from Theorem 4.14.2 that if с is any zero of Ca (x) then the next larger zero is at most с + n/w. When \a\ < 1/2, take фг(x) = w2 < 1. Thus for real a, Ja(x) has an infinite number of real zeros. Essentially, Sturm's theorem says that the greater the value of ф, the more rapid are the oscillations of the solutions of the equation as x increases. Theorem 4.14.2 can also be used to prove that the forward differences of the positive zeros of/ff O) are decreasing for \a\ > 1 /2 and increasing for |a\ < 1/2. Suppose|a| > l/2andlet7Q,n_i < jan < ^n+ibethreesuccessivepositivezeros of Ja(x). Now set фх(х) = 1 - (a2 - 1/4)Д2 and ф2(х) = ф\{х - k), where к = 7ffn-7ff,n-i. Now 0i (л) is an increasing function, so 0i (л) > ф2 (х). Consider the interval [jan, ja,n+\\- At x = jan, u\ = Ja(x) = 0 and u2 = Ja(x - k) — 0. By Sturm's theorem, u\ oscillates more rapidly and hence y'a.n+i — ja,n < ja,n < ja,n-i- A similar argument applies to the case |a| < 1/2. It should be clear that the same argument works for the general solution of Bessel's equation. We end this section with an infinite product formula for Ja (x), a real. For large x, the asymptotic formula D.8.5) for Ja{x) suggests that the asymptotic behavior of the zeros is given by x ~ (m + Ba + 1)/4)тг. D.14.3) Since the zeros of /„ (x) are real and simple, one expects that the number of zeros of x~aJa(x) between the imaginary axis and the line Re x = (m + (a + 1)/4)тг, for large x, is m. This is true. See Watson [1944, §15.4]. It follows that the entire function x~a Ja (x) has the product formula OO 1 - */¦/«.») еЧ>(*//«.»)} П{A + */¦>«¦»> еЧ>(-*//«.»)}- DЛ4.4) Л=1 П = \ This result continues to hold when a is not real. See Watson [1944, §15.41].
4.15 Monotonicity Properties of Bessel Functions 229 4.15 Monotonicity Properties of Bessel Functions Sturm's comparison theorem for differential equations stated in the previous sec- section gave information about the zeros of the solutions of the differential equation _ 0. ,4.15.1, Sturm also used his theorem to prove that the second forward differences of the positive zeros of any nontrivial solution of the above equation are all positive if |v| < 1/2 and all negative if |v| > 1/2. Lorch and Szego [1963] have greatly extended this result to higher-order differences. In this section we present one of their theorems. For further generalizations and other related results, the reader should see the references section. Consider the differential equation y" + f(x)y = 0, D.15.2) where x e I, an open interval. Let A > — 1 and let y(x) be an arbitrary solution of D.15.2) with zeros at x\, x2,. ¦. in ascending order. Set Mk = JXM \y(x)\xdx, k=\,2,.... D.15.3) Observe that when A = 0, Mk = Axk, the difference of the successive zeros of y(x). When A = 1, then Mk gives the area under the arch formed by y(x) from xk We state the theorem of Lorch and Szego without proof. The reader may consult the original paper for a proof, which is somewhat lengthy. In the statement of the theorem, the notation An\xk is used to denote the nth forward difference of the sequence {fj,k}. Thus, А"цк = An~l/j,k+i — Ап~1цк and А°[л,к = ц,к. Theorem 4.15.1 Let y\ and уг be two independent solutions of the differential equation D.15.2) in a closed interval I. Suppose that (-D"^-f[MW]2 + Ы*)]2} > 0 for n = 0, 1,..., N, dxn where the Nth derivative exists in the open interval I, and the lower-order deriva- derivatives are continuous in I. Then {-l)nA"Mk>Q forn=0,...,N; к =1,2,.... In particular (-l)"-lA"xk>0 forn = l,...,N + l; к =1,2,....
230 4 Bessel Functions and Confluent Hypergeometric Functions Moreover, ify(x) denotes another solution of D.15.2) with zeros at x\, xi,.. ¦ and ifx\ > x\, then (-1)иДи(х*-х*)>0 for n = 0,...,N; ?=1,2,.... This theorem yields results on Bessel functions when applied to equation D.15.1). Two independent solutions of this equation are */xJv(x) and */xYv(x). Let */xCv(x) denote the general solution. To apply Theorem 4.15.1, we have to study the expression p(x)=x[Jv(x)]2+x[Yv(x)]2, D.15.4) which can be represented by Nicholson's integral D.13.7). We need the following formula below: I-OO K0(x) = / e~xcoshtdt. D.15.5) Jo For a proof, see Exercise 11. Now, by D.13.7), 8 Г°° = —j / [K0Bx sinht) + 2x smhtK^x sinht)] cosh2vtdt. л Jo Integrate the second term by parts to get Q p'(x) = — [Ko Bx sinh Otanhf cosh 2vi]g° 8 Г° / d Л—~ / ABjc sinh t) ( cosh 2vt (tanh t cosh 2vt) ) dt. л1 Jo \ dt ) The first term on the right is zero, for by definition D.12.3), it follows that behaves like log x as x -*¦ 0, while D.12.6) gives the behavior of Ko(x)asx -> oo. Thus p'(x) = —j / K0Bx sinh t) tanh t cosh 2vi[tanh t - 2v tanh2vt]dt. л1 Jo It is easy to check that the expression in brackets is negative for | v | > 1/2 and the rest of the integrand is positive. So p'{x) < 0. Similarly, it can be shown that 8 f°° p(")(x)= / Kg- D Bx sinh f)B sinh r)"~ 'tanh t cosh 2vt л1 Jo ¦ {tanh t - 2v tanh 2vt}dt. It is clear from D.15.5) that (-1)" K{Qn) (x) > 0 for x > 0, и = 0, 1, 2,.... Thus, the conditions of Theorem 4.15.1 hold for Equation D.15.1) when |v| > 1/2. So we have the following corollary:
4.16 Zero-Free Regions for i F, Functions 231 Corollary 4.15.2 Let cvk, cvk denote the kth positive zeros in ascending order of any pair of nontrivial solutions of Bessel's equation D.15.1) with |v|>l/2. Suppose Л > — 1 and set Mk= [*'Ш xx>2\Cv(x)\xdx. Then, for к = 1,2,..., (-1)"Д"М* >0 forn = 0,1,..., (-l)"-lA"cvk>0 forn = 1,2,..., t-l)"A"(c,m+t - cvk) > 0 for n = 0,1,... with m a fixed nonnegative integer, provided that cVitn+\ > cv\. In particular, (-I)"'1 A"jvk > 0, (-l)"-1Anyvk>0 forn = 1,2,... and (-l)nAn(jvk - yvk) > 0 for n = 0, 1,..., where jvk, yvk denote the kth positive zeros of Jv(x) and Yv(x) respectively. Remark 4.15.1 Lorch, Muldoon, and Szego [1970] have extended Theorem 4.15.1 to a study of higher monotonicity properties of = f Ш W(x)\y(x)\xdx, Jxk Mk Jxk where W(x) is a function subject to some restrictions. As an example, it is possi- possible to take W(x) = x~l/1 when у is a solution of D.15.1). This implies the monotonicity of (_1)»д» ГШ \Cv(x)\dx, Jcvk where the integral is the area contained by an arch of a general Bessel function instead of xl/2 times a Bessel function. For further extensions of the results see also Lorch, Muldoon, and Szego [1972]. 4.16 Zero-Free Regions for iFi Functions We end this chapter with some results of Saff and Varga [ 1976] on zero-free regions for sequences of polynomials satisfying three-term recurrence relations. These polynomials can be partial sums of i F\ functions, so we may invoke a theorem of Hurwitz on zeros of an analytic function which is the limit of a sequence of analytic functions to obtain zero-free regions for i F\ functions. Saff and Varga's basic theorem is the following:
232 4 Bessel Functions and Confluent Hypergeometric Functions Theorem 4.16.1 Let {pk(z)}o be a finite sequence of polynomials satisfying k=l,2,...,n, D.16.1) h where p~\{z) := 0, po(z) = po ф 0, and the bk andck are positive real numbers. Let a :=min{bk(l -bk-X/ck) : к = 1,2,...,n], b0 = 0. D.16.2) Г/геи, г/а > 0, f/ге parabolic region Pa = {z=x+iy :y2 < 4a(x + a), x > -a} D.16.3) contains no zeros of pk(z), к = 1, 2,..., п. Proof. Suppose г е Pa is a fixed complex number that is not a zero of any pk(z), к = l,...,n. Set := zpk-i(z)/bkpk(z) fork=l,...,n. The proof depends on the following two facts: 1. The polynomials pk(z) and pk-\{z) have no zeros in common for each k, k=l,...,n. 2. Re/u,k <l for/fc= l,...,n. Assume these results for now and suppose that for some k, pk(z) is zero at a point zo e Pa. Observe that кф\, for, p\ (z) = po(z + b\)/bi has a zero at -bu which by D.16.2) is < —a and hence cannot be in Pa. Suppose 2 < к < п. Since pk(zo) = 0, it follows from D.16.1) that (zo/bk + l)Pt-i(zo) = (zo/ck)Pk-i(zo)- Fact 1 implies that pk-i(zo) ф 0 because pk and pk-\ have no common zeros. So we can divide by pk-\ (zo) to get Ck . . , , ZOpk-2(Zo) (zo + h) = -— = /j,k-i(zo). D.16.4) bk-\bk The second fact and the continuity of ц, give Re/x^_i(zo) < 1. Then by D.16.4) Rezo < -bk{\ - bk-i/ck) < -a, which contradicts the assumption that zo€ Pa- This means that pk(z)(k = 1,..., л) have no zeros in Pa. It remains only to prove our two assumptions. It is evident from D.16.1) that none of the polynomials pk(z), к = 0,1,... ,n vanish at 0. So suppose pk(zo) = Pk-i(zo) = 0, where zo ф 0, for some к > 1. By a repeated application of D.16.1) we get po(zo) = 0, which contradicts the assumption that po ф 0. Thus pk(z) and pk~\{z) have no common zeros.
4.16 Zero-Free Regions for i F, Functions 233 We now prove that Re jut(z) < 1 by induction. Clearly, z z + bi Since z e Pa, Re г > —cc > —b\. Thus Re/xi < 1. Now, it follows easily from D.16.1) that z —\ z + h- Ькск or where Tk{w) is a fractional linear transformation defined by z + bk -bkck bk-iw The function. Tk hajs a pole at the point u>k = (z + bk)/(bkCklbk-i) whose real part, by D.16.2) and D.16.3), is seen to be >1. So remaps Re w < 1 into abounded disk with center f^ '= 7\B — п>к), where 2 — wk is the point symmetric to the pole u>k with respect to the line Re w = 1. By the definition of Tk Moreover, the point 0 — 7\(oo) lies on the boundary of this disk so its radius is lit |. Thus the real part of any point in the disk does not exceed „<_.,<_, Re? + |2| -h^cp) ~ 2(Rez where the first inequality follows from D.16.2) and the second from D.16.3). Now, by the induction hypothesis, Re/^-i < 1; thus Re^i* = Re Tk{iik-i) < 1- This proves the theorem. ¦ Corollary 4.16.2 For an infinite sequence of polynomials {pkiz)}™ satisfying the three-term relation D.16.1), suppose that a- inf \bk(l -bk-!c71)} >0. k>l l v '' Then the region Pa defined in the theorem is zero-free for the polynomials Pk(z). Moreover, if pk(z) —>¦ f(z) ф 0 uniformly on compact subsets of Pa, then f(z) is also zero-free in Pa. Proof. The first part is obvious. For the second part use Hurwitz's theorem. ¦ The next corollary applies to the polynomial squence obtained from the partial sums of a power series.
234 4 Bessel Functions and Confluent Hypergeometric Functions Corollary 4.16.3 Suppose sk(z) := ?^=o djZj have strictly positive coefficients and > > 0, where a-i/ao = 0. ak ak-\J J Then the polynomials sk(z),k = 1,2,..., и /?ave ио zeros ш Р^. Proo/ First observe that V() V * = 1, 2,..., n, ak-i J ak-i when s-i = 0. Then apply Theorem 4.16.1 to obtain the required result. ¦ Note that a consequence of the above results is that the partial sums sn (z) = J2o zn/n\ofthe exponential function have the parabolic region, y2 < 4(x+l),x > — 1, as a zero-free region. This region is sharp, both because of the zero at z = — 1 for s\(z) = 1 + z and asymptotically as n ->• oo in sn(z). The next corollary concerns the more general i F\ confluent hypergeometric function. The proof is left to the reader. Corollary 4.16.4 Suppose sn(z) is the nth partial sum of \F\{c; d; г). Then $n(z),n =0, 1, 2,... have no zeros in the region (i) Pd/C, ifO<d< c, (ii) Puifl <c<d, (Hi) Pa, a = Bc-d+ cd)/(c2 + с), г/0 < с < 1 and с < d < 2c/A - c). Moreover, i F\ (c; d; z) has no zeros in the corresponding interior region. For other applications of Theorem 4.16.1 see de Bruin, Saff, and Varga [1981]. Exercises 1. Show that 1 1V ' ' ' Г(а)Г(с-а)Л v ' when Re с > Re a > 0. 2. Let \Fi(a~) = i_Fi(a — 1; c; jc) and define iFi(a+) etc. in a similar way. Prove the contiguous relations: (a) (c-a)lFi(a-) + Ba-c + x)iFi -alFl(a+)=0, (b) c(c — 1) i F\ (c—) — c(c — 1 + x) i F\ + (c — a)x \ F\ (c+) = 0, (c) (a-c+l)iFi- a iF,(e+) + (c - 1) iF^c-) = 0, (d) ciFi-ciF1(e-)-'ifi(c+)=O, (e) c(a +x)iF\ — (c — a)x \F\(c+) — ас \ F\(a+) = 0, (f) (a-l+jc)^! +(c-a)iF!(a-)-(c- 1),^(с-) = 0.
Exercises 235 3. Prove the formulas (a) -—iFi(a;c;x) = —^-iFl(a+n;c+n;x), dxn (с)„ (b) ^-[e-'iFiia; c; x)] = (-1)" {° ~ a)n e~* ^(a; с + n; x). dxn (с)„ 4. Express the following functions in terms of Whittaker functions: (a) The sine integral Si(x) — jQx f sin? Jf. (b) The cosine integral Ci(x) = - /л°° f"' cosrdt. (c) The Fresnel integrals rx C(x) = / f/2 cos tdt/Jbz, Jo S(x) = f r1/2sintdt/V2n. Jo (d) The exponential integral /•00 -/ Ei(x)= —dt. Jx t 5. Use D.3.6) and D.4.4) to derive an asymptotic expansion for erf x. 6. Prove that ,.00 e~sxxc \Fi{a; c; x)iFi(ai; c; Jo = T(c)(s- l)~a(s - Xya'sa+ai~c2Fi(a,ai;c;X/[(s - l)(s-k)]). 7. Show that, for the parabolic cylinder function Dn(x) given by D.4.11), the following properties hold: (a) Dn{x) = V^V^ViC-wA 1/2; х2/2)/Г(A -п)/2) 2-n)/2\ 3/2;х2/2)/Г(-и/2). (b) Dn{x) = (-l)nex l4-r-^(e~x212). 8. Prove the formulas D.6.11) and D.6.12) for Jn+i/2(x) and /_n_i/2(x). 9. Show that for Re a > -1/2, Г (a + l/2)Ja(x) = —=(x/2f / cos(x si \ln Jo and Г (a + 1/2)/Дх) = 4=(x/2f / e~xt(\ - t2)" л/Л J-\ Deduce that \JAx)\ < \х/2\аем/ Г (a + 1), where x=u + iv.
236 4 Bessel Functions and Confluent Hypergeometnc Functions 10. Use D.9.11) to obtain Neumann's formulas (Watson [1944, p. 32]): 1 fn 1 Гп ji(x) = — J2nBxsme)de = — / J0Bx sinO) cos 2n0de. л Jo n Jo 11. Show that, for |argx| < jt/2, Deduce that, when Re a > —1/2, ¦'" + , Hence 3711") / «""(I - f2)" Г (о + 1/2) jo 12. Prove that for x > 0 and a > -1/2 2°T(a + l/2) f00 cosxt 13. Show that for a > — 1 and с > О лA) 2л-г 14. Prove the following result of Sonine and Schafheitlin: [ Ja.fi(at)Jr-x(bt) о := -o at Л and S = —- /a,a-y + l a1 provided the integral is convergent.
Exercises 237 Consider the particular cases (a) p = О, у - a = 1; (b) у — 3/2, a+P =1/2; (с) у = 1/2, a + P = - 1/2; (d) у = 3/2, a + P = 3/2; (e) у = 1/2, a + ?=1/2. See Watson [1944, §13.4]. 15. Show that when a = b in Exercise 14, the result is -a-p)T(a) = tr-"-» 2ГA-р)Г(у-а)Г(у-р) ' provided that Re a > 0 and Re (y — a - P) > 0. 16. Show that Г(Р+ 1)Г(а + 1) y^ (-lyW-w, -a-n;p + l; b2/a2)(ax/2Jn Deduce that lJn(x/2) 2" ^ (о + \)Лр + 1)„(а+р 17. Show that for a, ? > 0 and -1 < Rea < 2Re? + 3/2 x"+lJa(bx) g'-W v ^ К (Note that $™ e-^+^'tPdt = (J}^+, for Re p > -1.) 18. Show that fr/T\ff /"O ^ io 19. Prove that for a > 0, b > 0, >> > 0, and Rep > -1 ЪЧ \а2 + ЪЛа~*~\ Consider the case a = 1/2, p = 0. 20. Prove the following formula for Airy's integral: 1 f°° -I Ai(x) := - / 7Г Jo See Watson [1944, §6.4].
238 4 Bessel Functions and Confluent Hypergeometric Functions 21. Let ф(х) be a positive monotonic function in Cl(a, b) and let y(x) be any solution of the differential equation у" + ф(х)у=0. Show that the relative maxima of |>>|, as x increases from a to b, form an increasing or decreasing sequence accordingly as ф (х) decreases or increases. [Hint: For f(x) = {y(x)f + {у'(х)}2/ф(х) show that sgn/'(x) = —sgn ф'(х).] (Sonine) 22. Suppose that k(x) and ф(х) are positive and belong to С (a, b). If y{x) is a solution of the equation then show that the relative maxima of |>>| form an increasing or decreasing sequence accordingly as к(х)ф{х) is decreasing or increasing. (Butlewski) 23. Show that и = xaja (bxc) satisfies the differential equation u" + —и' + Lex-1J + a2~a2c2] u = о r r2 x |_ x j 24. Take 0(x) = 1 + '^. Use D.8.5) and Exercise 21 to prove that { s/2/n if-l/2<a < 1/2, (. finite and > ^/2/n if a > 1/2 . For Exercises 21, 22, and 24 and the references to Sonine and Butlewski, see Szego[1975, pp. 166-167]. 25. Let a=A— 1/2,0 < Я < 1. Denote the positive zeros of Ja{x) by j\ < h < h < • • • and the zeros of the ultraspherical polynomial C^(cos#) by в\ < 02 < • ¦ • < в„. Use Theorem 4.14.2 to show that + K), k=\,2,...,n. [Note that и = (sin#)^C^(cos#) satisfies the equation I. sin' v and compare this with the equation satisfied by *f9Ja{(n + X)9}.] 26. Suppose -1/2 < a < 1/2 and mn < x < (m + 1/2)тг, w = 0, 1, 2,.... Show that Jv(x) is positive for even m and negative for odd m. [Note that when x = (m + 9/2O1 with 0 < 9 < 1, 2(jt/4)« Z2^ cosGrM/2) Г (a + 1/2)VttBw + 0)" y0 {Bw _(
Exercises 239 and show that sgn Ja(mn + вл/2) = sgn [(-l)m{v'm + (vm - vm.i) + (vm-2 ~ Vm-г) H }] = sgn(-l)m, where r2r cos(ttm/2) and r _ rl -!)»„' = f2m+e ^{ли/Ddu m J2m 12m \\^m -TV) —U-y- 27. Show that tJl{t)dt = X- [х{У^(х)}2 - /ff(x)^-{x^(x)}l. /o 2 [ dx a J J Deduce that /(x) = AJa(x) + BxJ'a{x) ф 0 has no repeated zeros other than x = 0. (This result is due to Dixon. See Watson [1944, p. 480].) 28. Let f{x) = AJaix) + BxJ'aix) and gix) = CJa(x) + DxJ'aix) with AD - ВС Ф 0. Prove that the positive zeros of fix) are interlaced with those of g(x). (Show that 0(x) = fix)/gix) is monotonic.) 29. Prove Graf's formula D.10.6) when a is an integer by using the identity ea(t-l/t)/2e-b(te-">-l/(te-w))/2 _ ec(tu-l/(tu))/2 where и = (a — be~w)/c.
Orthogonal Polynomials Although Murphy [1835] first defined orthogonal functions (which he called recip- reciprocal functions), Chebyshev must be given credit for recognizing their importance. His work, done from 1855 on, was motivated by the analogy with Fourier series and by the theory of continued fractions and approximation theory. We start this chapter with a discussion of the Chebyshev polynomials of the first and second kinds. Some of their elementary properties suggest areas of study in the general situation. The rest of this chapter is devoted to the study of the properties of general orthogonal polynomials. Orthogonal polynomials satisfy three-term recurrence relations; this illustrates their connection with continued fractions. We present some consequences of the three-term relations, such as the Christaffel-Darboux formula and its implications for the zeros of orthogonal polynomials. We also give Stieltjes's integral represen- representation for continued fractions which arise from orthogonal polynomials. In his theory on approximate quadrature, Gauss used polynomials that arise from the successive convergents of the continued fraction expansion of log( 1 +x)/ A - x). Later, Jacobi [1826] observed that these polynomials are Legendre poly- polynomials and that their orthogonality played a fundamental role. We devote a sec- section of this chapter to the Gauss quadrature formula and some of its consequences, especially for zeros of orthogonal polynomials. We also prove the Markov-Stieltjes inequalities for the constants that appear in Gauss's formula. Finally, we employ a little elementary graph theory to find a continued fraction expansion for the moment-generating function. In the past two decades, combina- combinatorial methods have been used quite successfully to study orthogonal polynomials. 5.1 Chebyshev Polynomials We noted earlier that the example of the Chebyshev polynomials should be kept in mind when studying orthogonal polynomials. The Chebyshev polynomials of the first and second kinds, denoted respectively by Tn{x) and Un(x), are defined 240
5.1 Chebyshev Polynomials 241 by the formulas _ B")! T (x) _ B">! g and A/2>1/2) Bл+ 2)! )] E.1.2) where x = cos#. The orthogonality relation satisfied by Tn{x) is given by /•+1 / Tn{x)Tm(x){\ - x2)/2Jx = 0, when m ф п. For x = cos в, this is the elementary result: Г cosmdcosnOde—0, when тфп. Г / h /о Similarly, the orthogonality for E.1.2) is contained in / sin(« + 1H sin(w + 1H d0 — 0, when тфп. Jo To motivate our later discussion of orthogonal polynomials, we note a few results about Chebyshev polynomials. The three-term recurrence relation, for example, is given by 2хГт(х) = Tm+1(x) + Tm.i(x), E.1.3) which is equivalent to 2cos#cosm# = cos(w + 1H +cos(w - \H. E.1.4) The last relation is contained in the linearization formula 2cosm6 cosn0 = cos(w + nH + cos(m — nH E.1.5) or Tm(x)Tn(x) = -(Tm+n(x) + Tm.n(x)). E.1.6) In a more general context, one is interested in the problem of determining the coefficients a(k, m, n) in m+n Pm{x)pn{x) = У?2а(к,т,п)рк(х), E.1.7) 4=0
242 5 Orthogonal Polynomials where {pn(x)} is a sequence of polynomials with pn(x) of degree n exactly. A simple but important special case of this is ml __ v It is usually difficult to say very much about the coefficients a(k, m, n). Later we shall see an important example which generalizes E.1.5) and the next formula about Chebyshev polynomials of the second kind Un{x): sin(m + 1)9 sin(« + 1)9 sine sin(w + n + 1 - 2k)9 k=0 where m л п = min(m,n). This formula is easily verified by noting that sin(m + n+l-2k)9sin9 = ±[cos(m +n -2k)9 -cos(w + n -2k + 2)9]. The dual of E.1.8) is given by sin(n + 1)9 sin(n + 1)ф = / sin(n + l)fdf, E.1.9) 2 Je-ф whereas the dual of E.1.5) is essentially the same formula, that is, cos«#cos«(/> = -(cos«(# +ф) + cosn(9 — ф)). In Fourier analysis, one represents a periodic function f(x) by a series of sines and cosines. This involves analyzing partial sums of the form 1 cosm9 +bm s\nm9), m=l where 1 г2ж am = — / /@) л Jo and b 1 Г2ж = — / л Jo Therefore, 1 2a m=\ ¦2n ^ Jo (cosm9 cosw0 + sinm9 si m=\ m=\ By using the trigonometric identity 2sin(#/2)cosm# = sin(m + 1/2)9 - sin(w
5.1 Chebyshev Polynomials 243 it can be verified that 1 JU sin (n + Ш k =¦¦влв)- EЛЛ0) Thus, the sum inside the last integral is Dn{9 — ф). The function Dn{9) is called the Dirichlet kernel. Define the Chebyshev polynomials of the third kind by Vn(x) = 2DnF), where x=cosfl. E.1.11) It is easy to check that the sequence Vn(x) is orthogonal with respect to A — х)г A + x)~ 2 on (— 1,1). Polynomials equivalent to Vn (x) were studied by Viete. See Edwards [1987, p. 8]. A generalization of E.1.10) is given by Ecos(n + 1N cos«0 — cos ив cos(« + 1H 2cosmflcosm0 = —. E.1.12) _ cos 6 — cos 0 This is in fact a particular case of the Christoffel-Darboux formula, which holds for general orthogonal polynomials. Note that the three-term recurrence E.1.4) implies 2 cos тв cos тф (cos 6 — cos0) = [cos(w + 1N +cos(w — 1N] cos тф — [cos(m + 1H +cos(w — 1H] cos тв. E.1.13) Adding this equation for the various values of m gives E.1.12). To understand the reason for the factor 2 in the sum E.1.12), observe that !°rW^!!' E.1.14) for m = 0. The normalized function is, therefore, <. -cosmd when тфО and 4= when у л ' л/7Г w = 0. The Poisson kernel for the Chebyshev polynomials defined by cos nx is given by the sum oo 1 + yjBcosm# cosm0)rm =: Pr(co&6, cos0). E.1.15) m=\ When 0 = 0, we have 1-r2 E.1.16) m=\
244 5 Orthogonal Polynomials which shows the positivity of the sum Pr(cos9, 1) for \r\ < 1. This implies the positivity of Pr(cos9, cos0) in \r\ < 1, since 2cosw#cosot0 = cos m (9 + ф) + cosm(9 — ф). We end this section with some results concerning the zeros of Tm(x), which can be given explicitly since we are dealing with cos m9. Since cos m9 = 0 when 9 = Bn+ \)n/2m, it follows that Tm{x) has m simple zeros in (—1, 1), given by cosBn + \)n/2m, for n = 0, 1, 2,..., m — 1. Moreover, the zeros of Tm(x) and Tm+\(x) mutually separate each other. Also observe that between two successive zeros of Tm{x), cos[Bfc + Ъ)п/2т] and cos[Bfc + 1Oг/2от], there is a zero of Tn (x) for n > m. This follows from the fact that we can always find a nonnegative integer I < n — 1 such that 2k + 1 21 + 1 2k + 3 < < . 2m 2n 2m These properties of the zeros of Tn (x) have generalizations to general orthogonal polynomials. 5.2 Recurrence Let a{x) denote a nondecreasing function with an infinite number of points of increase in the interval [a, b]. The latter interval may be infinite. We assume that moments of all orders exist, that is, Ja x"da(x) exists for n = 0, 1, 2,.... Definition 5.2.1 We say that a sequence of polynomials {pn (x)}g°, where pn (x) has exact degree n, is orthogonal with respect to the distribution da{x) if Pn{x)Pm(x)da{x) = hn8mn. E.2.1) / Ja In effect the next theorem says that {pn{x)} satisfies a three-term recurrence relation. Theorem 5.2.2 A sequence of orthogonal polynomials {р„(х)} satisfies pn+i (x) = (Anx + Bn)pn (x) - CnPn-\ (x) for n > 0, E.2.2) where we set p~i(x) = 0. Here An, Bn, and Cn are real constants, n — 0, 1, 2, ..., and An_\AnCn > 0, n = 1,2, If the highest coefficient of р„(х) is kn > 0, then . _ kn+i _ An+\ hn+\ " ~ ~J~' +1 ~ ~A h~' where hn is given by E.2.1).
5.2 Recurrence 245 Proof. First determine An so that pn+i (x) — Anxpn (x) is a polynomial of degree n. Then n pn+i(x) - А„хр„(х) = ^2/bkPkix) E.2.3) k=0 for some constants bk. Note that, if Q(x) is a polynomial of degree m < n, then by E.2.1) р„(*)б(*)</а(*)=0. This implies that ^ = 0 for к < n — 1, as can be seen by multiplying both sides of E.2.3) by Pk(x) and integrating. This proves E.2.2). It is clear that An — kn+\/kn. To derive the final result, multiply E.2.2) by pn-\ (x) and integrate to get 0 = An pn(x)xpn-i(x)da(x) - Cn / pl_x{x)da{x). J a J a Since Xpn-\(X) = -—pn{x) + 2__ " k=0 we get A - Cnhn-X = 0. An-\ This proves the theorem. ¦ Corollary 5.2.3 hn = (A0/An)ClC2 ¦ ¦ ¦ Cnh0. This follows from the equation hn = An-iCnhn-i/An by iteration. Corollary 5.2.3 shows that the L2 norm of р„(х) can be computed from the recurrence relation. A converse of Theorem 5.2.2 also holds. If a sequence of polynomials [pn{x)} satisfies E.2.2) then {pn(x)} is orthogonal with respect to a positive measure. This is usually called Favard's theorem. See Szego [1975, §3.2] orChihara[1978,p. 21]. Remark 5.2.1 The form of the recurrence relation given in E.2.2) is the most useful when finding pn+i(x) from pn(x) and pn-i(x). However, there are other reasons for considering the three-term recurrence relation satisfied by orthogonal polynomials. Another useful form is xpn(x) =anpn+\(x) +bnpn(x) + cnpn-i{x),
246 5 Orthogonal Polynomials where а„,Ь„, and с„ are real. A similar calculation gives the relation an-\hn = cnhn-\. This implies that an~\cn > 0, n = 1,2, The L2 norm of р„(х), that is, hn, now has the form An important consequence of the recurrence relation in Theorem 5.2.2 is the following result, called the Christoffel-Darboux formula. Theorem 5.2.4 Suppose that the р„ (х) are normalized so that rb K= Pi J a 2{x)da{x) = 1. Then Е, ч , ч к" Pn+l(x)Pn(y)- Pn + l(y)Pn(x) Рт (у)Рт (X) = , E.2.4) т=0 *л+1 X — У where kn is the highest coefficient of pn(x). Proof. The recurrence relation E.2.2) implies that Pn{y)Pn+\{x) = (А„х + В„)рп(х)р„(у) - Cnpn-i(x)pn(y) and р„(х)р„+х(у) = (А„у + В„)р„(у)р„(х) - Cnpn-i(y)pn(x). Subtract and divide by An (x — y) to get 1 Pn (y)Pn+i (x) - р„ (x)pn+i (y) An x — у ,ч ,Ч| 1 Pn-l(y)Pn(x) - Рп-\(Х)Р„{У) = Pn (x)Pn (у) + ¦ E.2.5) We have used the fact that С„ = An/An-\, since hn = 1. Repeated application of E.2.5) gives the required result when we observe that An = kn+i/kn. ¦ Remark 5.2.2 \ihn ф 1, then E.2.4) takes the form V^ Pm (y)Pm (X) к„ pn+i (X)pn (y) - pn+l (у)р„ (х) ?^0 hm kn+i (x - y)hn The following theorem gives the confluent form of E.2.4), that is, when x — y.
5.2 Recurrence 247 Theorem 5.2.5 When hn = 1, then " к J>2(x) = T-MK+iW^W - Pn+i(x)p'n(x)). E.2.6) Proof. Write the right side of E.2.4) as К (р„+\ (х) - pn+i (y))pn (y) - (р„ (x) - pn (y))pn+i (y) K+\ (x - y) and let у —> x. The result follows. ¦ Corollary 5.2.6 p'n+l(x)pn (x) - pn+x (x)p'n (x) > 0 for all x. To conclude this section, we show how the three-term recurrence relation for Jacobi polynomials can be found. Earlier, we derived this formula from a contigu- contiguous relation, but this is hardly a practical idea. The methods given below can be extended to other hypergeometric orthogonal polynomials. Consider the polynomials n — 0,1, 2, Write the recurrence relation as A -x)pn(x) = Anpn+i(x) + В„р„(х) + Cnpn-i(x), n =0, 1,..., where p-\(x) = 0. To obtain An, equate the coefficients of A — x)n+l. It remains to find Bn and С„. Take x = 1 to get 0 = An + Bn + Cn or Bn = -{An + Cn). From Remark 5.2.1, we see that Cn = An_ihn/hn_u where hn is the L2 norm of pn (x) and its value follows from the L2 norm for Jacobi polynomials given in B.5.14). Thus the recurrence relation is obtained. The next method simultaneously yields the recurrence and the orthogonality of Jacobi polynomials. It is clear that 1 — x\ ' р„(х) = An+ipn+i(x) + Anpn(x) + An-lpn-i(x)+ ¦¦¦ +Aopo{x). E.2.8)
248 5 Orthogonal Polynomials Set x = 1 to get An = -(An+i + А„_! + А„_2 + • • • + Ao). E.2.9) This implies that ' 1 - An _,(/?„ (x) — pn-i (x)) + other terms. A short calculation shows that Therefore, 1-лЛ ' 2 An_12/V 1 2 a + 1 "»-^"V « + 2 ' 2 + other terms. Equating the highest power of ^- gives "+1 ~ B The next highest power gives A n(n Now one finds that with these values of An and А„_! the sum of the first two terms on the right equals the polynomial on the left. Thus An_2 = А„_3 = • • • = Ao = 0, and we have the three-term recurrence relation. Moreover, we have also proved the orthogonality of these polynomials, since by Favard's theorem such recurrence relations are satisfied only by polynomials orthogonal with respect to some positive measure. 5.3 Gauss Quadrature The need to approximate an integral that cannot be evaluated exactly has existed since the start of calculus. Newton used the method of interpolating a function at n points and then integrating the interpolating function. He used polynomials to do the interpolation.
5.3 Gauss Quadrature 249 We use Lagrange interpolation polynomials here. Suppose x\ < X2 < ¦ ¦ ¦ < xn в a set of n numbers in an increasing sequence and y\,y2,...,ynisan arbitrary set of numbers. Definition 5.3.1 The Lagrange interpolation polynomial is a polynomial of de- decree n — 1 that takes the value yt at x,- for i = 1, ..., n. This polynomial is given by j^ P'(Xj)(X - Xj) *here P(x) = (x - xi) ¦ ¦ ¦ (x -xn). We write It is clear that Thus if f(x) is a continuous function whose values /(x,) are known at the points t, . i = 1, 2,..., n in an interval [a, b], then (x)f(xj) E.3.3) is a polynomial of degree < n — 1 which interpolates the function / in [a, b]. Formula E.3.3) can be applied to approximate integration. We have г f(x)da(x) 'а where fb Xj := / ?j(x)da(x). E.3.5) Ja It is evident that E.3.4) is exact if /(x) is a polynomial of degree < n - 1. For, in this case Ln(x) = /(x). There are a number of ways to measure how well the quadrature method approximates the integral. The most obvious is to see how large the difference is. There is another way that has been very fruitful: Require that the quadrature method be exact for as large a class of functions as possible. For the interpolation method above, there are 2n parameters, A.* and xk. When the xk are given in advance, it is easy to determine kk so that there is equality in
250 5 Orthogonal Polynomials E.3.4) for all functions /(x) that are polynomials of degree at most n — 1. One simply requires that the approximating polynomial agree with / at n points, so that the two are identically equal when / is a polynomial of degree at most n — 1. This is as far as one can go by means of Lagrange interpolation at fixed points. However, if one does not require that the points xk be fixed, there is a possibility of increasing the degree of the polynomial by one for each хь which is allowed to vary. The maximum degree should be In — 1 when all the x^s are allowed to vary. This appears to be a difficult nonlinear problem, for we seem to need to solve In equations that are linear in the A^s but nonlinear in the x*s: pb pb l xjda(x), j = The solution is contained in the next theorem, known as the Gauss quadrature for- formula. Before stating it we introduce some notation. Suppose \Pn (x)} is a sequence of polynomials orthogonal with respect to the distribution da(x), that is, ? b Pn(x)Pm(x)da{x) = 0 for m ф п. E.3.6) Let Xj = Xjn = Xj<n, j = 1, 2,..., n, denote the zeros of Pn(x). We prove in the next section that these zeros are simple and lie in the interval [a, b] used in E.3.6). We saw an example of this in the Cheybshev polynomials of the first kind, Tn (x), which had n simple zeros in [—1, 1]. Theorem 5.3.2 There are positive numbers X\, Яг,..., Я„ such that for every polynomial f(x) of degree at most 2n — 1 /b E.3.7) where Xj, j = 1, ..., n are as defined after E.3.6), and Xj = Xjn = Я7,п. Proof. Let /(x) be an arbitrary polynomial of any degree. Then by the division algorithm, f(x) = Pn(x)Q(x) + R{x), where Pn(x) is as in E.3.6) and deg R < n — 1. Since Xj are the zeros of Pn (x), we have f(Xj) = R(xj) for./= 1,2,..., и
5.3 Gauss Quadrature 251 and ь л « f(x)da(x) = Pn{x)Q(x)da(x) + Y,*-jf(xj), E-3.8) where Xj is defined in E.3.5). Now E.3.7) is exact if jf Pn(x)Q(x)da(x)=0. E.3.9) Since E.3.8) is true for deg Q{x) < n — 1, it follows that E.3.7) is exact for poly- polynomials f(x) of degree < In — 1. We have only to show the positivity of Xj. For this, note that tj — tj is a polynomial of degree In — 2 which vanishes at xk, к = 1,2, ...,n. So, where deg Q < n - 2. Thus pb pb / (t) - lj)da(x) = / Pn(x)Q(x)da(x) = 0 J a J a. and /"* , fb ¦> Xj = I lj(x)da(x)= / lj(x)da(x) > 0. This proves the theorem. ¦ Now, if /(x) is not a polynomial of degree < 2n — 1, then E.3.7) is not exact, but we can use the right side as an approximation for the left side. Here the question of the error involved is of great importance. We do not go into this question in depth but merely prove that the right side of E.3.7) tends to the left side as n ->• oo if /(x) is a continuous function. Theorem 5.3.3 If /(x) is continuous on a finite interval [a, b], then n ^ rb Hm 2jA>/(xjn) = / f(x)da(x), "^°° j=l ¦>« where Xjn andx]n are as in Theorem. 5.3.2. Proof. First note that by Weierstrass's approximation theorem (see Exercise 1.40), for every e > 0, there is a polynomial p(x) such that |/(x) - p(x)\ < e/BS) for all x € [a, b].
252 5 Orthogonal Polynomials Here S = 2^knk= / da(x). For notational convenience, denote g{x)da(x) and In(g) = V^jfi;,) 7=1 where g is any continuous function in [a, b]. Then \ПЛ-1(Р)\< I \f(x)-p(x)\da(x)<e/2 Ja and \Ш) - In(p)\ < ^Я7„|/(х7„) - p(xjn)\ < e/2, 7 = 1 so that \Ш) - Hf)\ < |/и(Л - /Я(РI + |/h(p) - Up)\ + \Hp) Take 2n - 1 > deg p(x) so that In(p) — I(p) and The conclusion of the theorem follows immediately from this inequality. ¦ Remark 5.3.1 Gauss considered the case where da(x) = dx in Theorem 5.3.2. The orthogonal polynomials are then Legendre polynomials given by _ p@,0)/ ч _ for the interval [— 1, 1]. Remark 5.3.2 When da(x) = dx/*J\ — x2, we get the Chebyshev polynomials of the first kind. In this case, one can prove that A-i = A.2 = ¦ • • = A.,, and E.3.7) reduces to COS— 7Г , 2n when / is a polynomial of degree < 2n - 1. A converse of this result is also true. See Natanson [1965] for this and for Exercises 3-10.
5.4 Zeros of Orthogonal Polynomials 253 5.4 Zeros of Orthogonal Polynomials We have seen that the Chebyshev polynomial Tn (x) has n simple zeros in [ — 1, 1 ]. More generally, one can prove the same about Jacobi polynomials by using the representation C(l-x)~a(l+x)^?^{(l-x)"+a(l+x)"+^} and Rolle's theorem together with induction. The next theorem shows that a similar result is true for orthogonal polynomials in general. Theorem 5.4.1 Suppose that {Р„(х)} is a sequence of orthogonal polynomials with respect to the distribution da(x) on the interval [a,b]. Then Pn(x) has n simple zeros in [a, b]. Proof. Suppose Pn(x) has m distinct zeros хь хг,..., xm in [a, b] that are of odd order. In that case Q(x) = Pn(x)(x - xO(x - x2) ¦ • ¦ (x - xm) > 0 E.4.1) for all x in [a, b]. If m < n, then by orthogonality I ¦b Q{x)dx = 0. E.4.2) However, the inequality in E.4.1) implies that the integral in E.4.2) should be strictly positive. This contradiction implies that m = n and that the zeros are simple, yielding our result. ¦ For the next theorem we denote the zeros of Pn (x) in increasing order by x\n < X2n < " ' • < Xnn. Theorem 5.4.2 The zeros of Pn (x) and Pn+\ (x) separate each other. Proof. From Corollary 5.2.6, Pn+l{x)P'n(x) - Pn(x)P'n+x(x) < 0. Since Xjt,n+i is a zero of Pn+i(x), we get l(xk,n+l) > 0. The simplicity of the zeros implies that P^+l(xkin+\) and P^+x{xk+\in+\) have different signs. It follows that Pn(xk,n+i) and Pn(xk+iill+1) have different signs. By the continuity of Р„, we know it has a zero between Xk,n+i and x^+i,n+\ for к = 1,2,... ,n, and our result follows. ¦ We can obtain an extension of Theorem 5.4.2 by using the Gauss quadrature formula. Theorem 5.4.3 Let m < n. Between any two zeros of Pm(x) there is a zero of Pn{x).
254 5 Orthogonal Polynomials Proof. Suppose there is no zero of Pn(x) between xkm and xk+i<m. Consider the polynomial Pm(x) g(x) = (x - xkm){x - xk+Um) It is clear that g(x)Pm(x) > 0 for x g (xkm,xk+\_m). By the Gauss quadrature formula b " g(x)Pm(x)da(x) = y^g(xjn)Pm(xjn). " 7 = 1 Since g(xJn)Pm(xjn) > 0 and cannot vanish for all j = 1,..., n, and Xj > 0 for all j, we see that the sum is positive. The integral, however, is zero by orthogonality. This contradiction proves the result. ¦ We conclude this section with the Markov-Stieltjes inequalities for the sums Z)t=i ^ь where j < n. Once again we let xj, j = 1,2,... ,n, denote the zeros of Pn (x) in increasing order. Theorem 5.4.4 The Markov-Stieltjes inequalities j da(x) < У^Хк = 1 "" k=\ hold for j = 1,2, ...,n. Proof. The Gauss quadrature formula is fb ТГ- / f(x)da(x) = 2,^kf(xk), E.4.3) J" k=i where / is a polynomial of degree < 2л — 1. We have already noted that f(x) = 1 gives da(x). If E.4.3) were exact for the step function k=i Ja fjix)-\0, x>Xj, we would have the value of Хд=1 ^ as f*' da(x). But fj(x) is not a polynomial of degree < 2n - 1, so we use the following idea. Define polynomials фп(х, j)
5.4 Zeros of Orthogonal Polynomials 255 a x1 Figure 5.1 and Фп(х, j) (see Fig. 5.1) of degree <2n —2 such that 0, k = j,...,n; E.4.4) О, fc = j + !,...,«; and E.4.5) E.4.6) We first assume the existence of these polynomials to derive the Markov- Stieltjes inequalities; we then prove that they exist. Use ф„(х, j), a polynomial of degree < In — 1, in the Gauss quadrature formula E.4.3), to get 7-1 л $^л* = / фп(х, j)du{x) k=l ¦'" < / fj(x)da(x)= f ' J a J a Use of Ф„(х, У) gives к=\ da(x). These inequalities may be written as J (X and we have the Markov-Stieltjes inequalities. To show the existence of фп(х, j), note that the conditions E.4.4) together with ф'н(хк, j) = 0, кф], E.4.7)
256 5 Orthogonal Polynomials determine a polynomial of degree < 2n — 2. It is now sufficient to prove that Фп(х, j) touches but does not cross the line у = 1 and crosses у = 0 at xj and does not cross it again after that. Observe that, if фп(х, j) crosses у = 1 at Vi points and у — 0 at v2 + 1 points, then by E.4.4) and E.4.7), ф„(х, j) - 1 has at least 2G — 1) + vi zeros for x < Xj and ф„(х, j) has at least 2(n - j) + v2 + 1 zeros for x > Xj counting multiplicity. Thus ф'п (x, j) has at least 2(j — 1) + vi — 1 + 2(и — j) + V2 = 2n — 3 + vi+V2 zeros. Since ф'п is of degree In — 3, vi and v2 must be zero. This proves one part of the inequality E.4.6). The other part is done in the same way and the theorem is proved. The second part is also a consequence of the first part; replace xk with b - xk and ф with 1 — ф. ш 5.5 Continued Fractions Continued fractions of a certain type are closely connected with orthogonal poly- polynomials. This connection has been extensively studied. We shall merely touch on this topic and prove an interesting result of Stieltjes. Suppose {an}f and {bn}8° are sequences of complex numbers. One notation for an infinite continued fraction is We shall denote the nth convergent of this continued fraction by Cn. So Co = b0 =: —, Ci = bo + — = =: —, ? b tS a\ a2 a\ bo{b\b2 + a2) + a\b2 t2 = Do + -.—- j- = t>0 + 0+0 T;7Г = t>l+a2/t>2 Definition 5.5.1 We say that the continued fraction E.5.1) converges, if at most a finite number of Cn are undefined and limn^oo Cn exists. (In the above example, C2 is undefined if B2 = b\b2 + a2 = 0.) The sequences {An} and {Bn} defined above satisfy the three-term recurrence relations given in the next lemma. Lemma 5.5.2 Forn > 1, An = ЪпАп-Х + а„Ап_2, A_i = 1 E.5.2) and Bn = bnBn-i + anBn_2, B-i = 0. E.5.3) Proof. Since a\ a2 an C b +
5.5 Continued Fractions 257 and „ , , a\ a2 an an+l Ln+i =t>o+ -— -—— ¦ ¦ ¦ t— t>+t>+ O+ t>2+ On+On+l we see that Cn+\ is obtained from Cn by replacing an with anbn+\ and bn with bnbn+\ + an+i. Suppose the result of the lemma is true up to n (it is clearly true for и = 1); we have An+l = (bnbn+i +an+i)An-i + anbn+lAn-2 = bn+i(bnAn-i +anAn-2)+an+lAn-i = bn+\An + an+iAn-i. This proves the result for An by induction; the proof for the sequence {Bn} is similar. The lemma is proved. ¦ We prove the following lemma by a method similar to the proof of the Christoffel- Darboux identity. Lemma 5.5.3 Proof. Multiply E.5.2) by Bn-X and E.5.3) by An_i and subtract to get AnBn-i - BnAn_Y = -an(An_iBn-.2 - Bn-.xAn-2). Now iterate to get the result. ¦ The recurrence relation E.2.2) satisfied by a sequence of orthogonal polynomials {Р„{х)} when compared with E.5.3) suggests the consideration of the continued fraction Ao C\ C2 E.5.4) Aox + BQ- Aix + #i- A2x + B2- In this case the nth convergent is a rational function whose denominator is Pn(x). We denote the numerator by P*(x). The sequence {P*(x)} satisfies the same recursion, namely CiW = (A"x + Bn)P*{x) - CnP*_x{x), n > 1, E.5.5) but P*(x) = 0, P?(x) = Ao- Suppose that the sequence {Pn (x)} is orthogonal with respect to the distribution da(x) on [a, b]. The next result relates P*(x) to Pn{x).
258 5 Orthogonal Polynomials Theorem 5.5.4 With Pn(x) and Р„{х) as defined above, we have P*n{x) = § f" Pnix) ~_Pn{t) da{t), n > 0, E.5.6) Ja X t where S is a constant. Proof. The result holds for n = 0 because in that case Pq(x) = constant. For n = 1, P\{x) = k\x-\- constant. Thus the result holds in this case as well, if we adjust S. If n > 2, then denote the right side of E.5.6) by Rn{x) and observe that Rn(x) - (AB_i* + Bn^)Rn^{x) + Cn b Pn(x) - Pn(t) - (An-ix + fi X - t Cn-x{Pn-2{x) - Pn-2i,t)) H da(t) x -1 X — t (A,.,* + Bn-X)Pn-j{t) fb = 8An-i / Ja This means that /?n(x), which is the right side of E.5.6), satisfies the same recur- recurrence relation as P* with identical initial values. This proves the theorem. ¦ The next result is an application of the Gauss quadrature formula. Let jc* ,k = 1, 2,..., n, denote the zeros of Pn(x). Theorem 5.5.5 Using the notation of Theorems 5.3.2 and 5.3.3, we have P^1=S±^^, E.5.7) where S is the constant appearing in E.5.6). Proof. The rational function P*(x)/Pn(x), expressed as a partial fraction, is PnW fr[ Р!г(Хк)(х - Xk) (Note that the degree of P* (x) is less than the degree of Pn (x).) By Theorem 5.5.4, it follows that P (t) nK da(t)=SXk. E.5.8)
5.6 Kernel Polynomials 259 The last equality follows from the Gauss quadrature formula in Theorem 5.3.2. The result is proved. ¦ Theorem 5.5.5 is due to Stieltjes [1993, paper LXXXI]. The next theorem was presented by Markov [1895, p. 89]. Theorem 5.5.6 Let [a, b] be a finite interval. For any x g [a, b] E.5.9) Proof. For any x g [a,b], the function ^ is a continuous function of t in [a, b]. This observation taken together with Theorems 5.3.3 and 5.5.5 imply the result. ¦ Remark 5.5.1 Since Xk > 0, it is an immediate consequence of E.5.8) that the zeros of P* and Pn alternate. Remark 5.5.2 In Theorem 5.5.6, we may take x to be a complex number that does not lie in [a, b]. If we denote the right side of E.5.9) by F{x), then the inversion formula of Stieltjes is given by 1 fd a(c) - a(d) = lim / Im{F(M + iv)}du. Ж v->0+ Jc Thus, the distribution can be recovered from F. 5.6 Kernel Polynomials In Section 5.1 we saw that the partial sum of the Fourier series of a function when expressed as an integral gave us the Chebyshev polynomials of the third kind: Vn(x) — sin(n + 1/2N/sin(в/2), where x — cos6. More generally, we get the kernel polynomials when we study partial sums involving orthogonal polynomials. Let {pn(x)} be a sequence of polynomials orthogonal with respect to the dis- distribution da(t) on an interval [a, b]. As before, — oo < a < b < oo. Let / be a function such that fa f(t)pa(t)da(t) exists for all n. The series corresponding to the Fourier series is given by aoPo(x) + aipi(x) H h anpn(x) -\ , E.6.1) where fl»=/ f(t)pn(t)da(t) / f {pn(t)}2da(t). E.6.2)
260 5 Orthogonal Polynomials In this section we assume that the denominator of an is one, that is, the sequence {pn(x)} is orthonormal. Then the nth partial sum Sn(x) is given by f(t)pk(t)da(t) = / f(t)Kn(t,x)da(t), E.6.3) Ja where n J2 E.6.4) *=0 Definition 5.6.1 For a sequence of orthonormal polynomials {pn(x)}, the se- sequence {Kn(x0, x)}, where n Kn(xo,x) = у ^ Pk(xp)pk(x), k=0 is called the kernel polynomial sequence. Lemma 5.6.2 If Q(x) is a polynomial of degree < n, then Q{x)= / Kn{t,x)Q{t)d<x{t). Ja Proof. Clearly, k=0 for some constants ak. Multiply both sides by pj(x) and integrate. Orthogonality gives us / Q{t)Pj(t)da{t)=aj. Ja The lemma follows immediately. ¦ Theorem 5.6.3 Suppose xq < a are both finite. The sequence {Kn(xQ, x)} is orthogonal with respect to the distribution (t — Xo)da(t). Proof. In Lemma 5.6.2, let Q(t) = (t — x0)Qn-\{t), where Qn-\ is an arbitrary polynomial of degree n — 1. The theorem follows. ¦ Remark 5.6.1 A similar result is obtained when b < xq are both finite.
5.6 Kernel Polynomials 261 Remark5.6.2 In the case ofthe Chebyshev polynomials Tn(x) with xo = a = —1, we see that for x = cos в, Kn(-l,cosd) = cos#+ cos2#- ••• + (-l)"cosn# cos(n+1/2H 1 ' cos(#/2) " The polynomials cos (n + k)Q Wn(x) = i—^-,x=cos<? cos I are the Chebyshev polynomials ofthe fourth kind and Theorem 5.6.3 implies that the sequence {W^x)} is orthogonal with respect to the weight function л/yzf ¦ If we choose xq = b = 1, then we get the Chebyshev polynomials of the third kind, Vn(x), which are orthogonal with respect to the weight [A — x)/(l + x)]1/2. Remark 5.6.3 Another straightforward consequence of Theorem 5.6.3 is that if {Pn(x)} is orthogonal on [a, b], then pn{x) pn+l(x) _ г~г ГТ" — An4n\X)\x a), pn(a) pn+\{a) where {qn{x)} is orthogonal with respect to (t — a)da{t), for we have Pk(x)Pk(a) k=0 * Anpn(a)pn+i(a) \pn+i(x) pn{x) {x-a)hn [pn+i(a) pn(a) - ИпЧп(х), for a constant fxn. The last equation also implies that , л , . р„(х)рп(а) пп These results will be used in Chapter 6, Section 6.4. The Christoffel-Darboux formula (Theorem 5.2.4) gives a compact expression for the kernel polynomials: kn pn+i(x)pn(x0) - pn+\(xo)pn(x) /CzrC4 ^„(xo,x) = . E.6.5) K-n+l X — Xq If we choose x0 to be the kth root of pn(x), that is, x0 = xk, then we can write „ , , kn pn+i(xk)Pn(x) ,„ , ,. К„(хк,х) = — . E.6.6) «71+1 X — Xk This expression for Kn suggests a connection with the Gauss quadrature formula. In fact, we have the following theorem:
262 5 Orthogonal Polynomials Theorem 5.6.4 The numbers Xk (or Xkn) occurring in the Gauss quadrature formula are given by X k^\ Xk = --^ \ ; kn pn+\(xk)p'n(xk) their reciprocal is E.6.7) \/Xk = Kn(xk, xk) = ^(pk(xk)f. E.6.8) k=o Proof. The expression for Xk in Gauss's formula is /b pn(t)da(t) p'n{xk){t -xk)' By E.6.6) and Lemma 5.6.2, f Kn{ n(Xk) Ja kn Pn+\(Xk)P\ _ kn+i 1 kn рп+\{хк)р'п{хкУ This proves E.6.7). To derive E.6.8), let x ->• xk in E.6.6). Thus, К K(xk,Xk) — Pn+l(Xk)Pn(Xk)- kn+i This proves E.6.8). ¦ The kernel polynomials also have a maximum property, as contained in the next theorem. Theorem 5.6.5 Let xq be any real number and Q{x) an arbitrary polynomial of degree < n, normalized by the condition rb \ (Q{t)fda{t) = \. Ja The maximum value of(Q(xo)J is given by the polynomial Q(x) = ±Kn(x0,x)/^/Kn(x0,x0) and the maximum itself is Kn(xo, xq). Proof. Since Q(x) is of degree < n, we have Q(x) = aopo(x) + aipi(x) -\ \- а„р„(х).
5.7 Parseval's Formula 263 The normalization condition gives al + a\-{ bfl?= 1. By the Cauchy-Schwartz inequality n J2 = *«(x°' Xo)" k=0 Equality holds when ak = Apk(xo), where A is determined by k=0 This proves the theorem. ¦ 5.7 Parseval's Formula Let L?(a, b) denote the class of functions / such that rb / \f\"da(x) < (X). Ja As always, we assume that f x"d<x(x) < oo for n > 0. In this section, we are interested in the space L2a(a, b). By the Cauchy-Schwartz inequality, we infer the existence of / f(x)x"da(x), Ja for n > 0. Theorem 5.7.1 Suppose f € L2a(a, b). Let Q(x) be a polynomial of degree n, such that n Q(x) = ^akPk{x), k=0 where {pn(x)} is the orthonormal sequence of polynomials for da. The integral I U(x) - Q(x)]2da(x) E.7.1) Ja becomes a minimum when fb ak= f{x)pk{x)da{x). E.7.2) Ja
264 5 Orthogonal Polynomials Moreover, with ak as in E.7.2), Proof. Let с* = fa f (x)pk{x)da(x). By the orthonormality of {pn(x)}, we get /¦ 0< / [/(x) - /•ft « n = / [f(x)]2da(x) - 2 J] fltct + ^ a,2 ^a k=0 k=0 = f [f(x)fda(x) -J24+ J2(ak - ckf. The last expression assumes its least value when ak = ck. This proves both parts of the theorem. ¦ Corollary 5.7.2 For f e L2a{a, b) and ak as in E.7.3), we have Х^«„2< / U(x)]2da(x). E.7.4) Proof. The sequence of partial sums sn — YH=oak is increasing and bounded. ¦ The inequality E.7.4) is called Bessel's inequality. We now seek the situation where equality holds. Assume that [a, b] is a finite interval. We shall use the following result from the theory of integration. Lemma 5.7.3 For f e L2a{a, b), and a given € > 0, there exists a continuous function g such that fb [f(x)-g(x)]2da(x)<€. / Ja Theorem 5.7.4 Let [a, b] be a finite interval. With the notation of Theorem 5.7.1, we have Parseval's formula: oo „b al= U(x)]2da(x). E.7.5) k=o Ja Proof. Suppose g is as in Lemma 5.7.3. By Weierstrass's approximation theorem, for a given e > 0 there exists a polynomial Qn (x) such that / Ja \g(x) - Qn(x)]2da(x) < €.
5.7 Parseval's Formula 265 This implies rb [/(*) - Qn(x)]2da(x) < 4e. E.7.6) / Ja By Theorem 5.7.1, we may choose Qn(x) = J2l=o <*kPkM> where a* is given by E.7.2). As in the proof of Theorem 5.7.1, it follows from E.7.6) that Since € is arbitrary, we have proved the theorem. ¦ Corollary 5.7.5 Suppose f e L2a(a, b), where [a, b] is a finite interval. If I ь f(x)xnda(x) = 0 for all integers n > 0, E.7.7) then / = 0 almost everywhere. Proof Since ak = 0 for all к, it follows from Parseval's formula E.7.5) that [ [f(x)]2da(x)=O. Ja This implies the result. Exercises 24-28 give results similar to Corollary 5.7.5 for infinite intervals. Stone [1962] contains proofs of these results. Remark 5.7.1 We could also argue as follows: With Qn (x) as in E.7.6) it follows from E.7.7) and the Cauchy-Schwartz inequality that [f(x)]2da(x)\ = (j /MUM ~ Qn(x)]da(x) f [f(x)]2da(x) f [f(x) - Qnix)]2daix). Ja Ja So and the result follows. jf Corollary 5.7.6 With f as in Corollary 5.7.5 andsnix) = J2l=o akPkix), where ak is given by E.7.2), it follows that rb \\sn(x)-f(x)\\22= / [sn(x)-f(x)]2da(x)^0 Ja
266 5 Orthogonal Polynomials Proof. This result is contained in the proof of Theorem 5.7.4. ¦ The results of Theorem 5.7.4 and its corollaries are in general false when [a, b] is not finite. As an example (see Exercise 1.20), one may take da(x) = exp(—xM cosixn)dx, f(x) = sin^^ sin/Kjr), 0 < ц, < 1/2. There are, however, important examples of orthogonal polynomials over infinite intervals, such as the Laguerre polynomials on @, oo) and Hermite polynomials on (—oo, oo). We prove in the next chapter that Theorem 5.7.4 continues to hold in these cases. 5.8 The Moment-Generating Function In this section we obtain a continued fraction expansion for the moment-generating function J2n>o /J-nX", where Mn = (l,f«) = / t"da(t). E.8.1) Ja The treatment here follows Godsil [1993] and the reader should consult this book for further information on the methods of algebraic combinatorics in the theory of orthogonal polynomials. We assume a minimal knowledge of graph theoretical terminology. Let G be a graph with n vertices. The adjacency matrix A = A(G) is the n x n matrix defined as follows: If the j'th vertex is adjacent to the 7 th vertex, Ац = 1; otherwise it is zero. An edge {г, j} in G is considered to be composed of two arcs, (г, j) and (j, i). A walk in a graph is an alternating sequence of vertices and arcs where each arc joins the vertices before and after it in the sequence. If the first vertex is the same as the last one in the sequence, then it is a closed walk. The number of arcs in a walk is called the length of the walk. The following result is easily checked by induction: The number of walks in G from vertex г to vertex j of length m is given by (Am),7, that is, the entry in the jth row and 7th column of the matrix Am. We shall have to consider graphs with weighted arcs so that the entry (A)^ is the weight of the arc (г, j). We continue to denote this matrix by A = A(G). Let 0(G,x) = det(x/- and n>0
5.8 The Moment-Generating Function 267 Thus Wjj (G, x) is the generating function for the set of all walks in G from vertex i to vertex j. Let W(G, x) be the matrix whose entries are Wtj(G, x). Then n>0 From the fact that A adj (A) = det( A) /, where adj (A) is the adjoint of A, it follows that ,-K-l W(G,x)=x-l4>(G,x~l) This implies that Wu(G,x)=x-l4>(G\i,x~l)/4>{G,x-1), E.8.2) E.8.3) where G\i is the graph obtained from G by removing vertex i. The connection of the above discussion with orthogonal polynomials is obtained as follows: Suppose {pn(x)} is an orthogonal polynomial sequence satisfying the three-term recurrence pn+i(x) = (x- an)pn(x) - bnpn-x{x), и > 1. E.8.4) It is assumed that the polynomials are monic. Let A denote the matrix Ъ\ \ a\ b2 1 a2 Ъъ \ I where the rows and columns of the matrix are indexed by the nonnegative integers. Let An denote the square matrix obtained from A but taking the first n rows and columns. Observe that when det(xl — An) is expanded about the last row, we get det(x/ - An) - (x -an_! = pn(x). Thus pn(x) is the characteristic polynomial of An. Observe that matrix A is the adjacency matrix of a particular weighted directed graph G whose vertex set is indexed by the nonnegative integers. If only the first n vertices of G are taken, then the adjacency matrix of the subgraph is An. Denote this subgraph by Gn. We need the next lemma to derive the continued-fraction expression for the moment-generating function 5Zn>o(^' ?")x"> which is understood in the sense of a formal power series. We assume that /xq = A, 1) = 1.
268 5 Orthogonal Polynomials Lemma 5.8.1 For nonnegative integers n, Proof. First note that (Ak)oo = (A*)oo for ifc < 2л + 1, because no closed walk starting at 0 and of length < 2n +1 can include a vertex beyond the nth vertex. This implies(pn(A))oo = (pn(An))oo.Wehavealreadynotedthatforn > 1, pn(x)is\he characteristic polynomial of An. By the Cayley-Hamilton theorem, pn(An) = 0, and hence (I, pn) = (Pn(A))oo for n > 1 and for n = 0 by definition. Now x" is a linear combination of po, pi, ¦ ¦¦, pn, so the result follows. ¦ Theorem 5.8.2 With an and bn as in E.8.4), 1 x2b\ x2b2 , tn)xn = 1 — xuq— 1 — xa\ — 1 — xaz— Proof. Let An^ be the matrix obtained from An by removing the first к rows and columns. Set qn-k{x) =det(I -хАпЛ). Observe that 0(Gn, x) = det(xl - An) = x" det(/ - дс"^^) = ^«„(дГ1) and 0(Gn\O, x) = det(xl - АпЛ) = xn~l det(/ - x~l АпЛ) = ^-^„-itc). By E.8.3), we can conclude that x Woo(Cjn,x ) = — — x ——. E.8.5) <p(Gn,x) qnix-1) Expansion of det(/ — xAn) about the first row gives qn{x) = A -xao)qn-i(x) - x2biqn-2(x) or qn-\(x) qn(x) 1 - xa0 - x2biqn-2(x)/qn-.i(x)' E.8.6)
Exercises 269 By Lemma 5.8.1, V У2 = WooiCx) = lim ^zl^l. This combined with E.8.6) proves the theorem. Exercises 1. (a) Prove the positivity of the Poisson kernel for Tn(x) in the interval — 1 < r < 1 by showing that 1 -r2 rm = r. 1 - 2r cos в + r2 m = \ (b) Compute 00 sin(m + 1N sin(m + 1H m sin6> sin0 m=0 which is the Poisson kernel for Un(x). Observe that it is positive in the interval — 1 < r < 1. (c) Show that the Poisson kernel for sin(n + 1 /2N is r" sin(n + \/2)в sin(n + 1/2H A - r) sin@/2) sin@/2)[(l -rJ +4r(l - cos@+.p))/2cos@ - 0)/2] + 0)/2+r2][l -2r cos@ -0)/2+r2] 2. Suppose / has continuous derivatives up to order n in [a, b] and x\ • ¦ ¦ < xn are points in this interval. Prove the following Lagrange interpolation formula with remainder: fix) = Ln{x) + J—^(x-x,)(x-x2) ¦ ¦ ¦ (x-xn), where a < min(x, x\, хг,..., xn) < | < max(x, x\,..., х„) < b. Here Ln (x) is the Lagrange interpolation polynomial (denned by E.3.1)) that takes the value /(*,) at д:,-, i = 1,2,... ,n. A discussion of the results in Exercises 3-10 can be found in Natanson [1965]. This book also contains the references to the works of Hermite and Fejer mentioned in the exercises. 3. With the notation of Exercise 2, suppose that n Ln(x) = ^2Ak(x - x\)(x - x2) ¦ • ¦ (x - xk). k=0
270 5 Orthogonal Polynomials Show that Ак-г = ' Xj X\) • • • \Xj Xj — \){Xj X}+\) ' ' ' \Xj Xk) J Now let Xj¦ = a + (j — \)h for j = 1, 2,..., n. Show that Ak~l = /^ where and Aef(Xj) = 4. Let tj(x) be defined by E.3.2) with P(x) = (x - xi)(x - x2) ¦ ¦ ¦ (x - xn). Check that t'j(xj) = P"(xj)/2P'(Xj). Now show that the function H(x) defined by H(x) = E yj [l - |^(x - *,)] t){x) + XJy'jix ~ Xj)l){x) (where y\, уг,..., yn, y[,..., y'n is a given set of 2n real numbers) satisfies H(xj) = yj and H'(xj) = y'j, where H' denotes the derivative of H. Prove also that if / is as in the previous problem with derivatives of order 2n, then where yj = f(Xj) and y'j = f'(Xj) in the definition of H. Again f lies in the same interval as before. 5. Apply Gauss quadrature to the formula for f(x) in the previous problem to obtain rb n fQ-n), ч rb / f(x)da(x) = J2 hf(xk) + n / P^(x)da(x), a<r]<b. Ja ?_[ l^"). Ja Here {Pn(x)} is a sequence of polynomials orthogonal with respect to da(x) on [a, b] and the leading coefficient of Pn(x) is one. 6. (a) Prove that, for the Legendre polynomials, 2"n! dxn
Exercises 271 (b) Use the Christoffel-Darboux formula to prove that 1 PH(t)PH-i(x)-PH(x)PH-i(t) 2 dt = —. i t — x n (c) Deduce from (b) that if xk, к = 1,..., n are the zeros of Pn(x), then p"(r)-* = —2 1-х t - xk (d) Use Exercise 5 and the above to obtain the formula rl — 2f(xk) , 22и+1(п!L/Bи)A) J-x f^nPn-l where — 1 < § < 1. 7. Use Chebyshev polynomials to prove the following formula, which is similar to Exercise 6{d): dx -lff (co, Bk-Dn\ n with — 1 < § < 1. Note that all the Xk equal ^ in this case. (Hermite) 8. (a) Show that the roots of the Chebyshev polynomials Un (x) are cos -^, к = 1, 2,..., n and those of Vn(x) are cos ^j, k=l,2,... ,n. [Recall rn(cos#) = cosnd and Un(cos6) = si (b) Prove the quadrature formulas fl I г л v^ . , кл ( I \/\-x2f{x)dx = > sm2 /cos кл ' /r\ \ in?»_1_ 1 J \S/' Bn)!22"+1 and 1-х 4л" ^—^ 2 кж ( 2кл \ f(x)dx = > sin f cos 1 + x J 2n + 1 f^ In + Г V 2n + \) ' Bn)!22«J Vb/' where —1 < ^ < 1. Obviously the various f are not necessarily the same.
272 9. Prove that 5 Orthogonal Polynomials Tn(x)dx -i т;(Хк)(х-хк) >o, B1с)ж where xk = cos Hint: Write the integral as sinflt Г n Jo cos пв cos 0 — cos 6k sin Odd, where #? = B*2n1>7r • Apply the Christoffel-Darboux formula and integrate term by term. (Fejer) 10. Prove that -хк) where Un(xk) = 0. 11. Suppose {Pn(x)} is an orthogonal polynomial sequence. Let*b k = 1,2,..., n, denote the zeros of Pn(x). Suppose that Pn-i(x) ak is apartial fraction decomposition of Pn-\{x)/Pn{x). Prove thata^ > 0. 12. With the notation used in Lemma 9.5.2, show that provided bt ф 0, В, фО A < i < n). 13. Show that if {Р„ (*)} satisfies andF_!(x) = 0, then Bo 1 0 0 C2 A2x + B2 Pn(x) = for и 0 0 1 0 0 0 Cn-\
Exercises 273 14. Suppose An = 1 for all n in Exercise 13 and let Cn = \dn |2 = dndn. Then the zeros of Pn (x) are the eigenvalues of the matrix: Bo di J D *1 — " 1 0 J2 0 0 0 ••• d2 ••• -B2 rf3 0 0 0 dn-2 0 0 0 0 -Bn-2 dn-\ 0 0 0 : dn-\ —Bn-\ 15. Prove the following recurrence relations for Laguerre and Hermite polynomi- polynomials respectively: (а) (и (b) \)Lan{x) - (n +a)Lan_l(x), n =0, 1,2, 3,... = 2xHn(x)-2nHn-1(x), и =0,1,2,..., = 1, Я_!(х)=0. 16. Let {pn(x)}§° be an orthonormal sequence of polynomials with respect to the distribution da(x). Let rb i«= x" J a da(x), и = 0, 1,2, Show that pn(x) =Cn tin ¦ ¦ ¦ Mn+1 1 X X1 where Cn is a constant given by Cn = (Dn-\Dn) 1/2, when Dn is the positive- valued determinant [/X/t+mb,m=o,i H. 17. With the notation of Exercise 16, prove that pn(x) = Cn ц,ох - Mi Mi* - M2 - М2 Ц-2Х - M3 finx — Mn+i fln-lX — fln Ц,пХ — Mn+1 ••• M2n-2* — M2n-1
21A 5 Orthogonal Polynomials 18. With the notation of Exercise 16, prove that i=0 П (Xi - XjJda(xo)da(xi)---da(xn-i) and ¦da(xn). 1 f^ f^ D" = , , 14, / • ¦ • / П (Xi - x,-) (For the reference to Heine, see Szego [1975, p. 27].) (Heine) 19. Let 1 > x\(a, P) > хг(а, /J) ¦ ¦ ¦ > xn(a, fi) > -1 be the roots of the Jacobi polynomial P^a-^(x). Show that, fora > -1 andjS > — 1, да Эуб Proceed as follows: (a) Take a = -\,b = 1, and da{x) = A - x)"(l + x)fi in the Gauss quadrature formula (Theorem 5.3.2). Take the derivative with respect to a to get fi Лу-VI r1aC1 -I- r\P 1пяП rWr -1 (b) Take /(x) = {P^a-fi)(x)}2/(x - xk) to show that -f k x - xk дхк Н a \ dx Now observe that the expression in curly braces and x — xk have opposite signs. Prove Щ > 0 in a similar way. (Stieltjes) 20. This result generalizes Exercise 19. Let <w(x, r) be a weight function depen- dependent on a parameter r such that <w(x, r) is positive and continuous for a < x < b, t\ < т < T2- Assume that the continuity of the partial derivative Щ fora < x < b, x\ < т < T2 and the convergence of the integrals I dt occur uniformly in every closed subinterval r' < г < г" of (ть гг). If the
Exercises 275 zeros of Pn(x)=Pn(x, r) (the polynomials orthogonal with respect to w(x, r)) arexi(r)>X2(r)> • • • >xn(t), then the kth zero xk(x) is an increas- increasing function of т provided that ff/w is an increasing function of x, a<x<b. (Markov) 21. Let w(x) and W(x) be two positive, continuous weight functions on [a, b]. Let W(x)/w(x) be increasing. Prove that if {xk} and {Xk} denote the zeros of the corresponding orthogonal polynomials of degree n in decreasing order, then xk < Xk, к = 1,..., и. Hint: Take w(x, r) = A - x)w(x) + xW(x) in Exercise 20. See Szego [1975, §§ 6.12 and 6.21] for Exercises 19-21 and for references. 22. Use the result of Exercise 20 to show that if the parameters a and /3 of the Jacobi polynomials lie in [-1/2, 1/2], then the zeros of P%*'p)(x) satisfy 2k-\ 2k , 7Г < xk < п, к = 1, 2,.... п. 2и + 1 ~ 2и + 1 23. Suppose {?>п(*)}о° is an orthonormal sequence of polynomials with respect to the distribution da(x). Let where ck = f(x)pk(x)da(x). k=o Ja Prove that rb f(x0) - sn(x0) = / [/(xo) - f(x)]Kn(x, xo)da(x). Ja 24. Suppose / : [0, oo) —> Л is continuous and lim^^oo /(x) = 0. Show that / can be uniformly approximated by functions of the form e~ax p(x) where p(x) is a polynomial, when a is a fixed positive number. 25. Prove that if / e Lp@, oo), p > 1 (or / is a bounded measurable function), and (for a given a > 0) Г Jo f(x)e-axxndx=O for и = 0,1,2,..., о then/(x) =0а.е. 26. Suppose / : (—00, 00) —> R is continuous and lim^^ioo /(x) = 0. Prove that / can be uniformly approximated by functions of the form e~a x p(x) where p(x) is a polynomial. 27. Show that if / e Lp{~oo, 00), p > 1, and 2jr2xVx = 0, и =0,1,2,..., then /(x) = 0 a.e.
276 5 Orthogonal Polynomials 28. Show that k=0 (п + 2)Г(п+а+р x x-y 29. (a) Show that the Legendre polynomial Pn(x) is a solution of A - x2)y" - 2xy' + n(n + 1)у = 0. (b) Show that Qn(x) = \ /ij ^fzj-dt, x g [—1,1] is another solution of the differential equation in (a). (c) Show that Qn(x) = Pn(x)Q0(x) - Wn-ito, where Wn-{(x) is a poly- polynomial of degree n — 1 given by „. , , 1 /"' Pn(x)-Pn(t) Wn-\(x) = - dt. 2j-i x-t For -1 < x < 1, define Qn(x) = Pn(x)Q0(x) - Wn-i(x) with Goto = (d) Prove the following recurrence relations: Bn + \)xPn{x) = (n + l)Pn+l(x)+nPn-i(x), л = 0, 1, • • •, Bи + l)xG«to = (и + l)Gn+iCO + «Gn-i(x), и = 1, 2,.... (e) Prove that n i=0 " J | 1)[6«+itoG«(>')-6«(xNn+i(>')" and 1 " 1- V^r?i--4- ШГ>,ГгЛ12 Bk+l)[Qk(x)]2 (f) Show that Gn(^) has n + 1 zeros in — 1 < x < 1. See Frobenius [1871].
Special Orthogonal Polynomials Special orthogonal polynomials began appearing in mathematics before the signi- significance of such a concept became clear. Thus, Laplace used Hermite polynomials in his studies in probability while Legendre and Laplace utilized Legendre polyno- polynomials in celestial mechanics. We devote most of this chapter to Hermite, Laguerre, and Jacobi polynomials because these are the most extensively studied and have the longest history. We reproduce Wilson's amazing derivation of the hypergeometric representation of Jacobi polynomials from the Gram determinant. This chapter also contains the derivation of the generating function of Jacobi polynomials, by two distinct methods. One method, due to Jacobi, uses Lagrange inversion. The other employs Hermite's beautiful idea on the form of the integral of the product of the generating function and a polynomial. This generating function is then used to obtain the behavior of the Jacobi polynomial Fn(" ^'(x) for large n. We quote a theorem of Nevai to show how the asymptotic behavior of P^"'^ (x) gives its weight function. It is important to remember that the classical orthogonal polynomials are hypergeometric. We apply Bateman's fractional integral formula for hypergeo- hypergeometric functions, developed in Chapter 2, to derive integral representations of Jacobi polynomials. These are useful in proving positivity results about sums of Jacobi polynomials. We then use Whipple's transformation to obtain the lineariza- linearization formula for the product of two ultraspherical polynomials. This clever idea is due to Bailey. One of the simplest examples of linearization is the formula cosmd cosnO = -[cos(m +п)в + cos(m — п)в]. We observe that a linearization formula for a set of orthogonal polynomials is equivalent to the formula for an integral of the product of three of these poly- polynomials. We briefly discuss the connection between combinatorics and orthogonal poly- polynomials. In recentyears, this topic has been studied extensively by Viennot, Godsil, 277
278 6 Special Orthogonal Polynomials and many others. We content ourselves with two combinatorial evaluations of an integral of a product of three Hermite polynomials. This chapter concludes with a brief introduction to ^-ultraspherical polynomials. This discussion is motivated by a question raised and also answered by Feldheim and Lanzewizky: Suppose f(z) is analytic and \f(rew)\2 is a generating function for a sequence of polynomials pn(cos6). Do the pn(cos0) produce an orthog- orthogonal polynomial sequence other than the ultraspherical polynomials? The answer involves an interesting nonlinear difference equation that can be neatly solved. 6.1 Hermite Polynomials The normal integral f^°ooe~x2dx, which plays an important role in probability theory and other areas of mathematics, was computed in Chapter 1. The integrand e~x has several interesting properties. For instance, it is essentially its own Fourier transform. In fact, e~x = — / e-'e2ixtdt. F.1.1) This can be proved in several ways. (See Exercise 1.) The integral is uniformly convergent in any disk \x \ < r and is majorized in that region by the convergent integral -= / e-'2e2r'dt. Thus the integral can be repeatedly differentiated with respect to x, and we have ^— = ЩИ f e-t2tne2ixtdt. F.1.2) dxn Vя J-oo The polynomials orthogonal with respect to the normal distribution e~x are the Hermite polynomials. They can be defined by the formula Hn(x) = (-l)"e*2 * . F.1.3) It is easy to check that Hn (x) is a polynomial of degree n. By F.1.2), it is seen that (-2i)nex2 f°° i ,. Hn(x) = =— / e ' tne2lx'dt. F.1.4) Let us first prove the orthogonality property of Hn(x), namely / e-x2Hn(x)Hm(x)dx = 2nn\*Jn8mn. F.1.5) J — OO
6.1 Hermite Polynomials 279 Consequent from the definition F.1.3), we can write this integral as Suppose n > m and integrate by parts n times. This shows that the integral is zero. The m = n case will be considered a little later. The Hermite polynomials have a simple generating function. Observe that the term tn in the integrand suggests that we consider Fл 6) ^i- The integral can be computed by F.1.1). The result is the generating function for Hn(x): F17) This generating function is useful for deriving several properties of Hermite polynomials. For example, we have the following expression for these polynomials: This can be obtained by writing ^ p\ ^-^ Bq)\ and equating the coefficient of r11 on each side. From F.1.8), it follows that 2»!. dxn We can now complete the proof of F.1.5). Integration by parts shows that F.1.9) where F.1.9) is used in the final step. We learned in the previous chapter that orthogonal polynomials satisfy three- term recurrence relations. To find this relation for Hermite polynomials, note that F(x,r)= e2xr^2
280 6 Special Orthogonal Polynomials satisfies dF — -Bx-2r)F=0. dr Substitute the series in F.1.7) for F to get Hn+i(x)-2xHn(x) + 2nHn_1(x)=0, и = 1,2,.... F.1.10) This is the recurrence relation for Hermite polynomials. Another recurrence rela- relation comes from dF Тх-2гР-°- This implies H'n{x) = 2nHn_x(x), и = 1,2,.... F.1.11) Eliminate #n_i(x) from F.1.10) and F.1.11) to obtain Яп+1(х)-2хЯ„(х)+Я»=0. Differentiate this equation and use F.1.11) again to get Щ(х) - 2xH'n(x) + 2nHn(x) =0, и = 0, 1, 2,.... Thus the Hermite polynomials Hn(x) satisfy the second-order linear differential equation u" - 2xu' + 2nu = 0. F.1.12) It is also worth noting here that the function V(x)=e'x2/2Hn(x) satisfies the differential equation V" + Bn + l -x2)V =0. As another application of the integral representation F.1.4) for Hn(x), we derive a closed expression for the Poisson kernel for Hermite polynomials, namely ^Lrn = A _ r2rl/2eP4<r-UJ+>V]/U-'2). FJ.13) By F.1.4), Hn(y) = -
6.1 Hermite Polynomials 281 So for \r\ < 1, the left-hand side of F.1.13) becomes e*2+y2 г00 г00 x +y r°o roo / / -s2-t2+2iys+2itx-2str Л J-oo J-oo Now use the formula /•00 f J — e —oo twice in the double integral and F.1.13) follows. The various formal processes can be justified by the absolute convergence of the integrals involved. A different approach to the proof of F.1.13) using the three-term recurrence is as follows: Denote the series on the left of F.1.13) by K(r, x, y). By F.1.10) and F.1.11), дх 2»-i(n- 1)! ' y, Hn(x)Hn(y) *-( 2nn\ n=0 — = 2ry* Эх 00 и=0 n=l э# 2»- (х)Яп+1 2"и! :*)д,-1< Thus and by the symmetry in x and j ал ол =2гхК -г . ду Эх The last two equations imply that 1 ЪК _ 2ry - 2r2x К Эх I — г2 and so 2rxy — r2x2 log К 1- g(y, r) or A" = h(y, r)e{2rxy~r x )/A-r \ Again by the symmetry in x and y, we can conclude that K _ с(гче[2д:у/--(дг2+У2)г2]/A-г2)
282 6 Special Orthogonal Polynomials To find c(r), set x = у = 0 to get n=0 From F.1.8), - and Я2п+1@)=0. F.1.14) Therefore, n=0 , ^ This proves F.1.13) once again. Remark 6.1.1 An interesting formula for Fourier transforms can be formally obtained from formula F.1.13). Multiply both sides of F.1.13) by Hn (y)e~y2 and integrate over (—oo, oo) to get e-y2+[2xyr-(x2+y2)r2]/(l~r2) == Hn{y)dy = Hn(x)r". The validity of this formula for \r\ < 1 can be proven. Let r —> i and we have, at least formally, L Г е1ХУе-У212Hn{y)dy = ine~x2/2Hn(x). F.1.15) 27Т J-oo This equation embodies the self-reciprocity of Hermite polynomials. It gives e-x /2 ^ ^ as an gjgenfunction of the Fourier transform with eigenvalue /". There are various ways of proving F.1.15). See Exercise 11 for one method. Also see Exercise 12, which reproduces de Bruin's [1967] proof of Heisenberg's inequality using F.1.15). 6.2 Laguerre Polynomials The Laguerre polynomials are orthogonal with respect to the gamma distribution e~xxadx, where a > — 1. The definition of the Hermite polynomials and the proof of their orthogonality given by F.1.5) suggest consideration of the polynomial dxn
6.2 Laguerre Polynomials 283 An application of Leibniz's rule for derivatives shows that this expression is a polynomial of degree n. The Laguerre polynomial L" (x) is defined by the formula Lan(x):=^f~(e-xxn+a), forn>0. F.2.1) It is easy to check that KM = J1—rL 2J / Г „ = ^r^ i^i(-«;« +1; *)• F-2-2) The orthogonality relation for the Laguerre polynomials is contained in [°° a о a -г Г(а+и + 1) / Lm(x)Ln(x)x e ax = ; omn, a > —1. F.2.3) Jo The integral on the left is dn (e~xxn+a)La (x)dx n\J0 dx"{ )Lml) Suppose n > m and integrate by parts n times to see that its value is 0. For n = m, first observe that, by F.2.2), so that n integration by parts gives F.2.3) after an evaluation of the gamma integral at the last step. The generating-function formula for L" (x) is given by (-n)kxk h h 00 , 4t 00 • , -i ч и (a+k+\)nrn (-*#•)* = A - r)-" exp(-xr/(l - r)). F.2.4) Denote the generating function in F.2.4) by F(x,r). It is readily verified that dF A - r2)— + [x - A + a)(l - r)]F = 0. or
284 6 Special Orthogonal Polynomials This gives the three-term recurrence relation (n + \)Lan+l{x) + (x-a-2n- \)Lan{x) + (n + a)Lan_,{x) = 0, F.2.5) where n = 1, 2, 3, Before deriving the differential equation for Lan{x), we obtain an interesting formula for the derivative of Lan (x). The formula is dLa(x) xdhT = nL"(x) ~ (n+ff)L"-iw' for" - L F-2-6) This arises from the identity dF — aX which implies dL°{x) dLan_x(x) dx dx Eliminate Lan_x (x) from F.2.5) and F.2.7) to get + Lan_l(x)=0, forn>l. F.2.7) +0, + I) dx dx for n > 0. Replace n by n — 1 in this equation and eliminate (d/dx)L"_l(x) by means of F.2.7) to get F.2.6), the required result. Now differentiate F.2.6) and then apply F.2.6) and F.2.7) to arrive at d2La(x) dLa(x) x /У+(a+l~x)—^-+nLan(x) = 0, for«>0, F.2.8) dx1 dx Thus и = L" (x) satisfies the second-order linear differential equation xu" + (a + 1 - x)u' + nu=0. F.2.9) Because the normal integral is a particular case of the gamma integral, it should be possible to express Hermite polynomials in terms of Laguerre polynomials. Such a relationship exists and is given by H2m(x) = (~l)m22mrn\L-l/2(x2) F.2.10) and H2m+l(x) = (-l)m22m+]mlxL]J2(x2). F.2.11)
6.2 Laguerre Polynomials 285 To prove that #2m(x) = CL~1/2(x2) for some constant C, it is sufficient to show that, for any polynomial q(x) of degree < 2m — 1, /•0 / J— A general polynomial is the sum of an even and an odd polynomial. When q is odd, the integral is obviously zero. When q is even, it can be written as q(x) — r(x2), where л is a polynomial of degree < m — 1. Then for у = x2, the above integral becomes /•O / Jo ) F.2.12) a ) J /о by the orthogonality of L~1/2(y). The value of С can be found by setting x = 0. Relation F.2.11) can be proved in the same way, or by differentiating F.2.10). There is another way in which the normal integral is related to the gamma integral. The normal integral is a limit of the gamma integral. This gives another connection between Laguerre and Hermite polynomials. By Stirling's formula, r°° /r\a Иг r°° / (?) e-u-«) " _^^г= / e-**dx asa^oo. Jo \a) V2^ i-oo A change of variables x = a + t/*/2a gives /•oo У e The orthogonality relation for L"(x) implies Г° а 2 a -x Г( Jo L"(X)X € dX = ~ n\ Setx = a + \p2au. Then, by Stirling's formula, \ „--Jbiu lim (-) L«(a + V2^x) = (-1)"^^. F.2.14) ^o \a / n\ n! A comparison of F.2.12) and F.2.13) suggests that
286 6 Special Orthogonal Polynomials This may be verified by using the generating functions for the Laguerre and Hermite polynomials and is left to the reader. Recurrence relations or the definitions F.2.1) and F.1.3) can also be used to give easy derivations. We started the treatment of Hermite polynomials by expressing e~x as an integral. A similar approach can be taken for Laguerre polynomials. It follows from D.11.25) that e x" *n+" = / Jo By D.6.1), this leads to the integral representation of Lan{x) given by Lan(x) = — / t"+0l/2JaB*Jxt)e-'dt, F.2.15) n\ Jo fora > — 1. The reader should verify that the generating function for Laguerre polynomials follows easily from this. Also, since /2 /2 Jm(x) = \ —sinx and J-m(x) = \—cosx, ' V ТГХ ' V ПХ <2„х poo L-l/2(x) = —— / e-'2t2ncosBVx~t)dt one gets and e-t {2n+l sjj Compare these with F.1.4) to get alternative proofs of F.2.10) and F.2.11). We note a few elementary formulas that are useful: —L"(x) = -L"+|(x), F.2.16) dx n ?[x«^(x)]=(n+a)x«-1L«-1W, - F.2.17) ^ [e-xLan{x)} = -e-xLan+l{x), F.2.18) = (n + X)x<*-le-xLan-\(x). F.2.19)
6.2 Laguerre Polynomials 287 We now prove F.2.19); the others can be proved in a similar way. By F.2.2) and Kummer's transformation D.1.11), we have dxl J n\ dx )nd F a + dx и! dx (а + 1)„ ^ (л + а + !)*(-!)*(* +«)**4в-1 a (a + 1)» „_, ±1ха-1е-, F (_„ _ ! } n\ Formulas F.2.17) and F.2.18) can be written as integrals and then extended as fractional integrals. Formula F.2.17) extends to fx-tr-^-\F,{a;b;t)dt, F.2.20) o for Re д > 1. (See B.2.4).) Write this as when ft > a. Formula F.2.18) extends in a slightly different way. The extension is given by 1 f°° / (t - 1 f e~xLan(x) = — / (t - xf-^e-'L^dt, F.2.22) when ft > a. We give a proof below that uses F.2.21) and the orthogonality and the completeness of Lan (x). A proof of completeness is in Section 6.5. Observe that /•OO / Jo - г,Лг<1^'+1) Г *«г[ F.2.23)
288 6 Special Orthogonal Polynomials The orthogonality relation F.2.3) applied to F.2.23) implies By F.2.3), F.2.23), and F.2.24) for n = 0,1, Now the completeness of L" (t) gives F.2.22). The formula for the Poisson kernel for Laguerre polynomials is given by F.2.25) when \r\ < 1, a > —1, and /„ is the modified Bessel function of order a. A simple proof of F.2.25) is obtained by using the generating function F.2.4) and the derivative formula F.2.19). Write the left side of F.2.25) as k{x)r". F.2.26) To find a closed form for the inner sum, start with the generating-function formula OO ^L«+*(x)r" = е~хг/а-г)/A - r)a+k+l. я=0 Multiply both sides by xk+ae~x, take the &th derivative, and apply F.2.19). The result is я=0 1 dk Г k+a -x/d-г i i \n dk rn+k+a ( l> dxkX A -r)nTl\ + « +1;« +1; -*/(i - »•))• F-2.27)
6.2 Laguerre Polynomials 289 Apply Kummer's transformation D.1.11) to the 1F1 to see that the expression F.2.27) is equal to k\Lak(x/(l-r))e-x/(l-r\ Use this for the inner sum in F.2.26) to get n\Lan{x)Lan{y)r" 00 E n=0 The sum in the last expression can be written as i-r ~Jxyr r)). The last equation follows from the series expansion for the modified Bessel function Ia(x). This proves F.2.25). We can obtain the Hankel transform and its inverse from F.2.25). The argument given here can be made rigorous. See Wiener [1933, pp. 64-70] where the Fourier inversion formula is derived from the Poisson kernel for Hermite polynomials. Let Then F.2.25) can be written as H(x,y,t) := — 1 -t F.2.29) я=0
290 6 Special Orthogonal Polynomials Let /(x) be a sufficiently smooth function that dies away at infinity. Then /(x) has the Fourier-Laguerre expansion я=0 Multiply F.2.29) by /(y) and integrate to get /oo °° /(у)Я(х,у,г)^ = У>„(* r. /л я=0 Let t —>¦ e~ni to arrive at /oo f{y)emcIa{y/xye-*i'2)dy = я=0 =: g(x). F.2.30) Thus, /•oo «W=/ f(y)Ja{y/xJ)dy. F.2.31) Jo Now g(x) has a Fourier-Laguerre expansion that, by the definition of g(x), implies that poo /*oo / g(x№n(x)dx = (-1)" / f(x)irn(x)dx. Jo Jo By a derivation similar to that of F.2.31), we have / g{x)Ja(y/xy)dx = y2(-l)n Jo n=0 Jo n=0 n=0 = /(x). F.2.32) We can write F.2.31) and F.2.32) as the Hankel pair /•OO / f(y2)JAxy)ydy = g(x2), J° F.2.33) /oo v ' g(x2)Ja(xy)xdx = f(y2). This may be the place to point out that F.2.21) contains Sonine's first integral D.11.11) as a limiting case. This follows from the fact that lim n-aLan(x/n) = x-ff/2yaBVx). F.2.34)
6.2 Laguerre Polynomials 291 If in the Hankel inverse of Sonine's first integral D.11.12), we change x to x/t and, in the formula thus obtained, change t to \/s, we get /O 1 when Л > a > — 1. The Hankel inverse of this is Г(Л- for д < Л < 2д + 3/2. Write it as 1 ' -Pi,„2 iyL-M-1, = — / Г(Л - /л) Jx for-1 < /л <Л <2д+3/2. This is theanalog of F.2.22).Two analogs ofSonine's second integral are + y) = Y^Lak(x)Lfin_k{y) F.2.35) and L«+n@) Г (a - /3) F.2.36) Formula F.2.36) is due to Feldheim [1943]. Formula F.2.35) is an immediate consequence of the generating function F.2.4), and F.2.36) is proven by using the series representations of Laguerre polynomials, the value of the beta integral, and the Chu-Vandermonde sum. When у = 0, F.2.35) is equivalent to = у Г(п-к + р-а) This is an easy consequence of the generating-function formula F.2.4). The details are given in Section 7.1. This formula is equivalent to / Li{x)Lak{x)xae-xdx Jo t~"~uil F.2.38)
292 6 Special Orthogonal Polynomials This can be used to show that the Fourier-Laguerre expansion of xa~^Lf(x) in terms of L& (x) is given by the formula xae-xLak{x) + 1) „ p _x F.2.39) for a > (P — 1 )/2. To understand this condition on a and ft, needed for convergence, see Theorem 6.5.3 and the remark after that. Note that F.2.39) is the inversion of F.2.37). Just as the Sonine integral and its inversion can be used to solve dual integral equations, F.2.37) and F.2.39) help to solve a dual sequence equation involving Laguerre polynomials. Theorem 6.2.1 Let a, X, с be given, such that с > (Л — 2a — l)/2, а, Л > —1. Then ifan, bn are given (and are small enough) and if pec an= xcf(x)Lan(x)xae-xdx, n=O,l,...,N, Jo /•OO bn= f(x)L\{x)xxe-xdx, n = N + 1, N + 2,..., Jo and iffi — a + c, then ^ T{n + \) Proof. By F.2.37), we have ¦a) f° 1 - a) Jo However, by F.2.39) we have Г (A: - n + Л - Р)Г(п + ft + 1)Г(к + 1) Г (к - п + 1)Г(А, - Р)Г(п + 1)Г(к + k + l) k Jo Now use the Fourier-Laguerre expansion of f(x) to get F.2.40). This makes more than merely formal sense if the bn s are small enough. This proves the theorem. ¦
6.3 Jacobi Polynomials and Gram Determinants 293 Similarly, F.2.21) and F.2.22) can be used to solve dual-series equations in- involving Laguerre polynomials. Theorem 6.2.2 Let a > 8 > — 1, and let a < min(<5 + 1, 28 + 1). Suppose that /(*) = and Then n\ an = Г(п+8 g(x) = n=0 -а x < у < oo. a(x - t)s-adt]Lsn(x)e-*dx J /OO 1 Г /-0O -I / g(t)e-'(t-xr-s-4t\Lsn(x)xsdx Г (а -8) \_JX J 6.3 Jacobi Polynomials and Gram Determinants If rb Cn= Xn J a da{x) are the moments with respect to a given distribution da(x), then the polynomials Pn(x) = CO Cl Cn-1 1 Cl C2 Cn X C3 Cn+1 X2 с„ • ¦ • cn+i ¦ • • c2n-i x" F.3.1) are orthogonal with respect to da(x). It appears at first sight that it would be difficult to obtain a more useful representation of the polynomials from the determinant. Wilson [1978, 1991] showed that, in many interesting cases, it is possible and quite easy to get a hypergeometric representation of pn (x) from the determinant. In this section, we give his derivation of the hypergeometric form of the Jacobi polynomials. We start with the following lemma, which generalizes the result contained in F.3.1).
294 6 Special Orthogonal Polynomials Lemma 6.3.1 Suppose {</>n}o° is a sequence of independent functions. The se- sequence of functions {pn(x)\ given by pn(x) = Cn AH.o Дл-1,0 Дл-1,1 ¦¦¦ Дл-1,л F.3.2) where ,-,_,- = / (j>i(x)(l>j(x)da(x), Ja k=0 and where Cn is a constant, satisfies the relation / pn(x№m(x)da(x) = 0, form<n. Ja (Here a(x) need not be a positive measure but all the integrals are assumed to exist.) Proof. Expand the determinant F.3.2) as pn(x) = Cn Then fb " F.3.3) If от < n — 1, then the right-hand side of F.3.3) represents a determinant in which two rows are identical; the value of such a determinant is zero. This proves the lemma. ¦ Corollary 6.3.2 Suppose {</>n}o° and {^п}о° are two sequences. Define pn(x) as in F.3.2) with rb jda(x). lU — / &Ф]С Ja Then f" / Pn Ja (x)^m(x)da(x) = 0, form <n — 1. We now specialize to the situation where the weight function is a'(x) = A — x)a A + x)P over the interval (—1, 1). Our aim is to find pn(x). For this we choose
6.3 Jacobi Polynomials and Gram Determinants 295 = A— x)k andi/aW = A +x)k. Other choices of ф^ and ^ as polynomials of exact degree к would work, but these make the calculations simpler. In this case l ( f"\Cl V{a F.3.4) Theorem 6.3.3 The polynomial F.3.2) with /x,j given by F.3.4) is a constant multiple of the Jacobi polynomial n\ -n,n +a + ft + 1 1-х a + l ' 2 F.3.5) Proof. From each row, j = 0,..., и — 1, pull out the first factor in F.3.4) and absorb it in the constant Cn. We will continue to call it С„, so it may differ from one line to the next. Then factor out 2J (a + 1)_,- from each of the columns, j = 0,1,... ,n. The resulting determinant is Cn Мод Ai.o Ai.i An-1,0 An-1,1 MO.n Аи-1,и where and <j>j(x) = a-xy 2J(\+a)j' Now expand about the last row to obtain the expression 1 -x F.3.6) Set A = a + p + 2. The problem now is to compute the determinant 1 A(A,n,k) =det - , A(A,0,0) = l. Observe that the first column in this determinant consists of Is when к ф 0. We shall consider this case first. Subtract row n — 1 from row n, then row и — 2 from row и — 1, and so on. This makes the first column zero except for the first entry,
296 6 Special Orthogonal Polynomials which is one. Expand the determinant along the first column to get Д(А,иД) = det But since 1 1 1 (A + i + i (A + i)j+i (A + i)j+2 -O' + D = -I/ + 1) (A + i)j+2 (A + /)(A + i + 1)(A + i + 2),' [(A + j) - (A + i + j + 1)] then A(A,n,k) = det -0" + 1) (A + i)(A + i + 1)(A + i + 2)j J o<.-<n-2 0<j<n-l Now —(j + 1) is a common factor in the yth column and 1/f (A + /)(A + i + 1)] is common in the /th row and so these factors can be taken outside the determinant. Thus A(A,n,k) = n-\ c-d-1 По- y=0 ln-2 i=0 ¦2,и-1Д-2) "\\ "I, А(А + 2,п-1Д-1). Repeat this process /c times to get А(А,иД) k-l s=0 (A 2ik, и-ik,0). F.3.7) Recall that we had assumed к ф 0. The above equations holds trivially for к = О, provided we assume that the product in the braces is one. In the determinant A (A + 2k, n — k, 0), the column index j is not 0 so that j goes from 1 to n — k. Therefore, =det = det 1 (A + 2/c + /)(A + 2k + i A(A + 2k+ l,n-k,n-k) (A + 2k)n-k '
6.4 Generating Functions for Jacobi Polynomials 297 Combine this with F.3.7). The result after some rearrangement is '"' k\(n-k)\ The product depends only on n so it can be absorbed in С„. Substitute this value of the determinant in F.3.6) to arrive at « + Dt k\{n-k)\ Choose Cn = (a + 1)„. This gives the hypergeometric representation of P^a'^(x) and the theorem is proved. ¦ An immediate consequence of the hypergeometric representation F.3.5) of Jacobi polynomials is the formula for the derivative: {P^(X)} (л + a + P + 1)Р(х). F.3.8) We have seen in Chapter 3 that iFiia, b\ c; x) is a solution of d2y dy x(l - x)-l + {c - (a + b + \)x}-f- - aby = 0. dxz dx Therefore, и = Р^а^\х) is a solution of the differential equation A - x2)u" + {/3-a-(a + l3+ 2)x\u' + n(n + a + fi + \)u = 0. F.3.9) Another straightforward result from F.3.5) is that the coefficient of xn in is 2"n\ F.3.10) 6.4 Generating Functions for Jacobi Polynomials Generating functions are of great importance in the theory of orthogonal polynomi- polynomials. We have already used them to study Hermite and Laguerre polynomials. In this chapter, we derive Jacobi's form of the generating function by two methods. One is due to Jacobi and uses Lagrange inversion; the other is due to Hermite. A different generating function, particularly useful in studying ultraspherical polynomials, is also derived. One way of finding the generating function for Jacobi polynomials is to use Lagrange inversion. We use the following lemma. Its derivation and some appli- applications are given in Appendix E.
298 6 Special Orthogonal Polynomials Lemma 6.4.1 Suppose that ф (у) is analytic in a neighborhood ofy = x, oo r — = У^an(y - x)" with ax ф 0, F.4.1) п=1 and f is analytic in a neighborhood of y = x. Then f(y) can be expanded in powers of r: 00 r" d"~l fiy) = fix) + Х^^Т(ЛХ)«КХ))Я)- F.4.2) n=l Theorem 6.4.2 The generating function for the Jacobi polynomials P^a^\x) is given by F(x, r) = 2a+fiR-l(l -r + R)-a(l +r + R)-0, F.4.3) when tf = (l -2xr + r2)l/2. Proof. Take ф(у) = (у2 - l)/2 in F.4.1). Then 1 A-2ХГ + Г2I/2 1 R y = = . r r r r The derivative of F.4.2) with respect to x is For f'{x) — A - x)a{\ + x)P this becomes R 00 rn d" + Е-7УТ«1 -^d +^(^2- D72"). ^—' и! ax" n=l Use Rodrigues's formula B.5.13'), A -x)a{\ +xfP^\x) - (-^-?-{(l -x)n+a(\ +x)"+P}, to arrive at A short calculation shows that \-y 2 and 1-х \~r + R l+x \+r + R This completes the proof of the theorem. (See Jacobi [1859].) ¦
6.4 Generating Functions for Jacobi Polynomials 299 If one were to take the definition of Pjf •^) (x) as the polynomials that satisfy F(x,r) = 2a+liR-l{\-r + R)~a(\ + r , F-4-4) then one would face the problem of proving orthogonality. When a = fi = 0, we have the Legendre polynomials Р„(х). For these polynomials Legendre observed that 1 л/rs I — *Jrs -L dx - 2xr + r2v 1 - 2xs + s2 Pn(x)Pm(x)rnsmdx. This implies orthogonality. Chebyshev [1870] applied the same method to the general Jacobi polynomials. His proof is a tour de force and shows his skill in handling formulas. A simpler proof was given by Hermite [1890]. He treated the Legendre case, but his argument extends in an obvious way to Jacobi polyno- polynomials. Theorem 6.4.3 If{P^a^}(x)} is given by F.4.4), then the sequence is orthogonal with respect to the weight function A — x)a{\ + x)& and conversely. Proof. Consider the integral /•l /m = / xmF(x,r){\ -x)a{\+xfdx " / xmP^\x)(\-x)a(\+xfdx. n=0 Set or A -2xr +r2I/2= 1 -ry Then У + r(l-y2)
300 6 Special Orthogonal Polynomials Clearly, Im is a polynomial in r of degree m. So we must have /•i / хтР^\х)(\-х)а{\+х)Ых = 0, п = т + 1,т + 2, This is equivalent to -x)a(\ +xfdx = 0, тфп. -l For the converse, consider lm again. The same change of variables gives -r + l-/ This is clearly a polynomial of degree m in r if F(x, r) = C(l - r + 1 - r>)-a(l + r + 1 - ry)^(l - ry)~\ where С is a constant. To find C, take x = 1 to get n=0 n=0 = A - r)-"'1 = C2-a~P{\ - r)-a~\ огС = 2а+Р. m Remark 6.4.1 The last theorem gives another way of finding the polynomials orthogonal with respect to the beta distribution A — x)a(\ +x)$dx. These polyno- polynomials must have F(x, r) in F.4.4) as their generating function. Then by Lagrange inversion the polynomials must be -x)~a{\ +x)-^{(l -x)n+a(l The ultraspherical polynomials are the important subclass of the Jacobi poly- polynomials р(а'Р)(х), when a = /J. These are defined by a different normalization than P(a'a\x), giving a simpler generating function than F.4.4) when a = fi. The new generating function is useful in obtaining properties of ultraspherical polynomials. To motivate this generating function consider the Poisson kernel oo ? P^p\x)P^0)(y)rk/hk (a, ft > -1), F.4.5) k=0
6.4 Generating Functions for Jacobi Polynomials 301 where = I [P^\x)f{\ - x)aA + xfdx 1)Г(к Bk + a + p + 1)Г(к + a + p + 1)Г(к + 1) Take у = 1 in F.4.5) to get the sum 00 ">k +a + в + l)(a + В + I) Observe that the last series, except for a factor, is obtained from by differentiation. Rewrite this series with the hypergeometric expression for P%"'P)(.x) in powers of A + x)/2. We have (n-k)\k\(l+p)k _ v^ A +<* + P), /1 (A + « + /3)/2)t(B + « + P)/2)k2k(x + \frk We used the binomial theorem in this computation, along with the fact that (aJk = 22k(a/2)k((a The final result is + 1}« F.4.7)
302 6 Special Orthogonal Polynomials This is the other generating function we were looking for. When a = /3, this gives Ba + l)nP^a\x)rn ^ 1 / 2(l+x)r A+J r2y°'-L2. F.4.8) It is reasonable to define polynomials C«W := а 12поъ pWW-^Oc) F.4-9) with the generating function oo A -2xr + г2Гх = ]Pc*(Jt)r". F.4.10) n=0 When к = 0, C\ (x) = 0. So the case к = 0 has to be considered as к -*¦ 0 in the formulas involving C%(x). The polynomials C^(x) are called ultraspherical polynomials or Gegenbauer polynomials. They are orthogonal with respect to the weight function A - х2)л"A/2) when к > -A/2). An important expression for С„(х) follows immediately from the generating function F.4.10). Let x = cos в, factor the left-hand side using 1 -2rcos0+r2 = A ~rem)(\ -re~w), expand by the binomial theorem, and equate the coefficients of r". The result is The second equation is true because С„ (cos 0) is real and the real part of e'(" is cos(n - 2k)9. When к > 0, F.4.11) implies Another hypergeometric representation for C*(x) is obtained by taking a different factorization: 1+r2 (A.)* Bxr)k k\ (l+r2)*+*
6.4 Generating Functions for Jacobi Polynomials 303 ifc! и! n,k n,k oo Thus, F.4.12) Even though ultraspherical polynomials are special cases of Jacobi polynomials, their great importance compels us to note and prove some of their properties. When к —> 0, we get the Chebyshev polynomials of the first kind: 1, n=0, F.4.13) 2Г„(х), и = 1,2,..., Ck(x\ When к ->¦ oo, we have The Rodrigues formula B.5.13') takes the form к^-'г)^"г- FА14) and the formulas for the derivative and the three-term recurrence relation are —— Cn(x) = 2kCn^i (x) F.4.15) and n&n{x) = 2(n+k- \)xCxn_y{x) - (л + 2k - 2)Ckn_2(x) F.4.16) for и > 2 and C#(jt) = 1, C\(x) = 2kx. It should also be noted that uF) — (sin#)xC^(cos#) satisfies the differential equation = 0. F.4.17) sin
304 6 Special Orthogonal Polynomials A proof of F.4.15) from the generating function for С„{х) is as follows: The derivative of F.4.10) with respect to x gives oo . У —Cx(x)rn = 2Xr(l - 2xr + r2y^+» ^—' ax n=0 n=0 and hence F.4.15). To get F.4.16) take the derivative of F.4.10) with respect to r. The result is oo 2X(jc - r)(l - 2xr + r2Yk = A - 2xr + r2) ^2 пС^(х)г"'\ n=0 The left side is 2X(x - r) Y^Lo Cnix)r"- Relation F.4.16) follows on equating the coefficients of rn on each side. We now state a property of the relative extrema of ultraspherical polynomials. Let y^l, к = 1,..., n — 1 denote the zeros of the derivative of Р^а'а)(х). Order the zeros so that ук,„(а) < Ук-\Аа) and set Уо,п(а) = Ь Уп,п(а) = -1- Define )A), * = 0,1,...,я. F.4.18) These numbers satisfy the inequality fiktn(a) </jLktll-i(a), a > -1/2, к = 1, 2,..., n - 1, и = 1,2,..., F.4.19) but the inequality is reversed for — 1 < a < —1/2. For a = —1/2,1лкп(а) = 1. For а = 0, F.4.19) was observed by Todd [1950] after studying graphs of Legendre polynomials. Todd's conjecture was proved by Szego [1950]. We prove F.4.19) with a = 0 by an argument that generalizes to give F.4.19). It is left to the reader to prove the general case. We begin by stating some necessary results about Jacobi polynomials. The following two identities follow directly from Remark 5.6.3: (n + a + 1)Р„("'«(х) - (л Bn+a + 6+2)(l -x) F.4.20) - (n + р)Р?Хш(х). F.4.21) For the reader's convenience, we again note that (r/'+1) F.4.22)
6.4 Generating Functions for Jacobi Polynomials 305 and pW\-x) = (-l)nP^a)(x). F.4.23) Following convention, we denote the Legendre polynomial Pn@0) (jc) by Pn (jc). By F.4.20), F.4.23), and then F.4.21), we have [Pn(x)]2 - [Pn+i(x)]2 = [Pn(x) - Рй+1(х)][Р„(х) + P = (l-jc)PnA'O)(Jc)(l+Jc)Pn(O-1)(x) 2 2 (n 2(и 2(л where the last step follows from F.4.22). Thus (I — x2) \ d ~[2 f + ' \ /(x) := [Рп(х)У + j—~^ \—Pn(x) (и + IJ The zeros of /'(•«) contain the zeros of P^(x) = Pn(i'11)(x)/2 as well as the zeros of Pj,+i(x) = Р^Л)(х)/2. Since /(x) is of degree In, we must have By comparing coefficients of highest powers, we see that Because of the separation of the zeros of PnA1)(x) and P^ (x), they take opposite signs for ykttl <x < Ук,п+1- Thus /(x) decreases in this interval and F.4.19) is proved for a = 0. It is also easy to prove that й,»<й-1,- * = l,2,...,lii/2J, F.4.24) where ixk n = /x,t,n@). Consider the function /(x) defined in Exercise 40 with a = p = 0. Then , A -x2)[P'(x)l2 /(x) = [Pn(x)]2 + П " П . F.4.25) n(n + 1)
306 6 Special Orthogonal Polynomials Compute f'(x) and check that /(x) is increasing for 0 < x < 1. This implies F.4.24). Szasz [1950] proved F.4.19). His argument also works for the case — 1 < a < — 1/2, though he did not make this observation. Later he proved a similar result for Hermite functions. See Szasz [1951]. Exercise 10 contains a statement of his result. We close this section with the observation that the Hermite and Laguerre poly- polynomials are limits of Jacobi polynomials. There are a number of ways to obtain these limits. One uses a generating function. Observe that and we conclude that lim X-"/2C^(x/X) = Hn{x)/n\. F.4.26) A->0O Another method is to use the hypergeometric representations ш ц± ± ( /3->oo " - " ' ?->oo n\ ^ (a - F.4.27) This means that it is possible to derive the properties of Laguerre and Hermite polynomials from those of Jacobi polynomials. However, it is usually easier to deal with these polynomials directly as we did in Sections 6.1 and 6.2. 6.5 Completeness of Orthogonal Polynomials The problem of expanding an arbitrary function in terms of orthogonal polynomials was briefly considered in the previous chapter. Here we consider expansion by means of Jacobi, Laguerre, and Hermite polynomials. The latter two are the most interesting as they involve integration over infinite intervals. Just as in the case of Fourier series, the result can be nicely stated for functions in L2(a, b), that is, the Hilbert space of square integrable functions. We reproduce Hewitt's [ 1954] proof of the completeness of the Hermite and Laguerre polynomials. This proof depends on the uniqueness of Fourier transforms of integrable functions. For completeness, we give a complex variables proof of the latter result due to Bak and Newman [1982, p. 228]. Theorem 6.5.1 If f is integrable on (—oo, oo) and if /•OO fix) = / f(t)eix'dt = 0, J —oo then / = 0 almost everywhere.
6.5 Completeness of Orthogonal Polynomials 307 Proof. Clearly, f f(t)eix('-a)dt = 0. J — 00 So, if a is real, then ra /*oo / f{t)eix{'-a)dt = - / f{t)eixi'-a)dt. F.5.1) J — oo </a Define two functions of z — x + ry: L(z) = / f(t)e^'-a)dt, R(z) = - / f(t)eU('-a)dt. J —oo J a It is clear that L(z) exists for Im z < 0 and is analytic in Im z < 0. Similarly, R(z) exists for Im z > 0 and is analytic in Im z > 0. Moreover, by the dominated convergence theorem and F.5.1), we have lim L(x + iy) = I f(t)eix('~a)dt = lim R(x + iy). У-*0 J-oo У^° This implies that | ltf(z), Imz>0, is a bounded entire function. By Liouville's theorem, F(z) is a constant. Again, by the dominated convergence theorem, /•OO lim F(iy) = lim - / f(t)e~y('~a)dt = 0. ¦y^OO J^OO _/a Thus F(z) = 0 and, in particular, F@) = 0. This means that for all real a (since a is arbitrary) / f(t)dt = O. J-oo This implies that / = 0 almost everywhere and the theorem is proved. ¦ Let p{t) denote a square integrable function that dies out exponentially at infi- infinity, that is, p(t) = 0(e~aW) for some a > 0 as \t \ ->¦ oo. F.5.2) Theorem 6.5.2 Let -oo < a < b < oo. Let p(t) € L2(a, b), with p(t) different from zero almost everywhere, and let p{t) satisfy F.5.2), if a = —oo orb — oo. Iffe L2(a,b)and / tnf(t)p(t)dt = O forn = 0,1,2,... Ja then / = 0 almost everywhere.
308 6 Special Orthogonal Polynomials Proof. Let z = x + iy and define rb F{z) = / elzlp(t)f(t)dt. Ja If —oo < a < b < oo, then F is an entire function; otherwise, F is analytic in —a < у < a. Therefore, F<">(Z) = i» f e^tnp(t)f(t)dt. Ja By hypothesis F(n)@) = 0 for n = 0, 1, 2,.... This implies F(z) s 0 in -a < у < a. In particular, fb ¦ F{x) = / eMp{t)f(t)dt = 0. Ja Since p(t)f(t) is integrable on (a, b), the uniqueness of the Fourier transform gives p(t)f(t) = 0. Since p(t) is different from zero almost everywhere, f(t) = 0 almost everywhere. This proves the theorem. ¦ For the next theorem, let da(x) denote either xae~xdx or e~*2dx. In the former case (a,b) = @, oo) and in the latter (a, b) = {—oo, oo).Let0n denote either the nth Laguerre or Hermite polynomial, normalized so that f" 2dax -1 Ja For any function / such that fb / f(xJda(x) < oo, Ja set rb c« = / Ja Theorem 6.5.3 Suppose sn — Y?k=o сьФк- Then b oo /b n=0 and fb 2 lim / [f(x)-sn(x)]2da(x) = 0.
6.5 Completeness of Orthogonal Polynomials 309 Proof. For n > m, it is clear that m+l / m+1 Also, implies that k=0 \ck\z < f f(x)zda(x). F.5.4) Ja It follows from F.5.3) and F.5.4) that {.?„(.*)} is a Cauchy sequence in L2a(a, b). (Here L2a is the set of all square integrable functions with respect to the measure da.) There is therefore a function g e L2a such that lim / b 2 [g(x)-sn(x)fda(x)=0. F.5.5) Now for n > k, rb 1 Ja g(xL>k(x)dot(x) - ck = f [gW - sn(x)](t>k(x)da(x) < / [g(x)-sn(x)fda(x), J a J a by the Cauchy-Schwartz inequality and the fact that the norm of фк is 1. Let n ->• oo to get f" ck f" = J a By Theorem 6.5.2, it follows that f = g almost everywhere. By F.5.5), we arrive at /b rb s2n(x)da{x) = / f(xJda(x), J a so that rb \2, 2_j\Cn\ = I f(xYda{x). n=0 This proves the theorem.
310 6 Special Orthogonal Polynomials Remark In the last theorem, the series ^o° с"Фп converged to / in the L2 sense. Pointwise convergence can be obtained, for example, by assuming that / is smooth or piecewise smooth. In the latter case the series converges to ^[f(x + 0) + f(x — 0)], when x is a point of discontinuity. We can use Theorem 6.5.2 to prove the following result: Suppose {pn(x)} is a sequence of polynomials orthogonal with respect to the weight function w(x) = O(e~c^) on @, oo). Then the sequence {pn(x)} is complete. We can prove this by taking another sequence {qn(x)} orthogonal with respect to xw(x) on @, oo). Define a sequence {rn(x)} by Г2п{х) = pn(x2), f2n+i(x) = xqn(x ). This sequence is orthogonal with respect to |jc|i«(jc2) on (—oo, oo). This weight function satisfies the conditions for Theorem 6.5.2. Hence the sequence {rn (x)} is complete, which implies the completeness of {r2n(x)} for even polynomials. Thus the result is proved. Compare this result with the comments after Corollary 5.7.6. 6.6 Asymptotic Behavior of P^ix) for Large n Suppose f(r) = YlT=oa"r" *s an analytic function in a neighborhood of zero with only a finite number of singularities on the circle of convergence. Suppose, for convenience, that the radius of convergence is one. We would like to have an estimate of an for large n. Let us see how an can be found approximately by knowing the singularities of /. Assume that the singularities are poles that we take to be of order one for simplicity. Let the singular part of / on the unit circle be a\ ot2 oik S= I-far + I-far +'"+ I-far' These functions can be expanded by the binomial theorem, so we know the coeffi- coefficient of r", which we call bn. Now / — S = ?(а„ — bn)r" has a larger radius of convergence than /. This means that (a, - bn)r" = o(l) for some r > 1, or an-bn= o{r~n). Since bn is known, we have an estimate of an. This is the idea behind what is known as Darboux's method for finding the asymptotic behavior of an. The application to orthogonal polynomials is possible because the generating function has the nth
6.6 Asymptotic Behavior of p("-P> (x) for Large n 311 polynomial as a coefficient of rn. In this case, there is also a need to consider algebraic singularities. Take as an example A — 2xr + r2)~1/2, the generating function for the Legendre polynomials. With x = cos в, we have fir) = A - 2xr + r2)-1'2 = A - rewr1/2il - re-wr1/2. The singularities are at r = ew and r = e~'e. In the neighborhood of r = e'e, the behavior of fir) is like 8 = A - е2>в. In this case fir) — gir) = /г(г)A — re~w)l/2, where/г (r) is continuous at ew. So f — g still has an algebraic singularity, but it is now continuous. Thus it is possible to say something about an — bn. The precise result is contained in the theorem below. Before stating it, we consider the case If X > 1, then subtracting a term like g does not make f — g continuous. More terms have to be subtracted. These can be determined by expanding A — re'e)~x in powers of A — re~w) and vice versa. We have, about r = e'e, f{r) - A - If for an integer n, n — к > 0, we can take Theorem 6.6.1 Let /(z) = X^o° anz" ^e anafytic in \z\ < r, r < oo, and have a finite number of singularities on \z\ = r. Assume that giz) = ^o° bnz" is also analytic in\z\ < r and that f — g is continuous on\z\ = r. Then an—bn=o(y~n) as n —> oo. Proof. By Cauchy's theorem and the hypothesis on / - g, 1 /* fGЛ — рGЛ 1 С 2ni Jk]=r zn+l 2nrn Jo The Riemann-Lebesgue lemma for Fourier series implies that the last integral tends to zero as n ->• 00. This proves the theorem. ¦
312 6 Special Orthogonal Polynomials In fact, in the above theorem it is not necessary to assume continuity of / — g on \z\ = r. The same conclusion can be obtained by assuming that / — g has a finite number of singularities on \z\ =r and at each singularity Zj, say, where <ry is a positive constant. For these refinements and further examples, see Olver [1974, §8.9] or Szego [1975, §8.4]. The generating function for Pjf-^ix) is -r + \/l-2rx+r2)-a(l +r + v/l-2rjc+r2)-/i(l - 2rx + r2y1/2. Take x = cos в. The above function has singularities at r = е±1в. In the neighbor- neighborhood of r = ew, the generating function behaves as шГа- = 2^A - ешГа-Ц1 +e This implies that Write where Therefore, pic \A\ \ri\ '¦«(c A 2 :os0) = 0 + — I l)/2 [2a the v< conjugate] ¦ 1 l-jcr+J( A/2) n\ n "I =|/ Observe that the denominator of \A\2 is A — jc)"A + x)&, except for the factor 2"+^+1(l — x2I/2. This is exactly the weight function for Jacobi polynomials. This is not a coincidence. To state the theorem in a form due to Nevai [1979, pp. 141-143], suppose Pn+i(x) = (Anx + Bn)Pn{x) - CnPn_i(x), n = 0,1,..., P_1=0, P0 = l, and An^AnCn > 0 forn = 1,2,....
6.7 Integral Representations of Jacobi Polynomials 313 Theorem 6.6.2 If the series Bn , ( Cn+i \ 8 I — + -— -- > converges, i A.. \ A.. A.. , 1 / 7 I n=0 then dxjf can be expressed in the form Here ir'(x) is continuous and positive in (—8, 8), supp {\jr') = (—<5, 8), and i/fj (jc) is a step function constant in (—<5, S). Furthermore, the limiting relation 2 holds almost everywhere in supp(di/r). The singularity in the generating function for Laguerre polynomials is more complex. This makes it difficult to apply Darboux's method. Fejer, however, has shown how this can be done. See Szego [1975, §§8.2-8.3]. 6.7 Integral Representations of Jacobi Polynomials Integral representations for hypergeometric functions imply the existence of such representations for Jacobi polynomials. A few important and useful integral repre- representations are given in this section. Recall Bateman's [1909] fractional integral formula: ) Г(с)Г(д) Jo \ + ) Г()Г(д) J \ с ) where Re с > 0, Re /x > 0, and |jc| < 1, if the series is infinite. This formula is a particular case of B.2.4) and we use it to prove the next theorem, which is called the Dirichlet-Mehler formula. Theorem 6.7.1 For 0 < в < л, the nth Legendre polynomial is given by 2 f cos(n + iH _ 2 Г sin(n + \)ф BCOS0-2COS0I/2 Proof Takea = -n,b = n + \,c = /x = 1/2, д: = sin2 (в /2), and t = sin2@/2) in Bateman's formula. This gives Р„(СО5в) _ 1 fe ^ Jo (sin2@/2) - sin2@/2))!/2
314 6 Special Orthogonal Polynomials The 2 F\ in the integral is the hypergeometric form of the fourth Chebyshev poly- polynomial given by cos(n + i)</> cos(</>/2) To get the other form of the integral change в to n — в and ф to л — ф. Then use the fact that Pn{-x) = {-\)nPn{x). ¦ One way to use this theorem is to show, as Fejer did, that the sum of Legendre polynomials, X^=o Pk(x), is greater than or equal to 0 for 0 < x < 1, since sin(w + l/2H ; - • PtA) s For the reference to Fejer and other related results, see Askey [1975, Lecture 3]. Theorem 6.7.2 For yc > 0, -1 < x < 1, (c) (d) Г(а + 1)Г(д) Л A + у)п+а+1Л+1 х{у-хУ~хс1у, a>-\; Г* A+ /_! A - у)« > -1. Proof. To obtain (a) use the hypergeometric representation of the Jacobi poly- polynomials and apply Bateman's formula with an appropriate change of variables, (b) now follows from (a). Apply Р^^{-х) = (-l)nP^'a)(x).
6.7 Integral Representations of Jacobi Polynomials 315 Finally, (c) and (d) are derived from (a) and (b) respectively by an application of the Pfaff transformation: Note that when this is applied to Bateman's formula we get The theorem is proved. We also note that Bateman's formula and the results in the theorem are all particular cases of Theorem 2.9.1. ¦ An important integral of Feldheim [1963] and Vilenkin [1958] can be obtained from (c) by using the quadratic transformation and and taking /3 = ±1/2. The result is Cvn(cos9) sin2"-1 в Cvn{\) 2r(v + i) Гв 2, [cosV-cos2^-^1 C^(cos0) Ф' 7Г 1 < -, V > > --. A change of variables gives Corollary 6.7.3 Forv > k> -1/2, 0 < в < ж CvJcos0) 2f(v + i) fn'2 ,. , ,. , 99 -!L^ = —, ^-—^ / sin21 ^cos2"-2^1 ф[\ - sin2 в cos2 ф]п Cvn{\) T(k + \)T{v-k) Jo C^(cos6»(l -sin2 в cos2 </>Г1/2)
316 6 Special Orthogonal Polynomials Here Cn°(cos0) C*(cos0) 2= lim —^ = cosn6>. с*A) The final theorem of this section is the Laplace integral representation for ultra- spherical polynomials. It is due to Gegenbauer [1875]. Theorem 6.7.4 For X > 0, C*(cos 6>) = 22X_]_,/W,442 / [cos в + i sin в cos ф]п sir" Proof. Recall that Rewrite this using the beta integral. Then k=0 Set у = sin2 i/f to get 2Г(п + 2к) Г'2 / i,w,442 / fc°s 0 + i sin 0(cos2 ^ - sin2 n-\\ W) Jo x sin21 f cos21 fdf. Now let ф = 2i/f to get the result in the theorem. ¦ 6.8 Linearization of Products of Orthogonal Polynomials The addition theorem for cosines implies the formula cos тв cos пв = \ cos(n + т)в + \ cos(n — т)в. In the previous chapter, we noted that this result pertains to Chebyshev polynomi- polynomials of the first kind, /)(~1/2'~1/2)(x), where x = cos#. This is called a linearization formula because it gives a product of two polynomials as a linear combination of
6.8 Linearization of Products of Orthogonal Polynomials 317 other polynomials of the same kind. More generally, given a sequence of polyno- polynomials {pn{x)} one would like to know something about the coefficients a{k, m, n) in m+n Pm(x)pn(x) = ^2a(k,m,n)pk(x). F.8.1) /t=0 If the pn(x) are orthogonal with respect to a distribution da(x), then 1 r a{k,m,ri) = — pm(x)pn(x)pk(x)da(x). F.8.2) "k Ji Thus the problem of the evaluation of the integral of the product of three orthogonal polynomials of the same kind is equivalent to the linearization problem. As another example of a linerization formula, recall the identity sin(n + 1N» sin(w + 1N» m^) sin# sin 6* -^ sin 6* k=0 This comes from the addition formula for sines. The addition formula is contained in an important special case of F.8.1), that is, m n m+n Л л — А whenx = e'°. One way of obtaining linearization formulas would be to look for those poly- polynomials for which the integral F.8.2) can be computed. A simpler integral would involve only the product of two polynomials; but this would yield the orthogonality relation. As we have seen, using the generating function is one way of obtaining orthogonality in some cases. The simplest generating function is for a Hermite polynomial, since it involves only the exponential function, which can be multi- multiplied by itself without resulting in something very complicated. For example, to get orthogonality, note that ~ dx _oo-'—' mini 00 m,n roo = 1 e2"-r2+2*°- J — oo [¦OO J — CO S2 2 x dx -oo V = V^r2" Therefore, 2 Hm(x)Hn(x)e-x dx =
318 6 Special Orthogonal Polynomials Similarly, to find the integral of the product of three Hermite polynomials, consider t\m\n\ /•CO _ / 2xr-r2+2xs-s2+2xt-t J — oo -'2-*2dx COO e-(x-r-s-tJ <ya+b+c ~a+b fb+c „a+c u а\Ъ\с\ ' Й,О,С This shows that He(x)Hm(x)Hn(x)e~x2dx = (e+~_n, (m+n-e'\]~G+i-m\ ' F-8-3) when ?+m + n is even and the sum of any two of ?, m, n is not less than the third. In all other cases the integral is zero. Theorem 6.8.1 min(m,rt) Hm(x)Hn(x)= J2 Proof. This follows from the integral formula F.8.3) and the orthogonality rela- relation for Hermite polynomials. ¦ Remark 6.8.1 An important feature of the coefficients in the linearization formula is their positivity. This property is shared by the integral F.8.3). At this point we have linearization formulas for Р^/2л/2)(х), .Pn(~1/2~1/2H0, and the Hermite polynomials. Symmetric Jacobi polynomials are ultraspherical or Gegenbauer polynomials and the Hermite polynomials are limiting cases, since = lim This suggests the possibility of the existence of a linearization formula for the Gegenbauer polynomials. Of course, there are linearization formulas for f^1/2.-*/2) x(x) and pt~l/2'l/2){x), which are essentially the Chebyshev polynomials of the third and fourth kind, but these are related to P^I2'll2\x) and Р^^~^2){х) by quadratic transformations. The idea of using the generating function A — 2xr+r2)~x to obtain a lineariza- linearization formula for Gegenbauer polynomials does not appear very hopeful, because
6.8 Linearization of Products of Orthogonal Polynomials 319 the product of three of these does not seem tractable. One useful idea, due to Bailey [1933], is to express Cx(x) as a hypergeometric series and then use the Whipple transformation for a very well poised yFe to handle the product C^(x)Cx(x). The resulting linearization formula is due to Dougall [1919]. It is interesting to note that a more general result was known to Rogers [1895]. See formula A0.11.10). Theorem 6.8.2 min(m,n) Ckm(x)Cx(x) = J2 a(k,m,n)Cxm+n_2k{x), F.8.4) *=o when (m + n + X- 2k){X)k{X)m_k(X)n-k{2X)m+n-k a(k, от, n) = (от + n + X - k)k\(m - k)\{n - k)\{X)m+n.k (m+n-2k)\ BX)m+n-2k Proof. Recall that ^() F.8.5) n\ \\ -n - X Apply Euler's transformation (Theorem 2.2.5) to get ~ *, 1 " 2A. - Л! e(ley2FA от! \ \—X — m Multiply the two equations above to obtain . F.8.6) \X)m(X)n ^i(jn+nH ^ -2i0\l m\n\ (-n)k(X)k 2Ш ^ A-Я)Д1 -2X-m)j 2jW \ ^ !AЯ) ' f^ A -X-n)kk\ When s = j + k, the two sums can be rewritten as [-2k- m)s _2sW ^ (~n)k(X)k(m +X-s)k (l-X-m)s в f^k\(l-X-n)k(X-s)kBX E(l -A),(l - lk-m)s j>0 (\-X-m)s 4 3Vl -X-n,X-s,2X -n,X,-s,m+X-s ^\ _2sW F.8.8)
320 6 Special Orthogonal Polynomials This 4F3 is balanced. Recall Whipple's formula (Theorem 2.4.5), which transforms a balanced 4F$ to a very well poised jF^: a + l-b-c,d,e,-s \ (a + 1 - d)s(a + 1 - e)s l-c,d+e-a-s' ) (a + l)s(a + 1 - d - e)s a,l+a/2,b,c,d,e,-s Л a/2, a + \ -b,a + l -c,a + l-d,a + l -e, a + l+s' )' when s — 0, 1, 2, Take a — —k—m—n,b = -m, с = l—2X — m — n + s, d — k, and e = —n. Then F.8.8) is transformed to - 2k - m - n)se ~1M s\{l-k-m-n)s {-k-m-n)k{\-{k + m+n)/2)k{-m)k f k\(-(k+m+n)/2)k(l-k-n)k A-2А.-Ш-И+ s)k(k)k(-n)k(-s)k (k — s)k(l —2k — m — n)k{\ — к — m)k{\ — к — m — n + s)k Reverse the order of summation, set s = к + ?, and simplify. Put this in F.8.7) and use F.8.6) for the inner sum to get an identity that reduces to Dougall's identity F.8.4). ¦ The limit к —> 0 gives the identity for cos тв cos пв and к = 1 is the identity for sin(w + 1)9 sin(n + 1)9. When к -> оо, Dougall's identity reduces to The next corollary was given by Ferrers [1877] and Adams [1877]. Corollary 6.8.3 For Legendre polynomials Pn (x), min(m,n) - _ , ., x-^ 2m +2n + 1 — 4-k ) g Proof. Take к = 1/2 in Dougall's identity. ¦ Remark 6.8.2 The coefficients a(k,m,n) in F.8.4) are positive for к > 0. Moreover, Theorem 6.8.2 implies a terminating form of Clausen's formula. See Exercise 3.17(d) for the statement of Clausen's formula.
6.8 Linearization of Products of Orthogonal Polynomials 321 Corollary6.8.4 ForX > -1 /2andX^O 1 Cxt(x)Cxm{x)Cxn{x){\ -x2) „ VJ I r,, F810) (s - ?)\(s - m)\(s - n)\(k)s [T(\)]2s\(s+\) when ? + m + n = 2s is even and the sum of any two of?,m,n is not less than the third. The integral is zero in all other cases. This is straightforward from Dougall's identity. It contains F.8.3) as a limiting case. Integrals involving products of some orthogonal polynomials also have com- combinatorial interpretations. In Section 6.9, we show how F.8.3) can be computed combinatorially. The coefficients a{k, m, n) in F.8.1) can also be computed in terms of gamma functions when pn(x) = P^a^\x) and a, /3 differ by one. This once again covers the cases of the third and fourth Chebyshev polynomials. Hsu [1938] showed how to use the result in Theorem 5.11.6 to go from F.8.10) to the corresponding integral of Bessel functions: Ja(at)Ja(bt)Ja(ct)t1-adt ( 0 if a, b, с are not sides of a triangle, i of a triangle of area A. This integral was evaluated by a different method in Section 4.11. It is possible to define a second solution of the differential equation for ultra- spherical polynomials that converges to the second solution Ya(x) of the Bessel equation. In fact, there is an analog of the second solution for general orthogonal polynomials {pn{x)}. For simplicity, suppose that the orthogonality measure (or distribution) dce(t) has support in a finite interval [a, b] and that, on [a, b], da(t) = co(t)dt, where со(t) is continuously differentiable and square integrable. Define the function of the second kind qn outside [a, b] by qn(z)= [ —a>(t)dt, zeC,z?[a,b], F.8.11) J Zt and on the cut (a, b) by qn(x) = lim \(qn(x + iy) + qn(x - iy)) = / ^-co(t)dt, F.8.12) 3-^0+ 2 Ja X -t
322 6 Special Orthogonal Polynomials a < x < b. Note that, on the cut, qn is the finite Hilbert transform on (a, b) of the function copn. The ultraspherical function of the second kind, Dx (x), is defined by A - x2f-l'2D"n{x) = -qn(x) = - /' ^A - t2f-dt. F.8.13) It can be shown that ^) ^^^^^ F-8.14) Askey, Koornwinder, and Rahman [1986] have considered the integral I Охп(х)С^(х)С^(х)A-х2Jх-Ых. F.8.15) It vanishes if the parity of ? + m + n is even. It also vanishes when I + m + n is odd and there is a triangle with sides I, m, n. In the other cases its value is 1/2)]2BА)яBЩ2А)т \)]2n\l\m\ M(n+m+e+i)/2((n +m -I - l)/2)!(-A)(n-m-?+i)/2((n -m -I + l)/2)< They also evaluate a more general integral in which Dx(x) is replaced by a func- function of a different order, D^{x). The proof of these results used Whipple's 7F6 transformation. Corresponding integrals for Bessel functions are also derived. For these results and for references, the reader should see their paper. This work arose from a special case studied by Din [1981]. For general Jacobi polynomials, P^a^\x), the linearization coefficients can not be found as products. Hylleraas [1962] found a three-term recurrence rela- relation for these coefficients from a differential equation satisfied by the product p(a'P\x)P^P\x). In addition to the case a = ft, Hylleraas showed that when a = p + 1, the linearization coefficients are products. For many problems, only the nonnegativity of these coefficients is necessary. Gasper used Hylleraas's recurrence relation to determine the values of (a, /3) with all of the linearization coefficients nonnegative. For many years, the best representation of these coefficients was as a double sum. Finally, Rahman [1981] showed that these coefficients can be written as a very well poised 2-balanced gFg. These series satisfy three-term contiguous relations and comprise the most general class of hypergeometric series that satisfy a three-term recurrence relation. Although Rahman's result was unexpected, in retrospect it would have been natural to expect such a result. If Jacobi polynomials are normalized to be positive at x = 1, as they are when a > — 1, then the linearization coefficients are nonnegative when a>p> — \, whena+ye>0, -1 < P < 1/2, and for some a < -ySwhen-1 <p< -1/2. For
6.9 Matching Polynomials 323 the first two of these regions, there is a general maximum principle for hyperbolic difference equations that implies nonnegativity. See Szwarc [1992]. 6.9 Matching Polynomials Orthogonal polynomials have connections with some combinatorial objects. These connections have been studied intensively in recent years. Here we shall define matching polynomials of graphs and show their relation to Hermite polynomials. This relationship will then be used to evaluate integrals of products of Hermite polynomials; in particular, it will give their orthogonality. We begin with some definitions from graph theory. Let G be a graph. We can think of G as an ordered pair (V, E), where У is a set of vertices (points) and ? is a set of edges that join pairs of vertices. We denote the number of vertices by | v |. A graph G is complete if every pair of vertices is joined by an edge. We denote a complete graph on от = \v\ vertices by Km. The complement of a graph G is another graph G that has the same set of vertices as G but those and only those edges of Кщ that are not in G. See Figure 6.1. A k-match on G is a set of k disjoint edges of G. By disjoint we mean that no two edges meet at the same vertex. Example The set of edges {{1, 2), {3,4)} in Figure 6.2 is a 2-match in G. There is no 3-match in this example. Let p(G,k) denote the number of &-matches in G. We take p(G,0) = 1 and p(G, — 1) = 0. It is clear that p(G, 1) is the number of edges in G and that for G Figure 6.1 Figure 6.2
324 6 Special Orthogonal Polynomials к > [m/2] (the greatest integer in m/2 = \v\/2), p(G, k) = 0. A match that uses every vertex of G is called a complete match and is denoted by pm{G). For a graph G, we define the matching polynomial of G, a(G), by [m/2] a{G) = a{G, x) = ^ (-l)kp(G, k)xm~2k, F.9.1) k=0 where m = |u|. Theorem 6.9.1 a(Km,x) = 2"m/2#m(x/V2). First Proof. We proved earlier that ft—0 It is therefore enough to show that 1 m\ p(Km, k) = -r 2* r. 2* k\(m — 2k)! We have to find the number of к matchings in a complete graph with m vertices. The number of ways of choosing 2k vertices for the к matching from m vertices is (^). From a given set of 2k vertices any particular vertex can be joined to 2k — 1 vertices to give a match. Any vertex of the remaining 2k — 2 unmatched vertices can be matched up with 2k — 3 vertices and so on. This implies that p(Km,k) = (™\Bk - 1)B* - 3) ¦ • • 3 • 1 1 m\ 2kk\(m-2k)\ This proves the theorem. ¦ Second Proof. A more interesting approach is to show that a (Km ,k) satisfies the same recurrence relation and initial conditions as Hem(x) := 2~m/2Hm(x/^/2). From the recurrence relation F.1.10) for Hm(x), we find that Hem+i(x) =xHem{x) -mHem_i(x), Heo(x) = 1, Hei(x) = x. We first prove that p(Km+l,k) = p{Km, к) + тр{Кт_и к - 1). Take a vertex и e Km+\. This vertex can be a part of a fc-match in m ways, and the number of ways to complete each &-match is p(Km~i, к — 1). If v is not in
6.9 Matching Polynomials 325 a fc-match of Km+u then the number of such ^-matches is p{Km,k). This proves the recurrence relation for p. Now = V (-l a(Km+i,x)= k=0 Km+l)/2] = ? (-1 k=0 Jfc=0 If [(m + l)/2] > frn/2] in the first sum, then p(Km, [{m + l)/2]) = 0. So the first sum is [m/2] k=0 In the second sum, change ktok + l.lt becomes Km-l)/2] This proves the theorem. ¦ Our next objective is to give a combinatorial evaluation of the integral /•OO I{nun2,...,nk)= / Hni(x)Hn2(x)-Hnt(x)e-x2dx. J — oo For the case к = 2, its evaluation will give the orthogonality of the Hermite poly- polynomials. A related integral is /»OO J(ni,n2,.-.,nk)= / Heni(x)Hen2(x) ¦ ¦ ¦ Hent(x)e~x2/2dx, J-oo where Hem(x) = 2~m/2#m(x/V2). A change of variables shows that /(«!, «2, . ... Л*) = 2(+-+"'-1)/27(l!b «2, • • • , Л*). It is convenient to shorten the notation by writing Я = (n\, m, ¦ ¦ ¦, и/t). Also, let .., л,-_ь и, - 1, и,+ь ..., л*). Here the Jth parameter и, is reduced by 1. Similarly, J~'lj) will mean that the ith and 7th parameters are reduced by 1. The next lemma gives a recurrence relation
326 6 Special Orthogonal Polynomials Lemma 6.9.2 J^ = Y?i=2"< Jn"l) and J° = л^1"'• Proof. First observe that the Rodrigues-type formula F.1.3) for Hermite poly- polynomials gives Hem(x) = (-!)" dxm Also, F.1.11) implies H'em = —-Hem(x) = mHem-x{x). dx Applying integration by parts, we get f°° d h= \ (-1Г ~П^е~Х /2HemW ¦ ¦ ¦ Hemk(x)dx J—CO ^ = (-1)--1 / J—с (=2 к i=2 Since Jq is the normal integral, ЛОО / J-c the lemma is proved. A combinatorial object that satisfies the same functional relation as in Lemma 6.9.2 is obtained as follows: Let Vi, V2, ¦ ¦ ¦, Vk be a disjoint set of vertices. Let V = Vi U V2 U • • • U Vk. Let | Vt\ = n, so that | V| = Ya=i "<"• Construct a graph G from V by putting an edge between every pair of vertices that does not belong to the same V,¦. G is called the complete fc-partite graph on Vi U V2 u • • • U Vk. Let P(ni,ri2,. ¦ ¦ ,nk) = Pn denote the number of complete matches on G, that is, the number of matches that use all the vertices of G. We set Pg = 1, in accordance with the earlier convention. It is clear that if X^=i ni is an 0<ld number, then Р„ = О. We also define P%'J) similarly to J%J).
6.9 Matching Polynomials 327 Lemma 6.9.3 Рц = Y^i=2niP^'l) and ^o = !• Proof. Choose a specific vertex in V\. This vertex can be matched with any of the n, vertices in V,-, i ф 1. Once one such match is made, the rest can be completed in pi1(> ways. This implies that к рп = ^2щР^и\ i=2 and the proof of the lemma is done. ¦ Since Jn and Pn satisfy the same recurrence relation we have the following: Theorem 6.9.4 h = л/ЪгРц, h = B"'++ ¦+Я*яI/2РЯ. Theorem 6.9.5 P(m, n) = m\Smn. Proof. In this case V = V{ U V2, \Vi\ = m, and \V2\ = n. If m ф n, then the vertices of V\ cannot be matched with the vertices of V2 to give a complete matching. So P(m, n) = 0 for m ф и. If m = n, then the number of complete matchings is m! and the theorem is proved. ¦ This theorem implies the orthogonality of the Hermite polynomials, for we have the well-known result о Hm(x)Hn(x)e~x*dx = 2mm\-y/nSmn. Now suppose V = V\ U V2 U V3, where Vj, V2, V3 have ?,m,n elements respec- respectively. If ?+m+n is odd, or if ? > ш+и, then it is easy to see that P{?,m,n) —0. The next theorem considers the other situations. Theorem 6.9.6 Suppose l+m+n is even ands = (?+m+ n)/2. Suppose also that the sum of any two of ?, m,n is greater than or equal to the third. Then ?\m\n\ P(?,m,n) = (s - ?)\(s - m)\(s - n)\ Proof. Without loss of generality, we assume m > n. After all vertices in Vi are matched with vertices in V2 and V3, the same number of vertices must be left over in V2 and V3 for a complete matching to be possible. This means that there are m — n more matchings of Vi into V2 than V\ into V3 in a given complete match. So if x denotes the number of V\, V2 pairs and у the V\, V3 pairs, then x + у = ? and x — у = m — n. Therefore, x = s — n and у — s — m. This implies that there are (s — ?) V2, V3 pairs. There are (s_n) ways of choosing elements in V\ to pair with elements in V2. The remaining elements in V\ then pair with elements in V3. Moreover, for any given 2(s — n) elements, taking s — n from Vi and s — n from
328 6 Special Orthogonal Polynomials V2, there are (s — и)! ways of doing the pairing. All this means that P(l,m,n)=( l )( m \( П )(s-nV.(s-l)\(s-m)\ \s — n) \s — I) \s — mj t\m\n\ (s - l)\(s - m)\(s - n)\' The theorem is proved. ¦ The theorem implies that Г Ht(x)Hm(x)Hn(x)e-x2dx = 2<«+'"+")/277- J-oo (— ){ )( 2 ) when I + m + n is even and the sum of any two of I, m, n is not smaller than the third. Otherwise the above integral is zero. It is possible to compute P(k, I, m, n) as well, but the result is a single series rather than a product. The reader should read the paper of Azor, Gillis, and Victor [1982] for this and other results. For further results on matching polynomials see Godsil [1981]. We now give a different approach to the theorem that 7я = s/brPn- First observe that the matching polynomial of G, a(G, x), can be written as where a runs through all the matchings of G and \a\ — the number of edges in the matching a. Let the disjoint union of two graphs G\ and G2 be denoted by Gi U G2. See Figure 6.3. Lemma 6.9.7 a(Gi U G2) = a(Gi)a(G2). Proof. Suppose Gi has m vertices and G2 has n vertices. Then
6.9 Matching Polynomials 329 The last relation follows because every matching у breaks up uniquely into a matching a of G\ and fi of G2. The lemma is proved. ¦ Let ф be a linear operator on polynomials defined by 1 ф(х") = = / \lLit J-00 If и is odd, then ф{хп) = 0. When n is even, say n = 2m, then We have seen that this is also the number of perfect matchings on Km. We denote this quantity by pm(Kn). Let Vt (i = 1,2,..., k) have и, vertices and У = V\ U V2 U • • • U Vk be their disjoint union. Let Kv be the complete graph on V. An edge of Kv is called homogeneous if it joins two vertices in the same V|; otherwise it is inhomogeneous. With this terminology, P% is the number of perfect matchings of Kn with no homogeneous edges. Lemma 6.9.8 where a runs over all matchings ofG = KVl U KVl U ¦ • • U KVt. Here n = X)*=i Proof. By the previous lemma and the above remarks L~n = ф(Нещ(х)НеП2(х) ¦ ¦ ¦ Hent(x)) = ф(а(Кщ)а(КП2)---а(КПк)) This proves the lemma. ¦ The expression Х)„(—1)'а'/мп(?п-2|а|) can also be written as where a, is a matching in KVt. Finally, we can rewrite this as «Ь —.«*,}'
330 6 Special Orthogonal Polynomials where у runs through all the complete matchings of Kn-2\a\ with \a\ = \cti\-\ h \ak\. The matchings a\, a2,..., ak, y, taken together, give a complete matching ofKv. To complete the final step, we need one more lemma that uses the concept of a colored complete matching of Kv. For each matching ct\, a2,..., ak, y, color the edges in each a; red and the edges in у blue. Thus all the red edges are homogeneous and the blue edges are either homogeneous or inhomogeneous. The set of all matchings ct\,..., ak, у in the summation is the set X of all matchings of Kv in which only the homogeneous edges are red. Let Y c. X, where Y is the set of all matchings in which there are no red edges and all the blue edges are inhomogeneous. These are the complete matchings in the к -partite subgraph in Ky. If r(a) denotes the number of red edges in a, then by Lemma 6.9.8 we have shown that Also, by definition a&Y The next lemma will complete the proof of P^ = Ь„. Lemma 6.9.9 ]T (-iy«"> = o. a&X-Y Proof. First define an involution в on X — Y. Number the edges of К у arbitrarily. For any a e X — Y, consider the set of all homogeneous edges of a. This set is nonempty. Consider the smallest edge in this set and change its color from red to blue or from blue to red. This gives a new matching a' = в (a) in X — Y. Clearly в (в(a)) = a. It is also clear that (-l)r<a> + (-l)r<0(«)) = o. This proves the lemma and the theorem. ¦ The above proof follows DeSainte-Catherine and Viennot [1983]. Also see Viennot[1983]. 6.10 The Hypergeometric Orthogonal Polynomials The hypergeometric representations of the Jacobi, Laguerre, and Hermite poly- polynomials, which we have extensively studied in this chapter, are respectively
6.10 The Hypergeometric Orthogonal Polynomials 331 given by n,n+a + p + l l-x n\ * V a + l 2 ) n\ and -n/2,-(n-l)/2 _)_ — ' x2 In Chapter 3, the Wilson polynomials were introduced. These polynomials can be represented as 4F3 hypergeometric functions: Wn(x2;a,b, c,d) (a + b)n(a + с)„(а + d)n f-n,n+a + b + c+d-l,a + ix,a-ix \ ,,,.,, = 4^3 . , , , , ; 1 • F.10.1) \ a+b,a+c,a+d ) We saw that Jacobi polynomials are limiting cases of Wilson polynomials and in turn the Laguerre and Hermite polynomials are limits of Jacobi polynomials. A question arises as to whether there are hypergeometric orthogonal polynomials at the 3F2 level. In fact there are such polynomials. A few are treated in this section and others are given in the exercises. For a more complete treatment the reader should see Koekoek and Swarttouw [1998]. It is easily seen that Wn(x2;a,b,c,d) ( -n,a + ix,a — ix lim ' (a + b)(a + c)F( ' ; rf^oo (a + d)n V a+b,a+c =: Sn(x2;a,b,c) F.10.2) and Wn((x + tf; a - it, b - it, с + it, d + it) t^So (-2)" л! ,n(a + c)n(a +d)n f-n,n+a+b + c + d -l,a+ix = 1 3Г2 I ; 1 n\ \ a + c,a + d =:pn(x;a,b,c,d). F.10.3) The polynomials Sn{x2; a, b, c) and pn(x; a, b, c, d) are called the continuous dual Hahn and continuous Hahn polynomials respectively. Their orthogonality and recurrence relations can be obtained from those of Wilson polynomials, which were derived in Chapter 3. We restate them here for convenience. When Re(a, b, c, d) > 0 and the nonreal parameters occur in conjugate pairs, the orthogonality is
332 given by 1 Г 6 Special Orthogonal Polynomials Г (a + ix)T(b + ix)T(c + ix)T(d + ix) ГB/дс) ¦Wm(x2; a, b, с, d)Wn(x2; a, b, с, d)dx — (n+a + b + c+d- l)nn\ T{n+a+b)T(n+a + c)T(n+a + d)T(n+b+c)T(n+b+d)T{n+c+d) rBn+a+b + The recurrence relation is -(a2 + x2)Wn(x2) = AnWn+{(x2) - (An + Cn)Wn(x2) where F.10.4) F.10.5) An = (a+b)n(a+c)n(a+d)n' (n+a + b + c + d- 1)(л + a+ b)(n +a+ c)(n + a + d) and Cn = {2n + a + b + с + d - 1)Bл + a + b + с + d) n(n + b + c- 1)(л + b + d - 1)(л + Bn+a+b + c + d- 2) Bл + a + b + These polynomials also satisfy a difference equation that is a dual of the recurrence relation. This is given by n{n + a+b + c + d- l)y(x) = B{x)y(x + i) - [B(x) + D(x)]y(x) where and B(x) = D(x) = + D(x)y(x-i), = Wn(x2;a,b,c,d), (a — ix)(b — ix)(c — ix)(d — ix) F.10.6) 2ixBix - 1) (a + ix)(b + ix)(c + ix)(d + ix) 2ixBix+ 1) The corresponding results for the continuous dual Hahn polynomials are Г (a + ix)T(b + ix)T(c + ix) Sm(x2)Sn(x2)dx T{2ix) = Г (л + a + b)Y(n + a+ с)Г(п +b + c)n\Smn. F.10.7)
6.10 The Hypergeometric Orthogonal Polynomials 333 Here Sn(x2) = Sn(x2; a, b, c) and a, b, с are either all positive or one is positive and the other two are complex conjugates with positive real parts, -(a2 + x2)\{x2) = AnSn+l(x2) - (An + Cn)Sn(x2) + C~Sn-\(x2), F.10.8) where ~Sn(x2) = Sn{x2)/[(a + b)n(a+c)n], An = (n + a + b)(n +a + c), and Cn =n(n + b + c- 1); ny{x) = B{x)y(x + i) - [B{x) + D(x)]y(x) + D{x)y{x - i), F.10.9) where y(x) = Sn{x2), {a- ix){b - ix)(c - ix) 2ix{2ix — and = 2ix{2ix + 1) In the case of the continuous Hahn polynomials, the results are 1 f°° — Г(а + ix)T(b + ix)T(c - ix)T(d - ix)pm(x)pn(x)dx 2n J_oo Г(и + a + с)Г(и + a + d)T(n + b + с)Г(п +b + d) = omn, (o.lO.lO) Bи + a + b + с + d - \)T(n + a + b + с + d - 1) when Re(a, b, c,d) > 0, с = a, and d — b; {a + ix)pn{x) = Anpn+i{x) - (An + Cn)pn(x) + Cnpn-\(x), F.10.11) where n\ Pn(x) = ———— —Pn(x; a, b, c, d), in(a+c)n(a + d)n (n + a + b + c + d — \)(n + a + c)(n + a + d) Cn = + a + b + c+d- 1)Bл + a + b + n(n + b + с - 1)(и + b + d - 1) Bn+a + b + c + d- 2){2n +a + b + c
334 6 Special Orthogonal Polynomials and n(n+a+b + c + d- l)y(x) = B(x)y(x + i) - [B(x) + D(x)]y(x) + D(x)y(x - i), F.10.12) where У(х) = pn(x;a,b,c,d), B(x) = (c - ix)(d - ix), and D(x) = (a + ix)(b We observed earlier that Jacobi polynomials are limits of Wilson polynomials. In this case, however, the difference equation for the Wilson polynomials becomes the differential equation for Jacobi polynomials. See the exercises for other ex- examples of hypergeometric orthogonal polynomials. For recent developments on some polynomials considered here and their extensions and applications, see Nevai [1990]. 6.11 An Extension of the Ultraspherical Polynomials The generating function of the ultraspherical polynomials is given by the product of A — re'e)~x and its conjugate. More generally, Fejer [1925] studied a sequence of polynomials defined as follows. Let f(z) = YlT=o anZn be a function that is analytic in a neighborhood of z = 0, with real coefficients. The generalized Legendre polynomials or the Legendre- Fejer polynomials are defined by n=0 k=0 n=0 k=0 F.11.1) л=0 Feldheim [1941a] and Lanzewizky [1941] independently asked whether the pn(cose) give rise to orthogonal polynomials other than the Gegenbauer poly- polynomials. We know that if the pn{x) are orthogonal with respect to some positive measure, then they must satisfy xpn(x) = Anpn+l(x) +Bnpn(x) + Cnpn-i(x), n =0,1,2,... F.11.2)
6.11 An Extension of the Ultraspherical Polynomials 335 with AnCn+\ >0 and A, Bn, Cn+\ real. We can normalize to take p-i(x) = 0, Pq{x) = 1. The converse is also true, although we did not prove it. So, to find polynomials pn (x) that are orthogonal, it is enough to derive those that satisfy the three-term recurrence relation. Note that if n pn(cosd) = 2_\пкап-к cos(n — 2к)в, then by в -> в + л we obtain Therefore, if р„(х) satisfies F.11.2), it must, in fact, satisfy 2xpn(x) = Anpn+i(x) + Cnpn-\{x), with An, Cn real, AnCn+i > 0, n = 0,1, 2, This implies that n n+1 2COS# 2_,ak0n-k COS(n — 2к)в = An ^^OkOn+l-k COS(n + 1 — 2к)в k=0 t=0 n-1 + Cn ^atan_i_tcos(n - 1 - 2к)в. F.11.3) Now use the trigonometric identity 2 cos6» cos(n - 2к)в = cos(n + 1 - 2к)в + cos(n - 1 - 2к)в to write the left side of F.11.3) as пп-к cos(n + 1 — 2kH + 2^ пкпп-к cos(n — 1 — 2k) в, i=0 k=0 Substitute this in F.11.3) and equate the coefficient of cos(« + 1H to get or The coefficient of cos (n — 1 — 2к)в gives пп—к On Qk+l On—к an-k-\ an+\ ak an-t-i
336 6 Special Orthogonal Polynomials Take k = 0 and 1 to obtain an equation for the variables a. To simplify this equation, set sn = an/an-i.We obtain the nonlinear difference equation Sn+1 Sa-i - or ~ Sn-i + S\- S2) = S\Sn ~ S2Sn-\. For further simplification, set sa = tn + s\. The equation becomes tn+i(tn - /„_! - t2) = -t2tn-uh = 0. Write tn = t2un; then un+\(un - и„_1 - 1) = -м„_ь u\ = 0. For linear difference equations we get polynomial solutions ^ Anq". Such a so- solution is not possible here and as there is no general method for solving nonlinear equations, we try the simplest rational expression as a possible solution, keeping in mind that и \ = 0. Set U"= withal < l,foiun(q,A,B) = un(q~\ A/(Bq), B~l). Then A(\-gn) 1 - Bq"+l A(l - q"'1) - A - Bq") 1 - Bq" - qn) - A - Bqn+l) 1 — Bqn+l 1 — Bq"'1 For this to be true, we must have В = 1 and A - q"-l)(A - 1 - (A - q)q"-1) = A - Чп~2){А - 1 - (A This is identically true for A — 1 = q. Thus A + q){\ - qn~]) So l-q" (l+eXl-?11-1)^-*!) F.11.4) 1 -q" which shows that for some a and / sn - 1 -^r"
6.11 An Extension of the Ultraspherical Polynomials 337 This gives 1 - qn+l A" = ^T^ FJL5) and after some simplification Cn = n . F.11.6) The recurrence relation for pn (x) is given by 2xa(l - Pqn)Pn = A - qn+l)pn+l +oc2{\ - p2qn-l)pn-u with ~ , > 0, n = 0,l,2,.... We have taken \q\ < 1. If q = 1, then the value of un in F.11.4) is defined by the limit as q -» 1. This case also gives rise to orthogonal polynomials. For example, if p = qk in F.11.5) and F.11.6), then An = (n + l)/(n + A), Cn = (n + 2X — l)/(n + A.) (with a = 1), and the recurrence relation for ultraspherical polynomials is obtained. Clearly, there are other cases where division by zero may be involved in F.11.5) and F.11.6). These do not lead to orthogonal polynomials of all degrees unless q is a root of unity. Consider the situation where this problem does not arise and let us see what polynomials we get. We need an expression for an. We have an = i = a-p)a-Pq) •••(!-fig"-1) GO An.iAn-2---A0 a (l-q)(l-q2)--(\-qn) Therefore, fire )=«oL which suggests that we take uq = a = 1. In this case, the polynomial is COS(n — 2k) в. This expression may appear a little strange at this point. As pointed out before, taking p = qx and letting q —*¦ 1 gives the ultraspherical polynomials. One should keep this procedure in mind. In Chapter 10, we give an introduction to objects of
338 6 Special Orthogonal Polynomials this kind. By the methods developed there, it can be shown that the generating function is given by n=0 - re-wqn) This may appear like a much more complicated expression than the generating function for the ultraspherical polynomials that it is supposed to extend. But there is one sense in which it is simpler. Recall that the singularities of the generating function can be used to get information about the asymptotic behavior of the poly- polynomials and the weight function (see Theorem 6.6.2). The generating function for the ultraspherical polynomials has algebraic singularities, whereas the singulari- singularities here are simple poles. These are easier to deal with. The poles closest to the origin are at r = e'e and r = e~'e. Near r = e'e, the generating function behaves like e2Wq")(l - qn+l) A - re~m)' So, „_o Write the infinite product as Re'*. Then a n\ n=0 П Ln=o - Pe2Wqn)(l - -e2Wq")(l -e-2 e2Wq")(l as n —*¦ oo. By Theorem 6.6.2, we expect the weight function to be шр{соьв) = „_o -emq")(\ -e~2Wq") (l-2cos2eq"+q2n) - 2P cos n —> oo. 1/2 These infinite products are well known in the theory of elliptic functions. In Chap- Chapter 10, we hope to convince the reader that they are quite natural and tractable, unwieldy though they may appear now.
Exercises 339 Exercises 1. Evaluate the integral POO POO I = / e-x2e2ix'dx = 2 e~x2 cos2xtdx J-oo J0 (a) by contour integration, (b) by expanding cos 2xt in powers of x and integrat- integrating term by term, and (c) by showing that the integral satisfies the differential equation dl — = -2tl. dt 2. Prove that F{x, r) := e2xr~r2 = J2T=o ^f-r" by showing that = Hn(x). \r=0 3. Prove that n/2 lim ( - > ) (r^ n\ Hint: You can use generating functions, recurrence relations, or Rodrigues's formula. 4. Prove that un = e~x 1гНп{х) satisfies the equation < + Bn + 1 - x2)un = 0. Deduce that — (u'num - u'mun) + 2(n - m)umun = 0. Hence prove the orthogonality of Hermite polynomials, that is, pOO I umundx=0 form ф n. J—oo 5. Use the generating function for Hermite polynomials in Exercise 2 to prove that Hn(x cosu+y sinu) = и! V cos* u sinn->c u 6. Let n be a nonnegative integer. Show that and r2n _ Bn)! ^- H2k(x) X = ! А 22«+i Z_^ Bk + l)\(n -k)\ k=0
340 6 Special Orthogonal Polynomials 7. Define sgnx = {-i, x<a Show that у (-1)" H n=0 8. Use the generating function F.2.4) for Laguerre polynomials to prove that 00 jj f. -r" = A + 4r2)/2(l + 2xr + 4r2 9. Obtain the generating function F.2.4) from the integral representation F.2.15) of Laguerre polynomials. 10. Let ф„(х) = е~х2/2Н„(х)/^2пп\, n = 0, 1, 2,.... Denote the relative max- maxima of \фп(х)\, as x decreases from +oo to 0, by До,п, Ai,n, A2,n, • • ¦ • Prove that Ar,n > Mr,n+b П >Г >0. Deduce that |</>„(д;)| < max \<fo(x)\ = 1. (See Szasz [1951].) 11. Show that the Fourier transform of un(x) = e~*2/2Hn (x) is г"м„ (х) by filling in and completing the following steps: Г un(x)eixydx = Г e-x'—eixy+x2'2dx J-oo J-oo dxn = цуе^2— Г dyn J-o 12. Let yjfn{x) = т^рщжщHn(^/2nx)e~7tx . Suppose / is square integrable on (—oo, oo) and g is its Fourier transform. Let ¦2_^anfn{x), g{x)' n=0 n=0 oo oo xf(x) ~ 2_]сп^п(х), xg(x) ~ у dn\/fn(x). n=0 n=0 (a) Show that an = i"bn. (b) Use the recurrence relation for Hermite polynomials to obtain fn(x) = Vn + 1 tyn+i(x) + Jn fn-\{x).
Exercises 341 (с) Use (a) and (b) to show that cn = Vn + \an + \fnan-\, n = i~n~l[y/n + \an - *Jnan-\]. (d) Deduce that roo poo i poo / x2\f(x)\2dx + x2\g(x)\2dx > — / \f(x)\2dx J— oo -/—oo -^^ J—oo with equality only if /(x) is almost everywhere equal to a constant mul- multiple of exp(—jrx2). (e) Rescale to show that (d) implies that (for p > 0) /¦OO pOO 1 /-OO p2 / x2\f(x)\2dx + p-2 x2\g(x)\4x > — / |/(*)|2Jx. J—OO J—OO -^^ J—OO 13. 14. IS. (f) Show that (e) implies Show Show j a /„- Prove [y°°x2|/(x)|2Jx] that that (-1)пГ(п+а л/тгГ(а + 1/2) that Heisenberg' 1/2 г ,оо / xl \.J-oo I (\ s inequality: -,1/2 \g(x)\2dx\ Г(п+а + 1) n\ 2 „-1/2 (See de E " 1)" sc+n+l j— Э0 oo Bruin [1967].) (-1)" ^ Hlk{x)Hln_lk{y) 2 16. Prove that 17. For Re(a + 1, /S) > 0, prove that and
342 6 Special Orthogonal Polynomials 18. Prove (а) УA)т!г?;Уя~к = (У + Ъя— (b) У 19. Prove the identity \+xy 20. Show that for a > -1, r > 0, and x > 0, \al1 —r \—^ ^* 21. Prove that for Legendre polynomials Р„(х), 00 Ern\X) „ xr - . Г. 7 r = e Jn(v 1 — x2r). n=U n! More generally, Л > -1/2. 22. Suppose Pn (x) is the Legendre polynomial of degree n. Then Turan's inequal- inequality states that [Pn(x)]2-Pn-i(x)Pn+i(x)>0, п>1, -1<дс<1. This exercise sketches a proof of Turan's inequality. See Szego [1948] on which this is based. (a) Show that if the polynomial has all real roots, then U2n_x - UnUn-2 > 0. (b) The following is a result from entire function theory: Suppose 00
Exercises 343 is an entire function with the factorization n=0 where a > 0, /S and fin are real, and J2™=o Pn *s convergent. Then Sn (y) has all real roots. To obtain Turan's inequality, use Exercise 21 and D.14.3). 23. (a) Use Exercise 19 to show that Sn(y) in Exercise 22(a) with uk = Pk(x) has all real roots and thus obtain another proof of Turan's inequality, (b) Extend Turan's inequality to the polynomials Hn(x), L"(x), and C%(x). Prove these inequalities by two different methods. 24. For the Legendre polynomial Р„ (x), prove the following results: (a) / Р„(х)е "xdx=i "\ — Jn+y2(t). 7-i (b) %^( J-°o 10 if x > 1 orjc < — 1. (c) For a nonnegative integer k, 25. Suppose a, /3 > — 1. Show that , d2y dy A - хг)-\ + [p-a-(fit + P + 2)x]-f- +Xy = 0 dxl dx has a nontrivial polynomial solution if and only if к has the form n{n +a + /3 + 1), where n is a nonnegative integer. This solution is CP^a^\x), where С is a constant. 26. Prove the following results for ultraspherical polynomials: (a) (b) where 00 ,. n=0 ^ lim 1/2)» яд ^-"Свх(дс) = X-l/ (А)„ n' /г = (i -2xr (с) *=o
344 6 Special Orthogonal Polynomials 27. Use Rodrigues's formula to prove when x ф ±1, and when the contour of integration is a simple closed curve, around t = x in the positive direction, that does not contain t = ± 1. J(l-yf( (b) 2nJ(l-yf(l+ 28. Prove that (а) -m,m+X Л+ 1/2 ; (b) -2m+V A+l/2 ;1 The following problems define some important hypergeometric orthogonal polynomials. For a given nonnegative integer N appearing in the definition of a discrete orthogonal polynomial, we use the notation jfc! 29. The Racah polynomials are defined by n,n + a + P+l,x,x + y+8+l a + l,/J + «+l,y + l for n = 0, 1,2,..., N, and where Л(х)=х(х+у +5+1) and one of the bottom parameters is —TV. Show that the orthogonality relation is given by " (y + S + l)x((y + 5 ^ x\((y + 5 + l)/2),(y + S - a + l),(y - /8 + 1),(S ¦Rm(X(x))Rn(k(x)) = M(n + <* + P+ 1Ш + !)»(" ~ Д + 1)и(" + <g - У (a + /8 + 2Jn(« + l)B(j8 + 5 + 1)„(У + 1)„ \
Exercises 345 where M = 2)N(-p) ifa = -N, 2)N(8-a)N if у + 1 = -N. Show that the recurrence relation is given by X(x)Rn(X(x)) = AnRn+i(X(x)) - (An + Cn)Rn(X(x)) + CnRn where (n + a + P + \)(n + a + l)(n + P + 5 + l)(n + у + 1) Л„ = and С„ = {In + a + /в + 1)Bи + а + /в + 2) и (и + /в) (и + a + /в - у)(и + a - 8) Show that the difference equation satisfied by y(x) = Rn(X(x)) is n{n + a + l)y(x) = B(x)y(x + 1) — [B(x) + D(x)]y(x) + D(x)y(x — 1), with (x + a + l)(x + p + 8 + l)(x + у + l)(x + у + 5 + 1) B(x) = Bx + у + 8 + l)Bx +y+8 + 2) and Bx + у + 5)Bx + у + 5 + 1) 30. The Hahn polynomials are limits of Racah polynomials defined by lim Rn(X(x); a, p, -N -1,5)= Qn(x; a, p, N) or lim Rn(X(x); a, p,y,-p-N-l) = Qn(x; a, p, N). y->OO Show that a + l,-N ' ;1 , n = 0,l,...,N. (b) x=0 x\(N -x)\ (-l)nn\(p +!)„(« NlBn + a + p „(x;a,p,N)
346 6 Special Orthogonal Polynomials (c) -xQn(x) = AnQn+i(x) - {An + Cn)Qn(x) + CnQn-i(x), where _ (n + a + p + l)(n + a + l)(N - n) "~ Bn + a+p + l)Bn + a + p + 2) ' and c = n(n+p)(n + a+p + N + l) " Bn+a + P)Bn+a+P + l)' (d) Qn(x) satisfies the difference equation n{n + a + p + l)y(x) = B(x)y(x + 1) - [B(x) + D(x)]y(x) + D(x)y(x - 1), where В(х) = (х-Л0(х+а+1), D(x) = x(x - P - N - 1). (e) Use (b) to show that Qn(x; a, p, N) = CnQn(N - x; p, a, N), where Cn = (-l)"(a + l)n/(P + 1)„. Deduce Corollary 3.3.2. 31. Define the dual Hahn polynomials by Rn(\(x); y, S, N) := lim^ RnDx); -N - 1, p, y, 8) and deduce properties corresponding to (a) through (d) in the previous prob- problem. Observe that dual Hahn is obtained from the Hahn by interchanging n andx. 32. Show that, for the Hahn polynomials Qn(x) defined in Exercise 31, the fol- following limit formula holds: xQn(;a,p,) 33. The Meixner polynomials can be defined by - c) \ lim Qn[x;b-l, '-^ -, N =: Mn(x; b, c). N->oc \ С J Show that (a) Мя(х; b, c) = 2Fl(-n, -x; b; 1 - 1/c). ^ (b)x c~"n\ (b) > —-cxMm(x;b,c)Mn(x;b, c) = —— -Smn, f^0 x! (b)n(\ -c)b b > 0 and 0 < с < 1. (с) (c- (Note that an application of the Pfaff transformation (Theorem 2.2.5) shows that Meixner polynomials satisfy a three-term recurrence, which,
Exercises 347 by Favard's theorem, implies that they are also orthogonal with respect to a positive measure when с > 1. The reader is encouraged to find this orthogonality relation, which is obtainable from (b).) (d) n(c - 1)М„(х) = c(x + b)Mn(x + 1) -[* + (*+ b)c]Mn(x) + xMn(x - 1). Observe the duality in n and x exhibited by relations (c) and (d). 34. A way of defining Krawtchouk polynomials is given by Kn(x; p, N) := lim Qn(x; pt, A - p)t, N). r->oo Prove the following relations: (a) Kn{x;p,N)=2Fx{-n,-x;-N;l/p), n = 0,l,...,N. ^ Л ( x )PX^ - p)N-xKm{x; p, N)Kn(x; p, N) (-irn\f\-p\"s (-N)n V P (c) -xKn(x) = p(N - n)Kn+l(x) - [p(N -n)+ n{\ - p)]Kn(x) + n(l-p)Kn-l(x). (d) -nKn(x) = p(N - x)Kn(x + l)-[p(N -x)+x(] - p)]Kn(x) + x(l - p)Kn(x - 1). Note the relationship of (c) and (d) as in the case of the Meixner polynomials given in the previous exercise. In fact Kn(x; p, N) = Mn{x; -N, p/(p - 1)). 35. Define the Charlier polynomials by С„(х; a) := lim М„(х; b, a/{a + b)) b—юо or С„(х; a) = lim Kn(x\ a/N, N). Deduce that 00 x (b) ^2 -^cm(x; a)Cn(x; a) = n\a~neaSmn, a > 0. (c) -хС„(х) = aCn+l(x) - (n +fl)CB(jc) + лС„_1(дс).
348 6 Special Orthogonal Polynomials (d) -nCn(x) = aCn(x + 1) - (x + a)Cn(x) + xCn(x - 1). Compare (c) and (d) as in the previous two exercises. 36. Prove that Urn Ba)n/2Cn(V2ax +a\ a) = (-1)"Я„(х). a—>oo 37. The hypergeometric representation of the Meixner-Pollaczek polynomials is These polynomials can be obtained as limits of continuous dual Hahn (or continuous Hahn) polynomials. Show that , Sn{{x (а) Р„(х;ф) = hm n\(t/sin<p)n /•00 (b) / Bф^ -лч2^ ! 3m"' Л > 0' 0<ф <7T. (c) (и + 1)Р„л+1 (х) - 2[х sin ф + (и + X) cos 0]Р„*(х) + (и + 2Х-1)Р„л_1(х)=0. (d) в1'*(А. - гх)Р„А(х + 0 + 2/[xcos0 - (и + A.) si 38. This problem gives generating functions for some orthogonal polynomials, (a) Wilson polynomials: fa + ix, b + ix \ / с — ix,d — ix and ((a+b+c+d-l)/2, (a+b+c+d)/2,a+ix,a-ix -At 4 3\ a+b,a + c,a+d ' A - tJ
Exercises 349 A simple corollary is that dk fa+x,b + x \ /c-x,d-x for —oo < x < oo when a,b,c,d > 0 and 0 < t < 1. (b) Continuous dual Hahn: and , „ (a + ix, a-ix \ ^ Sn(x2; a,b,c)t" (c) Continuous Hahn: a + ix \ fd-ix \ ^ pn(x;a,b,c,d)tn г 1^*1 ' гМ = У > г' 11^11 \a + c J[ \b and (a + b + c + d- l)/2, (a + b + с + d)/2, a + гх -4Г a + c,a+d ' A -О2 (d) Meixner-Pollaczek: 00 A _ егф1)-к+1х{\ - е-'ф1Гх-'х = E Pn& Ф)*" n=0 and 'X + ix ,.-,. . ^ BV"ein* (e) Meixner: t and , _ (-x (l-c)t\ ^ Mn{x;b,c)
350 6 Special Orthogonal Polynomials (f) Charlier: и л=0 For more examples of orthogonal polynomials and their properties, see Chihara [1978, Chapter 6]. 39. Prove the inequalities in F.4.19) for a > —1/2. Prove the corresponding result for —1/2 > a > — 1. 40. Suppose a, fi > — 1. Show that [ \P?"-P)(xi)\ ~ VVn when max(a, /8) < -1/2. Here x\ is one of the two maximum points nearest (/в — a)/(a 4-/6 + 1) = (Compare with Exercise 4.18. Take n(n + a + p and show that f'(x) can change sign only at x0.) 41. Show that f&k ifX>0, max \c^(x)\ = { "¦ -l^x-1 { \C^(xi)\ if Л < ОД nonintegral. Here xi is one of the two maximum points nearest 0 if n is odd; x\ = 0 if n is even. 42. Show that for a fixed с and и —»• oo, |-0-«-1/2О(и-1/2) if c/n < ^ < я/2, Pn(a'«(cos#)= <^ - - [ О(иа) ifO <#<c/n. [Use the result on the asymptotic behavior of Pn(a^\x) in Section 6.6. Also apply Exercise 4.18, with y(x) = and 1/4 — a2 1/4 — B + (и 4sin2(x/2) 4cos2(x/2) 43. Show that the sequence formed by the relative maxima of \L" (x)\ and \L" @) | is decreasing for x < a + 1/2 and increasing for x > a + 1/2. (Consider the function n{L«(x)}2 ^2
Exercises 351 44. Prove that the successive relative maxima of \Hn(x)\ is a decreasing or in- increasing sequence according as x < 0 or x > 0. 45. Use Theorem 6.7.2(c) and Gegenbauer's integral in Theorem 6.7.4 to obtain Koomwinder's Laplace-type integral for Jacobi polynomials: P^*'P){x) _ 2Г(«+ 1) 2 r 1" у1 Г Г1+х-A-х)м2 . г 1 / / ; HV1-*«COS0 Jo Jo L 2 J a > /в > -1/2. 46. Note that v(x) = e~x2/2Hn(x) is a solution of y" + {In + l)y = x2y. (a) Hence verify that for s = \Jln + 1, v(x) = An[cos(sx - ИЛ-/2) + /fn(x)], where Rn{x) = ~ f t2y(t)sin(s(x-t))dt, sAn Jo and (n\/k\, n = 2k, А„ = < l(n!/Jk!)B/s), n = 2*+l. (b) Use Schwartz's inequality to prove that \Rn(x)\ <C|x|5/2n-1/4, where С is a constant. (c) Deduce that Hn{x) ~ 2<"+1>/V/V/V2/2cosГ« - ^ as n ^ oo. 47. Show that y(x) = e~x/2L"(x) satisfies the equation (a) Deduce that, for N - (a + 2n + l)/2, У(х) = j yi(x) + ^N I ^г)а+1У@[У1@У2(х) - yi(x)y2(t)]dt, where У\(х) = .
352 6 Special Orthogonal Polynomials (b) As in Exercise 46, but after much more work, it can be shown that the integral divided by Г(n + a+I)/n\ tends to zero as n -» oo. Thus prove that +a + l) as n -> oo. (c) Use D.8.5) to conclude that as n —»• oo. 48. Let rk denote the number of ways in which к rooks can be placed in different rows and columns on an m x n chessboard. Let Rmn = Y17=o rkXk ¦ Prove that, when a is an integer, Rn,n+a — n\x"L^(—l/x). 49. Use the methods of Section 6.9 to evaluate the integral -== / Hea(x)Heb(x)Hec(x)Hed(x)e-x2/2dx. Then evaluate it by a different method. 50. Suppose f(x) is expandable in terms of Jacobi polynomials. Let л=0 where hn is defined in F.2.5). Let g(x) be an average of f(x) denned by g(x) = A - х)-а-]A - лгГ" / /@A - t)a(l + tfdt. Jx Suppose for simplicity that a@) = 0. If b{n) and a{n) are the Jacobi coefficients of g and / respectively, show that b(n) = + а+^ + 1 Г(л + 1) ^ a(k)[2k + a + fi + 1] Г(*+1) \ht\ k[k+ a + fi + l] Г(* + а + 1)' for n = 0, 1,.... A sufficient condition for the result to hold is the convergence of the series Ti\a(n)\nl+a. Similar results hold forLaguerre and Hermite polynomials. 51. This exercise gives a proof of the following theorem of Hardy [1933]: If / and its Fourier transform g are both 0(\x\me~x /2) for large x and some m, then each is a finite linear combination of functions of the form e~x2/2Hn (x), where Hn{x) is the Hermite polynomial of degree n.
Exercises 353 Note that it is enough to prove the theorem for self-reciprocal (/ = g) and skew-reciprocal functions (/ = —g). Take / to be self-reciprocal and show that: (a) For Re s = a > — 1, X(s) = Г e-s*2'2f{x)dx Jo satisfies X(s) = s~ (b) When ix(s) = y/(s + l)k(s), fi(s) = The function ix(s) may have a singularity at s = — 1 but is analytic at all other points including infinity. Hence, л=0 (c) X(s) = 0(\s + \\~p) forsome p, near.? = —1. This can be proven by the argument below. Let а +1 = т. On the unit circle \s\ = 1, т = \s +112/2. For |s| < 1 X(s)= o( Пe-Tx2'2xmdx) = Ods + ll"™). (d) / e-«2V'/2u-M-'--' 1Ч" ' ^-^ 2 /•00 /•00 (e) / е-/2(/-Ф)Л = 0, where Ф is an appropriate linear combination of functions of the form 52. Hardy's theorem given in the previous exercise extends to the following result ofRoosenraad[1969]: Set and define generalized Laguerre functions by Ca2n(x)=caJx\a+l/2e-*2?2Lan(x2), Ca2n+X(x) = can+x\x\a+ll2e~xll2xLa+\x2), л = 0, 1, 2,....
354 6 Special Orthogonal Polynomials For a function / defined for all real numbers, set If00 ( = -J f(x)U (xt) i^ Note that 1Za is the sum of an even and an odd Hankel transform. Check that ?%, are eigenfunctions of TZa, that is, naCam=im?am, m =0,1,2,.... Theorem Iff and ga = Haf are both O(xm+a+l/2e~x2/2) for large x and some m > 0, then each is a finite linear combination of the functions C"(x). As before, it is sufficient to consider the cases where lZaf = ±f.TakelZaf = f. Show that, for Re s > — 1, /•00 = / xa+l/2e-sx2/2f(x)dx Jo satisfies So, if n(s) = A +i)a+1A.(i), then (x(s) = fx(l/s). Now complete the proof as in the previous exercise.
Topics in Orthogonal Polynomials As we have seen before, we can gain insight into Jacobi polynomials by using the fact that they are hypergeometric functions. In this chapter, we reverse our procedure and see that Jacobi polynomials can shed light on some aspects of hyper- hypergeometric function theory. Thus, we discuss the connection coefficient problem for Jacobi polynomials. We also discuss the positivity of sums of Jacobi polyno- polynomials. We mention several methods but here, too, there are situations in which the hypergeometric function plays an important role. Finally, for its intrinsic interest, we present Beukers's use of Legendre polynomials to prove the irrationality of f C), a result first proved by Apery. It is evident that the Jacobi polynomial P^Y'S\x) can be expressed as a sum: YH=o cnkPk (*)¦ The significant point is that the connection coefficient cnk is expressible as a 3 Fj hypergeometric function. This 3 F2 can be evaluated in terms of shifted factorials under conditions on the parameters a, /3, y, and <5. Surprisingly, this leads to an illuminating proof of Whipple's 7 F6 transformation. We have seen that, with the exception of Gauss's 2^1, most summable hypergeometric series are either balanced or well poised. A puzzling fact is that at the 5 F4 and higher levels, the series are very well poised. The above-mentioned proof of Whipple's transformation sheds light on this fact by showing that very well poisedness arises from the orthogonality relation for Jacobi polynomials. Fejer used the positivity of the series YH=o sin (fc + \)в to prove his famous theorem on the Cesaro summability of Fourier series. The positivity of some other trigonometric series have also been important in mathematics. It turns out that these inequalities are generalizable to inequalities for sums of Jacobi polynomi- polynomials. The soundness of this generalization is illustrated by the usefulness of the inequalities. A dramatic example is an inequality proved by Gasper that played an unexpected but significant role in de Branges's proof of the Bieberbach conjecture. For this interesting story, see Baernstein et al. [1986]. We also state and prove a trigonometric inequality due to Vietoris. 355
356 7 Topics in Orthogonal Polynomials 7.1 Connection Coefficients Suppose V is the vector space of all polynomials over the real or complex numbers and Vm is the subspace of polynomials of degree < m. Suppose po(x), p\(x), рг(х),... is a sequence of polynomials such that pn(x) is of exact degree n; let qo(x), q\ (x), qj(x),... be another such sequence. Clearly, these sequences form a basis for V. It is also evident that po(x),..., pm (x) and qo(x),... ,qm(x) give two bases of Vm. It is often necessary in working with finite-dimensional vector spaces to find the matrix that transforms a basis of a given space to another basis. This means that one is interested in the coefficients cnk that satisfy ^ G.1.1) k=0 The choice of pn or qn depends on the situation. For example, suppose pn(x) =xn, qn(x) = x(x- l)---(x -n + 1). Then the coefficients cnk are Stirling numbers of the first kind. If the roles of these pn and qn are interchanged, then we get Stirling numbers of the second kind. These numbers are useful in some combinatorial problems and were defined by Stirling [1730]. Usually, little can be said about these connection coefficients. However, there are some cases where simple formulas can be obtained. For example, 7%^y cos(n - 2к)в G.1.2) gives an expansion of Ри(а'а)(х) in terms of Pk~ ' ¦ ' \x). This formula was derived in the previous chapter from the generating function for C*(x). See F.4.11). Another example is = t~^T^). GЛ.З) This can be obtained from the generating function for L%(x). We have = A -r)-^1exp(-xr/(l -r)) n=0 -ЕЕ
7.1 Connection Coefficients 357 Notice that in both these cases the polynomials are similar, in that they are orthog- orthogonal on the same interval and their weight functions are closely related. The next lemma is a basic result of this section and gives the connection coeffi- coefficient cnk when qn{x) = P?'*>{x) and pk{x) = Рк(аф)(х). Lemma 7.1.1 Suppose P^-S) (x) = ?Lo cnkР1"ф) (х). Then (n + у + S + l)k(lc + y + l)n-kBk + a + p _ Cnk~ (п-к)\ГBк+а + Р + n+k,n + k + y + 8 + l,k + a + l\ k + y + l,2k + a + p + 2 ' )' Proof. From the orthogonality of Jacobi polynomials, Cnk = ank/hk, where [ +Р+1 а + 1)Г(к Bk + a + P + 1)Г(к +а + Р + 1)Г(к and апк= / P^&\x)Pla'P)(x)(\-x)a(\+xfdx 7-1 ( ivt Г1 dk = Чг- / Pi^GO—r[(i-*)B+*O+*)/'+t]<fr 2*Л! У_! dxk .-x)a+k(l+x)p+kdx. We have seen earlier that G.1.4) dx-n v»/- 2 '«-i (*)• F-3.8) Therefore, n(y ,8) /.^.4 ^ * D vK~i~^i^~r^) / \ Г ' I у I — r i ( у I
358 7 Topics in Orthogonal Polynomials Use this in the integral for ank to get ank = 22kk\ y+8+ l)k(y + k + !)„_* y^ (-и + k)j(n 22kk\(n-k)\ • Г (l-x)a+k+i(l+x)p+kdx (п + у + S + \)к{к + у + l)B-tr(fc + а + 1)Г<*г This is equivalent to the claim in Lemma 7.1.1. ¦ In general, the 3 F2 in the lemma cannot be summed. If we take у = a, then the 3 F2 reduces to a terminating 2 F\, which can be evaluated by the Chu-Vandermonde formula (Corollary 2.2.3). The 3F2 can again be summed if S = p. For in this case we get a balanced 3F2 whose value is given by the Pfaff-Saalschiitz identity (Theorem 2.2.6). Finally, the 3F2 can be summed by Watson's identity if а = p and у — S (Theorem 3.5.5). It is, however, sufficient to do the а = у case in Lemma 7.1.1 as the other two cases are consequences of this one. Theorem 7.1.2 П ft V / (fy l_ 2)„ 2^ jfc=0 Proof. Take у = a in Lemma 7.1.1. The 3 F2 reduces to (a + p + 2k + 2)n_k Use this in Lemma 7.1.1 and simplify to get the result. ¦ The case S — fi is a corollary. Theorem 7.1.3 (a + p+2)n ¦^ (y — а)„_^(а + P + l)k(a + P + 2k + 1)(/? + у + n jfc=0 '" " « и
7.1 Connection Coefficients 359 Proof. Use Theorem 7.1.2 and the fact that P^\x) = (-1)" P^'a)(-x). Ш Theorem 7.1.4 p(.Y.Y)(x\ - ^ + ^ rm *• л> ~ B у + l) ? B« 3/2)m_afc(y - a). (a,g) m y ?j (a + l)m_2ifc(« + 3/2)m_t(a Proof. Replace x with 2x2 - 1 and ft = ± 1 /2 in Theorem 7.1.3. Then (« + 3/2)я (К - a)n-k(a + l/2)tBfc + « + 1/2)(и + у + 1/2). (,/) k ( G.1.5) (a,-i/2) 2 (n-Jk)!(l/2)t(o+l/2)(n+o + 3/2)t k (X and 2)Bx2 l) C/2)B (« + 5/2)„ уЧ (у - а)„-П« + 3/2)tBfe + а + 3/2)(и + у + 3/2). (аЛ/1) 2 _ G.1.6) Now use the quadratic transformation formula from Chapter 3 (see C.1.1)) to obtain p(a,a)( = (« p 2й (о+1)„Bл)! " and (aa) («+lJn+iw! (l/2J _ ,ч r? , ox Take m = In and combine G.1.5) and G.1.7) and simplify to get p(Y,Y)(x\ = (y + ^m By + l)m (У ~ <*)f-*B« + 1)ц(« + 3/2)a(y + l/2)f +k (tt.g) 2* The formula in the theorem for even m follows from this by reversing the order of summation, that is, by changing к to у — к. The odd case is obtained similarly from G.1.6) and G.1.8). ¦
360 7 Topics in Orthogonal Polynomials We can also write the previous theorem in terms of ultraspherical polynomials: Theorem 7.1.4' Remark 7.1.1 The relation in G.1.2) is obtained from the above formula by letting ц, -» 0. Surprisingly, it is easy to obtain Theorem 7.1.4' directly from G.1.2). Proof of Theorem 7.1.4' Note that Differentiate G.1.2) with respect to 9 and divide by — sin в to get a+i, m V^ W»-*W*(* 2fc) sin(n - 2к)в „ Ucosd) = > . G.1.10) "~1V f^ (nfc)!fc! sin0 f (n к—0 If /t is replaced by n — к, the expression in the summation does not change. So for n odd the terms of the sum can be paired and for n even the same can be done, because the term when n = к is zero. Thus, after changing n — 1 to n and X to A.- 1 in G.1.10), we get Here we used the fact that , sin(n + 1N Repeat this process, that is, differentiate with respect to x and change n — lion and X to A. — 1 to get Щ(*2)я + 22* t=0 v-zn-ic-v By induction it follows that for jx = 1, 2, 3, It is evident from G.1.2) that C^_2jt(xIS a polynomial in д. Hence the right side of G.1.11) is a rational function of д. Since G.1.11) is true
7.1 Connection Coefficients 361 for infinitely many values of ц, it is identically true. This completes another proof of Theorem 7.1.4'. ¦ It is possible to obtain Dougall's sum of a very well poised 7F6 from Theorem 7.1.3. This evaluation of Dougall's identity gives an insight different from those suggested by earlier evaluations. To start, take x = 1 in Theorem 7.1.3 to get (y + l)n(a + p+2)n = (y-a)n P Note that ^+0+1 = (|g+^+t)/2) • GЛЛ2) So /a + 0 + 1, (a + P + 3)/2, и + у + p + 1, a + 1, -n Set a + p + 1 = a, a + 1 = ?, and n + y + /J+l=cto rewrite this formula as яо + р+1 = a, a -t- i =o, ana n + y + p-t-i=:cio rewnie mis rormi / a, a/2+1, ft, с, -л; \ _ (а + 1)„(а + 1-ft - 5 4 Va/2,a + l -ft, a + 1 - c, a+ 1+n' У (a + 1 - b)n(a + 1 - hi« iHpntitv oivpc fhp cum rvf я tprminatino upn/ \x/p11 nni«pH * Т?л Tt i« intp.rp с)„' This identity gives the sum of a terminating very well poised 5F4. It is interesting that after Kummer's sum of the well-poised 2^1 at x = —1 and Dixon's sum of the well-poised 3F2 at x = 1, most well-poised series that can be summed have the additional feature of a numerator and denominator parameter differing by one. This makes the series very well poised. In the above series, this followed from G.1.12), which in turn came from the orthogonality relation for Jacobi polynomials G.1.4). This partially explains why the summable well-poised series after 3F2 are very well poised. The above 5F4 is only a particular case of Dougall's 7F6. To find the general case, note that evaluation of a function at a point is an example of a linear operator. This was the operator applied to the identity in Theorem 7.1.3 to get the 5F4. A more general operator is an integral with respect to a measure. To obtain attractive formulas, one chooses the measure suitably. The Jacobi polynomials can be written as 2^1 hypergeometric series and we know that the integration of a hypergeometric function with respect to a beta distribution introduces two new independent parameters into the series. This shows how we may obtain the
362 7 Topics in Orthogonal Polynomials generalization of the 5F4 formula. Write Theorem 7.1.3 as (Y + 1)п р f-n, y+p+n+1 ^ ; i211 , n\ V У + а + ? + 2)й^ (л-*)! (а + j8 + 2A: + l)(y + p + n + l)k(a + \)k ;ut\. G.1.13) Integrate this with respect to two independent beta distributions. The result is '-n, у +Р +n + I,a, b \ _ (P + l)n + 1)„п\ Л y + hc,d (у - a)n-k(a + p + l)t(o + P + 2k + l)(y +p + n + l)k(a + \)k k=Q (n - k)\(P + l)t(o + P + 1)(а + Р + П + 2)kk\ ,-k,a + P + k+l,a,b у \ a + l,c,d ) The 4F3S here cannot be summed without restriction on the parameters. Observe that the 4F3 on the left-hand side is balanced ifa + b + p + 1 = с + d. This is also the condition for the 4F3S on the right to be balanced. So assume that the parameters are chosen to balance the 4F3S. If we further let b = a + 1, then the 4F3 on the right side is reduced to a balanced 3F2, which can be summed by the Pfaff-Saalschiitz formula. The result is the formula +P + n + l,a+ I, a +0-c + 2,c-a a-y-n + l ( + p+ l)/2 + p + n+2p + l +a + p + 2' when a+P+a+2 = c + d. This is just Whipple's transformation formula from which Dougall's identity can be obtained, as we have seen earlier. Remark 7.1.2 The coefficients in Theorem 7.1.4 are nonnegative when у > a > — 1. This fact is useful in the proof of the positivity of a certain 3 F2 function. This played a significant role in the first proof of the Bieberbach conjecture. We prove the inequality in a later section. The nonnegativity of the coefficients in Theorem 7.1.3 holds under the same condition, that is, у > a > — 1. In Theorem 7.1.2, nonnegativity occurs when S — p = — 1, —2,..., 8 > — 1. For general a, p, y, S
7.2 Rational Functions with Positive Power Series Coefficients 363 the problem of the nonnegativity of the coefficients reduces by Lemma 7.1.1 to that of a certain 3F2. One way to deal with this is to use three-term contiguous relations for 3F2S. For details the reader may look to Askey and Gasper [1971]. 7.2 Rational Functions with Positive Power Series Coefficients We start by showing that m - к + a \ (n - к + a \ fk - a - 2\ m-k ) \ n-k ) \ к ) ~ for a > 0. Lorentz and Zeller [1964] used this to obtain a new proof of a theorem of Hardy and Bohr. The above inequality is not directly related to orthogonal polynomials but its proof gives a nice introduction to the method of generating functions. This method will be used again in this section to prove an inequality involving Laguerre polynomials. Moreover, G.2.1) is really an inequality for a 3F2, a topic we discussed in the previous section. Note that G.2.1) can be written as (« + 1>«(« +DVa (-».-». "«"I. Л, О G.2.2) m\n\ \— m — a, — n — a for a > 0. We prove a more general result, which is the content of the next theorem, due to Askey, Gasper, and Ismail [1975]. Theorem 7.2.1 If 0 < a < min(?, y), then min(m,n) i , o\ / ;i\/; <->\ El m — к + p\ /n — k + y\ / к — a — 2\ ы ( -* )( .-* )( * J0' m,n =0,1,2,.... G.2.3) Proof. Observe that k=0 xk. Thus G.2.3) must be the coefficient of some term in an expansion obtained as the product of three binomial expansions. In fact, G.2.3) is the coefficient of rmsn in the product
364 7 Topics in Orthogonal Polynomials Verify that we can rewrite this product as -5> S V rn-k J\ n-k A - rs)a+1 -r)Y+l = A -г)<*^A -s)"-y The two factors A — r)a~P and A — s)a~y have nonnegative power series coeffi- coefficients when ft > a and у > a. This shows that it is sufficient to prove the case where a = fi = у. Note that 1 - rs 1 1 (l-r)(l-s) 1-r 1 n=\ Thus the expansion of A — rs)/[(l — r)(l — s)] has positive coefficients and it follows that any positive integer power of this rational function also has positive power series coefficients. Now write 1-rs r+1 Г 1-rs ]M+1r 1-rs (l-r)(l-s)J Since 0 <a — [a] < 1, we need only consider the case a = f} = у andO < a < 1. Observe that the 3F2 in G.2.2) when written out is mn(a + 1) mirn — l)n(n — l)(a + \)a (m+a)(n + a)V. (m+a)(m+a- l)(n + a)(n + a - 1J! m(m - \){m - 2) л (л - 1)(л - 2) (a + l)a(a - 1) (m + a)(m + a - \){m + a - 2)(л + a)(n +a- l)(n + a - 2K! + •¦ There are both positive and negative terms in this series, so the sign of the sum is not immediately evident. To show that the series is positive, we transform it into another series, all of whose terms are positive. For this purpose apply to G.2.2) Thomae's formula (Corollary 3.3.4), F (а'Ь'с.Л r(rf)r(g)r(s) fd-a,e-a,s 3 2\ d,e ' ) r(a)r(s + b)T(s+cK 2\ s + b,s + c ' where s — d + e — a — b — с The result, after a little simplification, is a(a + 1) / 1— m, 1— n, 1— a ^ ; (m + or) (л +a) \ 1 — m — a, 1 — n — a
7.2 Rational Functions with Positive Power Series Coefficients 365 It is clear that every term of this 3F2 is positive when 0 < a < 1. This proves the theorem. ¦ Remark Theorem 7.2.1 is equivalent to the statement that 3F2f~IB'"n'"a;l>) >0, m, л = 0,1,2,... G.2.4) \-m-p,-n-y ) when 0 < a < min(/), y). The condition 0 < a < min(yS, y) is necessary. Take m = 1 in G.2.4) to get Let и —>• oo to see that a < fi. By symmetry a < у. The next problem is to show the positivity of the coefficients A(k, m,n) in the power series expansion of the rational function G.2.5) The A(k, m,n) satisfy a finite-difference equation that approximates a two- dimensional wave equation. Friedrichs and Lewy wished to use the positivity of A(k, m, n) to prove the convergence of solutions of finite-difference approxima- approximations to solutions of the wave equation. Szego [1933] gave a proof using Bessel functions. He also translated this problem into an equivalent problem about the positivity of integrals of products of Laguerre polynomials. We follow this direc- direction here. Rewrite the left-hand side of G.2.5) as 1 1 Jo e-xn\-s) e-x/a-t) dx- \-r \-s \~t Recall the generating function for the Laguerre polynomials L"(x), n=0 Then
366 7 Topics in Orthogonal Polynomials Thus G.2.6) is equal to V" / Lk{x)Lm(x)Ln{x)e-^dxrksmtn Jo E k,m,n and /•oo ,m,n)= I Lk(x)Lm(x)Ln(x)e~3xdx. Jo A more general situation can be treated in a similar way. Note that if f(x) (x - r)(x -s)(x- t) then the left-hand side of G.2.5) is 1//'A). Write k,m,n=0 so that 1 f°° Aa(k,m,n)= / Lak(x)Lam(x)L°(x)xae-3xdx. G.2.7) l (a + i) Jo Theorem 7.2.2 Fora > -1/2, Aa(/t, m, n) > 0. For a > 0, йе inequality is strict, that is, Aa{k, m,n) > 0. Proo/ In Chapter 6, we computed integrals of the products of three Hermite or three ultraspherical polynomials. This also gave their nonnegativity. These integrals were obtained from corresponding linearization formulas. That method does not work here. But recall that Thus it is reasonable to consider the positivity of the integral a+j)(x)(l -x)a(l +х)а+ъЫх. G.2.8) /•l / We already know that / Pt'a)(x)P^a)(x)P^a)(x)(\-x2)adx >0 fora >-1/2. G.2.9) The question is: Can one increase the second parameter f$ in P[, \x) and still retain positivity in G.2.8)? In fact, we have the formula ) 2{n +1) p{a^(x^ i 2(n + ^ +1} p(a-e)() G.2.10) Verify this by noting that the right side vanishes when x — — 1 and that Jacobi polynomials are orthogonal. Since the coefficients in G.2.10) are positive, we get
7.2 Rational Functions with Positive Power Series Coefficients 367 the nonnegativity of G.2.8) from G.2.9) and G.2.10). This in turn implies the nonnegativity of Aa(k, m, n) fora > —1/2. The strict positivity for a > 0 comes from l Г l This implies ^oo [Г((а + l)/2)]2 / Lak(x)Lam(x)Lan(x)xae-3*dx Jo к т п = Г (a + 1) 53 53 YlI(k ~ a' m ~ b> n ~ c)/(a' fo' c)' G-2л!) a=0 6=0 c=0 where (i, j, k) = / Lf-1)/2(x)Lf - Jo When a > 0, all terms in G.2.11) are nonnegative. So it is enough to find one strictly positive term. The positivity of the term a=k,b = m,c = 0 follows from the next lemma, which proves the theorem. ¦ Lemma 7.2.3 For a > — 1, e > 0, we have (¦OO e-"Lan{x)Lam{x)xae-xdx > 0. G.2.12) Proof. Consider the generating function Jo = Jo From the last expression it is clear that if 0 <e < 1, then the coefficient of r"sm is positive. The result may be extended to larger values of e by iteration as follows. Since e~(XL"(x) is smooth and integrable, we can expand it in terms of Laguerre polynomials. Let k=0
368 7 Topics in Orthogonal Polynomials For 0 < € < 1, Ct(e) > 0 by our previous remarks. Now k=0 So e 2exL"(x) can be written as a sum with positive coefficients. Iteration of this process completes the proof of the lemma. Another proof is in Exercises 6 and 7. ¦ A consequence of Lemma 7.2.3 is the next result. The proof is left to the reader. Corollary 7.2.4 Let a > — 1, and suppose that f(x) is represented by its Laguerre—Fourier expansion. Suppose also that the coefficients of the expansion are positive, that is, poo an= f(x)Lan(x)xae-xdx>0, п = 0,1,.... Jo Then /•oo an(€)= f(x)e-"Lan(x)xae~xdx>0, n = 0, 1, 2,..., e > 0, Jo unless f(x) = 0, x > 0. Theorem 7.2.2 is surprising when considered from a different point of view, as we shall see below. Consider the following result of Sarmanov [1968]. Theorem 7.2.5 If oo f(x, y) = J2anK(x)Lan(y)/Lan@) > 0, 0 < x, у < oo, G.2.13) n=0 then an= f rn Jo where dfi(r) is a positive measure. In fact, the positivity of G.2.13) for an = r", 0 < r < 1, is a consequence of F.2.28). Now Laguerre polynomials satisfy a differential equation in x and a difference equation in и. It frequently happens that a dual result can be obtained by interchanging n and x. For example, Lemma 7.2.3 is the dual of the positivity of G.2.13) when an = rn. A dual of Szego's positive integral G.2.7) would be z) > 0, 0 < *, j, z < oc, n=0
7.2 Rational Functions with Positive Power Series Coefficients for some sequence а„. However, Theorem 7.2.5 shows that 369 anLan(z)Lan(O) = / r"dn(r,z), Jo with dfi(r, z) > 0 for all z, 0 < г < oo. This is possible only when dfi(r, z) is a point mass at r = 0, and when ao = с > 0, а„ = 0, и = 1, 2,..., since ?"(г) is negative for some z > 0. Sarmanov's paper contains aproof of Theorem 7.2.5. Another proof that makes more explicit use of special functions is in Askey [1970]. There are some extensions of Theorem 7.2.2 for some a. Theorem 7.2.6 If a > a0 = (-5 + ч/Г7)/2, then 3 xaLak(x)Lam(x)Lan(x)e-2xdx > 0, a > a0, / Jo k,m,n = 0, 1,.... The only case of equality occurs when к = m = n — 1 a = ao. For a proof, see Askey and Gasper [1977]. The case a = 0, 1,... is outlined in Exercise 10. The fact that Theorem 7.2.6 implies Theorem 7.2.2 when a > ao follows from Lemma 7.2.3. Theorem 7.2.7 IfO < a < 1, a + b = 1, and a > 0, then xaLak{ax)Lam{bx)Lan(x)e-xdx > 0, Г Jo k,m,n = 0, 1,.... For a proof, see Koornwinder [1978]. We sketch a proof of Koornwinder's inequality in Theorem 7.2.7 for nonnegative integer values of a after a discussion of MacMahon's Master Theorem. The theorem of MacMahon [1917-1918, pp. 93-98], known as the Master Theorem, makes it possible to give combinatorial interpretations of coefficients of series expansions of rational functions in several variables. MacMahon's Master Theorem can be stated as follows: Suppose an - \jx\ an Я21 a22 - l/x2 a\n а„„ - l/xn
370 7 Topics in Orthogonal Polynomials Then the coefficient of xx coefficient of the same term in k2 x2 xk" in the expansion of 1 / Vn is the same as the (aux\-\ 1- alnxn)kl • ¦ ¦ (я„1*Н h annxn)k". As an application of this theorem, consider the following example. An easy calculation shows that 1 s+t) + -rst = -rst 1/2-1/r -1/2 -1/2 -1/2 1/2 - 1/s -1/2 -1/2 -1/2 1/2 -l/t The Master Theorem implies that the coefficient of rksmt" in the series expan- expansion [1 - (r + s + 0/2 + rst/2\~l is the same as the coefficient of rksmt" in (r-s-t)k(-r+ s-t)m(-r -s + t)n/2k+m+n. The combinatorial interpretation of this result is as follows. Take three boxes with k, m, and и distinguishable objects in them. Rearrange these objects among the boxes so that the num- number of objects in each box remains the same. Then the coefficient of rksmt" in (r — s — t)k(—r+s — t)m (—r — s + t)" represents the number of rearrangements where an even number of objects has been moved from one box to a different box minus the number of rearrangements where an odd number of objects has been moved to a different box. By Exercise 10 (where this coefficient has been obtained as an integral of a product of three Laguerre polynomials), this coefficient must be positive. Thus, we see that in the above combinatorial situation, the number of "even" rearrangements exceed the number of "odd" rearrangements. As another application of the Master Theorem, we outline Ismail and Tamhankar's [1979] proof of Koorwinder's result in Theorem 7.2.7 for a = 0, 1,2,.... Let poo Ba(k, m,n)= / Lak{x)Lam((\ - k)x)Lan{kx)xae-*dx. Jo A simple calculation, using the generating function for Laguerre polynomials, shows that Ba(k m n)rksmt" - kmn - О " *)Г - ^ - krt - A - k)St + rst]«+l ¦ Since a is a nonnegative integer, B°(k, m, n) > 0 implies that Ba(k, m, n) > 0. So we take a = 0. To apply the Master Theorem observe that 1 - A - k)r - ks - krt - A - k)st + rst = -rst
7.3 Positive Polynomial Sums from Quadrature and Vietoris's Inequality 371 By the Master Theorem, B°(k, m, n) is the coefficient of rksmtn in [A — k)r — \/k(l — k)s — Vkt]k[— -\AA — k)r + ks - Vl -kt]m[-Vkr - Vl -ks]n. By applications of the binomial theorem, Ismail and Tamhankar show that В (к, w, ft) = к A — A) -I 2 > 0. G.2.14) This proves Koomwinder's theorem for nonnegative integer values of a. Note that Koomwinder's inequality implies that for 0 < к < 1, a > 0, m+n Lam{kx)Lan({\ - k)x) = with ак,т,„ > 0. This relation can be iterated to give G.2.15) G.2.16) k=0 with ak > 0 when a > 0, J]/=i A.,- = 1, and A, > 0, i = 1, 2,..., j. Several proofs of the Master Theorem now exist. Perhaps the proof that best explains its combinatorial significance is due to Foata [1965]. This proof was later simplified by Cartier and Foata [1969]. A readily accessible treatment of this argument is given by Brualdi and Ryser [1991]. A short proof using a multiple complex variables integral was given by Good [1962]. 7.3 Positive Polynomial Sums from Quadrature and Vietoris's Inequality Fejer used the inequality ^ 1 - cos(n + 1N» sin2((n + 1N»/2) 1/2H= n .\Z = ¦ „ 1 ' > 0, 2 sin в /2 sin6»/2 to prove that the Fourier series of a continuous function is (C, 1) summable to the function. This inequality can be expressed as ~ - -
372 7 Topics in Orthogonal Polynomials In Section 6.7, we saw that a similar inequality holds for Legendre polynomials, that is, k=0 k=0 Fejer used this to study the summability of a series of spherical functions. He also conjectured that G.3.3) These sums are partial sums of the Fourier series Л - which was studied because it illustrates the Gibbs phenomenon. It is possible that the graphs of the partial sums suggested the conjecture to Fejer. We can write G.3.3) as E pA/2.,/2)m >°> 0<в<я. G.3.4) The earliest proofs of G.3.3) are due to Jackson [1911] and Gronwall [1912]. Recall that G.3.2) was obtained from G.3.1) by using Mehler's integral in Section 6.7. There are other integrals in that section that give extensions to sums involving It is possible to obtain positive sums with terms of either the form \\) or P^\x)/P^'a\\). Without some applications in mind, it is difficult to determine which extension is going to be useful. The inequality in G.3.1) suggests that sums of P^a-P)(x)/P^-a)(I) may be important. Here we consider a problem in quadrature that provides some confirmation of this. Let {Pn(x)} be a sequence of polynomials orthonormal with respect to the distribution da(x) on (a, b). As in Gauss quadrature discussed in Chapter 5, interpolation is done at the zeros of the polynomials Р„{х), but now the integration may be with respect to a different distribution. Let xv,v = 1,..., n, denote the zeros of Pn{x). Let f{x) be a continuous function and let the interpolation polynomial be given by E, . , ., . V"^ Pn(x)f(xv) tv{x)f(xv) := ^ • G-3.5) v=l v=l r"(Xv)(X Xv)
7.3 Positive Polynomial Sums from Quadrature and Vietoris's Inequality 373 Then we have the approximate formula where I ^ = jb Pn(xW(x) Ja P^(XV)(X-XV = [" Ja v=l G.3.6) G.3.7) Pn(x)Pn+i(xv) - Pn(xv)Pn+1(x) Pn+l(xv)P^(xv)(x-xv) dp(x) К ±i к Р' 1 rb " - I / J2 'n(xv) Pn+\(XV) Ja k=Q f2Pk(xv)Pk(x)dp(x). Here kn is the coefficient of x" in Pn{x) and the last equation follows from the Christoffel-Darboux formula (Theorem 5.2.4). Now write k=0 Pk(x) Then K(xv) К P;(xv)Pn+l(xv) G.3.8) G.3.9) If Ли is positive, then it can be shown that the sum on the right-hand side of G.3.6) converges to the integral as n —> oo. The proof given for Gaussian quadrature in Chapter 5 works here as well. Take the case in which Pn(x) = Р^ю(х), dfi(x) = dx, and (a, b) = (-1, 1). Then K(x) = k=0 РГ'р'(х) / Pp Ни where h"'p is given by G.1.4). Write PJ;a'p\t) in hypergeometric form and inte- integrate term by term to get -k- l,k+a+p
374 7 Topics in Orthogonal Polynomials This 2F\ can be evaluated by the Pfaff-Saalschiitz identity (Theorem 2.2.6). The result is A)Г(* + ?+1I (M Г(р)Г(к + 2) \ к This sum is intractable when written as E^k k p(aj k=0 Гк It can, however, be written in the form [ P rk G.3.10) Assume kn > 0 in G.3.9). By Corollary 5.2.6, it follows that Thus to show that Xv > 0 in G.3.9), it is sufficient to prove that ВД>0, -1<дс<1. It is easy to check that Bk + a + p + 1)Г(к + a + p) Ck = (кП)\ satisfies 0 < ck+i < Ck when 0 < a + p < 1. So the nonnegativity of Kn(x) in G.3.10) will follow for a, P>0,a+p<l upon summation by parts, provided that ^>0, -1<,<1. G.3.11) k=0 Гк У1) Observe that D^'p)(—l) = 0 when n is odd. Thus G.3.11) is sharp in the sense that equality holds at some point in [—1, 1] for infinitely many values of и. Proof of the inequality G.3.11) for some values of (a, f}) is given in the next section. Now suppose that dp(x) = A - x)a-y(l + x)P~sdx. We shall look at kv for some specific a, fi, y, 8. When a = 1/2, fi = —1/2, у = 1, S — 0, the positivity
7.3 Positive Polynomial Sums from Quadrature and Vietoris's Inequality 375 of Xv, using the expression G.3.9), reduces to the positivity of the sum 1/2H. The two cases or = 0 =-1/2, у = 1/4, S = -1/4 and a = p = 1/2, у = 3/4, S = 1/4 lead to the respective sums cos kx, \c* sin fct, G.3.12) k=0 k=l where ^^, * = 0,1,2,.... G.3.13) Vietoris [1958] proved the strict positivity of these sums for 0 < x < n. A proof of these inequalities is given below. The inequality > 0, 0<x<7t, G.3.14) 4=1 extends the inequality G.3.4) of Jackson and Gronwall. To see this let 9 -*¦ ж in G.3.4). The result is which vanishes when n is even. So it might appear that the inequality cannot be improved. Now suppose that all we assume about the series in G.3.14) is that 1 = c\ > c2 > съ > • • •. Divide the series G.3.14) by sinx and let x -* n. We obtain 1 - 2c2 + 3c3 - 4c4 + • • • + (-1Г+1исй. G.3.15) For nonnegativity of this series for all n, we require that c2 < 1/2. Take the largest value of c2 = 1/2. Then сз < c2 = 1/2 and the largest value of сз = 1/2. With these values of c\,c2, and c3, we have 4c4 < 3/2 or c4 < 3/8. So take c4 = 3/8. If we continue in this manner, we get the sequence q as defined in G.3.13). As a first step in the proof of Vietoris's inequality, we show that the two sums in G.3.12) are the partial sums of a Fourier series just as G.3.3) is.
376 7 Topics in Orthogonal Polynomials Proposition 7.3.1 Ifck is the sequence defined by G.3.13), then oo oo , j ч 1/2 ck ik V^ k I t(/2) oo , j ч 1/2 = V^ ck coskx = I - cot(;t/2) 1 for 0 < x < 7Г. G.3.16) 4=1 Proof. For \z\ < 1, z ф 0, we have Jt=O It follows that _ckz, |z|<l,z#±l. k=0 Setz — elx,0<x < n, and take the real and imaginary parts to get the result of the proposition. ¦ Proposition 7.3.2 For m>\, G.3.17) \ m 1 ¦Jinn Proof. Set '2m \ 2^\m)- It is easily seen that am < am+i for m > 1. Now observe that lim am = lim m—>oo /n—>oo 1 1 The proof of the next proposition is from Brown and Hewitt [1984]. Proposition 7.3.3 For ck defined by G.3.13), we have > Vsin0-2cn+b G.3.18) 4=0 n 2 sin@/2) Y^ ck sinkO > Vsin0 - 2с„+ь G.3.19) 4=1
7.3 Positive Polynomial Sums from Quadrature and Vietoris's Inequality 377 Proof. Observe that for m > n m m 2sin@/2) Y ckcosk0= Y c*[sin(fc + 1/2H - sin(fc - 1/2H] k=n+l k=n+\ m-\ = —cn+i sin(n + 1/2H + VJ (ck — Ck+i) sin(k + 1/2H k=n+l + cm sin(m + 1/2H < cn+i(l - sin(« + 1/2H) - cm(l - sin(m + 1/2H) < 2cn+i. By Proposition 7.3.1, 00 Vsin0 = 2 sin @/2) Y ck COS ke k=Q = 2sm@/2) > acosA:0+ > ctcoskO \ \ k=0 k=n+l / < 2sin@/2J_,ckcoskO + 2cn+i- k=0 This proves G.3.18), and the proof of G.3.19) is similar. The proposition is proved. ¦ We are now in a position to prove Vietoris's inequalities, which are explicitly stated in the next theorem. Theorem 7.3.4 If /2k C2k = C2k+l = Тп then and rn(x) = Nc* cos fct > 0, 0 < x < n. G.3.21) k=Q Proof Consider G.3.20) first. The result is clearly true for n = 1. So let n > 2. We need a separate argument for each of the three intervals: 0 < x < n/n,n/n < x < 7Г — n/n, and 7Г — n/n < x < ж. an{x) = 2_,ck sin/be > 0, 0 < x < jz, G.3.20) k=l
378 7 Topics in Orthogonal Polynomials The positivity of an (x) for 0 < x < л/п is obvious since each term in the sum is nonnegative and the first term is strictly positive. When л — л/п < х < л, set x = л — у so that 0 < у < л/п. Suppose n is even, say n = 2m. Then 2m m Gn(x) — /_](— l)*~'cjt sin^y = y_4c2,t_i sinB/c — l)y — сг* sin2/cy] /t=l k=l [sinBk — l)y sin2ky~\ . *-i -I The last term in square brackets is positive because sin /// is decreasing in @, л] and 2ky < 2my = ny < л. So <т„(х) > 0. When n is odd there is an extra term in the sum, cn = sinwy, which is positive for 0 < у < л/п. Thus an(x) > 0 whether n is even or odd. Now note that sin и > и — м3/6. For the interval л/п < x < л — л/п, which is nontrivial for n > 3, we then have sinx > sinGr/«) > (л/п)(\ — л2/6п2). By G.3.19) 2sin(x/2)<Tn(x) > [Gг/и)A - тг2/6и2)]1/2 - 2сй+1. G.3.22) An easy calculation shows that the term in square brackets decreases for n > л /а/2. So for n > 3, and by the definition of cn, the right-hand side of G.3.22) is positive for n = 2m — 1, if it is positive for n = 2m. For the latter value of n, G.3.17) implies that the right-hand side of G.3.22) is at least equal to -тг2/24/и2I/2-2л/2]. A simple computation shows that this expression is positive for m > 2. This proves the inequality in G.3.20). The inequality G.3.21) is clearly true for n = 0 and 1. Moreover, rj(x) = - cos 2л: + cosx + 1 7 1 ( П2 1 = COS X + COS X + - = I COS X + - I + - > 0. Assume that n > 3. For 0 < x < л/п, и n У kck sin kx < 0, 0 < x < л/п. dx ti
7.3 Positive Polynomial Sums from Quadrature and Vietoris's Inequality 379 So rn(x) is decreasing in 0 < * < n/n, and its value at n/n is positive. Note that [n/2] rn(n/n) = - cn-k)cos(kn/n) > 01. Thus rn(x) > 0 for 0 < x < n/n. Now let ж — n/(n + 1) < x < ж and set у = ж — x so that c2k[cos2ky - cosB? en, /t=0 where en = 0 if и = 2m — 1 and en = съп cos2my if и = 2ra. The expression in the sum is positive because cos* is decreasing in 0 < x < n. This implies that for n = 2m - 1, rn(x) > 0 for 0 < у < ж /п. When n — 2m, we have rn(x) > cosy + cos2y — cos3y + • • ¦ + cos2my) cos 2* + • • • = c2m Re iBm+l)x _ - e~ix'2 sin(w + 1/2)* cos mx ~C2m sin(*/2) cos(m + l/2)ycoswy ~ C2m cos(y/2) It follows that rn(x) > OforO < (m+l/2)y < n/2, that is, for 0 < у < п/(п + \). The rest of the argument can be completed as before. Suppose that n > 3 and n/(n+ 1) < * < ж —n/(n + l). As in the case of an(x) опж/п < * < ж —n/n, it is sufficient to show that 1/2 1 - ,. " ... ) - 2cn Ж n + l 6(n + IJ +i 0. Again it suffices to consider even values of n, say n = 2m. The inequality can be directly checked for m = 2 and 3. For m > 4, apply G.3.17) to see that the following inequality is stronger: ж 2m + 1 6Bm + IJ /nm This is true for m = 4, and when the left side is multiplied by фп, it is an increasing function of m. Thus the inequality holds form > 4, and the theorem is proved. ¦
380 7 Topics in Orthogonal Polynomials The next theorem, which is apparently a generalization of Theorem 7.3.4, is in fact equivalent to it. It is also due to Vietoris. Theorem 7.3.5 Ifa0 > fli > • • • > an > 0 and 2ka2k < Bk - 1)а2цД > 1, then sn(x) = ]Pfl* sinkx > 0, 0 < x < л, G.3.23) k=\ and n tn(x) = Y^ak coskx > 0, 0 < x < л. G.3.24) Proof. For ck as defined in Theorem 7.3.4, let a* = ckdk. Then d0 > d\ > d2 > ¦ ¦ ¦ > dn > 0 and summation by parts gives sn(x) ~ k=l n-\ = ^2(dk - dk+i)ak(x) + dnan(x) > 0, 0 < x < л, by G.3.20). This proves the theorem since G.3.24) can be done in a similar way. ¦ The Jackson-Gronwall inequality is a consequence of G.3.23). Just take ak = 1/k. There is a nice application of these inequalities of Vietoris to the problem of finding sufficient conditions on the coefficients of trigonometric polynomials to force all the zeros to be real, and then also yields information about the distribution of these zeros. Szego [1936] proved the following theorem. Theorem 7.3.6 If Xo > Xi > X2 > ¦ ¦ ¦ > Xn > 0, andsk and tk denote the zeros of р(в) = k=Q and n-l k=l
7.4 Positive Polynomial Sums and the Bieberbach Conjecture 381 respectively, with their order such that they are increasing in size on (О, л), then = 1,...,и G.3.25) ±J I \ z/ V <V / \ z/ [п + -], к = кп Пп+-\ <tk <(k+l)nU + -\, k=l,...,n-l. G.3.26) If the Xk are not only increasing but satisfy the following convexity-type condi- condition: 2Xq — X{ > X\ — X2 > X2 — ^.з^'-'^ Xn— i — Xn > Лл > О, G.3.27) then the right-hand sides of G.3.25) and G.3.26) can be replaced by kn/n and (k + \/2)n/n respectively. The Vietoris inequalities can be used to obtain two other trigonometric inequal- inequalities. See Exercise 17. These two inequalities along with the conditions Bk - 1)Л*_1 > 2kXk > 0, ?=1,2,..., G.3.28) lead to the following different improvements in G.3.25) and G.3.26): k = l,...,n, G.3.29) kn^n + -)<tk<(k+-jn/\[n+-), k=l,...n-L G.3.30) For a proof of these inequalities see Askey and Steinig [1974]. 7.4 Positive Polynomial Sums and the Bieberbach Conjecture In the previous section, we saw the significance of showing the positivity of the sums The positivity of some of these sums has turned out to be important. We illustrate this for some specific a and p, though much more is known. For more information, see Askey [1975]. The strict positivity of G.4.1) for a = fi = 0, — 1 < x < 1 was proved in Chapter 6, Section 6.7. This implies, after summation by parts, that for Legendre
382 7 Topics in Orthogonal Polynomials polynomials Pk(x), G.4.2) when ak > пк+\ > 0, uq > 0,k = 0, 1,..., n — 1. The next result is due to Feldheim [1963] and gives the positivity of G.4.1) when a = fi > 0. Theorem 7.4.1 For 0 < в < n and v > 1/2, Proof. By the Feldheim-Vilenkin integral (Corollary 6.7.3), we have for v > 1 /2, ^ 2Г(у + 1/2) Г/2 . 2v_2 Take ak = [1 - sin2 0 cos2 0]*/2. Then ak > a^+i > 0 and aQ = 1. So by G.4.2) the integral is positive and the theorem is proved. ¦ The Jackson-Gronwall inequality E sinkO > 0, 0 < в < тг, G.3.3) is a corollary of Theorem 7.4.1 when v = 1. The positivity of G.4.1) for fi = 0 and a = 0, 1, 2,... was needed in the first proof of the Bieberbach conjecture on univalent functions. See de Branges [1985]. More generally, take a > — 1. Since Pn(Oa)(l) = 1, the sum we are interested in is k{afi\x), a>-\. G.4.4) The first step in the proof, due to Gasper, of the positivity of G.4.4) is to express
7.4 Positive Polynomial Sums and the Bieberbach Conjecture 383 it as a hypergeometric series. Thus (-\У((\-х)/2у j^. (k-j)l (a + lJj((x - l)/2)J yl (a + 2y + 1), Since the inner sum is (a + 2j + 2)n_j (n — j)\ we have + 2)„ л,„+о + 2,(а+1)/2 1д: ^ ;G-4-5) n\ *\ (a + 3)/2,a+l ' 2 There is a formula of Clausen that gives the square of a iF\ as a 3 F2 (see Exercise 3.17). The formula is ЗГ21 - ¦ '- ¦ 1/2,2a This 3F2 is nonnegative because it is a square. The 3F2 in G.4.5) is fairly close to this but different in one numerator and one denominator parameter. We have seen before that by fractional integration of a pFq, it is possible to get a p+\ Fq+\ with the necessary extra parameters. So use formula B.2.2) to write the 3F2 in G.4.5) as / —n, n + a + 2, (a + iF2\ (a + 3)/2,a+l ; st G.4.7) for a > — 1. The 2F1 in the integral is really the ultraspherical polynomial CaJ2(l-2st)/CaJ2(l). This has zeros in @, 1) and so we do not have a positive integrand in G.4.7). To get an idea about what should be done, write the 3F2 in Clausen's formula with
384 7 Topics in Orthogonal Polynomials 2a = -k, 2b = к + a + 1. Then Г (а + 1) G.4.8) The proof would be complete if it is possible to write C2 (x) in terms of C(" ~1)/2 (x) using a positive coefficient. However, this is obtainable from the connection co- coefficient formula G.1.9). Thus we have proved the next theorem. Theorem 7.4.2 Fora > -1, P^0)(x) > 0, -1 < x < 1. G.4.9) *=o The integral in Theorem 6.7.2(b) can be applied to G.4.9) to give the positivity of G.4.1) for -1 < x < 1 when 0 > 0, a + 0 > -1. For a > -0, -1/2 < 0 < 0, Gasper [1977] has shown the positivity for —1 < x < 1, except when a = —0 = —1/2, in which case this sum reduces to G.3.1), when there are cases of equality as well as nonnegativity. 7.5 A Theorem of 1\iran In the last section we proved the Jackson-Gronwall inequality. There is a theorem of Turan [1952] that shows another way of doing this. Theorem 7.5.1 IfY^jLo \aj I converges and a,sin(y + 1/2H >0, 0<ф<п, G.5.1) > 0, 0 < в < it, G.5.2) then ^ sing + pa 7=0 J unless aj = 0, j = 0, 1, 2, Proof. In the integral formula given by Theorem 6.7.2(d), take a = 1/2, 0 = — 1/2, and (i=l. This gives sin(n + 1N» [*/г sinBn 2(и + l)(sin@/2)J"+2 "
7.5 A Theorem of Turan 385 Multiply by 2an(sin@/2)Jn+2 and sum to obtain sin(n + \)в fn/1 y> /sin@/2)\2n+2 an sin(n + \/2){2ф)<1ф Write the integrand as sin Ф „=o sin2 The strict positivity of this sum for \в < ф < \ж follows from the fact that n=0 2 Z171 = - / G.5.4) This formula is obtained by using the orthogonality of the sine function. Now note that the closed form of the Poisson kernel, rnsin(n + - Usin(/i + -W n=0 \ / \ / A — r)sin i^sin |V [0 ~ rJ + 4r(l — cos \{ф + if)cos \{ф — if))] [l — 2r cos i(<? + VO + r2] [l — 2r cos |@ — if) + r2] G.5.5) gives its strict positivity for 0 < r < 1. This and G.5.1) make the integrand in G.5.4) nonnegative, and the theorem is proved. ¦ There is a generalization of G.5.5) to Jacobi polynomials due to Watson. The result states that n + 2)/2)m+n((a + fi + 3)/2)m+n (a + l)m(/3+ l Dr sin2 ф sin2 в)тDг cos2 ф cos2 0)" A+rJm+2n •
386 7 Topics in Orthogonal Polynomials where h^ is given by G.1.4). We base our proof on a result of Bailey. For the result, see Bailey [1935, p. 81] and for the reference to Watson see Bailey, p. 102, Example 19. Bailey's formula is the following: 2Fi(a,/3;]/;*JFi(«,/?;« +/3 + 1 -y\y) = y- (a)m+n(fi)m+n[x(l - y)T[y(\ -x)T tt (y)m(a + p + I - y)nmln\ We sketch a proof of G.5.7) leaving some details to the reader. Start with the double series h G.5.8) Expand in powers of s and t and show that the coefficient of smt" is - /)mO + I ~ У)п(У ~ l)»-n When y + )/' = a + j6 + l, the last factor in the numerator cancels the last factor in the denominator and so G.5.8) is equal to i* rn\n\(y)m(y')n = 2Fi(a, Y-P;r, sJFi(P, y' - a; y'\ t) , P; y; -s/(\ - j)JFi(a, /3; /'; - The last step follows from Pfaff's transformation given in Theorem 2.2.5. This proves G.5.7). The left side of G.5.6) can be written as rk. G.5.9) First consider the simpler series Replace the Jacobi polynomials by their hypergeometric representations and apply
7.5 A Theorem of Turan 387 G.5.7) to get к -k,k+a+P + l 2 p+i ;cos + p + i). (-k)m^,(k +a + fi + l)m+n к[ tT ( + DO + D (sin 0 sin 0Jm (cos 0 cos 0) OS20)" + y, (q + ^ + l)^+2m+2n m+n y, (q + ^ + l)^+2m+2n . (r sin2 в sin2 0)m (r cos2 в cos2 0)" ¦ (a + ^ + lJm+2n(l + r)-2"-2"—"-1. Now multiply G.5.10) and the last expression by r(a+^+D/2 and take the derivative. This introduces the factor Bk + a + fi + 1) needed on the left side of G.5.9). The right side of Watson's formula follows after an easy calculation. This proves G.5.6). Theorem 7.5.2 If p > a > -1 and Yak \ ; ' >0, -\<x<\, unless cik = 0, & = 0, 1,..., n. Proof. As in the proof of Theorem 7.4.1, use the Feldheim-Vilenkin formula to get •л/2 2Г(/} + ° Г si -a)r(a + l)J0 з(",а)/ /,,1 -2 п „„„2 J,\ — 1 ¦ 2O 2 ,lk/2Pk (COS в (I -Sin2 в COS2 sin20cos2^2^ —— k=o rk
388 7 Topics in Orthogonal Polynomials Let r = A - sin2 в cos2 ф)^2 and и = cos6A - sin2 в cos2 0)~1/2. Then for 0 < 0 < л and 0 < ф < л/2, we have 0 < r < 1 and |w| < 1. We can conclude from G.5.6) that the sum inside the integral is strictly positive unless f(x) = 0. We know this because ёг^л hf Now observe that orthogonality of the Jacobi polynomials implies that f(x) = 0 for -1 < x < 1 if and only if a* = 0, /fc = 0, 1,2,..., и. This proves the theorem. ¦ 7.6 Positive Summability of Ultraspherical Polynomials The (C, 1) means of the formal series 1+2 J2b=i cosk6 are v^ / к \ 1 fsini(n+lH) » =1 + 2? 1 cos?<9 = I 2 . } > 0. G.6.1) Fejer used this positivity to prove the (C, 1) summability of the Fourier series of a continuous function. The generating function for the sequence <7n' in G.6.1) is A — r)~2( l + 2Vcosn<9 rn]=—— r. G.6.2) v \ ^ ) \-r 1-2rcos0+r2 The last equality follows from E.1.16). Now observe that , n > 0, 1, и = О, and the generating function for {^^C^cos в)} is ^ ^—Cnv (cos 0)r" = — Yl^I- G.6.3) This follows from the generating function for ultraspherical polynomials given in Chapter 6, namely, v СЦсо%в)гп = -— 1 G.6.4) ' A -2rcos# + Г2)" n=0 Kogbetliantz [1924] proved the following generalization of G.6.1). For a discus- discussion of Cesaro summability of infinite series, see Appendix B.
7.6 Positive Summability of Ultraspherical Polynomials 389 Theorem 7.6.1 The (C, 2v + 1) means of the formal series -Cv(x) -1 < x < 1 v > О are positive. That is, , (k -I- d) -C(jc)>0, -1<jc<1,v>0. G.6.5) The proof depends on the following lemma. Definition 7.6.2 A function is called absolutely monotonic if its power series has nonnegative coefficients. Lemma 7.6.3 Thefunction 1/[A —rJv(l —2xr+r2)v] is absolutely monotonic for-I <x < 1. Proof. Denote the function by [g(r)]v and let x = cos в. Then A(r) :=log*(r) = -21og(l -r) -log(l -rew) - log(l - re"'6*) 2cosn#) n n=l Thus A (r) and 00 vn[h(r)]n „=o и! are absolutely monotonic. ¦ Proof of Theorem 7.6.1 The generating function for G.6.5) is 1-r2 1-r2 1 A - rJ(l - Ixr + r2) A - rJv(l -2xr+ r2)v' The first factor is absolutely monotonic by G.6.1) and G.6.2). The second factor is absolutely monotonic by Lemma 7.6.3. This proves the inequality G.6.5) and the theorem. ¦ The lemma has the following corollary. Corollary 7.6.4 For v > 0, -1 < x < 1,
390 7 Topics in Orthogonal Polynomials Proof. This follows from G.6.4) and Lemma 7.6.3. ¦ Similar results have been obtained for Jacobi series. The results are very im- important, but the proofs involve quite complicated formulas. See Gasper [1977] for many of these inequalities. For example, Gasper proved the following extension of the inequality in Exercise 14: f(--y _l<x< 0<A.<a + /6 and «>?>() or « + ?>() and ?> -1/2. Proofs of this inequality use many of the formulas given here for hypergeometric series, as well as some others. Note the particular case A. = 0: G.6.8) for a + fi > 0, fi > —1/2. Inequality is strict in -1 < x < 1, when a = 1/2, fi = —1/2 are excluded. These inequalities have implications for Bessel functions. Using Theorem 5.11.6, it is easy to see that for a > fi - 1 ton - X] * (/,) = 2"г^ + !> / *~PJ«(t)dt. G.6.9) f0 / The condition a > ft — 1 is needed for convergence of the integral at t = 0, but it is not needed for the sum. In an appendix to the paper of Feldheim [1963], Szego considered the limit when a = /3, and also the resulting integral. He showed that / raJa(t)dt>0, x>0 G.6.10) Jo when a > a and when a is the solution of ГЛ..2 Г" Ja(t)dt = 0 G.6.11) with ja<2 the second zero of Ja(t). A similar result holds for rpJa(t)dt, -1<а<1/2. Jo /0 From G.6.8), we have Г t~fi Ja(t)dt > 0, x>0 G.6.12) /o when a + ? > 0, -1/2 < /3 < 0. In fact, G.6.8) also holds when a + /? > -1, /3 > 0. Thus G.6.12) holds when a > /S - 1 and /3 > 0. Gasper [1975] has
7.7 The Irrationality of t, C) 391 shown that the inequality G.6.12) is strict for these a, ft except when a = 1/2, /5 = -1/2. Gasper [1975] derived the following interesting identity when /J = —1/2 and a > -3/2 in G.6.12): 1/2j (t)dl _ Lr(f + 4)] 2a+1x G.6.13) This series is clearly positive when a > 1/2, nonnegative when a = 1/2, and negative when x is a zero of JBa+i)/4(x/2), where —3/2 < a < 1/2. Gasper also has a similar result for f (x- t)a'l/2taja(t)dt, a > -1/2. Jo See Exercise 25. This and G.6.13) are derived by using the identity Y2(v + lJ2v+1 ^ (n + v)r(n +2v) x = > /„.„(*)¦ G.6.14) See Watson [1944, §5.1] for G.6.14). An approach to G.6.12) via differential equations can be found in Makai [1974]. For more references, see Askey [1975] or Gasper [1975]. 7.7 The Irrationality of <C) The topic of this section is unrelated to the earlier parts of this chapter. However, it involves an interesting application of Legendre polynomials and so has been included here. In Chapter 1, we gave Euler's formula for ?Bn) when n is an integer >1. This showed that ?B«) is irrational. In spite of repeated attempts, Euler was unable to evaluate ?B« + 1). Later mathematicians have not had better luck. It was only as recently as 1978 that R. Apery proved that ?C) is irrational. A simpler proof using Legendre polynomials was given by Beukers [1979]. We follow Beukers's exposition. The basic lemma is the following: Lemma 7.7.1 There exist two sequences of integers {An} and {Bn} such that 0 < \An + BBf C)| < 3(9/10)". G.7.1) This immediately implies the theorem.
392 7 Topics in Orthogonal Polynomials Theorem 7.7.2 ?C) is irrational. Proof. If f C) = p/q, where p and q are integers, then the sequence of nonzero rational numbers \An + В„?C)| > \/q. The second inequality in G.7.1), how- however, implies that this sequence becomes arbitrarily small as n increases. This contradiction implies that ?C) is irrational. ¦ The proof of Lemma 7.7.1 depends on the following lemmas. Lemma 7.7.3 For nonnegative integers r ands, /7/^(ш)Ер) r=s o Jo ^ху { frtk ) = rational number whose denominator divides df, where dr = lcm(l, 2,..., r), when r > s. Proof. Note that for о > 0 and r > s к к l-xy У f^{k + r+o + \){k -s \k+s + a + 1 k + r+a + G.7.2) r—s{s + l+a r + а Differentiate with respect to a and then set a = 0 to get 1*1 /*i \nn v,, л ( \ j The second equality of the lemma follows from this. Now take r = s in the first equation of G.7.2) and again differentiate with respect to a and set a = 0. The result is ri /.l f1 f'logxy^ / / x у dxdy = к Jo i-xy ¦" ¦" This is the first equation of the lemma, which is now completely proved. For the next lemma, let Рп{х)=-.~{ХП(\-Х)п}, n\ dxn which is essentially the Legendre polynomial on @,1).
7.7 The Irrationality of t, C) 393 Lemma 7.7.4 There exist integers An and Bn such that Jo Jo 1 - xy Here dn = lcm(l, 2,..., и). Proof. Since pn (x) is a polynomial with integer coefficients, the equality follows from Lemma 7.7.3. Now observe that ~ i = / ~i 7i z~dz. L-Xy Jo 1-A-. Then the integral in the lemma can be written as / / / 1 7, —,dxdydz- Jo Jo Jo l ~ Integrate by parts n times with respect to x to see that this integral is equal to dxdydz. fff Jo Jo Jo A - A - xy)z)n+l Set 1-z со = and rewrite the last integral as / f f (l-x)n(l-co)n- f"(y) dxdyd Jo Jo Jo 1 - A - xy)co A A A ^(! -x)y(l -y)co{\ -CO)}" -111 Jo Jo Jo dxdydco. G.7.3) -A -xy)co}"+1 У У The last equality uses integration by parts n times. It is clear that the final integral is nonzero and so the lemma is proved. ¦ Lemma 7.7.5 For An and Bn as in Lemma 7.7.4, 0 < \An + Bni;{3)\d~3 < 2(V2 - 1L4C). Proof. The integral in Lemma 7.7.4 is equal to integral G.7.3). We find the maximum value of the integrand in G.7.3). Let x(l - x)y(\ - y)co(\ - со) f(x, у, со) = . 1 — A — xy)co By solving the equations V = V = 9/ =0 дх ду дсо
394 7 Topics in Orthogonal Polynomials it is easy to see that at the maximum, x = уапйш = l/(l+x). Thus / is bounded byx2(l -xJ/(l +xJ, which is maximized at x = л/2 — 1. Thus integral G.7.3) is bounded above by 1 Г1 1 L Jo Jo 1 - xy = 2(v/2-lLnfC). This proves the lemma. ¦ We need the following result from elementary number theory due to Chebyshev. Let n{n) denote the number of primes less than n. Then тг(л) < 1.06n/logи. G.7.4) See Ingham [1932, p. 15]. Proof of Lemma 7.7.1. Observe that dn = IT n[lo8"/logp] < TT pl°g"/l°gp — пя<")_ p<n p<n p=prime By G.7.4), it follows that dn <ni-°6"/iog« =ei06n < 3« An upper bound for ? C) is given by dx 3 Therefore, Lemma 7.7.5 gives (V2+1L Apery's proof used some sequences satisfying three-term recurrence relations. For a lively account of Apery's proof, see van derPoorten [1979]. These sequences were analyzed in terms of contiguous relations by Askey and Wilson [1984]. See Exercises 28 and 29.
Exercises 395 Exercises 1. Let n x(x — 1) • • • (x — n + 1) = yj.s(n, r)xr r=0 and n x" = ^2 S(n, r)x(x - 1) • • • (x - r + 1). r=0 The integers s{n, r) and S(n, r) are called Stirling numbers of the first and second kind respectively. Show that (n, r)s{r, m) = Snm. Use this to prove that an = \ s(n, r)br, n > 1, r if and only if bn = 2 S{n, r)ar, n > 1. r 2. Derive formulas G.1.14) and G.1.15) as indicated in the text. 3. Verify formula G.2.7). 4. Prove that 2 ( Deduce G.2.10). 5. Supposew(x) isapositive integrable function on (—1,1); {pn(x)} a sequence of polynomials orthogonal with respect to w(x); and {^„(x)} orthogonal with respect to A + x)w(x). Let /?„A) > 0 and qn(l) > 0. Prove that (l+x)qn(x) = Anpn+i(x) + Bnpn(x), where An and Bn are positive. Determine An and Bn in the case illustrated in G.2.10). 6. Let {pm{x)} be a sequence of polynomials orthogonal with respect to a dis- distribution da(x) in @, oo). Let ?i, ?2, •••.?* be zeros of pm(x) and let fk(t)= e-xt№-x)---($k-x)}-l{pm(x)}2da(x), t > 0. Jo (a) Show that f Jo f Jo (b) Deduce that fk(t) > 0 for t > 0.
396 7 Topics in Orthogonal Polynomials 7. With {pm(x)} as in the previous problem, assume that pm@) > 0. Prove that e-xtpm(x)pn(x)da(x) > 0, t > 0. For Exercises 6 and 7, see Karlin and McGregor [1957, pp. 507-509]. 8. Deduce Lemma 7.2.3 from the previous problem. 9. Prove Corollary 7.2.4. 10. (a) Prove that Г LUx)Lam(x)Lan(x)x"e-2"dx E rksmt" Г (a + 1) k,m,n=0 = [2 - (r + s + t) (b) Use the multinomial theorem to prove that, except for a constant factor, the right side of (a) also equals ^- k\m\n\ k,m,n>0 ( -k,-m,-n \ '3 2 \ (-a - ifc - m - л)/2, A-а-к-т-п)/2' )' (c) Let A: < min(w, n). Reverse the order of summation in the above 3F2 to get {-\fm\n\2lkT(m + n + a + 1 - к) (m - k)\(n - k)\T{m +n+k + a + l) / -k, (a + I + m + n - k)/2, (a + 2 + m + n - k)/2 \ '32V m+l-k,n + l-k ' )' (d) Apply Kummer's transformation (Corollary 3.3.3) to the 3F2 in (c) to show that the expression in (c) is positive for a — 0, 1, 2, This proves G.2.13) for these values of a. 11. Prove that (a) Vy ? ein(it A 12. (a) Deduce from G.3.1) that n 1H >0 (b) Use the results in the previous problem to prove that n 'Yjri + 1 - k) sin(k + 1H > 0 forO<0<7r. k=0
Exercises 397 13. LetSn(x) = Еь=1 п^- Use induction and the fact that the extrema of Sn(x) lie at Ikn/n, where к is an integer, to prove that Snix) > 0 forO < x < it. 14. Use Theorem 7.5.2 and Corollary 7.6.4 to prove that J^-л Bv)n-kC2v)k Clix) > -. > 0, — 1<х<1, A>v>0. Z-/ (n — jHli-l Cx(l} ~ _ _ > _ / n V IV ^ 1IV 1 \s К \ I } Deduce that > 0, 0 < a < 2, 0 < в < л. k=o ^ - Consider the cases a = 1, 2. 15. Define the difference operator ДА by (ДА/)@ = (f(t+h)-f(t))/h. Show that (-/0*4ДА/)@ = Е?=о(~1)?(*)/(*+?А)> where Akh is the Ath iterate of Ah. Suppose / is continuous on [0, 1] and /A) = 1. Prove that the following statements are equivalent: (a) / is absolutely monotonic in [0,1], that is, fit) = Eo° a«f"' й„ > 0, f e [0, 1]; (b) / G C°°@, 1) and /№)@ > 0 for к = 0,1, 2,... and t e @, 1); (c) (Д^/п/)@) > 0 for n = 1,2,... and 0 < m < n. One way to show that (c) =»¦ (a) is to use the Bernstein polynomi- / which uniformly approximate / in [0,1]. To prove the last equality, use the result at the beginning of this problem. 16. Let ao > a\ > • • • > am > 0 and 1 < n < m. Show that m n — \ ^atcos^sin(e/2)>^atcosA:esin(e/2) - (an/2)(l + sin(n - k=0 ft=0 and m n—\ ^atsinJfc0sin@/2)>^aifcsinJfc0sin@/2) - (an/2)(l - cos(n - k=0 k=0 (Vietoris) 17. Let ck = 2~2kBk). Show that for 0 < x < 2n t ck sinik + l/4)x > 0 and 2^ c*С08^ + г/4)х > 0. k=0 k=0 Deduce that the inequalities hold if ck is replaced by ak, satisfying Bk - l)ak_i > 2kctk > Oforifc > 1. 18. Let ak satisfy the conditions given in the previous problem. Prove that if 0 < v < 1/4 and 0 < x < 2л- or if -1/4 < v < 1/4 and 0 < x < n, then ak cos(fc + v)x > 0. k-0
398 7 Topics in Orthogonal Polynomials 19. With ak as in Exercise 17, show that if 1/4 < v < 1/2 and 0 < x < 2n or if 1/4 < v < 3/4 and 0 < x < n, then У^ v)x > 0. /fc=0 20. Show that J^ cos fcx l+> —-—> 0, 0<х<л\ 21. With c;t as in Vietoris's inequalities, show that for v > 0 and 0 < x < 1. 22. Show that if *:=0 then Observe that this implies the terminating form of Theorem 7.5.1. 23. Prove that if k=0 then n p(a-li,P+li)( ч 24. (a) Show that [о^(с; x)]2 = iF2(c - 1/2; 2c - 1, c; Ax). (b) Prove the formula of Bailey that /2x рл/2 ha(t)dt = 2x / [Ja(x sin<p)f sin<pd<t), a > -1/2 Jo (c) Show that /* Ja(t)dt > 0, a > -1. 25. (a) Use F.14) to show that Г >¦ м Jo Г (A. + 1)Г(а + ii + 1)Г2(у + 1)л4у_а ^ Г -n,n + 2v, v+ l,(a + fi+ l)/2, (a + д + 2)/2 1 ^5^4 [v + 1/2, а + 1,(а + А. + д + 2)/2, (a + A. + ц + 3)/2' J Bv + 1)„ 2n+2v 2 n\ n + 2v n+v\2J'
Exercises 399 when a + д > —1, A. > —1, and 2v Ф —1, —2,... and the factor Bл + 2v)/(n + 2v) is replaced by 1 at л = 0. (b) Take д = Л + 1/2 and set v = (a + A. + 1/2)/2 so that the 5F4 reduces to a balanced 4 F3. (c) Take A. = 0 to get F.13). (d) Take A. = a - 1/2 to get f\x-t)a-xl2taJa{t)dt Г(а + 1/2)ГBа з,, Г(За + 3/2) l)/4)n(Ba - l)/4)n Ba + 1)„ 2и + 2a 2 /xn n! n + 2a n+2a\2J' fora > —1/2. (Gasper) 26. Suppose that {/?n(x)} and {qn(x)} are orthonormal polynomials associated with the weights co(x) and ш\ (х) respectively. Prove that if n qn(x) = У^уск,прк(х), ifc=0 then 00 ш(х)рк(х) = У^ск,пд„(х)йI(х). n=k 27. Use Exercise 26 and Theorem 7.1.4' to show that for —1 < x < 1 and H > (A. - l)/2, where Ck,n 2k + A.)(w + 2^)!Г(п + 2д)Г> + fc + Х)Г(к + A - д) ~ Г(А.- д)Г(у)л!А:!Г(л + А: + д+ 1)Г(л + 2A: + 2A.) Note that c^ > 0 for (A. — l)/2 < д < A.. Deduce the special case 00 CM sin(" + 2k + Vе k=0 when д > 0, д ^ 1, 2,..., and n + *)!Г(л + 2ц)Г(к + 1 - д) 28. Show that
400 7 Topics in Orthogonal Polynomials satisfies the three-term recurrence relation пъЪп + (л - 1K2>„_2 = C4n3 - 5In2 + 27n - 5J>„_,. (Apery) 29. A *+* A satisfies a three-term recurrence relation. Find it. 30. Complete the proof of Bailey's formula G.5.7). 31. Use G.5.7) to prove Brafman's [1951] generating-function formula for Jacobi polynomials, ^ (y)n(tt +1 - у + l^f = 2F1 (y, a + P - у + 1; a + 1; A - r - R)/2) ¦ 2Fdy,<x + P - у + 1; $ + 1; A + r - R)/2), where R = A -2xr + r2I/2. The case у = a is interesting. 32. Complete the proof of G.2.14). 33. Show that, if a > -1Д, w, n = 0, 1,..., then /¦00 -Vx > 0. /¦0 / 34. If a > 0, j,k, m, n — 0, 1, 2,..., then prove that Laj(x)Lak(x)Lam(x)Lan(x)xae-2xdx > 0. / Jo
8 The Selberg Integral and Its Applications Dirichlet's straightforward though useful multidimensional generalization of the beta integral was presented in Chapter 1. In the 1940s, more than 100 years after Dirichlet's work, Selberg found a more interesting generalized beta integral in which the integrand contains a power of the discriminant of the n variables of integration. Recently, Aomoto evaluated a yet slightly more general integral. An important feature of this evaluation is that it provides a simpler proof of Selberg's formula, reminiscent of Euler's evaluation of the beta integral by means of a functional equation. The depth of Selberg's integral formula may be seen in the fact that in two dimensions it implies Dixon's identity for a well-poised 3F2. Bressoud observed that Aomoto's extension implies identities for nearly poised 3-F2- After presenting Aomoto's proof, we give another proof of Selberg's formula due to Anderson. This proof is similar to Jacobi's or Poisson's evaluation of Euler's beta integral in that it depends on the computation of a multidimensional integral in two different ways. The basis for Anderson's proof is Dirichlet's multidimensional integral mentioned above. A very significant aspect of Anderson's method is that it applies to the finite-field analog of Selberg's integral as well. We give a brief treatment of this analog at the end of the chapter. Stieltjes posed and solved an electrostatic problem that is equivalent to obtaining the maximum of an n variable function very closely related to the integrand in Selberg's formula. Stieltjes's remarkable solution showed that the maximum is attained when the n variables are zeros of a certain Jacobi polynomial of degree n. We devote a section of this chapter to Stieltjes's work and show how his result can be combined with Selberg's formula to derive the discriminants of Jacobi, Laguerre, and Hermite polynomials. Siegel used the discriminant of the Laguerre polynomial to extend the arithmetic and geometric mean inequality. Siegel's result, which we include, contains an interesting inequality of Schur relating the arithmetic mean and the discriminant. 401
402 8 The Selberg Integral and Its Applications 8.1 Selberg's and Aomoto's Integrals The theorem given below contains Selberg's [1944] extension of the beta integral. Theorem 8.1.1 Ifn is apositive integer and a, /3, у are complex numbers such that Rea > 0, Re? > 0, andRty > -min{l/n, (Rea)/(n - 1), (Rtp)/(n - 1)}, then / f /о Jo п Sn(a,p,y)= f JO Г (a + U ~ 1)У)Г@ + U ~ 1)У)ГA + jy) 7 = 1 where А(х)= The conditions on a, fi, у are needed for the convergence of the integral. For a discussion of this, see Selberg [1944]. Note, however, that the condition on у is related to the first occurrence of a pole of the function on the right-hand side of the integral formula. Selberg's proof of this formula appeared in 1944, but for more than three decades it was not well known. More recently, Aomoto [1987] found a simpler proof which depends on a recurrence relation satisfied by a slightly more general integral. Theorem 8.1.2 With the same conditions on the parameter a, ft, у and with к < n, -Д- (a + (n- j)y) Anderson [1990] proved a finite-field analog of Selberg's formula and then noted (Anderson [1991]) that the idea could be carried over to the continuous case. 8.2 Aomoto's Proof of Selberg's Formula To motivate this proof, recall the two basic steps of the proof of the formula
8.2 Aomoto's Proof of Selberg's Formula 403 Step 1. Obtain the functional equation fi(a, B) = ^fi(a, P + 1). Though we did this differently in Chapter 1, it can be done as follows: For Re a > 0, Re p > 0, /1 g — [xa(l-xf]dx dx r\ 1-Х = a\ xa~\\ -xfdx-B / xa(\-xf-ldx Jo Jo a-\l-xfdx-p xa-\\-xf~xdx Jo Step 2. Iterate the first step n times to get B(a,p+n). Then apply a change of variables in the integral for B(a, /3 + n) and let n -> oo to obtain the necessary result. We apply the same basic procedure to prove a generalization of Selberg's for- formula. Let w(x) = xif-1 J] and h = (8.2.1) (8.2.2) 1=1 where Cn is the n-dimensional cube and dx = dx\dx2 ¦ ¦ ¦ dxn. By symmetry the product nf=i xi таУ be replaced by the product of any к distinct variables without changing the value of the integral. Let Iq denote the integral without the factor To obtain the functional equation start with к dx С i=2 = «/A- Jcn + 2y±[ A- j=2 JC» к w(x)dx Xl)^ - X, (8.2.3)
404 8 The Selberg Integral and Its Applications The third term in (8.2.3) is derived from the fact that \\c \rlC if jc \x\ c\xrsgnx ax x The next lemma shows that the third integral can be written in terms of h and h-\- Lemma 8.2.1 L 0 if2<j<k, L L с„ m-xj K.\h-\ ifk<j<n. \h if 2 < j < k, lCn x\ - xj у ik if к < j <n. Proof. (a) In the first case, where 2 < j < к, the transposition x\ ¦«->¦ xj changes the sign of the integrand, so the integral vanishes. In the second case the same transposition leads to Xl Xj X\ X\ — Xj Xj — X\ X\ — Xj so Г UkiXiW(x)dx f Л / -1 =/ \\xiw(x)dx = Ik-i. JCn XX - Xj JCn ^ (b) For 2 < j < к, the transposition x\ *> Xj gives Л xxx) = X\Xj X\ — Xj Xj — X\ X\ — Xj which proves the first part of (b). For the second part observe that x\ _ XXXj — x\ + X\ - Xj X\ - Xj and the last term changes sign in the transposition x\ *> Xj. So its presence in the integral makes that part zero. The other part coming from x\ gives the necessary result. Thus the lemma is proved. ¦ Use this lemma to rewrite (8.2.3) as 0 = or/t_, - (« + 0)Ik + Y{n - k)h-x - YBn -k This gives a + fi + {In - к - \)y
8.2 Aomoto's Proof of Selberg's Formula 405 Iterate this functional relation to arrive at к / П Aln Г=1 + ^ + ( n ~ l ~~ The last integral is Selberg's integral, which we write as Sn(a, fi, y). The problem now is to evaluate this integral. Fortunately, it is possible to apply the functional equation (8.2.4) to Selberg's integral itself. Note that Sn(a + 1, в, у) = I I Sn(a, в, у). (8.2.5) nv .^>/y 11 a _(_ ^ _|_ Bn — у — l)y 'f''-" v -» Symmetry in a and ft and iteration give Sn(a, Д y) = -— ljL^5n(«, уб + к, у) пК ,Р,П 11 (P + (n-j)y)k 'P 'У> х + Р + Bп — j — \)у)к 08 + (и - У п к к" Let /: -> oo and use the limit definition of the gamma function to get /¦00 /-00 / /¦00 /-00 " •/ ¦••/ П^е П \xi-*i\lYdx. (8.2.6) Denote the integral in (8.2.6) by Gn (a,y). Then by symmetry in a and ft, relation (8.2.6) implies that П^г(« + <1.-яу) П-=1 Thus we can write '(a + (n - j)y)F(P + (n - j)y) Г(а+^ + Bя_^_1)у) W (8.2-7)
406 8 The Selberg Integral and Its Applications To compute Dn (y), first note that by the symmetry in the variables х\,хг, ...,х„ f /"' /"' Г1 / w(x;a,p,y)dx=n\ / •••/ w(x;a, fi,y)dxx- ¦ -dxn. (8.2.8) JCn J0 Jxn Jx2 Since /•i lim a I ta~xf(t)dt = /@) «^o+ Jo for a continuous function / (for a proof see Exercise 1 at the end of this chapter), multiply (8.2.8) by a, let a -> 0+, and apply (8.2.7) to get JCn-\ , = 1 \<i<j<n- r(e + Bnjl)y) 7=1 Again by (8.2.7), the last integral also equals Mj-ну) ГBу + уб + (n + ] - 3)у) This gives the functional relation This implies Thus, Selberg's formula and Aomoto's extension are both proved. Applying the change of variables and limiting procedure for obtaining (8.2.6) to Aomoto's integral formula, we get: Corollary 8.2.2 With the conditions on the parameters a and у as in Theorem 8.1.1, roc roo к п /00 /-00 * " •••/„ Ibll*"'-* П ^-- JV ;=1 i=l \<i<j<n к п — 1 |(« + (П — j)y) Y\
8.3 Extensions of Aomoto's Integral Formula 407 To derive another corollary from Selberg's formula, take a = ft and jc, = A + f,/V2a)/2, let a ->• oo, and apply Stirling's formula. Corollary 8.2.3 For Re у > -1 /n, П I*-* Remark 8.2.1 One can use Carlson's theorem to prove (8.2.7) without having to let /? go to infinity. It follows from (8.2.5) that (8.2.7) is trae when a and ft are positive integers. Moreover, both sides of (8.2.7) are analytic functions of a and p for Rea > 0 and Re? > 0 and they are bounded for Rea > 1, Re/3 > 1. 8.3 Extensions of Aomoto's Integral Formula Aomoto's formula involves the introduction of the extra factors [}*=1 xt, к < п. One may ask whether extra factors of the type ПA — xj) can be inserted in the integrand. The simplest integral of this type occurs when there is no common variable among the two different kinds of factors. Let -. j J+k B(j,k):= JJxt Y[(l-Xi)w(x)dx, where j + к <п and The formula is А Г (a + (и - i)y)VC + (n- i)y)r(iy + 1) Ul П Г(« + ^ + Bи-/-1)у)Г(у + 1) n/=iC» + (и - i)Y] ULlfi + (и - i)y] This is easy to verify. Denote the right side of (8.3.1) by C(j, k). Observe that Aomoto's integral implies B(j, 0) = CO", 0) and fi@, к) = C@, k); (8.3.2) moreover, both В and С satisfy the same recurrence relation B(j - 1Д - 1) - B(j, k-l) = B(j - 1, k). (8.3.3)
408 8 The Selberg Integral and Its Applications This proves (8.3.1). Now let i,k,t)= I T\xi T] A-х, so that j represents the number of extra x, factors, к the number of extra A —л:,-) factors, and I the number of variables that overlap among the extra factors. Here I < j,k <n and j + к - ? < п. Theorem 8.3.1 Bn - i - ПЙi [« + B+Bn-i- where Sn(a, fi,y) is the value of the Selberg integral. Proof. First consider the case where к = ?. This integral, after renumbering variables, can be written as i)w(x)dx. , j к j, k,k) — I TT Xj TTA — Xj Jcn ~t ~t It satisfies the functional relation that generalizes (8.2.4): (a+P+Bn-j-k- l)y)B(j + 1 ,k - 1 ,k - 1) = (a + (n-j - \)y)B(j, к - 1 ,k - 1). (8.3.4) To prove (8.3.4), start with r Q j k 0= / ~-T[xir\(\-xi)w(x)dx j к j к ~fi Y[xiY[(l -Xi)w(x)dx •'C- i=l 1=2 » i=2 i=l •'C- i=l 1=2 The functional equation (8.3.4) now follows from the following lemma.
8.3 Extensions of Aomoto's Integral Formula 409 Lemma 8.3.2 к т-г; тк (a) V / ^i=lXi^i=l(l~Xi)w(x)dx = 0. ~~~o JCn x\ ~~ xm (c) ]T / 'x x —w{x)dx ¦n—j-\-\ n = (n - j) — ~—! B(j, k-l,k- m=j The proof of this lemma is left to the reader. A direct consequence of (8.3.4) is that R..,,. W [a+fi + Bn-i-k-l)y]Of B(j,k,k)= \\ :—— — B(n,k,k) >Д, [« + (n - i)y] k (P + (n fj^ (a + y6 + Bn - г The second equation follows from (8.3.1) and the fact that fi(n ,k, k) is the same as B@, k, k) except that a is replaced by a +1. Equation (8.3.5) is, in fact, equivalent to * ( + p + in-i- l)y) + 1 + Bn - i - (n - i)y) nLi( Sn(a,P,y). (8.3.6) This proves Theorem 8.3.1 for the case к = 1. By writing x\ as 1 — A —x\) in the integral B{j,k,t) one verifies that ВЦ + \,кЛ) = В(],кЛ)-ВЦ,к + \,1) forj>?. (8.3.7) The right side of Theorem 8.3.1 also satisfies this recurrence relation. Thus The- Theorem 8.3.1 is proved. The above proof is due to Shaun Cooper. ¦ We end this section with the statement of another beta integral of Selberg.
410 8 The Selberg Integral and Its Applications Theorem 8.3.3 Let Dn = {(*,,..., xn) \ x,¦¦ > 0, ?"=i xt < 1}. Then •'D»,-=l 1=1 A proof is outlined in Exercise 5. Historical Remark Hardy and P61ya independently proved the following theo- theorem on entire functions: Let f(z) be an entire function of exponential type less than log 2. If f(n) is an integer for n = 0, 1, 2,..., then f(z) is a polynomial. This theorem can be restated as follows: 2Z is the smallest transcendental entire function taking integral values at the positive integers. Gelfond generalized this theorem as follows: Iff(z) is an entire function such that are all integers and f is of exponential type less than p log(l + e(l~p)/p), then f is a polynomial. The case p = 1 is the result of Hardy and Polya. Selberg discovered his integral formula when he generalized Gelfond's theorem. He proved thatp log(l+eA~/')//') can be replaced by log {min [|f=1 A+y,)}. where yi > 0, У1У2 •¦¦yP = el-p, and | Ili</<,-<,(? ~ ^)l = L To see that this is a generalization, note that since the yj are distinct, \ + yd > {\ + 1/у,У2- ¦ ¦ yP)p = 1=1 For references, see Boas [1954]. Corollary 8.2.3 was conjectured by Mehta and Dyson in the mid-1960s. They considered a gas of N point charges at Xi, хг,. ¦., xN, which are free to move on the infinite straight line — oo < x < oo. The potential energy of this gas is given by 2 1 = 1 l<i<j<n where the first term represents a harmonic potential that attracts each charge inde- independently toward the point x = 0 and the second term represents an electrostatic repulsion between each pair of charges. An important role in the thermodynamical study of this system is played by the partition function ЫР) = I ¦••/ e-Pwdxx-.-dxn. J — 00 J — OO
8.4 Anderson's Proof of Selberg's Formula 411 It was the value of this integral that was conjectured by Dyson and Mehta. See Mehta [1991]. Mehta's book also contains Selberg's original proof of Selberg's integral formula. 8.4 Anderson's Proof of Selberg's Formula In Chapter 1, we gave two essentially different proofs of the formula В (a, fi) = Г{а)Гф)/ Г(а+уб). One was done by constructing a functional equation and the other by evaluating a suitable double integral in two different ways. The second method applied to the finite field analogs of the beta and gamma integrals, that is, to the Jacobi and Gauss sums. Anderson [1991] found a proof of Selberg's formula which involves the computation of a (In — l)-dimensional integral in two ways. This proof carries over to a formula for the finite field analog of the Selberg integral. In fact, Anderson [1990] obtained a proof of the latter result first. Anderson's proof depends on Dirichlet's generalization of the beta integral given in Chapter 1: For Re a, > 0, ¦dpn-i = (8.4.1) where У is the set p, > 0, YTi=o A = 1 • The formula is used after a change of variables. To see this, first consider Selberg's integral, which may be written as nlAn(a,P,y):=n\ [ f " ¦ ¦ ¦ Г \F@)\a-'\F(l)^-l\AF\*dxi ¦ ¦ -dxn, • Jo Jo Jo where 0 < X\ < X2 < • • • < xn < 1, F{t) = (t-xi)(t -x2)---(t-xn) = tn - Fn^t"-1 + ¦¦¦ + (-l)nF0) and AF is the discriminant of F, so that We now change the variables from x\, xi, ¦ ¦ ¦, xn to Fo, F\,,.., Fn_\, which are the elementary symmetric functions of the x, s. Lemma 8.4.1 where the integration is over all points (Fq, F\, ..., Fn-\) in which the F, are elementary symmetric functions ofx\,..., xn with 0 < x\ < • • • < xn.
412 8 The Selberg Integral and Its Applications Proof. It is sufficient to prove that the Jacobian = |Af |1/2. (8.4.2) Observe that two columns of the Jacobian are equal when jc, = Xj. Thus Ylt<j(Xi — Xj) is a factor of the determinant. Moreover, the Jacobian and П,<;(*( — Xj) are homogeneous and of the same degree. This proves (8.4.2) and the lemma. ¦ We make a similar change of variables in (8.4.1). To accomplish this, set and let V={(t- xx){t - x2) ¦ ¦ ¦ (t - xn) | fc_i < x,¦< fc; i = 1,..., n). (8.4.3) Lemma 8.4.2 For all F(t) = t" - Fn^tn~l + h (-1)" Fo € V, the map /¦7-17-1 j-i \ / V5O/ VbW/ \ / . тлп-1-l (Fo, F\,..., Fn_!) h-> I — ,..., — I = (po, px,..., pn) e R + , where Z'(t) denotes the derivative of Z(t), is a bijection and pj > 0 with Ym=o Proof. Observe that since the numerator and denominator have exactly n — j negative factors. Now let z (t) - Z@ By Lagrange's interpolation formula F(t) = j^PjZjit) = ^ ~\Р%). (8.4.4) One can directly verify this by checking that both sides of the equation are poly- polynomials of degree n and are equal at n + 1 points t — fo, ?ь • • ¦, t,n ¦ Equate the coefficients of tn on both sides to get
8.4 Anderson's Proof of Selberg's Formula 413 Now for a given point (po, Pi, ¦ ¦ ¦, Pn) with Hpj = 1 and pj > 0, j — 0,..., n, define F(t) by (8.4.4). The expressions @ - &-i)(& - O+i) • and show that F(f;) and F(f,+1) have different signs and F vanishes at some point xi+i between f,- and ff+i. Thus F e 2? by (8.4.3). This proves the bijection. ¦ We can now restate Dinchlet's formula (8.4.1). Lemma 8.4.3 With the notation of Lemma 8.4.2 and Re a, > 0, *[[\F(b)\ai-ldF0---dFn_l = ^ Proof. We need to verify that the Jacobian 3(po, ••• ,Pn-\) 9(F0,...,Fn_,) i=0 Since the Jacobian is dpi IteOl The numerator is a Vandermonde determinant and the result follows. ¦ The final step is to obtain the Bn — 1)-dimensional integral. Let F(t) and G(t) be two polynomials such that and G(t) = (t - yi)(t ~ y2) ¦ ¦ ¦ (t - yn), 0 < Ух < Xi < y2 < • • ¦ < *n-i < Уп < 1- (8.4.5)
414 8 The Selberg Integral and Its Applications The resultant of F and G, denoted R(F, G), is given by \R(F,G)\ = п 1 = 1 И—1 п n-\ (Xi) (8.4.6) The absolute value of the discriminant of F can be written as \R(F, F')\. The (In — l)-dimensional integral is J(F,G) ;G)\r-1dFo---dFn-2dGo---dGn-i dFo ¦ ¦ ¦ dFn-2( / (F,G) ¦ ¦ ¦ dGn-\. (8.4.7) Here the integration is over all F and G defined by (8.4.6). Lemma 8.4.4 Selberg's integral An{a, fi, y) of Lemma 8.4.1 satisfies the recur- recurrence relation „{а, р, у) = Г(а)Г(Р)Г(пу) (а + Р + (п- \)у) , y). Proof. Integrate the Bn — l)-dimensional integral (8.4.7) with respect to dFo • dFn-2 and use Lemma 8.4.3 with G(t) instead of Z(t) to get "-'iGd)!"- Y-2 dG0 ¦ ¦ dGn-\ Г(пу) = An(a,p,y) Г(пуУ To compute (8.4.7) in another way, set F(t) = t(t — Xi)(t — x2) ¦ • ¦ (t — х„), «о = a, an = P, cij = у for j = 1,..., n — 1, x0 = 0, and xn = 1 so that (8.4.7) is equal to / n — 1 Y-\ dG0 • ¦ ¦ dGn-idF0 ¦ ¦ ¦ dFn-2 (f.G) j=l f[\G(.xi)\ai-idGo---dGn-1dFo---dFn-2. (8.4.8)
8.5 A Problem of Stieltjes and the Discriminant of a Jacobi Polynomial 415 Now integrate (8.4.8) with respect to dGo ¦ ¦ ¦ dGn-\ and use F(t) instead of Z(t) in Lemma 8.4.3 to obtain K-l/2 n-1 \F'@)\a~i'2\F'(l)f~1'2dF0---dFn-2 Since and n в —1 в —1 the last integral can be written as Equate the two different evaluations of the Bn — 1)-dimensional integral to obtain the result in Lemma 8.4.4. ¦ Selberg's formula is obtained by iterating Lemma 8.4.4 n — 1 times. R. J. Evans has shown that Aomoto's extension (8.3.1) can also be proved by this method. The idea is sketched in Exercise 21 at the end of this chapter. 8.5 A Problem of Stieltjes and the Discriminant of a Jacobi Polynomial There is a problem of Stieltjes that connects up Jacobi polynomials, the hyper- geometric differential equation and Selberg's integral in a very interesting way. It may be stated as a two-dimensional electrostatics problem. Let p > 0 and q > 0 be fixed. Suppose there are charges of size p at 0 and q at 1 and unit charges at x\, xi,..., х„, where 0 < x,- < 1, i = 1,..., n. Assume the potential is logarithmic to get the energy of the system as T(xux2, ...,xn) = -p^logXj -g^log(l -xt) - ^2 logl*i-•*/!• i = l i = l l<i<y<n (8.5.1)
416 8 The Selberg Integral and Its Applications The problem is to find the location of the charges so that they are in electrostatic equilibrium. The latter occurs when the energy is a minimum, so we either minimize (8.5.1) or maximize n H(x1,x2,...,xn):=l[x?(l-xiL Д \Xi-xj\. (8.5.2) i = l \<i<j<n Theorem 8.5.1 The maximum of (8.5.2) occurs when X\,xi, ¦ ¦ ¦, xn are the zeros of the Jacobi polynomial pO-p-iM-\)(\ _ 2x). Proof Since Я is a continuous function of X\, xi, ¦ ¦ ¦, х„ for 0 < x, < 1, i = 1,..., n, it has a maximum value at some point. If any of the x,- is 0 or 1, the value of H is 0. So the minimum of T (or the maximum of H) occurs where — =0, i = l,2,...,n. Эх,- Therefore Y1= 0' ' = 1> 2. ••-.«• (8-5-3) + Y1 Xf i X[ X/ Xj This is a set of n nonlinear equations in n unknowns. Stieltjes introduces a polyno- polynomial / whose zeros X;, i = 1,..., n satisfy (8.5.3), and he shows that / satisfies a specific hypergeometric differential equation. Set n f(x) = ]J(x - Xi) (=1 so that the discriminant A of / is given by n A- J] (х,-х,J = l<i<j<n Take the logarithmic derivative to obtain , п. Write (8.5.3) as — H = 0, i = l,..., л, 2 f'(Xi) Xi 1-Х; or x,(l -Xi)f"(Xi) + 2[p -(p + q)xi]f{xi) = 0, i = 1,...,«. Now consider the expression x(l - x)/"(x) + 2[p - (p + q)x]f\x).
8.5 A Problem of Stieltjes and the Discriminant of a Jacobi Polynomial 417 This is a polynomial of degree < n, since / is a polynomial of degree n, and it is zero at xt, i = 1, 2,..., n. Thus the expression is a constant multiple of f(x) and we deduce that у = f(x) satisfies the differential equation x A - x)y" + [2p-2(p + q)x]y' + Xy = 0 (8.5.4) for some constant k. Compare (8.5.4) with the hypergeometric equation x(l - x)y" + [c - (a + b + l)x]/ - aby = 0, which has the two independent solutions 'a,b ;x\ and xl C2FA ;x с ) V 2-е We see that these are also the independent solutions of (8.5.4) with с — 2p, a + b = 2p + 2q — 1, and ab = —A.. We get a polynomial solution of degree n only when a or й is — n. So if b — —n, then a=n + 2/7 + 2<jr — 1 and Л. = -n(n +2p +2q - I) and (-WW + 2;+2^1) (8.5.5) To find k, note that the coefficient of x" in /(x) is 1. So k _ (-D"Bp)n (и + 2p + 29 - 1)„' Except for a constant factor, the hypergeometric polynomial in (8.5.5) is the Jacobi polynomial pQ-p-^-V{\ - 2x). This proves the theorem. ¦ It is actually possible to find the maximum value of H from Selberg's integral. Theorem 8.5.2 „2, , Л <2p H(xu x2,..., *„) = g g ^ + ^ + ^ + . _ 2Jp+2q+n+]_2 where H is defined by (8.5.2). Proof. Recall that if \x is a positive measure on a measure space X and II/IU — ( / 1/1*^Д I < oo, for some к in 0 < A: < oo, \J J then 11/11* -> ll/lloo as^^oo. (8.5.6) In Selberg's integral, take a — 1 = 2/??, p — \ = 2qk, and у = A: and then apply (8.5.6) together with Stirling's formula to obtain the theorem. ¦
418 8 The Selberg Integral and Its Applications Since the expression for Я(хь ..., xn) involves the discriminant we have the next theorem. Theorem 8.5.3 The discriminant of the Jacobi polynomial P^a^\x) is 2„(в-1) TT J4a ¦Ц (a + ¦Ц (a + P + n + j)"+J-2 Proof. We have ;=0 1 = 1 Bp)n (-l)V(O) = (-1)"* = (n + 2p and The Chu-Vandermonde identity (Corollary 2.2.3) was used to sum the 2F\. Com- Combine these results with Theorem 8.5.2 to see that the discriminant of рО-р-^М-Ъ A - 2x) is Bp + 2q+n + j - 2)"+J-2 Theorem 8.5.3 follows after an appropriate change of variables. ¦ Similar theorems can be stated for Laguerre and Hermite polynomials. Theorem 8.5.4 The maximum of n U(xux2,...,xn) = Y[xfe-x> Д |jc,- — JCyl i=i i<i<;'<« is obtained when X\,x2, ... ,xn are the zeros of the Laguerre polynomial L%p-1) Bx). The discriminant of Lan(x) is Theorem 8.5.5 The maximum of П \*-*А
8.6 Siegel's Inequality 419 is attained when x\,... ,xn are the zeros of the Hermite polynomial Hn(x). The discriminant ofHn(x) is 1 " (n-l) 11- 22л(л-1) J The proofs of these theorems require Corollaries 8.2.2 and 8.2.3 and are left to the reader as exercises. Other formulations of Theorems 8.5.4 and 8.5.5 are also possible. These are given in the exercises at the end of the chapter. For references to the work of Stieltjes and to other methods for the calculation of the discriminants of the classical orthogonal polynomials, see Szego [1975, §§ 6.7-6.71]. Remark 8.5.1 Stieltjes's interpretation that the zeros of the Jacobi polynomials are the equilibrium positions of charges placed in the interval [0,1] is useful for guessing theorems about these zeros. For example, if the zeros of P^"^\\ — 2x) are written in increasing order 0 < Xi < X2 < ¦ ¦ ¦ < xn < I, then ^ > 0 and % < 0. Observe that if a is increased the unit charges in @,1) are pushed up toward 1, and if p is increased they are pushed down toward 0. 8.6 Siegel's Inequality Siegel [1945] derived an inequality that refines the arithmetic and geometric mean inequality. He used it to obtain results about traces of algebraic integers, all of whose conjugates are real and positive. The idea of the proof is as follows: Let 5 and p be two positive numbers such that sn > p. Find the maximum of A = Yli<i<j<n(Xi — XjJ considered as a function of X\,..., xn and subject to the conditions that X\ + ¦ ¦ ¦ + xn — ns and X\Xi ¦ ¦ ¦ xn — pn, Xi > 0. It turns out that the maximum lies at the zeros of a certain Laguerre polynomial. In fact, the method by which this is determined is the same as that used in the solution of the Stieltjes problem of the previous section. Siegel's inequality drops out easily after this. Before stating Siegel's inequality we prove the arithmetic and geometric mean inequality. Lemma 8.6.1 LetX\,x-i, ¦. ¦, xn be n positive numbers. Then X\ + X2 + ¦ ¦ ¦ + Xn $X\X2 ¦ ¦ • Х„ < , п where equality holds if and only if X\ = хг = ¦ ¦ ¦ = xn. Proof. Start with the inequality in Exercise 1.6. For a, P, u, and и nonnegative and a + p = 1 write the inequality as uavp <au + Pv. (8.6.1)
420 8 The Selberg Integral and Its Applications Here equality holds if and only if и — v. We prove the lemma by induction. It holds for n = 1. Assume the result true up to n. Then, by (8.6.1) and the inductive hypothesis, (XlX2 ¦ • -*B+lI/(B+1) = (*!/П ¦ ¦¦Х1'П)ПКП+Х)Х^Т1) n /х\Л \-xn\ xn+x ~n+l\ n ) n+1 n + 1 It is clear from the proof that equality holds if and only if all the x,- are equal. This proves the lemma. ¦ To state Siegel's extension, set n-k-l «-1 k=\ and A = where the x; are positive. For n > 2 and A ^ 0, let //, denote the unique positive root of the algebraic equation Pn(pL) = (Xl---xn)n-l&-1. (8.6.4) (The polynomial P (t) has only positive coefficients and P @) — 0, so ц, is uniquely determined.) Siegel's inequality is contained in the next theorem. Theorem 8.6.2 Let n > 2, А ф 0, ant/ //, ?>e the positive root of (8.6.4); then where Q is given by (8.6.3). Remark 8.6.1 Since Qn(fi) > 1 for positive //,, Theorem 8.6.2 is a refinement of the arithmetic and geometric mean inequality.
8.6 Siegel's Inequality 421 We first prove the following lemma. Lemma 8.6.3 Let s and p be two positive numbers such that sn > p. The maxi- maximum of A = Yl(xi ~ xjJ> subject to the conditions x\ H \-xn=ns and X\---xn = p", occurs when X\,... ,xn are the zeros of a certain Laguerre polynomial. Proof. To show the existence of the maximum, observe that for any positive set of values x3, хц,..., xn the equations Xi +X2 = ns - (X3H \-Х„), XYX2 = р(ХЪ ¦ ¦ ¦ ХпУ1 have a unique solution that is positive if and only if x3 H h xn < ns and {ns - (x3 H h xn)}2 > 4p(x3 ¦ ¦ ¦ xn)~l. (8.6.5) Moreover, x\ = x2 if and only if equality holds in (8.6.5). The conditions (8.6.5) define a compact domain D in a (n — 2)-dimensional space whose points have positive coordinates x^,... ,х„. The boundary of D consists of the surface x\ — x2 so that the maximum of Д is inside D. By Lagrange's method of undetermined multipliers, at the maximum point дф — =0, k = l,...,n, дхк where ф(хи...,хп) = -log A - A.(xi H h xn) + Atlog(xi ¦¦¦xn) and X and [i are some constants. In fact, ц, will be seen to be the positive root of (8.6.4) when the x, maximize A. As in the solution of Stieltjes's problem of the previous section, show that /(•«) = n7=i(^ - ^) satisfies ——— -X-i =0, k=l,2,...,n. f'(xk) xk So xf"(x) — (Xx — n)f'(x) is a polynomial of degree n that vanishes at X\,..., xn and hence is a constant multiple of /(x). This constant has to be — Xn. Therefore, / satisfies the differential equation xy" - (Xx - ix)y' + Xny = 0. Set t = Xx to transform this equation to ty" -{t- ц)у' + ny=0.
422 8 The Selberg Integral and Its Applications This equation is satisfied by the Laguerre polynomial L^~l)(t). So we get f{x)=kL^-l\kx) (8.6.6) for some constant k. This proves the lemma. ¦ We make a few observations necessary to complete the proof of Siegel's in- inequality. First note that the discriminate of the polynomial in (8.6.6) is, according to Theorem 8.5.4, n-l ^-„(„-1) Д{(д + jyu + jy + lj (8 6 ?) Also observe that since the constant к in (8.6.6) is (-l)nn!/A.n. So xi-\ \-xn ii + n - 1 5 = = n к and 00» к" This implies s"p~l = П~ =ПA+ + ft_i) =g"(^ (8-6-8) and (using (8.6.7)) p"-iд-i = - TT (^f) = Pn{ii). (8.6.9) n\ ±A- \n — к) Proof of Theorem 8.6.2 Suppose that yt, V2,..., yn are n positive numbers such that З'Н + Уп , =5 and У\---уп = Р- n Let ixo be the solution of equation (8.6.4) with these v in place of the x and let Ao denote the discriminant using the v. Since Pn(t) is an increasing function and snp-x = QnUi) > Qnino). we have ц, < ц,0- However, Qn(t) is a decreasing function and so snp- This proves Siegel's inequality.
8.6 Siegel's Inequality 423 The following corollary is due to Schur [1918]. Corollary 8.6.4 For positive numbers X\,..., х„ Proof. Set R(t) = P{t)Qn-\t) = Д ^ "~_ Use (8.6.4) and the inequality in Theorem 8.6.2 to eliminate p = x\ ¦ ¦ ¦ xn and obtain sn n(n-l) > дЛ(д)_ (8.6.10) Show that R(t) is an increasing function of t > 0 by proving j-t log R(t) > 0. Thus R{ix) > Л@) and sn(n-l) This is Schur's inequality. ¦ Siegel applied his inequality to the derivation of some results on traces of positive algebraic integers with positive conjugates. We include statements of two of these results as they are quite interesting. A proof of one of them is sketched. The details are left to the reader, since they involve an easy application of the Euler-MacLaurin summation formula. Before we state the next theorem, observe that the equation g(v) :=(l+uJlog(l + (l/t/))+logt;-t;-l =0 (8.6.11) has exactly one positive root, say, в. This is true since g@) = — 1, g(oo) = oo, and g'(v) = 2A + v) log(l + A/w)) - 2 > 0 forv > 0. Theorem 8.6.5 Suppose the algebraic equation with integer coefficients xn +a1x"-1 +---+an =0 has all positive roots X\, xi, ¦ ¦ ¦, xn. Let в be the unique positive root of (8.6.11) andletko = еA + A/в)Ув'; thenforanyk < ко, there exists a number N — N(k) such that x\ Л + xn > kn for all n > N.
424 8 The Selberg Integral and Its Applications The proof depends on the following lemma. Lemma 8.6.6 Ift is any positive number satisfying APn(t) > 1, then sn > Qn(t). Proof. Since p = х\Хг ¦ ¦ -xn = {—\)nan is a positive integer, Theorem 8.6.2 and Corollary 8.6.4 imply that We have seen that Q{t) is decreasing and R{t) is increasing for t > 0. It follows for t > 0 that 1 > mm(Qn-l(t), ДЛ@) = Qn~l(t)mm(l, AP(t)). This proves the lemma. ¦ Proof of Theorem 8.6.5 Start with the Euler-Maclaurin summation formula (see Appendix D) Jo Jo (Recall that Bx{x) = x - \.) Take f(x) = (n - x) log(x + vn - 1) in (8.6.12) 2 and show that n-\ k=\ k=2 — -g(v)n2 + O(nlogn). 2 Since lim{l - ulog(l + A/u))} = logAo, it follows that s = (xi + ¦ ¦ ¦ + xn)/n > X, for any X < Xo and all n greater than some suitably large N(k). This proves the theorem. ¦ Siegel computed the value of Xo = 1.7336105 The best possible value of the constant that could replace Xo in the theorem is certainly < 2. This follows from the fact that 4 cos2 -, for odd prime p, has trace equal to 2n — 1 < 2n, where n = (p — l)/2. The following theorem was also proved by Siegel, but because its proof is longer we omit it.
8.7 The Stieltjes Problem on the Unit Circle 425 Theorem 8.6.7 Suppose f is an algebraic integer ф 1 or iC ± V5). Suppose all the conjugates x\,X2, ¦ ¦ ¦, xn of% are positive. Then 3 x\ +x2 H \-xn > -n. 8.7 The Stieltjes Problem on the Unit Circle In Section 8.5 we considered Stieltjes's problem for the unit interval. Dyson and others have looked at the situation where the freely moving charges lie on a thin circular conductor of unit radius. See Mehta [1991] for references. We work out the case in which n unit charges are placed on the unit circle, although the more general case with a charge of size q at в = 0 and one of size p at в = л can also be treated similarly. The potential energy of the system with n unit charges on the unit circle is log\eiek-ew'\. (8.7.1) In the equilibrium position, W is a minimum. Theorem 8.7.1 The minimum value of W is — | log n and is attained when e'0k for к = 1,..., n are roots of the equation x" ± 1 = 0. Proof. Write \eWt - ew> \ as follows: \еЪ - eie' | = 2 sin (9-^-^\ = -(«'* - eie')e-m**>I2 for вк > в,. \ 2 ) i This shows that at a minimum л dW . n-\ ° Wkl\ V As in the Stieltjes problem, define so that f"(eWt) /() eiek _ ei Thus / satisfies the equation xy" - (и - 1)/ = 0 or f(x) = Cx" + D.
426 8 The Selberg Integral and Its Applications Since the coefficient of x" in / is 1 and the roots lie on the unit circle, С — 1 and D = ±1. Thus f(x)=xn±l. The least value of W is — \ log Д, where Д is the discriminant of f(x). Since Д= JJ/'(e1'*) = f[nei(n-m k=\ *=1 this minimum is — | log n. ¦ Remark 8.7.1 The partition function for this charge distribution is given by П ¦«>.¦ This integral is sometimes called Dyson's integral, but it is a special case of Selberg's integral, as can be seen after a suitable transformation. Its value is ГA + (pN/2))/(T(l + P/2))N. Incidentally, one may use the last integral to evaluate the discriminant of xn ± 1. Forrester and Rogers [1986] have consid- considered a different distribution of charges on the unit circle, which leads to Jacobi polynomials. See Exercise 13. 8.8 Constant-Term Identities Take fi = 2k, where к is a positive integer, in Dyson's integral (8.7.2). Its value is equal to the constant term in the Laurent expansion of the product П '-* Zi (8.8.1) To prove this, set Zj = e10', so that 2k \zj -zA2k = (Zj- = (l - | l - Z? Now observe that any power other than 0 of Zj vanishes on integration. From the value of Dyson's integral given in our last remark, we have ст.П (¦-?)'-2^ = ??. (8^) +к)п (k\)n
8.8 Constant-Term Identities 427 where C.T. stands for "constant term of." More generally, Dyson conjectured the result contained in the next theorem, which was first independently proved by Gunson and Wilson. The elegant proof given below is due to Good [1970]. Theorem 8.8.1 Ifa\, аг, ¦ ¦ ¦, an are nonnegative integers, then ГТ ze Proof. Let p(x) = П"=1 (х -Zi). Then X - Zj)p'(Zj) = ' since the left side is a polynomial of degree < n — 1 that is equal to 1 at the n points Zi,i = 1,2,... ,n. Rewrite the identity as =i \<k<n zi Zk Let x = 0 to get E П i-J— = Now put and multiply (8.8.3) by Fn(a\,..., an) to arrive at the recurrence relation Fn(au...,an) = Obviously, C.T. Fn (ai,..., an) must satisfy the same relation: n C.T. Fn{ai an) = Y^ C-T- F(au...,aj-i,aj - \,aj+\,... ,ak). Also C.T. Fn @,0,..., 0) = 1 and if ak = 0, then C.T. Fn(ai,..., an) = C.T. Fn_i (a,,..., ak-U ak+1,..., an). The last relation holds because ak = 0 implies that only nonpositive powers of Zk appear in Fn{a\,...,an). It is easy to check that (a\+a2-\ +an)\/
428 8 The Selberg Integral and Its Applications {a\! a2\ ¦ ¦ ¦ an!) also satisfies the same recurrence relations and initial conditions. This proves Theorem 8.8.1 by induction. ¦ Morris [1984] derived the following constant-term identity from Selberg's in- integral. Theorem 8.8.2 Suppose p, q, and r are nonnegative integers. Then г1 т ТТл xp / i * \ TT ~ j}t (P + U - l)r)\(q + (j - l)r)!r!" The proof of this identity is left as an exercise. 8.9 Nearly Poised 3F2 Identities In Chapters 2 and 3 we gave several derivations of Dixon's sum for a well-poised 3F2. A terminating form of this identity also follows from Selberg's formula. The more general result of Aomoto allows us to sum a few nearly poised 3F2. Recall that a series is nearly poised if all but one of the pairs of upper and lower parameters have the same sum. To see that Selberg's formula gives the terminating Dixon sum, take n = 2 and у = у to be positive integers in Selberg's integral. We get el A / / o Jo 0 Jo \ *i Expand A - x2/xiJy by the binomial theorem and integrate term by term with respect to xi to get 2v / У^ x?+r~'(l — X2) Jo Г(г + 1)ГBу -г + 1) ГBу + а +р -г) _ Д* ГBу + 1)ГBу +а- г)Г(Р)Г(а + г)Т(Р) ~ f^ ГBг + 1)ГBу -г + \)ГBу +а+р- г)Г(а + Р + г) Г(а)Г(РJГ(а + 2у) ( -2у, а, -а - р - 2у + 1 Г(а+Р)Г(а + Р + 2у) \ -а-2у+1,а FJ2y, а,а р 2у + 1 \
8.9 Nearly Poised 3F2 Identities 429 This, together with the value of Selberg's integral for n =2, gives -2у,а,-а-0-2у+1 \ _ ГA + 2у)Г(а + р)Тф + у)Г(а 2+l+9 ' ) ~ + 2у)ГA + у)Г(а + 0 + у)' This is the terminating form of Dixon's identity we mentioned earlier. The next theorem gives the values of some terminating nearly poised series that can be derived from Aomoto's formula and its extension. This theorem was also obtained by Bressoud [1987] in the more general setting of basic hypergeometnc series. Theorem 8.9.1 f -2k, a, -a - p - 2k \ _ (a + l)tQ8)t(lJ* ' 3 4 a2ka+p ' ) -a-2k,a+p ' ) (a г (-2к,а,1-а-р-2к \ _ (а)к(Р + 1)к(\Jк(а + P) 3 2V la2ka + p+l ' ) l-a-2k,a + p+l ' ) (а+Р)к(\)к(аJк(а+р + l,l-a-p-2k \ (a + l)k(P)k(lJk(a 1 1 - a - 2k, a + P -2k,a,-a-p-2k \ (a)k(P + l)k(lJk ;1 \-a-2k,a + p ' У (a + p)k(\)k(aJk' -2k,u + \,\ -a-p-2k l-a-2k,a + p + 2 (a + l)k(P + 1М1)п(а + + P Proof. Consider the integral when к is a nonnegative integer. Expand (jci — x2Jk by the binomial theorem and integrate. The result is I^ Г(а1+2к)Г(р1)Г(а2)Г(р2) (-2k, 1 - ax - px - 2k, a2 р2K 2{ 1 - a{ - 2k, a2 + p2 ' The value of / from Aomoto's formula can be found in the following cases: (a) «i = a2 + 1 (= а + 1), p2 = Pi (= P), (b) «i = a2 (= a), p2 = Pi + 1 (= p + 1), (c) a2 = a, + 1 (= a + 1), px = p2 (= fi), (d) «! = a2 (= а), Д = ft + 1 (= P + I)-
430 8 The Selberg Integral and Its Applications These give the first four cases of the theorem. Theorem 8.3.1, which extends Aomoto's formula, can be applied when (e) a2 = «i + 1 (= a + 1), p2 = Pi + 1 (= P + 1). Thus the theorem is proved. ¦ The reader may also use Theorem 8.3.1 to work out the case cti=a2 + l(=a + l), Pi = p2 + 1 (= 0 + 1)- (8.9.1) There are nonterminating extensions of the sums in Theorem 8.3.1. See Exercise 15 for one example. It is possible to sum similar, almost very well poised 5F4 series. See Bressoud [1987] for some terminating cases, again done there for basic hypergeometric series. 8.10 The Hasse-Davenport Relation The finite-field analogs of the gamma and beta integrals are the Gauss and Jacobi sums respectively. It is worthwhile to look into the analog of Selberg's integral. Anderson discovered his idea for the evaluation of the integrals by studying the finite-field analog. Evans [ 1981 ] conjectured formulas that are analogs of Selberg's integral formula, and Anderson [1990] proved a special case of it. Later Evans [1991] used Anderson's ideas to obtain the complete result. In this section, we prove the Hasse-Davenport relation, which can be viewed as another finite-field analog of the multivariable beta integral of Dirichlet, which was used in Anderson's proof of the Selberg integral. In the next section, we give the statement and proof of the analog of Selberg's formula due to Anderson. Though this is a particular case of the more general known result, it contains the essential ideas that help explain the origin of the concepts used in Section 8.3. In this and the next section some knowledge of finite fields is assumed. Every finite field is a finite-dimensional vector space over the field Z/pZ = Z(p), for some prime p. A finite field F thus has q = pm elements for some integer m > 0 and a prime p. Denote by Fs the finite field with qs elements. For a e Fs define the trace and norm of a from Fs to F as TrFs/F(a) = a + aq +aq2 + ¦¦¦+ aq"' and NFs/F(a) = a ¦ ofi ¦ ¦ ¦ -ofi' . Check that the trace and norm of a belong to F and that NFs/F(a +p) = NFs/F(a) + NFs/F(P)
8.10 The Hasse-Davenport Relation 431 and NFs/F{aP) = NFi/F(a)NFs/F(P). We omit Fs/F in the notation for trace and norm if the context is clear. Fora e F, aq = a. This implies that Tr(a) = sa and N(a) = as when a is viewed as an element of Fs. More generally, Tif,/f(oc) = fTrf(ayf (a) and NFl/F(a) = [NF(a)/F(u)Y/d, where d = [F(a) : F] = the dimension of F(a) over F. Suppose that a is a root of the monic irreducible polynomial f(x) =xd - cxxd~x + c2xd~2 ¦¦¦ + {-\)dcd e F[x]. (8.10.1) Lemma 8.10.1 (a) The trace and norm of a from F(a) to F are given by Tr(a) = c\ and N(a) = cj. (b) If a is viewed as an element of Fs 5 F(a), then Tr (a) = ^ci and N(a) = cjd. Proof. (a) In this case, Tr (a) = a + aq + \-aq and N(a) = aaq ¦ ¦ ¦ aq . Since the automorphism /3 -» /3q of any finite field, which contains the field F with q elements, fixes F, it follows that 0 = (f(a))q = f(aq). Thus if a is a root of /, so is aq. This implies (a). (b) This follows from (a) and the discussion preceding Lemma 8.10.1. ¦ We now extend the idea of a Gauss sum, introduced in Chapter 1 for Ъ{р), to any finite field. For convenience in writing, denote Z(p) by Fp (not to be confused with Fs, the finite field of dimension s over F.) Let \j/ denote the additive character on F defined by f(a) = ^FIF'(a\ where fp = e2ni/p. (8.10.2) A multiplicative character on F is a homomorphism x from F — {0} to the complex numbers, that is, X(«/J) = X(«)X(/?)- (8.10.3) By convention x @) = 0. In what follows, let x denote a nontrivial multiplicative character. Define the Gauss sum g(x) by Х(а)ф(а). (8.10.4) aeF
432 8 The Selberg Integral and Its Applications We are interested in the connection between g(x) and ), (8.10.5) where x' = x о NFs/F and x/r' = f о TrFt/f. (8.10.6) Check that x' and ^' are characters of the appropriate kind on Fs. A relationship between g(x) and g(x') was given by Davenport and Hasse [1934]. We state and prove it after a couple of lemmas. Lemma 8.10.2 Suppose a e Fs is a root of the irreducible polynomial (8.10.1). Then Х'Ш'(а) = [x where x' and \jf' are defined by (8.10.6). Proof. X'(a)f'(a) = x(N(a))f(Tr(a)) (by Lemma 8.10.1) The next lemma is a well-known result on finite fields. ¦ Lemma 8.10.3 The polynomial xq' — x is the product of all monic irreducible polynomials in F[x] of degrees that divide s. (F has q elements.) The proof is left to the reader. The essential idea in the proof of the Hasse-Davenport relation given here is due to Weil [1949]. Weil's proof is much simpler than the original. We follow the account given by Ireland and Rosen [1991, §11.4], which contains a further simplification of Weil's argument by P. Monsky. The Hasse-Davenport relation is given in the next theorem. Theorem 8.10.4 -g(x') = (-g(x)Y- In order to make the proof more manageable, we break it down into two lemmas followed by the completion of the proof. Suppose / is the monic polynomial xn — c\xn~l + сгхп~г + ¦ ¦ ¦ + (—l)ncn, where с, е F. Define a complex-valued function к on the set of monic polynomials with coefficients in F by the equation
8.10 The Hasse-Davenport Relation 433 Lemma 8.10.5 The function X is multiplicative, that is, if f and g are monk polynomials in F[x], then A.(/g) = X(f)X(g). Proof. Suppose f{x) = xn - cxxn~x + c2xn~2 + h {-\)ncn and g{x) = xm - dxxm-1 + ¦¦¦ + {-\)mdm. Then /(*)*(*) = xm+n - (ci + dl)xm+n-1 +¦¦¦ + (-l)m+ncndm. By definition = х(с„Жс1)х(<*«Ж<*1) = Lemma8.10.6 g(x') — X^/(deg/)A.(/)s/deg/, where the sum is over all monic irreducible polynomials in F[x] whose degrees divide s. Proof. Suppose a e Fs satisfies an irreducible polynomial / of degree d. Then X'(a)xfr'(a) = [x{cn)f(ci)Yld = X(fYld by Lemma 8.10.2. This implies that conjugates where the summation is over all the conjugates of a. Since every element of Fs is the root of a monic irreducible polynomial whose degree divides s and conversely (in fact, these polynomials are the only irreducible factors of xq" — x), the result is proved. ¦ Proof of Theorem 8.10.4. Consider the L-function given by the formal power series L(X,t) = J2/monicM/)?deg/> where the summation is over all monic polynomials in F[x]. The following identity is easy to verify: L(x,t)= ^ M/)'deg/ = П (i-M/)'*87), / monic / irred. where the product is over all monic irreducible polynomials. Here it is understood that X(l) = 1. Write the L-function as n=0 \deg/=n The coefficient of t is (x - a) = Y,X(a)f(a) = g(x), aeF aeF and the coefficient of tn for n > 1 is
434 8 The Selberg Integral and Its Applications The factor qn~2 arises because, for a given pair c\, cn e F, there are qn~2 ways of writing the other coefficients of a polynomial. We have shown that L(k, f) = l+ g(X)t = f irred. Take the logarithmic derivative and multiply the equation by t. Then tg(x) or _ _ J2 (deg/)A(/)J/de^ s=i Equate the coefficients of ts on each side and use Lemma 8.10.6 to complete the proof of the theorem. ¦ Remark 8.10.1 In Exercise 1.50, the Dirichlet L-function ПA - x(p)P~Tl was introduced. The prime ideals of the ring of integers p are the ideals generated by prime numbers p. The number of elements in the field Ъ(р) is p. For the ring of polynomials F[x], the prime ideals are generated by irreducible polynomials. If F has q elements, then F[x]/(f(x)), where / is irreducible, has qdesf elements. Thus, when t = q~s, f irred. / irred. is called an L-function. Note that Л is multiplicative on F[x] just as x is multi- multiplicative on Z. 8.11 A Finite-Field Analog of Selberg's Integral We begin by recalling the definition and some elementary properties of the resultant of two polynomials. Suppose К is a field and let f(x) — aox" + a\xn~x H + an, g(x) = boxm + bix1"-1 +•¦¦+*„ be two polynomials in K[x].
8.11 A Finite-Field Analog of Selberg's Integral 435 The resultant of / and g, R(f, g), is defined by the (m +n) x (m +n) determinant по b0 а\ ао Ъх bo а\ Ьх а„ по ьт ап ът ¦ а„ (8.11.1) bo h ... bn It can be shown (an exercise for the reader) that the resultant vanishes if and only if either / and g have a common nonconstant factor or the leading coefficients of / and g are zero. Suppose that / and g can be factored as / = ao(x - xi)(x - x2) ¦ ¦ ¦ (x - xn), g = bo(x - yi)(x -y2)---(x- ym). Then R(f, g) = ¦ * " У & = < П (8.11.2) (8.11.3) It follows that R(f,gh) = R{f,g)R{f,h). It is also clear from the definition that if either g or / is a constant, then R(f,bo) = bno, R(ao, g) = a?. (8.11.4) Now let F be a field with q = p" elements where p is an odd prime. Let x and rjr be the multiplicative and additive characters defined in the previous section. For all positive integers a, define the Gauss sum (8.11.5) (8.11.6) (8.11.7) x€F Extend the definition to all integers a by the requirement g(a+q-l) = g(a), andlet g*(a) = q/g(-a). For all positive integers a, fi, y, n, the Selberg sum is defined by
436 8 The Selberg Integral and Its Applications where the sum is over all monic polynomials / of degree n in F[x], A/ is the discriminant of /, and S — x(9~1)/2- To evaluate this sum we need a property of the following L-function: For a monic polynomial V e F[x], let w where W ranges over the monic polynomials in F[x]. Let G be the product of all distinct monic irreducible polynomials that divide V. Denote the multiplicity of the monic irreducible polynomial / as a factor of V by ord/V. We say that V is primitive if for every factor / of V, q — 1 does not divide ord/V. The next lemma plays a role in the evaluation of Selberg's sum similar to the role of Lemma 8.4.3, which is a consequence of Dirichlet's beta integral formula, in the evaluation of Selberg's integral. The proof of this lemma employs the Hasse- Davenport relation (proved in the last section). Lemma 8.11.1 Suppose V is primitive and of positive degree. Then degL(t, V) <degG-l and the coefficient e(V) o/rdegG^ is e(V) = S(AG)x(R(V, G'V f\G Proof. It follows from (8.11.2) that R(V, W) depends only on the value of W mod G. Suppose m is an integer > deg G. For a given polynomial S of degree < deg G — 1, there are qm~ deg G polynomials Q of deg m — deg G such that G<2+Sisofdegm. Write ( X(R(V,W))\tm. Since V is primitive, x(R(V> $)) Ф 1 for some polynomial 5, and hence deg W=m S Thus L(t, V) is a polynomial of degree < degG — 1. To find e(V), consider the double sum X(R(V,U)), \ I r I и w where W ranges over monic polynomials of deg G — 1 and U ranges over poly- polynomials of degree < deg G and ReSoo denotes the residue at infinity. When U is a
8.11 A Finite-Field Analog of Selberg's Integral 437 constant, say a e F, then „ UW, «•¦*,,• U(l/x)W(l/x) 1 —ReSoo ax = coefficient or 1/x in ¦ —т = а. G G(l/x) x (The factor 1/x2 comes in because the transformation x -> 1/x changes dx to 2.) By (8.11.4) When U is not a constant, then, by an argument similar to the one given before, the sum over all W vanishes. Thus ^2 (8.11.8) a€F To evaluate ц in another way, start with the fact that the sum of the residues of UW/G at finite points plus the residue at inifinity is 0. The finite poles are at the zeros of G. This implies UW ^ UW -Reso G ^ G ' Res i)S Here r\f denotes a root of G that comes from the irreducible factor /. The residue at r]f is given by U(r) f)W(r) f) /G\t] f), where G' denotes the formal derivative of G. The sum of residues at all the roots of / is then (U(.t)f)W(r)f)\ \ G'(r,f) )¦ Use this to write ц, as U W f\G Here N denotes the norm, and we used (8.11.2) to obtain the expression involving N. To simplify /x, observe that when R(V, W) ф 0, otdfV = Ф Tr I ord/V (8.11.9) \G'(rif)JJ It is clear that since V is primitive, the sum over U in ц vanishes when R(V, W) = 0. Thus the terms in the sum indexed by W that have R(V, W) = 0 can be dropped.
438 8 The Selberg Integral and Its Applications This observation, together with (8.11.9), (8.11.2), and (8.11.3), implies that ff (8.11.10) f\G In the above expression the Gauss sum gf is over the field F(r]f), where [F(r)f) : F] — degree of /. By the Hasse-Davenport relation, gf(-ordfV) = (-l Now observe that Y[f^G(—l)aeef~1 is equal to the sign of the permutation of the roots of G effected by the qth power automorphism of the algebraic closure of the field F. Since q is odd, this value is equal to S(AG). Thus д = €(V)S(AG)x(R(V, G'))Y[g(-ordfV)^f. (8.11.11) Compare the two expressions for /x, (8.11.11) and (8.11.8), to prove the lemma. ¦ Theorem 8.11.2 Suppose that the numbers a, a + y,..., a + (n — l)y; p, p + y, ..., p + (n — \)y; y, 2y,... ,ny are not divisible by q — 1. Then the Selberg sum can be written as g4 S( я = П g4a + JY)g4p + iY)g4u i Proof. The proof is by induction. It is sufficient to prove that с/ а ч g*(u)g*(P)g4ny) S(a * Y) Sia + YP + Sn(a> *' Y) g*(a + P + (n For this рифове, consider the double sum )"Q(lfR(P, Q)y), P Q where the sum on P ranges over monic polynomials of degree n — 1 and Q is over monic polynomials of degree n. First note that x(Q@)aQufR(P, QY) = x(Q@)aQ(ifR(QY, P)) So we can take V = Qy or V = xa(x - lfPY in Lemma 8.11.1, since the hypo- hypothesis of the theorem implies that V is primitive. The lemma also implies that the sum over P (respectively Q) is zero if Q (respectively P) is not square-free. Therefore, summing over P with Q square-free, Lemma 8.11.1 implies that g*(ny)
Exercises 439 This follows since, if Q = Q\ ¦ ¦ ¦ Qs is a factorization into irreducible polynomi- polynomials, then deg V = deg Qy = ny and orda V = y. Thus by (8.11.2), S = X(-ir+n{n-l)y/2Sn(a, p, y)g\y)n/g4ny). (8.11.13) Similarly, summation over Q with P square-free and V = xa(x — 1)PPr gives n-l •*(а + р + (п-1)У) Set T = —x(x- 1)P = (x- 1)P +xP +x(x- \)P' dx and observe that x(R(xa(x - \fPY, T)) = x(T@)aT(l)fiR(P, T)y) = X((-l)"P@)aP(lf R(P, x(x - and R(P,x(x - 1)P') = R(P,x)R(P,x- 1)R(P, P') = P@)P(l)R(P,P'). These relations and the fact that S(Ax^-i)P) = 8(AP) give " S = x(-ir+n(n-1)r/2Sn-i(a + У,Р + У, Y) 8(а)?У_?8(У) v (8.П.14) g*(a + p + (n- \)y) A comparison of (8.11.13) and (8.11.14) shows that (8.11.12) is true, which proves the theorem. ¦ Exercises 1. Prove that if / is continuous on [0, 1], then lim a [ ta~ The case in which / is differentiable is easier and can be done by integration by parts. 2. Suppose that / is a complex measurable function on a measure space X with a positive measure ц, and that ||/||oc > 0. Prove that if \\f\\p < oo for some 0 < p < oo, then II У* I ¦ II f> || as p —>• oo.
440 8 The Selberg Integral and Its Applications 3. Work out the details of the proofs of Corollaries 8.2.2 and 8.2.3, that is, prove the formulas /00 /-OO * " ¦••/ IixUx"le'X П \*-хА = П(« + («- j )r) П and J-оо ./-oo \ ^,.1 J l<i<jsn 4. Denote the first integral in Exercise 3 as Gn (k; a,y) and employ the method used in proving Aomoto's formula to show that Gn(k; a, y) = [a + (n- k)y]Gn(k - 1; a, y). Obtain another evaluation of Gn(k; a, y) from this recurrence relation. 5. Here is one proof of Selberg's alternative beta integral formula (Theorem 8.3.3). With R+ = [0, oo), set (ГЫ П iXi-xj^flxf-'e-^dx. \i=1 / 1 <(<;<« i=l (a) Show that A.^+*+na+(n-1)n>'-1e-xG(A.) = G{\)X^le-k. (b) Integrate (a) with respect to Л over [0, oo) and show that /,,„ Г1 i V^n -iP+k+nu+n(n-\)y \.LXi ' JR+ [L+2^1xi\ i = l =1(а + (л-у» " г (а + (л - у (с) Change variables so that / \ -i show that д(уи...,у„) and obtain Selberg's formula. 6. Prove Lemma 8.3.2. 7. Verify Equations (8.3.6) and (8.3.7) in the proof of Theorem 8.3.1.
Exercises 441 8. Prove Theorems 8.5.4 and 8.5.5. The next five problems are similar to Steiltjes's maximum problem and have similar solutions. 9. Let there be a positive mass p at the fixed point x = 0 and unit masses at the variable points x\, x2, ¦ ¦ ¦, xn in [0, oo) such that x\-\ Yxn < nK, where A" is a given positive number. Show that the maximum of j=\ l<i<j<n is attained if and only if the {Xj} are zeros of the Laguerre polynomial L^ (ex), where a = 2p — 1 and с = (n + a)/K. 10. Suppose there are unit charges at the variable points xi,..., xn in the interval (—oo, oo) such that х\л +хгп <nL, for a given positive number L. Show that the maximum of V(X\,...,Xn) = Y[ \xi~xj\ l<i<j<n is attained if and only if the {jc, } are the zeros of the Hermite polynomials Hn(Cx), С = V(n 11. Suppose there are n unit masses, n > 2, at the points x\, хг, ¦ ¦ ¦, xn in [—1, 1]. Find the positions of these points for which fJi <, <, <„ \x% — xj I is a maximum. 12. Suppose that the n unit masses exist at jci ,..., xn in [0, oo) and satisfy the condition X] +x2-{ Yxn < nK, where К is a given positive number. Find the positions of these points for which IIi<,-<;-<n I** - XA is a maximum. 13. Suppose charges are distributed on a unit circle. At в = 0, fix a charge +q and at, в = ж, fix a charge +p. Distribute 2N freely moving unit charges at 6»i,..., 02n so that O<0j:< ж, j = \,...,N and ж <0j <2tt,j = N + 1,...,2N. Show that the potential IN 2N T = ~qY, logl 1 - eWt | - p Y, logl 1 + e1'* I - XI 1оё1е'е' " e"J I /k=l k=\ l<k<j<2N is a miminum when the #,s are zeros of the Jacobi polynomial рй-1/2,г1/2) (Cos^)> о < в < 27Г. See Forrester and Rogers [1986].
442 8 The Selberg Integral and Its Applications 14. Prove that, with appropriate conditions on the parameters for convergence, 00 /-OO " / Г1A+^)~^~2У(Й~1) П \(xi-xj)\24xl...dxn T / / r Г{а + (у - 1)у)Г(р + U - 1)к)ГA + jy) and Bтг)" J-o \(Xi -Xj)\lYdxx...dxn 1 П Г(а _ 0- _ \)у)Г(р - (у - 1)У)ГA + у)' For the second integral, begin with Cauchy's beta integral. 15. Show that 3F2( а~ХЛс ;l) \a+ l-b,a+l -c J (а- 1)Г(а + 1 - b)T(a + 2 - c)T(a/2 + 1)Г(а/2 + 2-b-c) ~ (b - l)(c - 1)Г(а/2 + 1 - Ь)Г(а/2 + 1 - с) Г (а + 1)Г(а + 1 - * - с) Г (а + 1 - Ь)Г(а + 1 - с)Г(а/2 + 1/2)Г(а/2 + 3/2 - Ъ - с) {Ь- \)(с - 1)Г(а/2 + 1/2 - Ъ)Т{а/2 + 1/2 - с)Г(а)Г(а + 1-Ь-с)' Note that, if а = 2л, the second term vanishes and the first can be evaluated by setting a = —In — e and letting e ->¦ 0. When a — —In — 1, the first term vanishes and the second can be evaluated by a similar limit. 16. Evaluate the integral in Remark 8.7.1, that is, One approach is to set jc, = tan@,/2) in the second integral in Exercise 14. 17. Prove Morris's constant-term identity contained in Theorem 8.8.2. 18. Complete the missing steps in the proof of Theorem 8.6.5. 19. Prove the arithmetic and geometric mean inequality as follows: Let x\ < xi < хт, ¦ ¦ ¦ < xn. Show that with s = (xi + • ¦ • + xn)/n, s(x\ + xn — s) > xixn. By the inductive hypothesis the result holds for the n — \ numbers Jt2,..., xn-\, x\ + xn — s. Now obtain the necessary result. 20. Fill in the details in the proof of Theorem 8.9.1. In particular, work out case (8.9.1).
Exercises 443 21. Consider Sn(a,p,y;u) = J IFCO)!"-1 IF(l)^1AF\y-2F(u)dF0dFi... dFn-\, where и is a parameter, (a) Show that [ F(t)€T> i=Q A iz'(fr)r 11 ,-=0 x (This follows from Lemma 8.4.3.) (b) Let ¦F(u)dF0 ¦ ¦ ¦ dFn_2dG0 ¦ ¦ ¦ dGn-i. Show that 1 d Г(у)п In(ct, P, Y\ u) = -—Sn(a, p, у; и) ¦ ——- n du 1 (ny) (с) Now prove by induction that Sn(a,p,y;u) m-\ I ж—, I П \ = Sn(a,p,y) To prove this, let Tn(a, p,y;u) denote the sum inside the braces show that Tn(a, p, y; u) = nTn-i(a + y,P + y,y; u). (d) Use (c) to show that the introduction of the factor x\x2---xm(\ -дгт+1) • • • A -xm+i)\ m + l <n, inside the Selberg integral, multiplies its value by (e) Use the Chu-Vandermonde identity to sum the expression in (d). The following problems on finite fields use the notation and definitions given in Section 8.10. 22. For a e Fs show that (a) 2>F,/F(a) € F, NFs/F(a) e F, (b) Trpjp maps Fs onto F, (c) There exists a /? e F such that = 0-
444 8 The Selberg Integral and Its Applications 23. Define ga{x) = J2t€F Х(?Ж«О- Show that (a) \ga(x)\=q1/\ (b) IfX ^н^,thenga(x)s«(X-1) = X(-l)<^ 24. (a) Prove Lemma 8.10.3, (b) Take F = Ъ/рЪ = Z(p) in (a). Let Nd denote the number of monic irreducible polynomials of degree d in F[x]. Use (a) to show that d\s (с) Prove that 25. Prove that 1 00 = У2 (# of monic polynomials of degree n)tn n=0 = П а-'57)-1 / irreducible where Nd has the same meaning as in the previous problem. Now deduce the result in Exercise 24(b). 26. Verify formulas (8.11.4). 27. Prove that R(f, g) = 0 if and only if either / and g have a common noncon- stant factor or the leading coefficients of / and g are 0. (Note that / and g have a common factor if and only if there exist polynomials f\ and g\ such thatdeg/i < deg/ - 1, deggi < degg - 1, and fgx = fxg.) 28. Prove that if / = ao(x - x\) ¦ ¦ ¦ (x - xn) and g = bo(y -yi)---(y- ym), then
Spherical Harmonics The aim of this chapter is to introduce the basic functions necessary for Fourier analysis in higher dimensions. One way to view cos пв is as a restriction to the unit circle of the homogeneous polynomial [(x + iy)" + (x — iy)"]/2, which is a solution of the two-dimensional Laplace equation. Spherical harmonics are restrictions to the spherexj* +x| H \-x^ = 1 of homogeneous polynomials that are solutions of the и-dimensional Laplace equation. These functions are related to the ultraspherical polynomials studied in Chapter 6. An important result in this chapter is the addition theorem for ultraspherical poly- polynomials, which generalizes the addition formula for the cosine. A useful tool in the proof of this result is a theorem of Funk and Hecke on an integral of a product of a continuous function and a spherical harmonic. Our presentation owes much to Miiller [1966]. We also employ the Funk-Hecke formula to obtain the Fourier transform of a function on R", which is the product of a radial function and a spherical harmonic. The final six sections of the chapter show how spaces of spherical harmon- harmonics of a given degree give irreducible representations of SUB), the group of all 2x2 matrices a —i b a of determinant one. Representation theory provides a very important approach to the study of special functions. Unfortunately, we do not have space for it here. The reader may consult Vilenkin [1968] or Miller [1972]. Here we content ourselves with showing the manner in which Jacobi polynomials appear in representations of SUB) and deriving an addition theorem. 9.1 Harmonic Polynomials Solutions of the Laplace equation Y,^>=0 (9.1.1) 445
446 9 Spherical Harmonics are called harmonic functions. We are particularly interested in polynomial so- solutions. Since any polynomial in the variables x\, x2, • • •, xn is a sum of a finite number of homogeneous polynomials of different degrees, we concentrate on these. Definition 9.1.1 A polynomial Hm(x), which is homogeneous of degree m and satisfies (9.1.1), is called a harmonic polynomial. Interest in these polynomials arises from the fact that they are useful in finding harmonic functions with given boundary values. In particular, one would like to determine such functions in the unit ball YH=ixl < 1- To motivate the study of these polynomials and functions, consider the case n = 2. In this situation we usually write и — u(x, y), where x and у are real. It is clear that z = x + iy can be used to determine harmonic functions. Since and =B(Bi)(, + ly) dyl it follows that z" = (x + iy)" is a harmonic polynomial. Similarly, z" = (x — iy)" is also a harmonic polynomial. There is a technical problem in using (x + iy)" and (x — iy)n to find harmonic functions with a given boundary value on x2 + y2 = 1, because it is hard to see some of the important properties of these polynomials when they are given in terms of (x + iy)" and (x — iy)". This problem can be resolved by using polar coordinates. Setx = r cos в and у = r sin 6» so that z — re'9 and z" = r"e'"9. Then un(x, y) = r" cos пв = [{x + iy)n + (x - iy)"]/2 and vn(x, y) = r" sinne = [(x + iy)" -(x- iy)"]/Bi) are harmonic polynomials of degree n. The Poisson integral r2 _ r2 - 2r cosF> - ф) + r2 then solves the problem of obtaining a harmonic function u{x, y) = v(r, в) with Итг_и- v(r, в) = /(в), 0 < в < 2n. Note that the Poisson kernel X_r2 _ _~ 1 - 2r cos в + r2 Thus the kernel in the above integral is an infinite sum of harmonic polynomials.
9.2 The Laplace Equation in Three Dimensions 447 The polynomials un(x, y) and vn{x, y) can also be expressed as к even and On the circle x2 + y2 = 1, it is simpler to view these polynomials as cosnO and sin пв respectively. Note that, for n = 1, 2, 3,..., there are two independent harmonic polynomials of degree n. For n = 0, there is just one. This is reflected in the coefficients in the expansion of the Poisson kernel. 9.2 The Laplace Equation in Three Dimensions We have seen that it is convenient to use polar coordinates to study the harmonic polynomials on the unit ball in two dimensions. We could have employed polar coordinates right in the beginning by writing the Laplace equation as d2v 1 d2v 1 dv ^2+72W2+-r^ = °- (9'2Л) A set of solutions of the form U(r, в) = R(r)TF) can be obtained by separation of variables. We have r2R"T + rR'T + RT" = 0 or r2R"+rR'_ T" _ R ~ ~~T ~ C' where с is a constant. So r2R" +rR' -cR = 0, which is Euler's equation with a solution of the form R = rl. The exponent A. satisfies A. (A. - 1) + к - с = 0 or
448 9 Spherical Harmonics For polynomial solutions we must have A. = 0, 1, 2,..., and с = n2 for an integer n. Thus T satisfies the equation T" + n2T = 0. Two independent solutions are cos пв and sin пв. The above calculations can be carried over to three dimensions. We write Laplace's equation in spherical coordinates as 2dv\ 1 9 / dv\ 1 d2v _ r Y) + ^eYe \sm0Ye) + ^TeW2~ = —c. The ranges for в and ф are 0 < в < n and 0 < ф < 2л. Separating variables U(r, ф, в) = R(r)FD>)T(e) gives (r2R')' 1 (smOT')' 1 F" _ R + sin6> Г + sin2в ~F ~ Therefore, (r2R'Y R and 1 (sineT'Y 1 F" sin6< Г sin2 в F The first equation can be rewritten as r2R" + 2rR' -cR=0. Again, R = rx gives A. (A. - 1) + 2A -c = 0 or A. (A. + 1) -c = 0. Write с = n{n + 1), so that X = n and A = —и — 1 are the solutions. Again, for polynomial solutions take A = n, a nonnegative integer. Then (sin 6T)' , F" sine- -+n(n + 1) sin2 6>H =0. T F So, F" — dF = 0 for a constant, say, d = —m2. Then T satisfies the equation 1 d ( dT\ { m2 Л sin6>— ) +<n(n+\) 5— \T = 0. sinddey de) \ sin2 6> / Set x = cos в and Т(в) = у(х) so that this equation becomes 4 dx
9.3 Dimension of the Space of Harmonic Polynomials of Degree к 449 This differential equation was mentioned in Chapter 3 as a second-order equation with regular singularities at — 1, 1, and oo and no other singularities. In Riemann's notation, its set of solutions was given by -1 oo 1 \ m/2 n + 1 m/2 x >. C.9.5) —m/2 —n —m/2 ) Notice that the indices associated with 1 and — 1 are the same. From the discussion in Chapter 2, and in particular from Equation B.3.6), it follows that the set of solutions can also be written as (l-x2)m/2P{ 0 n+m+1 0 x \ s A -x2)m/2v. By Theorem 2.3.1, v satisfies the equation A -x2)y" - 2A + m)xy' + (n- m)(n This equation has polynomial solutions when n — m > 0 is an integer. In fact, the polynomial solution is the ultraspherical polynomial C™_m2(x). The reader may verify this by comparing the equation with the differential equation for Jacobi polynomials in Chapter 6. See Exercise 6.25. We have shown that the Laplace equation is satisfied by rn A — x2)m^2Cn_^ (x) cos тф and r"(l — x2)m//2C™_m2(x)sinm0, where x = cos# and 0 < m < n. These are 2n + 1 independent solutions. We will show that these span harmonic polynomials of degree n in three dimensions. Observe that only one solution, namely rnCxJ2{x), does not depend on ф. 9.3 Dimension of the Space of Harmonic Polynomials of Degree к Let Vjt denote the vector space of homogeneous polynomials of degree к in n variables. Each polynomial p e Vk has the form p(x) = \a\=k wherea = (ab a2, • • •, an),x = (xux2, ¦¦-, xn),ca =ca,,a2i. ..,„„, xa =x"' •••*"", and |a| = YH=i ai< with a' nonnegative integers. The dimension of Vk is the num- number of и-tuples (аь аг, ¦ ¦ ¦, а„) with YH=i a' = ^- Represent a given и-tuple (ai, аг, - - -, а„) by a sequence of к dots and n — 1 vertical lines as in the example given below:
450 9 Spherical Harmonics There are ai dots before the first line, then a2 dots between the first and the second line, and so on. Clearly, there is a one-to-one correspondence between such sequences and the и-tuples. The total number of dots and lines is n + к — 1 and so the number of different arrangements of dots and lines is A second way of seeing this is that dk<n is the coefficient of xk in the expansion of A — x)~n = A — x)~l • • • A — x)~x, for this coefficient is the number of solutions of Еаг = к with a,- nonnegative. Yet another argument in a special case is given by Anno and Mori [1986]. It is evident that not all the homogeneous polynomials are harmonic. For instance, in the case of two variables, the dimension of the space of harmonic polynomials of degree к > 0 is 2 but dk,2 = к + I. To find the number of independent harmonic polynomials of degree к in n variables, write the homogeneous polynomial к P(x) = ^2xJnAk-j(xi,..., xn-\), j=o where А*_у- is homogeneous of degree к — j in x\,..., xn-\. Apply the operator ^ 92 Э2 ii илп to p(x) to get к Ap(x) = ^2j(j j=2 k-2 If p(x) is a harmonic polynomial, then we must have So once Ak and Ak-\ are given, the remaining A, are determined. Therefore, the number of linearly independent harmonic polynomials of degree к in n variables is Ck,n = <4,«-l + dk-l,n-l k-l J fc!(n-2)!
9.4 Orthogonality of Harmonic Polynomials 451 Observe that ckt2 = 2 for к > 0 and ск^ = 2к + 1, which reconfirm the statements at the end of the first two sections. 9.4 Orthogonality of Harmonic Polynomials The harmonic polynomials of different degrees in two variables are orthogonal when integration is over the unit circle; the same is true in three variables over the unit sphere. This is clear by looking at the polynomials given in the earlier sections that formed a basis for the space of harmonic polynomials in two and three dimensions. This result continues to hold in higher dimensions. Before proving this generalization, note that polar coordinates in n dimensions, (r,0\,..., вп-г, ф) can be defined by the equations x\ = r cos 0i, X2 = a-sin 0i cos 02, x3=A-sin0i sin 02 cos 03, xn-\ = л sin 0i • • • sin0ra_2cos0, xn = r sin 0i • • • sin0n_2sin0, where 0 < 0, < 7Г and 0 < ф < 2т:. Let Hk(x) and Hj (x) be homogeneous harmonic polynomials in n variables of degrees к and j respectively with j ф к. By Green's theorem, 0= / [Hj(x)AHk(x) - Hk(x)AHj(x)]dV J\x\<\ = f •/№1=1 r=\ r=l (9.4.2) where ? = x/|x|, |x| = r, and da>(%) is the invariant measure on the surface of the sphere. We used the fact that the normal derivative on the sphere is in the radial direction. The homogeneity of Hk{x) gives 9 9 э7' r=l Substitute this in (9.4.2) to arrive at (* - j) f HjG)HkG)da>G) = 0. (9.4.3) The functions Hk{%), which are restrictions of homogeneous harmonic polyno- polynomials to the surface of the sphere in Rn, are called spherical harmonics. Some- Sometimes they are also called surface spherical harmonics and the Hk(x) are called
452 9 Spherical Harmonics solid spherical harmonics. A notation sometimes used when expressing spherical harmonics in polar coordinates is where в = (въ в2,..., <9„_2)- Let F be the space of real-valued continuous functions on the sphere |?|2 = ?f + f | H + %l = 1. An inner product on this space can be defined by {f,g)= I /(?)*(?)<M?), farf.geF. (9.4.4) J\l-\ = l The conjugate of g is taken in the integral, if complex-valued functions are used. The result contained in (9.4.3) is stated in the next theorem. Theorem 9.4.1 Spherical harmonics of different degrees are orthogonal with respect to the inner product (9.4.4). Spherical harmonics of the same degree may or may not be orthogonal. For example, cos кв and sin кв are two independent spherical harmonics in two di- dimensions and are orthogonal, but cos кв + sin кв and cos кв are not orthogonal. Using Gram-Schmidt orthogonalization, it is possible to choose an orthonormal basis of ckin = Bk + n - 2)(k + n- 3)!/(fc!(n - 2)!) spherical harmonics of degree к in n variables. Denote the members of this basis by Theorem 9.4.1 and the definition of Skj(i~) imply (SkJ, Sk'J'} = Skk'Sjj'- 9.5 Action of an Orthogonal Matrix Recall the definition of an orthogonal matrix. Definition 9.5.1 Ann xn matrix О is called orthogonal ifOO' = /, where O' is the transpose of O. It is clear that if О О' = I then О' О = /, that is, O' is also orthogonal. Write a vector x € RN as /xt\ X = X2 \Xn/
9.5 Action of an Orthogonal Matrix 453 and write the inner product of two vectors x and у in R" as (x,y) -x'y =x\y\ +х2у2~\ Vxnyn. If О is an orthogonal matrix, then {Ox, Oy) =x'O'Oy=x'y = (x,y). (9.5.1) So, if we write (x,x) = ||x||2, then ||Ox|| = ||x||. In particular, if ||?|| = l,then We now show that the Laplace equation remains invariant under the action of an orthogonal matrix. Write (f)-(f f)'- \OX J \OXl OXn J Then the Laplace operator is given ЪуD-)'D-). For a change of variables x' = Ox, This proves our claim. Consider the action of the orthogonal matrix О on a spherical harmonic Hk{%) defined by the mapping #*(?) -> Hk(O%). This action transforms an orthonor- mal basis Skj(^), j — 1,2,..., с*,„ to another orthonormal basis Skj(Ol-), j — 1, 2,..., ck n for the space of spherical harmonics of degree к inn variables, since Moreover, ~ ki,Sk,tG) (9.5.3) e=i and the coefficients (Ак}1) form an orthogonal matrix. This follows from the relation }1 Ct.n e=i * = / SkJ(OS)SkJ.(OS)d(oG) = Sjr.
454 9 Spherical Harmonics 9.6 The Addition Theorem The spherical harmonic of degree к in two variables, cos k6, has the addition formula coskO coskOo + sinkd sink90 = cosk@ — во). (9.6.1) We note two interesting properties of this spherical harmonic. The only orthogo- orthogonal transformation that fixes the point (cos во, sin#o) on the circle is the reflection about the line в = во; the unique independent spherical harmonic of degree k, which is invariant under this transformation is cosk(e — во). Moreover, since sinke = cos(ke — |), we can view coskd as the basic spherical harmonic in two variables. This uniqueness continues to hold in higher dimensions but with a different basic function. The goal of this section is to obtain these generalizations. There is no nonconstant harmonic polynomial that is invariant under all orthog- orthogonal transformations. But there is exactly one, modulo a constant factor, invariant under transformations that leave one point fixed. To prove this, start with the fol- following preliminary lemma. Lemma 9.6.1 Up to a constant factor, there exists at most one harmonic polyno- polynomial of degree к that is invariant under all those orthogonal transformations that leave one point r\ of the unit sphere fixed. Proof. Since the inner product (Ox, rj) = (x, 77) for all orthogonal transforma- transformations О that leave /? fixed, it is sufficient to prove that there is, at most, one (up to a constant factor) harmonic polynomial Hk(x) that depends only on r and (x, 77). From the homogeneity of Нк(х), we have Hk(x) = co(x, r,)k + ar(x, г?)* + c2r2(x, r))k-2 + ¦¦¦ . It is an easy calculation that A[rl(x, rj)m] = m(m - l)(rj, r])rl(x, j])m~2 + l(? + 2m+n- 2)гг~2(х, rj)m. Then AHt = 0 implies that the coefficients ck satisfy the relations (к - m)(k -m- \)cm + (m + 2)Bk -m-2 + n- 2)cm+2 = 0 for m = 0, 1, 2,... and c\ = 0. This shows that c0 determines Hk(x), and the lemma is proved. ¦ We now show the existence of the harmonic polynomial of Lemma 9.6.1. Let Skj($), j = 1,2,..., Ck,n, be the orthonormal basis of the space of harmonic polynomials of degree к in n variables. The form of (9.6.1) and the remarks after that suggest the consideration of the function F(l=,rf) defined below. First define
9.6 The Addition Theorem 455 the vector function and the orthogonality of the matrix С = (Akjt), we have thensetF(?, 77) = S^'SCt?), where ? and 77 are points on the sphere. From(9.5.3) kjt) , Or,) = SiOtf = S^yCCSir,) = F(?, 77), when О is an orthogonal transformation. It is clear that F, as a function of ?, is the restriction to the unit sphere of a harmonic polynomial of degree k. Moreover, if О is any orthogonal transformation that fixes 77, then F(Ot;, 77) = F(?, 77). This proves the existence of the function we were looking for. Its uniqueness was proved in Lemma 9.6.1. It is clear that the function F(^,r]) depends only on the inner product (?,17). Denote it bkPk((%, /?)) and normalize by taking Pk((n, rf)) = Pk{\) = 1. Definition 9.6.2 The function F(?, 77) = bkPk{(%, 77)) is called the zonal har- harmonic of degree к with pole rj. To determine bk, take ? = 77 to get Ct.n bk = Y^(SkJ(ri)J; 7=1 then integration over the sphere with respect to da>(rj) gives bkcon = ск,п or bk = ckttl/con, where con = 2(я)"/2 Г (и/2) is the surface area of the unit sphere x\ + x\ + ¦ • • + x2n — 1. Now multiply the equation 7?)) = ?5t y=i by itself and integrate with respect to 77. The result is "I У|Ч|=1 ri))]2da>(Ti)
456 9 Spherical Harmonics Therefore, f J\r (? i)jG, r,))dw(r,) = —Skj. (9.6.2) \r,\=l ck,n This orthogonality relation for Pk will help us identify the function. Rotate ? to ej = A,0,..., 0) and take ц = te\ + Vl — t2r]', where |/?'| = 1 and the first component of rj' is zero. A change to polar coordinates gives (9.6.3) The Jacobian is given by dx\dx2 ¦ ¦ ¦ dxn = rn~x sin"~2 0\ ¦ ¦ ¦ sin2 0„_з sin6n-2drd6\ ¦ ¦ ¦ dQn-2d4>, which implies that, on the sphere, dcon = sinn~2 eideidcon-i = A — t2)^~ dtdo)n^\. (9.6.4) The orthogonality relation (9.6.2) can be written as Pk(t)Pj(t)(l - t2)^dt = Шп Skj. (9.6.5) <»n-\Ck,n Thus Pk(t) = AC(k~2)l2 (t) is an ultraspherical polynomial. Since Pk(\) = 1, the constant is given by A = 1/cf ~2)/2(l). We have, therefore, proved the following addition theorem. Theorem 9.6.3 LetSk ,_/(?)» 7 = 1,2,..., ckn, be an orthonormal set of spherical harmonics of degree k. Then (9.6.6) Remark 9.6.1 This result contains (9.6.1) as a limiting case. Recall that C(n~2)/2(t) lim —7—тгт^—=cos^0, where t = cosd. п^г cf-2)/2(l) Remark 9.6.2 Since the integral (9.6.5) can be directly evaluated from the prop- properties of ultraspherical polynomials, the value of ck<n, the dimension of the space of spherical harmonics of degree k, can be computed from (9.6.5). To see what the addition formula (9.6.6) looks like for n = 3, consider the set of independent spherical harmonics in three variables listed at the end of Section 9.2. Rewrite them in terms of the associated Legendre function defined by
9.6 The Addition Theorem 457 It is easily verified that so that an orthonormal set of spherical harmonics of degree к for n = 3 is given by /2Jt + l V Ал where Pk(x), Атсо$тфР™(х), АтиттфР™(х), т = 1,..., **m — (k - m)\Bk + 1) Now take (к+т)\2л ' = (cos a, sin a cos 0i, sin a sin ф\) and ц = (cos/в, sin (i cos 02, sin $ sin fa) so that (?, 77) = cos a cos /8 + sin a sin /в cos ф when ф = ф\ — фг- Since ck3 2k+1 л n — = — and Pk(l) = 1 = q/2(l), co3 Ал (9.6.6) gives Pk (cos a cos /8 + sin a sin /в cos ф) = Pk(cosa)Pk(cosP) + 2^ m=\ (k-m)\ (k+m)\ Pf(cos a)P™(cos /?) cos тф, (9.6.8) where ф = ф\ — фг- The addition formula (9.6.6) shows that the ultraspherical function Ck ((?, t])) is the basic spherical harmonic in n dimensions, analogous to coskd in two dimensions. Observe that it is possible to find ckn points щ, rj2,..., r)Ct n on the sphere x\ + x\ + ¦ • ¦ + x% = 1 such that the matrix (9.6.9) Sk,2(r,CkJ Sk,CkJr,Ctn) is invertible. Now consider the system of скуП linear equations by choosing 77 = rji, rj2, ¦. ¦, rjCtn in (9.6.6). This system of equations can be solved uniquely for SkJ (|) in terms of cf ~2)/2((?, щ)). We have proved the following:
458 9 Spherical Harmonics Theorem 9.6.4 It is possible to choose points щ, т/2, ¦.., r\4n such that every spherical harmonic can be expressed in the form. 9.7 The Funk-Hecke Formula In this section we prove the Funk-Hecke formula, which will be useful in finding a basis for the space of spherical harmonics of degree k. This leads to the addition formula for ultraspherical polynomials. In the following material we write the inner product of two vectors a and p in R" as a ¦ p instead of (a, P). Observe that any continuous function / on the interval [—1,1] extends to a continuous function of two variables g(a, P) on the sphere defined by g(a, P) = f(a ¦ fi). If either a or fi is kept fixed, then we have a function on the sphere. Now consider the integral F(a, p) = f f(a ¦ r,)Cin'2)/2(p ¦ r,)d(o{r,). (9.7.1) J\r,\=\ For an orthogonal transformation O, F(Oa, Op) = / f(Oa ¦ т»С?~2)/2(Ofi ¦ ^dcotq) J\r,\=l = I /{а-О*т,)С?-2I2{р-О1т,)й<о{т,). By the invariance of measure under orthogonal transformations, it follows that F(Oa, OfJ) = F(a, /?). (9.7.2) As a function of p, F is a spherical harmonic and depends only on a ¦ p. The argument of the previous section implies that F(a, P) is a constant multiple of Cf ~2)/2(a ¦ P). Therefore, J\n\=l f(a ¦ г^СГ^'ЧР ¦ r,)dw(r,) = \кСГ"'\и ¦ p). (9.7.3) To find kk, take a = p = e\ = A,0,..., 0) and set r\ = te\ + Vl — t2r]' where the first component of r( is zero. A calculation similar to the one used to derive (9.6.5) gives = f Г J\f)'\=l J-l = (On-y J f(t)C(kn-2)ll(t)(\ - t2)(n-3^2dt, (9.7.4) -l where fj' is obtained from 77' by removing the first component.
9.8 The Addition Theorem for Ultraspherical Polynomials 459 The Funk-Hecke formula is contained in the next theorem. It was first published by Funk [1916] and a little later by Hecke [1918]. Theorem 9.7.1 Let f(t) be continuous on [—1, 1] and Sk(%) be any surface harmonic of degree k. Then for a unit vector a, / f(a ¦ r,)Sk(r,)dco(r]) = XkSk(a), (9.7.5) J\r,\=l where kk is given by (9.7.4). Proof 1. By (9.7.3), the result is true when Sk(rj) is replaced by C(k~2)l2(P ¦ rj). Theorem 9.6.4 says that any Sk(r]) is a linear combination of C(k~2)/2{fit ¦ /?). Thus (9.7.5) follows. ¦ Proof 2. An integrated form of the addition formula (9.6.6) is Ь.№ = (п-2)/2 / СГ2)/2(? ' r,)SkiJ(r,)dco(r,). (9.7.6) ШпС\ " A) J\t)\=l Now SkJ (?) can be replaced by any spherical harmonic Sk(t;), because the Skj (?) form a basis for the space of such functions. Multiply (9.7.3) across by Sk(P)dco(f3) and integrate with respect to /8. The theorem follows after an ap- application of (9.7.6). ¦ Remark 9.7.1 Formula (9.7.6) suggests another way of arriving at the zonal harmonic function. The map ф : SkJ -» Skj(%) is a linear functional on the finite-dimensional space of spherical harmonics of degree k. So ф is given by an inner product, that is, there exists a function g% such that Written out in full, we have SkJG)= I SkJ(r,)g^r,)dr,. (9.7.7) One can then prove 9.8 The Addition Theorem for Ultraspherical Polynomials To motivate the technique used to find a basis for the space of spherical harmon- harmonics, we start with an integral formula for ultraspherical polynomials obtained in Theorem 6.7.3. The formula can be written in the form -,(n-2)/2 /i гм„ •-» i i . . (9.8.1
460 9 Spherical Harmonics To obtain a proof different from the one given in Chapter 6, consider the integral g(x) У|«»_1|=1 [x r]n-i]kdo}(fjn-i), where x = (xb . ..,*„), ei = A,0,..., 0), 17 = te\ + Vl -t2rjn_u t = 77 • eb гпА?\п_1 is obtained from /;n_i by removing the first entry, which is zero. In what follows we write цп~\ instead of f]n-\. We first show that g is a harmonic function. Observe that |4 = *(* - 1) / and for j = 2,..., и, I t(t 1 where ey = @,..., 1,..., 0) with 1 in the y'th position. So лк-г 7=2 n_l) =0 since j=2 Thus g(x) is a solid spherical harmonic of degree k. It is clear that g is also invariant under all orthogonal transformations that fix e\. If we take x — eu then the value of g(x) is &>„_!, the volume of the unit sphere in n — 1 dimensions. Set I = x/\x\ and then ? = t€\ + Vl — /2?n-i- From the previous remarks, is a multiple of CJ,"' ^ (t). The normalization at t = 1 gives cf-2)/2(o -=Ы.м (9.8.2) Now apply the procedure used in the derivation of (9.6.5) to the above integral to get (9.8.1). The next step in the derivation of the addition theorem for ultraspherical poly- polynomials is to take any spherical harmonic of degree к in n variables and express it in terms of a spherical harmonic of a different degree in n — 1 variables. For this purpose consider the integral (9.8.3) i»-il=i
9.8 The Addition Theorem for Ultraspherical Polynomials 461 It is immediately obvious from the argument used above that g is a spherical harmonic of degree к in n variables. Moreover, by the Funk-Hecke formula, the integral can be written as «(?) = / [t + iy/l ~t4n-i ¦ ijn-i]kSj(ijn.i)da)(iin.i) ¦/141=1 = S,(?,_,)flfc-2 f(t + iV^2s)kC^)l2(s)(\ - Rodrigues's formula for ultraspherical polynomials, A x) Lk(x)- V *) implies that the term A - s2)(n~4)/2Cf~3)/2(s) in the integrand is a constant times (l2)J Integration by parts j times then gives - t2y'2 J where AT is a constant. By (9.8.1) the last integral is proportional to CJk*f~2)/2(t). We have, therefore, proved that 5Д|л_0A - t2)i/2CJk±f~2)/2(t) is a spherical harmonic of degree к in n variables. Recall that t and ?„_! are related by te\ + Vl — t2^n^i = ?, a point on the и-dimensional unit sphere. With the notation of (9.4.5), Sjfi(i-n-\), ? = 1,2,..., Cjttl-\, form a basis for the surface harmonics of degree j in n — 1 variables. Thus the set of functions ЗД„_1)A - t2y/2Ciy-2)/2(t), j = 0,... Д, I = 1,..., <:;,„_,, (9.8.4) forms an orthogonal basis for the vector space of spherical harmonics of degree к in n variables. The orthogonality is easily verified, and since ck<n = Ylkj=ocj.n-u there are the correct number of vectors to form a basis. Observe that, since j = f [C]ktf-2)l\t)}\\-t2y+(n-3)'2dt J — 1 -(п-2)/2)Пк-])\(к + (п-2)/2У the functions in (9.8.4) form an orthonormal basis when multiplied by 1/y/A] = Bj. If this basis is used in (9.6.6) with ?='<?! +
462 9 Spherical Harmonics and T] = then we have (n - 2)шп By (9.6.6) the inner sum is 2/+»-3 (n-3)/2 ; ^7^; (?n-i • Щп- <un_i(n-3) J This gives the next theorem due to Gegenbauer [1875]. Theorem 9.8.1 For an integer n > 3, cf-2)/V j=Q '<"k-j (*)<¦'j V9n-1 • Лп-l), (У. w/геге Г (и - 3J2^(^ - у)![ГG + (и - 2)/2)]2By + и - 3) u Remark 9.8.1 The result is true for и = 2 and и = 3 but a limit has to be taken. The case и = 2 is the addition formula for the cosine function and и = 3 gives the addition theorem for Legendre polynomials, (9.6.8). Formula (9.8.5) is often written as Cf~ )/2 (cos a cos p + sin a sin fi cos ф) к = Y, «.(sin «У C^("-2)/2(cos «)(sin pyc?f-2)/2 (cos p)Cf )/2(cos ф). j=o (9.8.5') The addition theorem can be extended to any spherical polynomial C?(x), X > 0 by analytic continuation, since both sides of (9.8.5') are rational functions of
9.9 The Poisson Kernel and Dirichlet Problem 463 к = (n — 2)/2. This identity holds for complex Л as long as no poles occur. It is also worth noting that (9.8.5') can be obtained from (9.6.8) by differentiating with respect to ф. Remark 9.8.2 The addition formulas for trigonometric functions have general- generalizations to elliptic functions that are different from those given by (9.8.5). Elliptic functions satisfy an addition formula of the type f(u + v) = A(f(u), /(«)), (9.8.6) where A(x, y) is an algebraic function. Weierstrass proved that the only solutions of (9.8.6) are algebraic functions, algebraic functions of ecm for some constant c, or algebraic functions of elliptic functions. Apparently, Weierstrass never published this result though he mentioned it in his lectures. See Copson [1935, p. 363]. 9.9 The Poisson Kernel and Dirichlet Problem The solution of the Dirichlet problem for the unit disk is given by the integral I r2n - 2r cosF> -ф)+г2 The Poisson kernel, \-r2 (9.9.1) 1 - 2r cosF> - ф) + r2 is the sum of all the two-dimensional zonal harmonics with poles at ф. In n dimen- dimensions, the zonal harmonic of degree к with pole at r\ is, according to (9.6.6), given by 2k + n-2 (n_2)/2 (n - 2)шп In Chapter 6, we saw that 00 л, , ^ (I ¦ I?)- f-? (n - 2)an K The generalization of (9.9.1) is contained in the next theorem. Theorem 9.9.1 Suppose f is a continuous function on the n-dimensional sphere. Let a be a point on the sphere and 0 < r < 1. Then If \-r2 u(ra) = — / /^Tf is harmonic inside the sphere and u(a) = f(a) on the sphere.
464 9 Spherical Harmonics This theorem is proved using the same method as we used in the two-dimensional case. 9.10 Fourier Transforms In Chapter 4, we considered the Fourier transform of functions of two variables. If the point (x, y) is identified with the complex number x + iy = z = re'e', then we can write the Fourier series of an integrable function f(re'e) as (9.10.1) The Fourier transform of (9.10.1) was seen to be expressible in terms of Bessel functions. This continues to be true in higher dimensions as well. First, we need a definition. Definition 9.10.1 A function f : R" -> R is called radial if there is a function /o(m) on 0 < и < oo such that f(x) = /o(M). We generalize the functions fk(r)e'ke in (9.10.1) to functions expressible as the product of a radial function and a harmonic polynomial in higher dimensions. The main result of this section concerns the Fourier transform of such functions. The presentation is based on Bochner [1955], though the proofs of the basic results are different. Lemma 9.10.2 For any spherical harmonic &(?) of degree к in n variables, (9.10.2) If l=i '" ""' Proof. By the Funk-Hecke formula, integral (9.10.2) equals /-1 Now apply Gegenbauer's formula: The result follows. ¦ Let / e L\{Rn) and let Tf be its Fourier transform: Tf{y) = [ e-2ni{yx)f{x)dx. (9.10.3) JR"
9.10 Fourier Transforms 465 Theorem 9.10.3 Suppose f e Li(Rn) is of the form f(x) = /0(|*|M*(?). Then Tf(y) = F0(\y\)Sk(rj), where /•OO F0(t)=2jriktl-n<2 / Ms)Jk^+n/2B7rst)sn^ds (9.10.4) Jo and у = \y\r]. Proof. It is easy to see that with x — (x\,..., xn) and Xj = s^j, we have Tf(y) = Г F0(s)s"-1 ( f e-2 Jo \^l?l=i The result now follows from Lemma 9.10.2. An interesting consequence is the next result, which is obtained by combining Theorem 9.10.3 with the fact that for Re(/t + v) > 0, lim /q _ ,VK.,. -. rA+(y_ To obtain (9.10.5), apply Pfaff's transformation (Theorem 2.2.5) to the 2F\ in D.11.4) and then take the limit. Corollary 9.10.4 For a spherical harmonic lim (9.10.6) Г((к-а)/2) when \y\ ф 0 and у = \y\r]. Proof. Take F0(t) = (V2~nt)ae^et in Theorem 9.10.3 and then use (9.10.5). ¦ A particular case of (9.10.6) is worthy of note. Take a = —n/2. We have (9.10.7) \У\2 This implies that |jc|~"/2^(^) is an eigenfunction of the Fourier transform with eigenvalue i". Observe that this transform is actually the Abel mean of the Fourier transform. One may restate Theorem 9.10.3 for the case where / e L\{R") is of the form f(x) — fo(\x |)Sjfc(jc) and Sk(x) is a homogeneous harmonic polynomial of degree к. It is sufficient to remark that Sk(x) — \x\"Sk(%).
466 9 Spherical Harmonics Theorem 9.10.5 Supposing f is integrable and of the form fo(\x\)Sic(x); Tfiy) = F0(\y\)Sk(y), where poo F0(t) = 2nikrk^n+l / Ms)Jk+in_lB7rst)si2n+kds. Jo It is well known that the Fourier transform of е~ж^ is е~л^ . A similar result holds when these exponential functions are multiplied by spherical harmonics. Theorem 9.10.6 For any homogeneous harmonic polynomial Sk{x) of degree k, f e-2*i(yx)e-*M2Sk(x)dx = ^e-^Skiy). Thus е"~лМ Sk(x) is an eigenfunction of the Fourier transform with eigenvalue ik. Proof This can be derived from Theorem 9.10.5 and the following formula of Sonine: f°° / Л Jo (st)e''2tv+ldt = ^-e"s2/4. (9.10.8) 9.11 Finite-Dimensional Representations of Compact Groups Representation theory provides an important and powerful approach to special functions. Unfortunately, we can do no more than devote a few sections to this topic. After giving basic definitions, we show how Jacobi polynomials appear in representations of 5GB). We also explain how spaces of spherical harmonics give irreducible representations of 5GB). Suppose G is a group and V a finite-dimensional vector space. Let GL{V) denote the group of linear transformations from V onto V. Definition 9.11.1 A finite-dimensional representation of G in V is a homomor- phismfrom G to GL(V). If G is a topological group, we assume that the homo- morphism is continuous. Thus if U : G i-> GL(V) is a representation, then U(gig2) = U{g\)U{g2)- The linear mappings U{g\), U{g2), and U{g\gi) can be represented by matrices if we choose a basis for V.
9.11 Finite-Dimensional Representations of Compact Groups 467 Suppose dim V = n and {x\, x2, ...,*„} is a basis for У. It is clear that the matrix entries Ujj(gig2), 1 < i, j < n, satisfy the relation n Uijigigi) = Yl Uik{gi)Ukj(g2), 1 < i, j < n. (9.11.1) k=\ If a different basis [x[, x'2,..., x'n] is chosen, then there exists a matrix P such that U(g) = PU'(g)P~l for all g e G. Here U'(g) is the matrix representation of U corresponding to the new basis. Suppose U\ and Ui are two representations of G in the vector spaces V! and V2. The two representations are called isomorphic, or more briefly, Vi and V2 are isomorphic, if there is a linear isomorphism Г : Vi -> V2 such that T о Ui(g) = U2(g) oT for all g € G. Now suppose that the vector space У has an inner product (jc, y) defined on it. Let [xi,X2,...,xn] be an orthonormal basis of V with respect to this inner product. Then j ,xi). (9.11.2) Definition 9.11.2 A representation U : G \->- GL{V) is called unitary if (U(g)x,U(g)y) = {x,y) for all x, у е У and g e G. In this section, we study the representations of the compact group 5GB). For compact groups, it is always possible to define an inner product on У such that a representation U of G in У is unitary. This is easy to prove, if the existence of an invariant measure on G is assumed. In fact, there exists a unique measure dg on G such that, for a continuous function /, (a) / f{g)dg = f f(gh)dg, Jg Jg where h and g e G, and (b) I dg=\. Jg
468 9 Spherical Harmonics The first condition gives invariance under right translation. It can be shown that the measure dg is also invariant under left translation. This invariant measure is often called the Haar measure. See Halmos [1950]. The only group we study in detail here is SUB). For this group the invariant measure is easy to construct. 5GB) is the group of all matrices with complex entries of the form a b -b a with determinant one, that is, \a\2 + |b|2 = 1. Suppose a0 bo' h= • и - —bo ao is also in 5GB). It is easily checked that gh is constructed by the parameters a\ and b\ where a\ = aoa — bob, b\ = boa + aob. Thus dg = da л da л db л db (9.11.3) is an invariant measure on 5GB). A simple calculation shows that dgh = da\ л da\ л db\ л db\ = (Mol2 + \bo\2Jda AdaAdbAdb = dg. (9.11.4) Suppose that ( )i is an inner product on the vector space V. For a given repre- representation U of G in V, define a new inner product by (x,y)= f (U(g)x,U(g)y)idg. (9.11.5) Jg That U is unitary with respect to this inner product follows from property (a) of the Haar measure. Note that (U(h)x, U(h)y) = f (U(g)U(h)x, U{g)U{h)y)xdg Jg = f (U(g)x,U(g)y)idg Jg
9.12 The Group 51/B) 469 Let U: G -> GL(V) be a representation of G, and let W be a subspace of V. We say that W is invariant under the action of G if G(g) maps W onto itself. This gives another representation of G, namely Uw'-G —> GL(W). Note that ?V(g) = ^(g)lve- Uw is called a subrepresentation of G. If f/ has no nontrivial subrepresentations, then U is called irreducible. Suppose G is a compact group; then there is an inner product on V with respect to which U is unitary. If W is an invariant subspace of V, then the orthogonal complement of W is also an invariant subspace. This is easily verified. In this situation, the matrix for U(g) has the form fUw(g) 0 " V 0 Uw±( By continuing this process, we see that v = Wi e w2e---e wk, (9.11.6) where W\,... ,Wk are irreducible. It can be shown that this decomposition is unique up to isomorphism. Thus, if v = w[ e w!, e • • • e wt, then к = I and, after renumbering if necessary, W, ~ W[. 9.12 The Group SUB) Recall that 5GB) is defined as the group of matrices of the form \ h), |fl|2 + |fc|2=l. (9.12.1) -b a) Thus, the group is defined by three parameters, which one may choose to be \a \, arg a,andargb. Whenab ф 0, one can uniquely choose another set of three parameters ф, ty, and в called Euler's angles. These are obtained from the relations a = е1(ф+ф)/2 cos -в, b = гУ w>~l/')/2 sin -в, (9.12.2) where 0 < ф < 2тг, 0 < в < л, and -2тг < rj/ < 2л. When ab = 0, the correspondence between a, b and в, ф, \jr is not one to one. Another way of writing the relation (9.12.2) in terms of matrices is a b -b a (9.12.3)
470 9 Spherical Harmonics Set with й and b as in (9.12.2). Then (9.12.3) is equivalent to g(A>, 0, f) = 8(ф, 0, Q)g(p, 0, 0)g@, 0, ir). (9.12.4) It also follows from (9.12.2) that 2|fl|2-l, e» = -??-, and ** = ??[. (9.12.5) |a||fc| fc \a\ A question that arises here is the following: If g@, 0, ф) = gi@i, 0i, \js\) 8г(Фг, 02, V^), then what is the relation of 0, 0, ty to ф\, 9\, 1^1 and (/>2, O2, ^2? This general case follows from the consideration of the particular case when ф\ = ilr1 = ij/2 = 0. Then cos i^i г sin \в\ \ ( cos \e2ei<k/2 i sin 10i cos \в\ ) \ i sin \вге~1фг11 cos i So a = cos 6i cos в2е sin ^0i sin 22 2 2 a = cos ^6>i cos -в2е1фг'2 - sin \в\ sin -в2е~1ф2/2, (9.12.6) b = г ( cos -0i sin -в2е1ф2'2 + cos -02 sin -91e~i<h From (9.12.5) and (9.12.6), we get cos 0 = cos 0i cos 02 — sin 0i sin 02 cos 02, 1ф sin 0i cos 02 + cos 0i sin 02 cos 02 + i sin 02 sin 02 sin0 ... ..,, cos 1в\ cos i02e"*2/'2 — sin i0i sin 1в2е~1ф2^2 COS \d This gives the formulas for the product g @, в\, 0)g @2, 02, 0). To obtain the general case the following remarks are sufficient. Observe that g@i, 0,0)g@, 0, V0 = #@i ~Ь0, 0, V0, #@, &¦> 1^)^@» 0, ф2) = ^@, 0, ^"Ь^2), ^@, 0, O)g(iff, 0,0) ^ g@, 0, ^r), and ^@, 0, Vi)g@2,0,0) = g(fi + 02,0, 0). Apply these relations to , 0, 0)?@, 0i, 0)g@,0, ^i)?@2, 0, 0)?@, 02, 0)?@,0,
9.13 Representations of SUB) All The general case follows immediately. Remark The invariant measure dg for G — SUB) defined by (9.11.3) when written in terms of Euler angles is dg = - sinfi'ded<pd\/f. Usually the measure is normalized to dg = г s 16tt2 so that ' dg=l. L Observe that dg is half the product of the normalized measure on the sphere, sin ededcp/4rt, and the normalized measure on the circle, d\js/2n. 9.13 Representations of SVB) Let VN+\ denote the (N + l)-dimensional vector space consisting of homogeneous polynomials of degree N in two complex variables with complex coefficients. If P e VN+i, then N P(x1,x2) = '22rkxklx?-k, (9.13.1) where x\ and x2 are complex variables and ru ,k = 0,..., N, are complex con- constants. We also write P(x) = P(x\,x2) with (x\ X = \X2 A representation of SL2(C) in VN+\ can be defined by U(g)P(x) = P(g'x), (9.13.2) where g=( ,), ad - be — \,a,b,c,d e C. \c d) Note that P(g'x) — P(ax\ + cx2, bx\ + dx2). It is easy to check that U(gig2) = U(gl)U(g2). (9.13.3) It can be shown that U gives an irreducible representation of SL2(C). In fact, all the finite-dimensional irreducible representations of SL2(C) are of this form. The restrictions to the compact subgroup 5GB) of these representations give all the finite-dimensional irreducible representations of 5GB).
472 9 Spherical Harmonics It is standard practice to write the polynomial in (9.13.1) in a slightly different form. Let N = 21, so that ? is an integer multiple of 1/2. Write i P(xl,x2)='^2rnxtl~nx%+n. (9.13.4) n=-l Here I + n takes integer values from OtoN = 21. Associate with P a nonhomogeneous polynomial Q given by P(x, 1) = Q(x). Thus, i Q(x) = Y, г"Хг~п, (9.13.5) n=-t and the homogeneous polynomial corresponding to Q is obtained from P(xux2)=xleQ(xl/x2). Denote the space of all nonhomogeneous polynomials of the form (9.13.5) by Ht and the representation of 5GB) corresponding to U in the space Ht by Tt. This implies that Tt{g)Q{x) = (bx+dJlQ((ax+c)/(bx+d)). (9.13.6) Since 5GB) is compact, the inner product (9.11.5) shows that it is possible to choose an inner product in Ht such that Tt is unitary. With respect to this inner product, the basis {l,x,..., x2t} of Ht is orthogonal. In fact, we have the lemma given below. Since the above inner product is defined up to a constant factor, assume that A,1) = B?)!. Lemma 9.13.1 (хе-т,хе~п) = (? - n)\(l + n)\Smn, -I < m, n < I Proof. Let еи'2 О О е-»/2 From (9.13.6), it follows that Tt(g)xe'k = е~а'хе~к. Since Tt(g) is unitary, (re~m xe"n) — lTAQ~\xl~m ТЛдЛх1~п\ This implies that (xe~m, xe~") — 0 for m ф п. The case m = n requires a little more work. Take /cos^r —sin^A О r^ I I \ sin I? cos|f J
9.14 Jacobi Polynomials as Matrix Entries 473 and observe that, by the first part of the theorem, 0 = (xe-n,xe-n+l) - (Tt(g)xe-n, Te(g)xe-n+l} = (ut+nvt-n,ut+n--ivt-n+l), (9.13.7) where f ¦ l \ l л ( l \ ¦ l и = I sin -t \x + cos -t and v = I cos -t \x — sin -t. Take the derivative of (9.13.7) with respect to t and set t = 0. The result is (i + n){xl-n+\xl-n+l) -(l-n + \){xl~n,xl~n) = 0. The theorem follows when the condition A, 1) = B?)! is used. ¦ 9.14 Jacobi Polynomials as Matrix Entries Choose the functions 1/л„(г) = — —, _?<„<?, (9.14.1) V(^-n)!(?+«)!' " - as an orthonormal basis for the space Ht. Then, for g e SL2(C), {ax+c)l~n{bx+d)l+n J4 „ Td8)fn(X) = V(? -h)!(/ + hI =ЕСШшЫ. (9.14.2) Now use Taylor's formula to see that the coefficient t^n(g) is given by m)\ dl~ Set у + 1 = a(bx + d). Since ad — be — 1, we have ax + с = y/b and (9.14.3) If g e 5GB), then the use of Euler's angles provides a simpler formula for n(g). Consider the decomposition Ъ(8(ф, в, f)) = Tt(g((t>, 0, 0)Oi($@, 0, 0))ГИ«(О, О, f)). (9.14.4) It follows from (9.12.3) and (9.14.2) that Tt{gD>, 0, 0М„ = e-'"V», -?<«<?. (9.14.5)
474 9 Spherical Harmonics This means that Т^(ф, О, О)) is a diagonal matrix given by / eW 0 \ The matrix for Te(g@, 0, i/r)) is similar. Write tlmn{g{0, в, О)) Equation (9.14.4) gives (9.14.6) ); then Denote tlmn(Q), 0 < в < n, by P^n(co&0), so that (9.14.7) Since ,0,0) = cos % i sin f 2 2 sin ^ cos x с dr the quantity fee in (9.14.3) equals — sin2 | = (cos в — l)/2. Thus, replacing у with (г - 1)/2 in (9.14.3), we obtain {t-n)\{t + пЩ1-т)Г Al-m [A )/-"A )* (i г)[A г)A + z)]. (9.i4.8) This shows that Р^„ (г) can be written in terms of a Jacobi polynomial. It is a constant multiple of ч( ) 9.15 An Addition Theorem It follows from (9.12.7) that when ?(<?, 0, V) = S@,6»ь OXgOfe, ^2, 0) we have COS0 = COS0] COS02 — Sin0] sin02COS02- (9.15.1) Use this in (9.11.1), (9.14.4), and (9.14.7) to get an addition formula: е-{(тф+пф)Р1п(соь9) = ? е-'к**Р*к(са&е1)Р?„(со*в2), (9.15.2)
9.15 An Addition Theorem 475 where в, ф, and \fr are given by (9.12.7). When ф2 = 0 and 0 = 9\ + 62 < n, we have 0 = ^ = 0. In this case (9.15.3) The addition formula (9.15.2) is the analog of Graf's addition formula D.10.6) forBessel functions. Formula (9.8.5) is an analog of Gegenbauer's addition formula for Bessel functions. An analog of this for Jacobi polynomials was found when fi = 0 by Sapiro [1968] and in the general case by Koornwinder [1972, 1975]. Koornwinder used the addition formula to derive the Laplace-type integral for Jacobi polynomials given in Exercise 6.45 and an integral formula for a product of Jacobi polynomials. Koornwinder [1974] observed that it is possible to obtain the product formula from the Laplace-type integral by using a result of Bateman [1932, pp. 392-393]. Bateman's result is P^\(l+st)/(s + t)) => bk,n{s + t) 7^-jr > (9.15.4) where bkn is denned by (9.15.4) when t = 1, Э 1 г +(Ba + l)coth7? + B^ + l)tanh7?)— \F = 0, (9.15.6) Эту2 Эту] r" (-1-) *=o Bateman proved (9.15.4) by showing that both sides of the equation are solutions of the partial differential equation ^ + (Ba + l)cot§ - B0 + Dtanf)^ Э2 Э + (B + l)h B^ l)h) when Now recall the formula in Exercise 6.45: , x _ (I _ X\U2 -I" h i'v 1 - x2 hcos# dma^(u,9), (9.15.7) where a > ft > —1/2 and
476 9 Spherical Harmonics This implies (x + y)n — -IT Jo Jo 2 "I П dmaj(u,6). J When this formula is combined with (9.15.4) and (9.15.5) the result is the product formula for Jacobi polynomials given by 1 Кф) y) + A - x)(l - y)}/2 -x2)(l - y2)u cos в - l]dma,p(u, в). (9.15.8) 9.16 Relation of SVB) to the Rotation Group SOC) The description of elements of SUB) in terms of Euler angles suggests a con- connection of SUB) with the group of rotations in three dimensions. The explicit relationship is given below. An interesting consequence of this connection is that the spaces of spherical harmonics in three variables are seen as the irreducible representation spaces of SUB). The rotation group SOC) consists of 3 x 3 matrices g with real entries and de- determinant 1 such that the transpose of g is also its inverse. These are the orientation- preserving linear mappings g from /?3 to /?3 such that for x € /?3, |x|2 = |gx|2. To define a homomorphism ф from SUB) to SOC), first identify the points x = (xi, x2, хз) € /?3 with 2x2 Hermitian matrices of trace 0: / —хз Xi - ix7 \X]+IX2 X3 Note that detMx = -|x|2. For g e SUB), define gMx^ forx€#3. It is clear that <p{g)x is a Hermitian matrix with trace zero and det[0 (g)x] = —|x|2. Moreover, <p(g) is an orientation-preserving linear mapping. Thus <p(g) can be identified with an element of SO C), and we have a homomorphism ф from SUB) to SOC) whose kernel is easily seen to be {±/}. It can also be shown, though we do not do so here, that ф is an onto mapping. Thus ф gives an isomorphism
9.16 Relation of SUB) to the Rotation Group SOC) 477 of SUB)/{±I] onto SOC). Recall that the Euler angle ^ in (9.12.2) ranges over [—2л-, 2л) whereas the range for \jr in a rotation is [0,2л-). This is related to the fact that ±g € SU{2) are associated with the same rotation in SOC). It is easy to check that ш\ (t), o>2(t), л>з(?) е SUB), defined by cos( isin(\ /cost - sint A) wi0) = , • • , , , , • i sin t cos t ) V sin t cos ? e" 0 О е~" correspond to rotations (by an angle of 2t) about the x\, x2, and X3 axes respectively. Thus by (9.12.3) and (9.12.4) we can view any rotation with Euler angles ф,в,ф- as a product of a rotation by the angle \j/ about хз, a rotation by the angle в about x\, and a rotation by the angle ф about хз. The object of the remainder of this section is to show that Hk(x), the space of harmonic polynomials of degree к in three variables provides an irreducible representation of SOC) and hence of SUB) as well. Let ф be the homomorphism from SUB) onto SOC). We have seen that 0(g!) = 0(g2) if and only if g\ = ±g2. It is clear that if U is a (irreducible) representation of SOC) in a vector space V, then ф ¦ U is a (irreducible) rep- representation of SU{2) in V. Conversely, if Г is a (irreducible) representation of SUB) in У such that Г (-identity) = identity, then T gives rise to a (irreducible) representation of SOC). Recall that in the notation of Section 9.13, the proof of Lemma 9.13.1 shows that the functions xl~k e Ht (not to be confused with Hk(x)) for к = —?,...,? are eigenvectors of Tt(g) corresponding to the eigenvalues e2lk'(k = —?,...,?) when g = л>з(—t). This fact may be used to prove that if U is a representation of SUB) in V with dim V = 21 + 1 and e2Ut occurs as an eigenvalue of ?/(w3@), then (?/, V) is isomorphic to Gi, #^). We have у = #tl e я*2 е ¦ • ¦ e нкр for some integers k\,... ,kp. If p = 1 and &i = •?, then we are done. If p ф 1, then &, < € and the eigenvalues of ?/(л>з(О) are of the form e2""' with \m\ < €. Thus V ~ #f and the result is proved. Now define a representation t/ of 5OC) in Hk(x) by U(g)p(x) = p(g~[x) for x e /?3, p e Hk(x), and g e ?ОC). This in turn gives a representation t/ • ф of SUB). Now observe that (jcj + гх2)* е Hk(x) and that w3 (r) maps to a rotation by an angle 2t about хз. Thus U(ф (л>з(?))) has (xi + гх2)А as an eigenvector with eigenvalue e2lkt. Thus by the result of the previous paragraph, Hk(x), the space of harmonic polynomials of degree k, is an irreducible representation of SUB).
478 9 Spherical Harmonics Exercises 1. Verify that the only polynomial solution of the differential equation A - x2)y" - 2A + m)xy' + (л - m)(n + m + l)y = 0, where т,я eZ,n-m>0,is the polynomial ^С™_Ш2 (х). 2. Let Vic denote the vector space of all homogeneous polynomials of degree к in n variables. For a = (а\,...,а„), let xa = x x ¦ • ¦ x"n and Da = да>+а2+-+а»/дх^дх22 ¦ ¦ ¦ дх«\ For a polynomial P in n variables, let P(D) denote the differential operator obtained by replacing xa by Da in P(x). For P, Q e Vk, define (P,Q) = P(D)Q (or P(D)Q if complex coefficients are used). Prove that ( , ) is an inner product on Vk- 3. Let Д be the Laplace operator and к > 2. Show that Д : Vjt ->¦ Vjt_2 is mapping by proving that there is no nonzero vector in Vjt_2 that is orthogonal (with respect to the inner product in Exercise 2) to the range of A. 4. Let Hk С Vk denote the subspace of harmonic polynomials. Suppose Lk = {P€Vk\ P(x) = \x\2Q(x), Q e Vk-2}- Prove that Vk = Hk ® Lk. 5. Use the result of Exercise 4 to prove that if P e Vk, then P{x) = P0(x) + \x\2Pdx) + ¦¦¦ + \x\2ePe(x), where Pj is a homogeneous harmonic polynomial of degree к — 2j, j = 0,1, ...,?. Deduce that any polynomial in n variables and restricted to the unit sphere is a sum of spherical harmonics. 6. Use the results of the previous problems to show that с к ,„ = dimension of the space of spherical harmonics of degree к in n variables = dim Vj — dim Vk-2 + k- 1\ _ /n+k-3' к ) \ Jfc-2 7. Prove that it is possible to choose points щ, ту2, •.., ЦСкп such that the matrix (9.6.9) is invertible. 8. Show that the function g^(ту) in (9.7.7) is given by ?SkJ (l-)Skj(rf). 9. Derive the addition formula (9.8.5) from (9.6.8) by differentiation. 10. Prove Theorem 9.9.1. 11. Use (9.8.5') to derive: (a) Gegenbauer's product formula sin#sin0cos Jo
Exercises 479 where A. > 0 and :' = / (^ (b) The integral formula (9.8.1) for ultrasphencal polynomials. 12. Prove that when one lets X -*¦ 0 in formulas (a) and (b) of Exercise 11, one obtains the well-known formulas cosn#cosn0 = -[cosn(# + ф) +cosn(# — ф)], cos пв = -[еш + е-ш]. 13. 1л1х,у е R",x = Щ,у = гг), |f| = \ц\ = l,and# > r, and note that (a) Let J2"=i У] эГ = r(rl' v)- Use Taylor's theorem to prove that (-D* i. i. ¦, X — (b) Deduce Maxwell's formula that 14. Deduce the following formulas from (9.15.2): 1 Г (a) P^(cos6>i)P/n(cos6>2) = — / е'(кф2~тф' mk кпУ In J_n 1 /¦" (b) Pi (cos 0i)Pf (cos0г) = — / Pi(cos6\ cos 02 — sin0i sin02co: 2л- J-n where ф, г/г, and 0 are as defined in the text. In (b) t is an integer. 15. This problem gives a generating function for spherical harmonics in three dimensions. Let x = (xi, x2, X3) and un = (—2?, 1 — t2, i + it2). Then define Hk(x) by n (u ¦ x)" = [x2 + гх3 - 2xit - (x2 - ixj,)t2f = t" ^ Hk(x)tk. k=-n (a) Show that Я* = (-\)kH~k.
480 9 Spherical Harmonics (b) Prove that H*(x) is a homogeneous harmonic polynomial by showing that V2(m • x)" = 0. (c) Let v — {—2s, 1 — s2, i + is2) and define the polynomial ф(и, v) by -'111=1 Use the technique of Lemma9.6.2 to show that </>(«> v) is a constant times (u ¦ v)n. [Note that и ¦ и — 0 and v ¦ v — 0.] (d) Define S*(f) = |х|-"Я*(х), where % = x/\x\. Show that 1) / 2« ч ,p Л Г(п+C/2)) (e) Also prove that 16. Show that the associated Legendre functions P™ (x) defined by (9.6.7) satisfy the recurrence form = 0, 1,2,.... 17. Derive the formulas Г(к+т + 1) Г i— t (a) Pk(x) = — / (x + Vx2-\ cos x/rf cosmx/rdx/r, л l (к + 1) Уо m+ 1) Bsin6>)" / Jo where Rex > 0, 0 < в < л, and m is a nonnegative integer. 18. Show that 19. Show that ]2 (—IVя f°° =-f—^— / uke~ucose Jm(u sin 6)du. (k - my. Jo [Ptm(cos0)]2 22>G-m)!G+m)!G!J(?-7)! 20. Prove the relation (9.10.5), that is,
10 Introduction to ^-Series When one is counting, one may use a generating function to keep track of the number of objects being counted. This is yet another way in which hypergeometric series arise. For example, the finite binomial theorem is usually written as (x + у)" = 4=0 The coefficient Q) counts the number of ways n — к х and к у can be arranged. The usual argument is to observe that the first у can be put in any of the n places, the second in n - 1 places, and so on, until the n{n- 1) ••¦(«- k + l) ways are obtained. However, note that the first у could have been in any of the к spots, the second in any of the remaining к — 1 spots, through the k\h, so that k\ of those arrangements are the same. Thus we can represent the number of combinations as n(n -!)•••(« -k + 1) n! k\ k\(n-k)\ The last expression makes the symmetry in к and n — к as clear as it is by counting the x first rather than the y. Observe that since x = у = 1 gives the binomial coefficients provide a refinement to the cruder result that 2" is the total number of arrangements of x and у in n places. A further refinement is possible. One nice way to illustrate this is to consider lattice paths on the first quadrant, starting at @, 0) and moving to (n — к, к) by n steps, each either one unit to the right or one up. Consider the case of two moves 481
482 10 Introduction to «^-Series @,0) @,0) il.l) Figure 10.1 shown in Figure 10.1. In the first case there is no area under the path; in the second the area is one. We will split the ("k) paths according to the area under the curve. Since (x + yJ = xx + xy + yx + yy we can keep track of the area by rewriting each term with the xs first and adding a unit area whenever yx is changed to xy. We will use a parameter q to do the counting, requiring that yx = qxy. Since we wish to collect the ^s together, also assume that УЯ = ЧУ, xq = qx. As an exercise, work out (x + yL: The coefficient of x2y2 comes from six pictures, and the generating function for these six pictures collected by areas under the graph is l+q+2q2 + q3+q4. A little reflection shows that this is demonstrating that there is some structure to this coefficient. It is possible to rewrite this in a form that suggests a general form for the coefficients, but there is a more elegant way to derive this formula. Recall Pascal's triangle property for binomial coefficients: A0.0.1) This can be explained by a combinatorial argument, since the (и +1 )-th spot could contain an x in Q) ways or а у in (k1l) ways.
10 Introduction to «^-Series 483 However A0.0.1) also comes from We use this method for finding the ^-binomial coefficients \nk\ that are defined by П г -i (x + y)" = Y, III x"~kyk A0-°-2) *=o \-ки when yx = qxy, xq = qx, yq = qy. First (x+y)n+1 =(x+yT(x + y) gives 4=0 L J9 4=0 Since ykx = qkxyk we have [T] =[t]9*+[*-i] • A0-°-3) This is a ^-extension of A0.0.1). In the case when q = 1 there is only one Pascal triangle relation. In the q-case there is a second. Using the same argument, (x + y)n+l = (x + y)(x + y)n gives Relations A0.0.3) and A0.0.4) can be combined to give Гл1 {\-qn+l-k)\ n k\q Iteration leads to [л 1 _ {\-qn+l-k)---{\-qn)\n~\ UL (l-qk)---(l-q) [o\:
484 10 Introduction to q-Senes But [oj, and so Гл1 (l-g)---(l-g")(q; q)n [k\q (\-q)---(\-qk){\-q)---(\-q"-k) (q; q)k{q; ?)„_*' A0.0.5) where ]J-qi)- (Ю.0.6) Another way to write A0.0.5) is n\a where л!, = A + ?) • • • A + q + ¦ ¦ ¦ + qn'X) = (q; q)n{\ - qT". A0.0.8) The question naturally arises whether there is a commutative extension of the binomial theorem that uses ^-binomial coefficients. To derive this result, replace у with xy in A0.0.2). This is possible since y(xy) = (yx)y — q(xy)y. Then Observe that (xy)k = xyxy ¦ ¦ • xy — xkykqk<-k'1^2. Also, (x + xy)(x +xy)---(x+ xy) = x(\ + y) ¦ ¦ ¦ x(l + y)x(\ + y) = X(l +y)...x(l + y)x2(\ + qy)(\ + y) Therefore, A +y)(l+qy)---(l+ qn~ly) = J2 qk(k~l)/2 \ " | /. A0.0.9) k=0 Replace у with y/x to obtain П г (x + y)(x + qy)---(x+ q"~ly) = X] ? ' qm-\)/ixn-kyk_ (Ю.0.10) k=0 L
10.1 The q -Integral 485 This is a ^-extension of the binomial theorem. The noncommutative binomial the- theorem is due to Schutzenberger [1953]. The ^-binomial theorem was independently known to several mathematicians of the nineteenth century. The interpretation of the g-binomial coefficient in terms of areas under lattice paths is due to Polya [1984, Vol. 4, p. 444]. The infinite q -binomial theorem can be seen as an analog of the formula for the beta integral on @, 1) in terms of gamma functions. To show this, we introduce a ^-integral. This was explicitly done by Thomae and Jackson, but the essential idea was discovered by Fermat. We also develop the g-extensions of the gamma and beta functions. A generalization of the q-binomial theorem is the i i^i formula of Ramanujan. This can be considered a ^-extension of the beta integral on @, oo). Ramanujan's formula and one of its consequences, the Jacobi triple product identity, are very important in number theory. We show how they imply results on representations of numbers as sums of squares. The remainder of the chapter is devoted to the study of a few other important q-btta integrals and to developing the elementary theory of basic hypergeometric (or q-hypergeometric) series. We also give a very short exposition of the theory of g-ultraspherical polynomials. We note that some of these infinite series and products are also modular functions. 10.1 The ^-Integral Even before the systematic development of calculus by Leibniz and Newton in the latter half of the seventeenth century, mathematicians from many parts of the world attempted to evaluate the integral / xadx. Jo For example, Archimedes computed the case a = 2. He did this in two different ways, one using the value of I2 +22 + ¦ ¦ • +n2, which was familiar to the Baby- Babylonians in 1700 B.C., and the other using the sum of a finite geometric series. In the early seventeenth century this integral was computed for other small values of a (up to nine according to some accounts). The difficulty experienced by those mathematicians was the problem of treating the sums 1* + 2k + ¦ ¦ ¦ + nk in a gen- general way. In the 1650s, Fermat, Pascal, and others found a method for this. Fermat also gave an easier way of computing the integral, using a geometric series. In his studies of the Greeks, Fermat must have noted that Archimedes also used a geometric series in the quadrature of a parabola. For more history and references, see A. Edwards [1987] or C. Edwards [1979]. Decompose the interval [0, a] into subintervals using a geometric dissection,
486 10 Introduction to ^-Series that is, subintervals with endpoints {xn}^ where xn = aqn, 0 < q < 1. In this case the sum approximating the integral is OO 00 n=0 n=0 oo = aa+\\-c A0.1.1) Fermat considered the case a = l/m, where ? and m are positive integers. Set t = #1/m and write A0.1.1) as J fin+n \ -L I -(-••• -)- (ГП+n — l -» ^ a(e+m)/m asr^l. A0.1.2) Thus, Fermat evaluated the integral when a is rational. Thomae [1869] and later Jackson [1910] introduced the ^-integral defined by ™ oo / f(x)dqx = 'S2f(aqn)(aqn - aqn+1). A0.1.3) Jo «=o We call dqx the Fermat measure. Jackson also defined an integral on @, oo) by /¦OO °° / f(x)dqx = (\-q) У] f(q")qn. A0.1.4) Jo „ГГсо Notice that roc lim / f(x)dqx - I f(x)dqx. N—> oo Jq Jq The idea here is that on A, oo) the division points are at q x,q 2,q 3,...when 0 < q < 1. It is easy to see that, when f(x) is continuous on @, a), /a ra f(x)dqx= / f(x)dx. A0.1.5) Jo It is clear that we can write the ^-integral for any continuous function, but it is important to keep in mind that the resulting sum should be interesting and manageable. After all, Fermat used the #-integral because a geometric series can be summed.
10.2 The ^-Binomial Theorem 487 Suppose that f(x) = xa~~l A — x)^; we are interested in obtaining an analog of the beta integral. From A0.1.3), the sum corresponding to this f(x) is n=0 We are unable to sum this series because of the term A — q")P l. So we look for a function fq(x) such that fq{x) ^ xa~\\ - х)р~1 asq^T, and for which the ^-integral /J fq (x)dqx can be evaluated in an appropriate form. It seems very likely that xa~x should be retained as it is. Then we might deal with A — x)$~x by expressing it as a power series in x and then deciding what to do with the coefficients. By the binomial theorem, for | kl We now need the ^-analogs of &! and more generally of (a)^ and finally an analog of the binomial theorem itself. 10.2 The ^-Binomial Theorem To define к \q, the #-analog of к!, note that A0.1.2) indicates that we should replace an integer m with \ + q-\ (- qm~l = A - qm)/{\ - q). Thus, _ A-<?)(!-<?2) ¦¦¦(!-<?*) and we can replace the shifted factorial (a)k with Now write (a; q)k = A - a)(l - aq) ¦ ¦ ¦ A - aqk~l). A0.2.1) We see that the series corresponding to Y^o (a)kXk/'k\ is ^^хк. A0.2.2) q;q) This series can be summed and its evaluation in terms of infinite products is given in the next theorem, called the #-binomial theorem. To see how to sum it, consider
488 10 Introduction to ^-Series the following proof of the binomial theorem. Let k=0 First differentiate *=i v ' To remove ga+i(x), consider -(« + 2^ k=l J} X = ~Xga+i(x) *=i Eliminate ga+\ (x) from the two equations to get = ga(x) 1-х' which implies ga(x) = A - xTa. Application of this idea to the evaluation of A0.2.2) requires the ^-difference operator. This operator is defined by fix) — fiqx) fix) — fiqx) j v j \ч j v j v<f ) (Ю.2.3) X — Q'X A — ^)X We now state and prove the q -binomial theorem. Theorem 10.2.1 For\x\ < \,\q\ < 1, (n- (a; q)k k _ iax; k=0 iq; q)k (*; q)c where (a; q)x = ЦТ=оA ~ ас1кУ First Proof. Let k=0
10.2 The ^-Binomial Theorem 489 Apply the q-difference operator Aq to both sides. Then a;q)k _ Krt-i ч *¦ 4 /X or Л0О - /a(?Jc) = A - a)xfaq{x). Now consider or fa(x) — A -ax)faq(x). Eliminate faq(x) from the two equations to get 1 — ax , fa(x) = ~ faiqX). 1-х Iterate this relation n times and let n —> oo to arrive at (ax; q)n (ax; q)^ (ax; q)x fa(x) = ~ —fa(q X) = fa@) = ~ " ¦ (x; q)n (x\ q)oo (x\ q)x This proves the theorem. ¦ Second Proof. The infinite product (ax; q)oo/(x;q)oc is uniformly and abso- absolutely convergent for fixed a and q in \x\ < 1-е and so represents an analytic function in \x\ < 1. Consider its Taylor expansion in \x\ < 1, F(x) = q)°° = ]T Anxn. Clearly, — ax) F(x) = A -x)
490 This implies 10 Introduction to ^-Series *хП = о -a n=0 n=0 Equate the coefficients of xn on both sides. Then (a; q)n {q\q)n' This completes the second proof. Remark 10.2.1 The infinite product in the g-binomial theorem also arises natu- naturally when we look for the analog of A — x)~a. To see this, suppose a is a positive integer n. A possible <?-analog of A — x)~" is 1 (l-q"x)(l-qn+lx) (l-x)(l-qx)---(\-q"-1x) A-*)A-?*)••• (л;?)»' The last expression is meaningful even if и is not an integer, so more generally we consider (ax; q)oo/(x; q)oo- There are many interesting special cases of Theorem 10.2.1. Corollary 10.2.2 (a) (b) 00 n i Г-^- = -——, M<i,M<i. ±L (q; q)n (x; q)oo / l)"/ = (x;q)oo, \q\<l- «=o = (*: ?)лг = A - A - (Euler) (Euler) (Rothe) OO where the q -binomial coefficient is In = (q;q)n/(q;q)k(q;q)n-k- Proof. (a) Set й = 0 in Theorem 10.2.1. (b) Replace a with I/a, and x with их and then set a = 0.
10.2 The ^-Binomial Theorem 491 (c) Seta =q~N. (d) Seta =qN. ¦ Remark 10.2.2 The g-binomial theorem was apparently discovered indepen- independently by several mathematicians including Gauss [1866], Cauchy [1843], and Heine [1847]. It seems that the first statement of the g-binomial theorem in ap- approximately the form given in Corollary 10.2.2(c) was published by Rothe [1811]. (He stated this as A0.0.10), but with a misprint.) For the references to Euler in Corollary 10.2.2, see Andrews [1976, p. 30]. In most cases we do not give detailed proofs to justify limiting processes. It is, however, interesting and important to know what is involved here. Hence we end this section with Koomwinder's [1990] proof of the fact that the ordinary binomial theorem is obtained from the ^-binomial theorem as q ->• 1~. The proof of the theorem depends on the following lemma; we omit the proof. Lemma 10.2.3 Suppose ц, к, к are real; 0 < ц — к < к; /л + k> 1; and „-lit _ e-(k+k)t ° Then f'(t) <Oift > 0. Theorem 10.2.4 Suppose к and /i are real. Then lim uniformly on {x e С : \x\ < 1}, if ц, > к, ц, + к > 1, and uniformly on compact subsets of {x, e С : \x\ < \,x ф 1} for other choices of'k and ц. Proof. First observe that since (q x; q)oo (q x; q)i (q + x; grHo = A0.2.4) (q^x; q)^ (q^x; q)m (gr/x+mx; q)oo we can choose ? and m appropriately so that fi + m > к + ? and fi + к + I + m > 1. Moreover, the first quotient on the right-hand side of A0.2.4) tends to A — x)l~m uniformly on compact subsets of [x eC: \x\ < \,x ф l}as^ ->• 1". Consequently, we need only consider the case where fi > к and /x + к > 1. By the ^-binomial theorem, the left side of A0.2.4) is A0.2.5)
492 10 Introduction to (/-Series It is easy to check that (q11 — qk+k)/(l — qk+l) increases with q for к + к > \i and so < lim к-ц+к Let m be the largest integer such that к + m — 1 < \i. Then for \x\ < 1 \-q l~qm 1 -q" , + m A. — и + n — 1 \-q m + 1 n Thus the series A0.2.5) from the rath term onward is majorized by the convergent series sup 0<o<l \-q l-q" (k - fi + т)„_„ (m + 1)„_ш This implies that we can take the termwise limit in A0.2.5) to get lim This proves the theorem. n=\ n\ A0.2.6) In a similar way, if we replace x with A — q)x in Corollary 10.2.2(a) and let q ->• 1", the series converges to ex: lim 1 (A -q)x\q)c A0.2.7) and we have a ^-analog of the exponential function. The sum in Corollary 10.2.2(b) also gives a series that converges to the exponential series, but the infinite product form is equivalent to A0.2.7). Similarly, lim (-A -q)x;q)oo = ex. A0.2.8) The functions used in A0.2.7) and A0.2.8) are occasionally given other names: 1 eq(x) := (A -q)x;q)oo' Eq(x) := (-A -q)x;q)oo. A0.2.9) A0.2.10)
10.3 The g-Gamma Function 493 10.3 The 4-Gamnia Function We return to the problem of finding an analog of the beta integral over @,1). By Theorem 10.2.4 it is reasonable to replace X \ 1 л j with Xa~l(qx;q)oo/(qfix;q)oo. We write the <?-binomial theorem as y. («¦+';?)„*¦ = (ах;д)ж(д;д)ж Л) * J (ft ft ft • ft\ (Y' ft \ (ft' /7 | Replace x with qa and a with q& in A0.3.1) to get ,_, (qx\ q)c j> -dqX = ^—^—' J;^v-"^. (Ю.3.2) x) (qa; q) oc (qp;q) oc This is the q -extension of ^"'(l -xf~xdx = /о To write A0.3.2) in this form we need the ^-version of the gamma function. The existence of a useful n \q indicates the possibility of a convenient analog of the gamma function. We follow Euler's procedure and look for an interpolation formula by using infinite products. Now (q;q)n (\-q)n The last expression does not require и to be a positive integer, so we set n\a = = ; , 0 < q < 1. 4 (\ „\n (\ n\n(nn+\- -^ ^ Tq(x):= ^;q)°° A-<?)'"* when \q\ < 1. A0.3.3) Here we take the principal values of qx and A — q)l"x. Then Tq(x) is a mero- morphic function with obvious poles at x = — n ± 2nik/ \ogq, where к and n are nonnegative integers. It is not difficult to see that the residue at x = — n is A - q)n+l (q-";q)n\ogq-1' Because Tq(x) has no zeros, its reciprocal is an entire function. It is left to the reader to check these facts. These properties of Yq (x) are similar to those of Г (x). We can now write A0.3.2), which is just another form of the g-binomial theorem, as follows:
494 10 Introduction to ^-Series Theorem 10.3.1 (qpx\q)eo q Tq(a+p) Remark 10.3.1 At first sight it might appear that we could have replaced A — л:)^1 with (ql~px; q)oo/(x; q)oo, but this is not as useful. The function A -л:/ is positive in the interval of integration @, 1) and 1-х vanishes at 1. In the q- integral the set of points over which the summation is carried out is q", n = 0, 1, 2,..., that is, a discrete set of points in [0, 1]. The first point to the right of the interval where we want our function to vanish is q~l. To get such a function replace xq~P by x in both infinite products. The result is (qx; q)<x/(qPx; q)^. The function (q1~^x; q)oo/(x, q)oo could have been used, but the ^-integral would beon[0,</]. Theorem 10.3.1 gives us one reason to accept Tq{x) as the natural (/-analog of Г (л:). Another reason is the following: The Bohr-Mollerap theorem states that Г (л:) is the unique function satisfying the functional equation f(x + l)=xf(x), /A) = 1, and is also logarithmically convex. It can be shown that Tq (x) is the only function that satisfies the functional equation and is also logarithmically convex. The proof of the latter result is identical with that of the Bohr-Mollerap theorem and was included in Chapter 1. See Exercise 10. From the definition of Tq{x) and the Bohr-Mollerap theorem, one may suspect that lim^i- Fq(x) — Г(х), and this is indeed true. We derive it as a consequence of the next theorem, the proof of which requires the following lemma. Lemma 10.3.2 Iff(l) = /B) = g(l) = gB) = 0and0< f"{x) < g"(x)for x>0, then f(x) < g(x) in [0, 1] U [2, oo) and f(x) > g(x) in [1, 2]. Proof. For x e [1, 2], it is easy to see that /(*) = J h{x,t)f"{t)dt with AC*, *) = {?_ 2)(jc - 1), 1 < x < t < 2. Since h(x,t) is negative for x and t in [1, 2], it follows that f(x) > g(x) in the interval [1,2]. We may assume without loss of generality that f(x) = 0. We have
10.3 The g-Gamma Function 495 just shown that, in that case, g(x) < 0 in [1,2] and g = 0 at the endpoints. By the mean-value theorem, g' is zero somewhere in A,2), and since g' is increasing we must have g'{x) < OforO < x < 1 and #'(¦*) > Oforx > 2. So g is decreasing in @,1) and increasing in B, oo). This implies the result. ¦ Theorem 10.3.3 ForO < r < q < 1, we have Гг(х) < Fq(x) < Г(х), 0<д:<1 or x>2, and Г(х)<Гч(х)<Гг(х), 1<д:<2. Proof Observe that d2 _^_ „x+n j _ q*+nJ We show that each term of the series is increasing in @,1). Set a — n +x; then aqa-\\ogq){\+qa) To prove that h' > 0, it is sufficient to demonstrate that the expression within the square brackets, which we denote by g(q), is negative. A simple calculation gives g'(q) = (l~q \2 > 0, q > 0. q(\+qaJ Since g(l) = 0, we have g(q) < 0 in @,1]. Thus h(q) is increasing and so ^2 log Г?(;с) is increasing for 0 < q < 1. This means that d2 , , iv/6i qyx), 0 < r < q < 1, лг>0. ил- dxl Moreover, logrV(l) = logr^(l) = logFrB) = logF9B) = 0. The theorem is now a consequence of Lemma 10.3.2. ¦ Corollary 10.3.4 lim^i- Vq(x) = Г(х). Proof. Theorem 10.3.3 implies that lim?^.i- Tq{x) = X(x) exists. Moreover, X(x) satisfies the conditions of the Bohr-Mollerap theorem. Thus X(x) = F(x)and
496 10 Introduction to ^-Series the corollary is proved for 0 < x < oo. For real nonintegral values, the functional equations then give the same result. For complex x, the Stieltjes-Vitali theorem completes the proof. See Hille [ 1962, p. 251] for this theorem. (For Gosper's proof of Corollary 10.3.4, see Andrews [1986, p. 109].) ¦ There are several results about Tq (x) that are analogs of corresponding state- statements about Г(х). The analogs of the Legendre duplication formula and the Gauss multiplication theorem are given in the theorem below. Theorem 10.3.5 (a) Г,Bдс)Г,2A/2) = A +<?J*-1Г>(х)Г>(х + 1/2). (b) Letr =q". Then Tq{nx)Tr{\/n)Tr{2/n) ¦ ¦ ¦ Гг((п - l)/n) = A +? + •••+ ?"-1Г-1Гг(х)Гг(х + 1/n) • • • Гг(х + (n - l)/n). The proofs are straightforward and left to the reader as exercises. The next two formulas can be regarded as asymptotic formulas for Tq (x) for large x, but they are of a different nature from Stirling's formula. They follow from Corollary 10.2.2: {q\q)o Re x > 0, A0.3.4) Rex>0. A0.3.5) Now recall that 1/ Г(х) has zeros at x =0,-1, -2,... whereas 1/ ГA - x) has zeros at x = 1, 2, 3,.... Thus their product has a zero at each integer. This fact is among the properties we see reflected in Euler's formula Г(х)ГA -х) =jr/sin;rx. Similarly, \/Tq{x) has zeros at x = — n due to the factor (qx; q)^. (In fact, 1/Г^(х) has zeros at x = —n ± 2л i k/ log q where к and n are nonnegative integers.) Hence the function with the full range of integer points as zeros may be taken as {qx\ qHO(q1~x; q)oo = (qx, q)oo(q/qx; q)oo- Replace qx with у to write the function as (y; q)oo(q/y, q)<x- We expect this function to have interesting properties. In fact, it is one of the theta functions discovered by Gauss and Jacobi and is the topic of the next section. 10.4 The Triple Product Identity The triple product identity expresses (x; q)oo(q/x; q)oo(q\ q)oo as a Laurent se- series in 0 < |jr| < oo. One proof of this identity follows from the terminating
10.4 The Triple Product Identity 497 ^-binomial theorem due to Rome given in Corollary 10.2.2. This proof was known to Gauss [1866a] and Cauchy [1843a]. At the end of the previous section we re- remarked that the infinite product (x;q)oc(q/x;q)oo arises naturally when we look for a <jr-analog of the Euler reflection formula. We shall see that the Laurent-series side of the identity also appears naturally from a particular Riemann sum approxi- approximation of the normal integral. This raises the question of whether the triple product identity could be approached from the series side. The answer is yes and this leads to another proof of the identity. We end the section with a number of identities that are important but simple consequences of the triple product identity. In later sections we give applications to number theory and combinatorics. Theorem 10.4.1 For \q\ < 1 andx e С - {0}, oo (*;<?)ooO?/.*;<?)oo(<7;<?)oo = ^ (-i)V'2)**- k=-oo Proof. Take N = In in Corollary 10.2.2(c) to obtain <y\k+n Ak+n)(k+n-\)/2xk+n I П k=—n Then replace x by xq~" and rewrite (xq~"; qJn as (xq-n; q)n(x; q)n = {-\)nxnq-n2+n(n~l)l2{q/x; q)n(x; q)n. The above identity then becomes {q/x;q)n{x;q)n = > - i~n (я\ q)n+k(q\ q)n-k When n -> oo, this gives (x;qHO(q/x;qHO= This limiting process can be justified by Tannery's theorem. The result in Theorem 10.4.1 is called the triple product identity. ¦ Remark 10.4.1 Replace q with e~2' and x with —e~'e'e in the identity. The result is oo oo e-nheine = д (j E fc=-oo n=0 The left side is a solution of the heat equation d2u du 1& = It'
498 10 Introduction to ^-Series The right side is positive for t > 0 since l+2rcos6>+r2 > l-2r+r2 = (l-rJ>0, when 0 < r < 1. The positivity is not evident from the left side. Thus the two sides give different properties of the function. Clearly, the right side also gives the zeros of the function. In view of the importance of the triple product identity, we look at it from another point of view. First replace q with q2 and x with — qx to get oo (~qx;q2HO(-q/x;q2HO(q2;q2HO= J2 ^x"¦ (МАЛ) n=—oo The sum on the right of A0.4.1) comes from an important integral. Consider the normal integral e~x2dx. Shift* by a/2 to get Replace this integral with the approximation formed by summing over a discrete one-dimensional lattice with space size 8: 00 г V^ -S2n2-aSn n=—oo It is natural to ask here how close the sum is to the integral when S is small. To answer this, consider the formula in Exercise 2.26: oo . oo V e-m(n+aI = -= У е-"/l'e2nina'. A0.4.2) n=—oo ' n=—oo When t is small the terms on the left are close to one for small values of n but all the terms on the right with one exception are very small. So the expression on the right works very well for numerical calculation of the series for small 8. Later we shall see that there is a deeper reason for the importance of the transformation formula A0.4.2). It shows that X^oo Ч" is a modular form. Let us now return to the sum in Theorem 10.4.1 when x = 1. The sum is ^ — Y^oo^^)n 4() -Apply the two changes in variables и -> — nandn ->¦ n+ 1 to get OO OO A = У (-1)V<«+1>/2 = У (-1)"+У<"+1>/2 = -А.
10.4 The Triple Product Identity 499 Thus A is zero. Write H(x) = n=—oo Then 1 -°°. л(л+П 1 ti(qx) = — > (—1) о 2 дг = — л: ^—' л: n=—oo or H(x) = -xH(qx). A0.4.3) This equation implies that if x is a root of H(x) = 0, then so are qx and ;c/<?. Since we know that x = 1 is a root, it follows that qn for every integer n is a root. Thus H{x) has (л:; q)oo(q/x; q)^ = T(x) as a factor. Without knowledge of Theorem 10.4.1 we cannot be sure that H(x) has no other zeros. However, it can be shown by a simple calculation that T(x) = ~xT(qx). That is, T(x) satisfies the same functional equation as H. It follows that the Laurent-series expansion of A0.4.1) is uniquely determined in a deleted neighbor- neighborhood of x = 0 up to a constant factor. Consequently, T(x) = C0(q)H(x). Replace q with q2 and x with — qx to get oo (-qx; q2)oo(-q/x\ q2)^ = C0(q2) ^ q"'xn. A0.4.4) n—~ oo There are a number of ways of finding Co (q2). We apply a device due to Gauss and Jacobi. Another method, also due to Gauss [ 1866b], using the arithmetic-geometric mean is given in Exercise 13. There also exist some combinatorial methods but we shall not describe them here. One such method is given in the next chapter. Let x = i in A0.4.4). The result is ОС E .«= — 00 (-1)" ЭС <E n=—oo (_1)«?B«+1J Since the left side is real for q real, we get
500 10 Introduction to g-Series The left side is identical to (—q2; qA)oo- Now set x = — 1 and replace q with q4 in A0.4.4) to get rt=—OO The last two identities imply C0(q2) (-q C0(q*) (q^q^l (q2; q' This gives Note that (q; q)x and J2T=~ooqk x* are continuous functions of q in \q\ < 1. Thus CoO?2) is also continuous, which implies that Co(O) = 1. Let n —> oo to get C0(q2)(q2;q2)oo = 1. This gives C0(^2) = l/(?2; ?2)oo and we have another proof of the triple product identity. Corollary 10.4.2 00 -?2")A - q2"-1J, (Gauss) A0.4.6) (-l)"qni3n+1)/2 = J](l - q"), (Euler) A0.4.7) OO OO л=0 л=1 OO -l)"Bn + l)^f"(n+1)/2 = JJA - 9"K. (Jacobi) A0.4.9) n=0 n=l Proo/ The identities in A0.4.5) and A0.4.6) are obvious from A0.4.4). Note that A0.4.6) also follows from A0.4.5) by changing q to -q. To get A0.4.7) replace q with q3/2 and set x = —^/q. For A0.4.8) and A0.4.9), write the identity in
10.5 Ramanujan's Summation Formula 501 Theorem 10.4.1, with x replaced by —qx, as OO 00 \ qn{-n >i (x" + x ) = A + 1/jc) 11A — q )(l+<? /x)(\ + q x). n=0 n=\ A0.4.10) Setx = l.Then n=0 This is A0.4.8). Now divide A0.4.10) by x + 1 and let x -* -1 to get A0.4.9). The corollary is proved. ¦ The sequences of numbers {n2}, {n(n + l)/2}, {п{Ъп ± l)/2} are the square, triangular, and pentagonal numbers, respectively. These are of number theoretic interest and their appearance as powers of q in the series make the above identities useful in combinatorial number theory. Remark 10.4.2 Gauss and Jacobi independently discovered the triple product identity. Jacobi's results including A0.4.9) are contained in his famous book, Fundamenta Nova (Jacobi [1829]). See Gauss [1866] for A0.4.1), A0.4.5), and A0.4.6). Identity A0.4.8) was published in Gauss [1808], a paper noted for con- containing the first evaluation of the quadratic Gauss sum. See Exercises 5 and 6. Euler's result A0.4.7) is the famous pentagonal number theorem. A discussion of Euler's proof and references are given in Weil [1983, p. 281]. See also Euler [1748]. 10.5 Ramanujan's Summation Formula The ^-binomial theorem evaluated a <jr-analog of the beta integral over @, 1). We have seen that the ^-integral over @, oo) is a bilateral series. Therefore, a ^-analog of the beta integral over @, oo), that is, /oo ra~^ should be a bilateral series, but similar to the series in the q -binomial theorem. The correct generalization was found by Ramanujan. He considered the bilateral sum —oo<*9>» A0.5.1)
502 10 Introduction to ^-Series which, as we shall see later, is a q -integral analog of B(a, 1 4° Jo dx. c01 Jo A + cx)a+P We should first clarify the meaning of (a; q)n in A0.5.1) for negative n. Since (a; q)n = (a; q)oo/(aq"; q)oo A0.5.2) for n > 0, and the right side is meaningful for negative n as well, we take A0.5.2) as the definition of (a;q)n for all n. If n = —m, then (a;q)-m = Ramanujan's evaluation of A0.5.1) in terms of infinite products is given next. It contains the <jr-binomial theorem and the triple product identity as special cases. Theorem 10.5.1 For \q\ < 1 and \ba~l\ < \x\ < 1, yv (g;q)n n _ (ax;qHO(q/ax;qHO(q;qHO(b/a;qHO n^0O (b\ q)n (x;q)oo(b/ax;qH0(b;qH0(q/a;qH0 First Proof. Use A0.5.2) to write the series as + ж Xlq;q)nfb\" The first series converges for |jc| < 1 and the second for \b/ax\ < 1. So the bilateral series converges when \ba~l\ < \x\ < 1. Observe that (a\q)n „ (a;q)oo ^ (bqn;qHO „ This proof of Ramanujan's identity depends on a functional relation satisfied by f(b). To find it, note that
10.5 Ramanujan's Summation Formula 503 This gives the desired functional equation, namely .... A -bid) fib) = ——f(bq). A — b/ax) Now f(b) is an analytic function of b for \b\ sufficiently small. Iteration gives fib) = |^%/@). (Ю.5.4) (b/axiq) It is not easy to sum /@), but f(q) can be obtained from the <jr-binomial theorem. Note that 0?; q)oo v^ («; <?)« „ _ 0?; < ^ ) ( From A0.5.4) /@) = _ (g/ад:; q)ooiq; q)ooiax; q)^ iq/a\q)ooia\q)ooix\q)oo Use this in A0.5.4) to get f(b) and then («;<?)„ „ _ jax; q)oo(q/ax; g)oo(q; (b; q)n (x; qHO(b/ax; q)^; q)ooiq/a; q)oo This argument assumed that we could take b = q. To prove the general case apply analytic continuation on b and x. Ш Second Proof (Venkatachaliengar). As in the case of the <?-binomial theorem, we can start with the infinite product and get the Laurent series. Suppose that F(x) = (**= *>*>(*/"'«>« = f Л.Х-. A0.5.5) ix;q)oo(b/ax;q)x nf^ The Laurent series is defined in \x\ < 1 and \b/ax\ < 1, that is, \b/a\ < \x\ < 1. We consider the Laurent expansion of F(qx), which we want to exist for \b/aq | < x\ < 1; so assume for the present that \b/aq\ < \x\ < 1. Both F(x) and F(qx) are defined in \b/aq \ < \x\ < 1. Thus it is possible to look for a functional relation
504 10 Introduction to ^-Series between F(x) and F{qx) in this region. Now oo (ax;q)(X(q/ax;q)oo(\-\/ax)(\ - x) ,, /1Г>с^ F(^x) = 7 Г~^ ГП 771—^ ; = 2^ A»«x- (Ю.5.6) (x; q)oc(b/ax; q)oc(l — ax)(I — b/aqx) f^ Therefore, oo oo q(l-x) J2 Anxn = (b-aqx) Y, Anq"xn. «= — 00 «=—00 Equate the coefficients of x" to get q(An — An_i) = bAnqn — aAn_\qn, or (?; q)n This implies that (ax;q)oo(q/ax;q)oo _ (x;qHo(b/ax;qHo Multiply both sides by A — x) and letx —> 1~ to get (a;q)cc(q/a;q)x = (^jOoo Aq A0.5.8) (q\q)oc(b/a;q)oc (b\q)oc Substitute this value of Ao in A0.5.7). Ramanujan's identity has the restriction on b that \b/aq\ < \x\ < 1. This can be removed by analytic continuation. In the derivation of A0.5.8), Abel's continuity theorem was used in the last step. This theorem states that if Нт„_>ос о,п= a then 00 lim A — x) yjanx" = a. ¦ x^l~ «=o See Appendix В for a discussion of this and related matters. Third Proof (Ismail [1977]). Rewrite the g-binomial theorem as follows (ax;^)oo ^(a;q)n °° '"¦-^ - (a;g)N N y^ (aqN;q)n „ (q;q)N „_>,(qN+1;q)n
10.5 Ramanujan's Summation Formula 505 Replace a with aq~N to obtain (a; q)n n _ (aq~Nx;q)oox-N(q;q)N i(qN+l;q)n (aq~N;q)N(x;q)oc (aq~Nx;q)N _N (q; q)oo{ax;q)Xl (q/ax;q)N{q;q)<Xl{ax;q)oo -x (aq~N;q)N (qN+1\ q)oo(x; q)oo (q/a\ q)N(qN+l; q)oo(x; q)oc (ax; q)oc(q/ax; q)oc(g; q)n(qN+l/a; д)ж (x\ q)oc(qN+1/ax; q)oc(qN+l\ q)oc(q/a\ q)x ' lfqN+l is replaced by b, both sides of this equality are analytic in b for b close to 0, and they agree when b = qN+l. Zero is the limit point of this sequence, and two functions analytic in an open set around b = 0 and that are equal at infinitely many points in this set must be identically equal there. Analytic continuation gives Theorem 10.5.1 for \b/a\ < \x\ < 1. ¦ Theorem 10.5.1 is called Ramanujan's i^i formula because bilateral <?-series are denoted by \j/ and there is one upper and one lower parameter. Remark 10.5.1 If the argument in the second proof is generalized to (ax; g)oo(b/x; д)х (cx;q)O0(d/x;q)O0' a second-order difference equation arises. This reduces to a first-order equation for d = q/c; the coefficients can then be determined. Unfortunately, the resulting series is incorrect. This problem occurs because the change from F(x) to F(qx) moves one region of the analyticity of F to an adjacent one, since the poles of F(x) are at x = c~lqn, n = 0, ±1, Details are given in Askey [1987]. Remark 10.5.2 As remarked earlier, Ramanujan's formula is a ^-analog of a beta integral. Rewrite the formula as (bgn; <?)oq n_ (ax;qHc(qjax;qHc(q;q)oo(b/a\q)oo _(.aqn;q)oo (a; q)oo(q/a; q)oo(x; q)oo(b/ax; q)x ' Set x = qa, a = -c, and b = -cqa+fi to get ^;<7)co „_, _ (-cqa; q)oc(-c-[ql-a; q)xrq(a)rq(P) Finally, by Theorem 10.2.4, ton C,q 'q_°° , C_.g_. 'q °° = A+сГ"A + \/c)a = с'01.
506 10 Introduction to ^-Series 10.6 Representations of Numbers as Sums of Squares Ramanujan's formula for Y^L-oc(a'< <?)«x"/Ф'< Ч)п is useful in obtaining simple and direct derivations of some results on the number of representations of a number as a sum of squares. Three cases are worked out here: two squares (x2 + y2), three squares where two are equal (x2 + 2y2), and four squares (x2 + y2 + u2 + v2). For other results of this type consult Fine [1988]. At the heart of the proofs is the following special case of Ramanujan's identity. Take b = aq in Theorem 10.5.1 and divide by 1 — a to get у x = — „tLl-aq" (x; (ax; q)oo(q/ax; q)oo(q; q)\ \q)oo(q/x\qH0(a;qH0(q/a;qH0' A0.6.1) Now observe that (OO \ * 00 Yl «" j =J2r^n)cin' A0-6-2) «=-oo / «=0 where rs (и) is the number of ways n can be written as the sum ofs squares. Observe also that rs (n) gives the number of points with integer coordinates on the sphere in.?-dimensional space. Here the counting distinguishes between the order, so that 25 = 32 + 42 = 42 = 42 + 32 are counted, and negative integers are also used; hence, for example, (—3J + 42 is counted. We consider s = 2 and s = 4. Now note that q2 =^s(n)qn, A0.6.3) n=—oo m=—oo n=0 where s(n) is the number of ways of writing n as x2 + 2;y2. The strategy for finding simple expressions for rs (n) and.? (n) is to use the triple product identity to express the left sides of A0.6.2) and A0.6.3) as products and then use A0.6.1) to express these products as a different sum. Let djj(n) denote the number of divisors of и congruent to i mod j. Theorem 10.6.1 (a) r2(n) = 4[dlA(n) - d3A(n)l (b) s(n) = 2[dU8(n) (c) Proof. (a) By A0.4.5) and A0.6.2), n=0
10.6 Representations of Numbers as Sums of Squares 507 To apply A0.6.1) rewrite the product as <g2;ffi(-«:g;;«. (ю.6.4) Replace q with q2 in A0.6.1) and set x = q and a = — 1. Then This implies oo n=0 2 (n)q - 00 =¦ = 1 This proves (a), (b) As in (a), one can show oo «= —ОС 00 !E< n=—oo J2m qn 00 n=—oo 00 +4E 00 +4E n=l that on + q> 00 E< m=0 (^1,4< (-q;q2)lc(q2>q (q;q2)U-q2;q oo 2)oo /00 oo / I 1 m=0 -3,, A This time replace q with q4 in A0.6.1) and then set a = — 1 and * = g to get n=—oo Y 00 00 n=l m=0 00 ОС Г_(8т + 1)л , _(8m+3)n ^,(8т+5)л ^,(8m+7)n] [q + q — q ~ q j • This proves (b).
508 10 Introduction to ^-Series (c) Rewrite A0.6.1) as follows: 1 \-aqn I-a 1 I xn(l-aqn)+aqnxn aqn -z^Y^V- Let a = — 1 and combine with A0.6.1) to get (~qx; q)oo(-q/x; 9H0(9; {xq\q)oo{q/x\q)oo(-q;q Then x -> -1 gives = 1 + 2A - x) y, (q/x)n[\ - x2n] n=\ 1 +9" A0.6.6) It follows from A0.4.6) that n=-oo v "' :9H n=—00 Change q to -g in A0.6.6) to arrive at 00 00 „ nq" 9)c = 1 Enq" . 1 - <7" ,„ Ln=i 00 CO 2" + 9 2" E"9 V^ 4'" n=l J n=l ,4n A0.6.7) The result follows as before. The identities A0.6.5) and A0.6.7) and their interpretations were first discovered by Jacobi [ 1829]. The number theoretic result in (b) was stated and proved by Gauss for n prime. 10.7 Elliptic and Theta Functions The theory of elliptic functions with its ramifications has been studied for two centuries. Its founders are Euler, Gauss, Abel, and Jacobi. In this century the theory of elliptic functions has been largely incorporated within the theory of
10.7 Elliptic and Theta Functions 509 elliptic curves, which was recently used by Wiles to prove Fermat's last theorem. In this section we will simply give some definitions and show how Ramanujan obtained the Fourier-series expansions of Jacobi elliptic functions from A0.6.1). There are four theta functions of Jacobi. They are all really the same function, just as sin z and cos z are the same, but it is useful to consider all four: sinBn 00 (n+l/2feBn+l)iz = 0i(z + jt/2; q), A0.7.2) 00 02(z, q) = 2^^(n+1/2J cosBn + \)z n=0 00 n=-oo 00 00 : 1 + 2 ]P g cos 2nz = ^2 qe2niz, A0.7.3) n=l n=—oo 00 04(z, q) = 1 + 2 J^(-1)"^ cos 2nz B=l / ТГ \ A0.7.4) Here q = еЛ!Т with Imr > 0 so that \q\ < 1 for a given т. The value of qx for some Л is determined from ел'Хт. The following relations, satisfied by the theta functions, follow immediately from the definitions: e3(z + n,q) =9A(z,q)\ and 0! (z + jrr, 9) = -«-'«"^^(z, 9), 7ГТ, ?) = «-^-^^(z, ?), A0./.6) ) = ?-1е-2'г03(г,^), + 7ГТ, q) = q'le-2ize4(z, q).
510 10 Introduction to g-Series The following formulas are obtained from the triple product identity: Z, q) = V/4sinz(?2, q2)oo(q2e2iz; q2)cc(q2e-2iz; q1)^ q2; q2)oo(q2e2iz; q2)ac(e-2iz; q2)oo, q)oo(q;q)oo(q;q)cc ? = ql/4eiz(q2,q2)oc(-q2e2iz;q2H0(-e'2iz;q2)oc, Z, q) = (q2; q2)oc(-qe2iz; q2)oo{-qe-'lil\ q2)oo, 94(z, q) = (q2; q2)cc(qe2iz\ q^ooiqe-**; q2)oc For z = 0, these reduce to в2 = 02@) = 02@, q) = 2qll\q2; q2)oo(~q2\ q2J^ 03 = 03(O) = 0з(О, q) = (q2; q2)oo(-q\ q2J^ A0.7.8) 04 = ^4@H4@, q) = (q2, q2)oo(q; q2fx, and Proposition 10.7.1 в[ = Proof. The right side equals The factor (—q; q )oo(—q , q )oo(q', q ) equals one because it can be written as , ч , 2ч (-q;q)oo(q,q2)(q2;q2)oo i-q\ q)oc{q\ q ) = —г—г- _ {-q\q)oc(q\q)oc _ (q2; q2)™ _ , This proves the proposition, which, it should be noted, is equivalent to A0.4.9). ¦ A meromorphic function is called an elliptic function if it has two periods whose ratio is not a real number. To define the Jacobi elliptic functions, set A0.7.9) and let 030i(m/U2) <10J10)
10.7 Elliptic and Theta Functions 511 It follows from A0.7.5) and A0.7.6) that sn(u, k) is periodic in и with periods 2л-Q\ and лтв2, and so it is an elliptic function. Other Jacobi functions are сп(мД) = ft and 04 dn(«,)fc) = — ft 04 (и/032) There are also functions that correspond to esc z, tanz, and so on: —, пс(иД) = —-——, nd(u,k) = sn(M.ifc) , cn(u,k) , сп(мД) „ ^,, sc(ii,jfe) = —^-(, cs(H,ifc)= cd(H,*)= ; / A0.7.11) cn(u,k) sn(u,k) dn(u,k) sn(u,k) dn(M.ifc) dn(u,k) sd(u,k)= ) '/ ds(ii*) ; ' ' dc(MA:) У dn(K) сп(м, к) Exercise 2.10 required us to show that '' dt = = K. " Jo ^(l-t2)(l-k2t2) SetM =2Kx/n.Then в3 0i (x) — • , A0.7.12) 02 0 W and similar relations hold for the other functions. We omit the modulus к in sn(w ,k) etc., and simply write sn и etc. It is easy to see that _ - 2q2n~l cos 2x + q4 A0.7.13) en и = 2ql/4ka/2k~l/2cosx TT < —; -.—, >, 1J-1 1-2a2'1-1cos2x + ?4'1 I and Here k' > 0 is denned by ka = I — k2. Again, by the exercise mentioned above, klXl2 = 64/63. There we took 0 < к < 1, but the resultsJiold more generally. One can also deduce from Exercise 2.11 that sn и has periods AK and UK'.
512 10 Introduction to ^-Series The Fourier expansion of the Jacobian functions is given in the next theorem. Theorem 10.7.2 With и = 2Кх/л, we have 7Г 2л х^-ч qn cos 2nx * ^ (Ш7Л5) n=0 Proof. We prove A0.7.14) and leave the other two parts as exercises. In Ramanujan's formula A0.6.1), first replace q with q1 and then set x = qe'2lx and a — \/q. Then ч2 _fl2n-l (np2ix- a2) (af-2ix- a2) (c- ~2\i ' /oo Change « to n + 1 in the second sum; then combine with the first to get " sinBn + l)x о n Now multiply both sides by —ielxql^27t/Kk and show that the product reduces to A calculation almost identical with the one in Proposition 10.7.1 is needed. This proves the theorem. ¦ There are similar series for cd, sd, and nd. The series for the other Jacobi functions such as ns и are somewhat different. For example, 7Г 2ТГ ^ ^2« smBn + 1)X + —V- , , • A0.7.17) n=0 ч This can be proved directly or by first showing that sn(M + iK') = k~ldc(u -K) = k~lns u. See Whittaker and Watson [1940, p. 511].
10.8 я-Beta Integrals 513 10.8 tf-Beta Integrals We have seen that beta integrals and their extensions and analogs are very useful and important. There are several more extensions and it is worthwhile to look at a few here. One is quite important as it is associated with a set of orthogonal polynomials that is beginning to find applications in several areas of mathematics. We noted earlier that Ramanujan's x t^i could be viewed as the integral of xa~l{—ax; q)oc/(—x; q)^ with respect to the Fermat measure. Ramanujan also integrated this function using the usual measure. He used a general interpolation- type theorem to evaluate it. An account of this is given later. For now, we give an evaluation using a functional relation since that fits in with the methods used earlier. Let 0 < q < I, Rea > 0, and \aq~a\ < 1. Assume for the moment that 0 < a < 1. Define f(a) = Г Х" {~UX'q)codx. Jo (-*; <?)oo Then f(a) = о (-x;q)oo = (l-a)f{aq)+aq'af(a). This iterates to 1 — aq a \и.ц , c^oo As in the evaluation of Ramanujan's i i/fi, it is hard to find /@). But [°° x01-1 ж f(q) = / -r-—dx = . Jo 1 + x sin not Thus (aq-a; q)oo(q\ q)oo smita when 0 < a < 1. The general case follows by analytic continuation in a, and by continuity when a is a positive integer. The limiting case when a = к is a positive integer is Jo (-«*; <?)cqdx = (-l)k+l(q/a)k(q;q)k-iuogq) (-x;q)oo (a~lq;q)k
514 10 Introduction to ^-Series To see that A0.8.1) extends I dx = /1 I \(Y-t-n 0 take a = qa+fi in A0.8.1) to get This formula lacks symmetry in a and /3. Symmetry is restored by the following integral: The details of the evaluation are left to the reader. First show that f(a, b) = f(aq, b) + aq-cf(a, bq) and then by a similar procedure that f(a,b) = f(a,bq) + bqcf(aq,b). The two equations together give 1 — ab (ab: q)oo f(a, b) = f(a, bq) = *, q'°° f(a, 0). A - bqc) (bqc; q)^ The value of /(a, 1) is given by A0.8.1), and the final result is (ab; q)oo(qc\ q)oo(qX~c, q)oo7t f(a,b) = (bqc; qHO(aq'c; q)oo{q\ q)oo sin^c When a = qa+c, b = q^c\ then Г Jo (-¦*; q)oo(-q/x; q)oo Г(с)ГA-с) Гч{а)Г„ф) Г, (с) Г, A-е) TJa To obtain other <?-beta integrals, let us again consider Ramanujan's i ^i formula (see A0.5.3)). When \b/a\ < x < 1 and \q\ < 1, (a; q)nxn (ax; qHo(q/ax; q)x(b/q; q)oo(q; q)oo у „fr^o (b><l)n (x;qHO(b/ax;qHO(q/a; q)oo(b; q)^
10.8 q-Beta Integrals 515 Set ax = ql/2ew andx = aew, \a\ < 1. Then f (q e '' ; 9H oo ¦7Г °° 'r,q)n 1^-2tz. A0.8.2) 9H0 To obtain a positive kernel in A0.8.2), take b = aq1?2. Then the integrand is an even function of в, and becomes ,, n=0 . __-, ,-а2?2" (a2; 9H0(9; 9)oo To get a perspective on these integrals, replace a with qa~xl2 and b with ^ in A0.8.1). The result is {q«-W;q)O0{ql>-We-»;qH0™-'ai Tq{a)Tq{fi) ' and the limit when q ->¦ 1 is /V. */ —7Г A0.8.5) In a similar fashion, A0.8.3) gives the special case a = /3 of A0.8.5), that is, "(Z-lc^ey-'de = 2"-' f{\-x)«-\l-x2)-dx = ^ ~ 1}. 0 7-i Г (а) Г (a) One simple extension of A0.8.2) can be seen when we multiply by e~'ke before integrating. A more important extension is obtained when we reconsider the argu- argument given in A0.8.2) and realize that the numerator could be replaced by other products. The product is one obvious choice. Two others are (^;9)оо(е-1Й;9)оо and {-ew; qUi-e40; ?)«,. Each of these has an extra factor from the theta product in the numerator of the i \j/\ sum, and there is no problem in evaluating a similar integral. One could get greedy
516 10 Introduction to ^-Series and ask whether it is possible to evaluate the integral when all of these factors are present simultaneously. The integral is I(a,b,c,d) ; q)oc(e~'e; q)oo(q1/2e10; q)oo(ql/2e~'9', q)oc(—e'e; q)oo(—e~'e; q)oo Г Jo (aeie; q)oo(ae-i»; q)oo(bei(>; q)^18; q)oo(ceie; q)^^^; qH h(x, \)h(x,ql'2)h{x, -l)h(x, -ql'2)dx /_i h(x, a)h{x, b)h{x, c)h(x, d)J\ - x2 where 00 h(x, a) = J|(l - 2axqn + a2q2n). n=0 The condition on the parameters is that max(|a|, \b\, \c\, \d\, \q\) < 1. Instead of directly evaluating the integral, let us first try to guess what its value should be. We would like to discover some functional relations. Observe that b a b(l - 2ax + a2) - a(l - 2bx + b2) h(x,aq)h(x,b) h(x, a)h(x,bq) h(x,a)h(x,b) _ (b-a)(\ -ab) ~~ h(x,a)h(x,b) The reason we put the factors b and a on the left is to remove the x in the numerator. This leads to bl(aq, b, c, d) - al(a, bq, c, d) = A - ab)(b - a)I(a, b, c, d). A0.8.6) This suggests that I {a, b, c, d) is a function ofab and the symmetric products. So we try I(a, b, c, d) = f(ab)f(ac)f(ad)f(bc)f(bd)f(cd). A0.8.7) Apply this to A0.8.6) with с = d = 0 to get (b - fl)(l - ab)f{ab) = {b- a)f(abq). This gives f(abq) f(abq") /@) fiab) = 1 - ab {ab; q)n (ab; qH
10.8 q-Beta Integrals 517 To see if this is correct, use it in A0.8.7) and then put this conjectured value of the integral in A0.8.6). The result is equivalent to (abq)O0(acq)oo{adq)oo(bc)O0{bd)O0(cd)O0 a (abqHO(acHO(adHO(bcqHO(bdqHO(cdHO _ b-a (abq)oo(ac)oo(ad)oo(bc)oo(bd)oo(cd)oo' where we have written (х)х instead of (x; q)^ for convenience. This equation is equivalent to b(\ - ac)(\ - ad) - a(l - bc)(\ - bd) = b - a. This is not true because the left side is (b — a){\ — abed). However, it suggests that the error is a function of abed. So we write h(abed) I(a,b,c,d) = (ab)oo(ac)oo(ad)oo(bc)oo(bd)oo(cd)o Now use this in A0.8.6) to get bh{abcdq) (abq)oo(acq)oo(adq)oo(bc)oo(bd)(x>(cd)oo ah(abcdq) (abqHO(acHO(adHO(bcq)O0(bdqHO(cdHO (b — a){\ — ab)h(abcd) (ab)oo(ac)oo(ad)oo(bc)oo(bd)oo(cd)oo Simplify to get h(abcd) = A -abcd)h(abcdq). Iterate this to arrive at h(abcd) = (abed; q)ooh@) = (abed; q)xM(q). A0.8.8) To identify M(q) we need to find specific values of a, b, c, and d where we can evaluate both the integral and the infinite product in A0.8.8). The obvious values (a, b, c, d) = A, ^/q, — 1, —^/q) work for the integral as well as the product. The integral is Г d0 = л,
518 10 Introduction to ^-Series and (-ql/2; q)oo(-q\ q)oo(q1/2q)o This implies that M(q) = 2n/(q; q)^. The conjectured value of I {a, b, c, d) is in fact correct and we have the theorem: Theorem 10.8.1 For max(|a|, \b\, \c\, \d\, \q\) < 1, ' h{x,\)h(x,Jq)h{x,-l)h{x,-Jq) dx h(x,a)h(x,b)h(x,c)h(x,d) Vl - x2 2л(abed; q)x (q> q)oc(ab; q)oo(ac; q)oo(ad; q)oo(bc; ^H0(Ы; ^)го(с^; q)oo ' when 00 h(x, q) = ЦA - 2axqn + a2q2n). n=0 Proof. We have already seen that both sides of this relation satisfy the func- functional equation A0.8.6). It is also clear that both sides are analytic functions of a,b,c, and d when max(|a|, \b\, \c\, \d\) < 1. By the uniqueness of ana- analytic functions, it will be sufficient to show that they are equal for (a, b, c, d) = (qJ, qk+1/2, -qe, -qm+1/2), j, к, I, m = 0, 1, 2,.... We first show that they are equal for a = 1, с = — 1, d = ^fq, and all b, \b\ < 1. In this case the integral is simply h(x; ^/q) dx and this was evaluated in A0.8.3). Thus we have to show that 2?r(V/2; 2; q)c But the right side is ; q)oo(bq1/2; q)M n{bq1'2; q) 2(q; q)oo(b2; q2)oo(b2q; q2)oo(-q; q)oo(q; q2)oo Ф2; q)x(qi q)oc ' This gives us a starting value, and the functional equation A0.8.6) then shows that if the integral can be computed for some (a,b,c,d), then it can be computed when
10.8 q-Beta Integrals 519 one of the parameters is multiplied by q. This proves the formula for (a, b, c, d) = (qj,b, -qe, -qm+1/2), j, I, m = 0, 1, 2, ..., and all b, \b\ < 1, which completes the proof. ¦ If the restriction to max(|a|, \b\, \c\, \d\) < 1 is removed, then some discrete mass points appear. This case was first treated directly via Cauchy's theorem by Askey and Wilson [1985]. A direct extension from the case in Theorem 10.8.1 was provided by Gasper and Rahman [1990, §6.6]. An extension of Theorem 10.8.1 is given below. Theorem 10.8.2 Formaxi<i<5(|ai|, |o|) < 1, h{x,l)h{x,Jq)h(x,-\)h(x,-Jq)h(x,Yl\ai) dx /, П1 *(*,*.-) VI- 27r(fl1fl2«3«4)oo(ai«2«3«5)oo («102H0 («1«з)оо(«1«4)оо(«1«5)оо(я2«з)оо (a2«3«4«5)oo(«l«2«4«5)oo(«l«3«4«5)oo («2«4)oo(«2«5)oo(«3«4)oo(«3«5)oo(«4«5)oo0?)oo' where (x)oo = (x; q)x- This is proved in the same way we proved Theorem 10.8.1, except that a more elementary initial result can be used. A more general integral was evaluated by Nassrallah and Rahman [1985]. This special case was noted by Rahman [1986], and the proof outlined here was given by Askey [1988]. Theorem 10.8.1 has an important limiting case, as we saw earlier. Write and in the integral and let q —*¦ 1 . The result is the de Branges-Wilson integral C.6.1): 00 Г (a + ix)T{b + ix)r(c + ix)T{d + ix) 2 J dx Г(Их) Г (a + b)T(a + c)T(a + d)T{b + с)Гф + d)r(c + d) ~ T(a + b + c + d) ' Re(a, b, c, d) > 0, A0.8.9) when either all the parameters are real or any complex ones appear in conjugate pairs. When a is complex, | Г (a + ix) \2 should be replaced by Г (a + ix)T(a — ix).
520 10 Introduction to ^-Series We saw that the integrand of A0.8.9) is the weight function for the orthogonality of the Wilson polynomials — n, n+a+b + c + d— l,a — ix,a + ix ; 1 a -\- b, a -\- с, a + d There is a #-extension of these polynomials for which the integrand in Theo- Theorem 10.8.1 plays the same role. See Gasper and Rahman [1990]. 10.9 Basic Hypergeometric Series The series in the #-binomial theorem takes the form Y2 cn with cn+\/cn a rational function of qn. In this case C/i+i _ 1 -aqn <-« i q and cn = 7 Г"-* i CQ = 1. (?; ?)« Earlier, we started with the <?-binomial theorem and then proceeded to consider the triple product and its generalization, the i ^i • This introduced us to theta functions and elliptic functions, among other things. The above remarks on the ^-binomial series suggest another direction. As a generalization of this series we consider a series V cn for which = cn This can be solved as (a; q)n(b; q)n n (c;q)n(q;q)n The function Yl^Lo c« 's denoted by а<Ь ;q,x and is an example of a basic hypergeometric series. If we set a = qa, b — q@, and с = qY and let q -> 1~, we have the ordinary hypergeometric function 2Fi(a,(i; y;x). The 2Ф1 series was studied by Heine [1847], who proved the following theorem, which is an analog of Euler's integral formula for the hyper- hypergeometric function:
10.9 Basic Hypergeometric Series 521 Theorem 10.9.1 For \q\ < 1, |x| < 1, and \b\ < 1, a,b \ (b;q)oo(ax;q)oo (c/b,x \ \q,x = 201 ,q,b]. с ) (c;q)oo(x;q)oo V ax ) Proof. As one might suspect from the proof of Euler's integral formula, a proof of the above transformation requires the <y-binomial theorem and a change of the order of summation. Once again we write (а)„ instead of (a; q)n for convenience. Then ,'a,b 201 f (9). ^0 (9)- c/b,x ; which completes the proof. To help understand this transformation, Thomae [1869] observed that it is a ^-integral analog of Euler's integral representation of the 2^1 hypergeometric function. Set a = qa, b = q&, and с = qv in Theorem 10.9.1 to get qr-P- q)m{x- q)mqP>n х ^ {q)n{qy)n X (qy\q)oo(x;q)oo ^0 iq\ q)m(qax; q)m {qP\ q^jq^; g)oo y^ (gm+1i q)oo{xqa+m\ ^ -l. q)oo{xqm. q)x ' A0.9.1) Use the notation (a\q)a (aqa;q) to write A0.9.1) as the ^-integral formula Tq{Y) (xt;q)a ^ Euler's integral representation is one way of evaluating a 2 F\ at x = 1. Similarly, we have the ^-analog of Gauss's sum due to Heine.
522 10 Introduction to ^-Series Corollary 10.9.2 For \c/ab\ < 1, a,b \ (c/a; q)co(c/b; ;q,c/ab = с J (c; q)oo(c/ab; q)x Proof. Takex = c/abin Theorem 10.10.1, assuming that |6| < Iand|c/a6| < 1. Then [( ' ;q,c/ab) = —— V с J (c;q {сjab; q)mb" (q)m (c; q)oo(c/ab; q)^ (b; q)^ The last step follows from the ^-binomial theorem. This proves the corollary except for the assumption on b that \b\ < 1. This can be removed by analytic continuation. ¦ Bailey [1941] found an analog of Kummer's theorem that evaluates 2F\{a,b\ a+\ -b;-l). Theorem 10.9.3 If\q\< min(l, \b\), then л. ( a>b iu\ (aq>q)°c(q;q)oc(aq/b;q)oc 2<pi ; q, —q/o = . \aq/b J (aq/b;q)oo(-q/b;q)oo Proof. First assume that \a\ < 1. Interchange a and b in Heine's transformation to get / a,b \ = l\aq/b' ' / (a; (a; (aqjb (aq/b q)oc ;q)a q)oo q)l (-q\ (-?; Л-q, Л-q, q)oo /b; q)z q)x /b;q)c /b;q)z 00 m=0 (a X) (q/ь (q (q2/i (q2 (a;q ;q)m ; q)m \q2\ 0 (-q/b; (-q;q) >m m m q)m m (aq/b; q)oo(-q/b; q)x Now remove the condition on a and the result is proved. Corollary 10.9.4 00 an(n-l)xn i , (Cauchy), A0.9.2) = {aq; q2)oo(-q; q)oo. A0.9.3)
10.10 Basic Hypergeometric Identities 523 Proof. For A0.9.2) set a = I/A, b = 1/B, and с - x in Heine's for- formula (Corollary 10.9.2) and then let A -+ 0, В ->• 0. To obtain A0.9.3), set b = l/В in Bailey's formula (Theorem 10.9.3) and let В -> 0. This proves the corollary. ¦ The case x = q in A0.9.2) gives This particular case of A0.9.2) was known to Euler. Similarly, the special case a = q in A0.9.3) gives Gauss's formula °° °° 1 Jim 1 -I- X^an(n+l)/2 = TT ~q 2-^<4 11 2 — q2m-\ ¦ We obtained this earlier from the triple product identity. It should be clear by now that many results on hypergeometric series have extensions to basic hypergeometric series. We develop this theme a little more in the next section, where we derive some basic hypergeometric identities. For much more on this subject, see Gasper and Rahman [1990]. Identity A0.9.2) is in Cauchy [1843]. We terminate this section with a generalization of the 2ф\ series that will be needed in the next section and later. An r<ps basic hypergeometric series is defined by ai,a2, n ПО 9 4) where q ф 0, when r > s + 1. An г+\Фг series is called ^-balanced if x = q and b\b2---br — qka\a2 ¦ ¦ ¦ ar+\. A0.9.5) When к = 1, the series is called balanced. The series A0.9.4) is called well poised if s = r — 1 and qa\=b\ai = ••• = br-\ar. A0.9.6) 10.10 Basic Hypergeometric Identities An iteration of Heine's transformation in Theorem 10.9.1 gives 'a,b \ (c/b;q)oo(bx;q)oo fabxlc,b \ ' •--> w 'HJOoy 'HJ°° + I ' ;q,c/b), A0.10.1) с J (c; q)oo(x', <?)oo V bx and a second iteration gives
524 10 Introduction to g-Series Theorem 10.10.1 'а,Ъ с This is a ^-analog of Euler's transformation formula 'a,b \ „ „-„-h „ fc-a,c-b > \ (abx c;q)x (с а, с b \ ПП1П~ ;q,x) = — 20i ;q,abxc). A0.10.2) In A0.10.2), expand the infinite product on the right by the ^-binomial theorem and equate the coefficients of x". This gives (a; q)n(b; q)n A (c/a;q)k(c/b;q)k k(ab/c;q)n-k (c;q)n(q;q)n f^ (c;q)k(q;q)k (q\q)n-k (ab/c; q)n -A (q~n; q)k(c/a\ q)k(c/b; q)kqk (q;q)n frZ (ql-nc/ab;q)k(c;q)k(q;q)k After renaming parameters, we get ~",a,b, \ (c/a;q)n(c/b;q)n (c;q)n(c/ab;q)n ' This extends the Pfaff-Saalschiitz identity for the balanced 3F2. The 302 in A0.10.3) is balanced because the product of the numerator parameters q~"ab times the power-series variable q equals the product of the denominator parame- parameters. Recall that the q used in the definition of balanced appears as the power-series variable. For Heine's 2^1 sum in Corollary 10.9.2, the variable is c/ab, and the same type of condition holds, that is, the product of the numerator parameters times the power-series variable is the denominator parameter (a ¦ b ¦ (c/ab) = c). The following is a more general result due to Sears [1951]. It gives the transfor- transformation between two terminating balanced 403 series and extends Whipple's 4F3 transformation. Theorem 10.10.2 For a positive integer n, (q~n,a,b,c, \ n(e I a\ q)n(fI a; q)n 403 ; q.q ] = a 1 J ~ r 4] (e;q)n(f;q)n / n~n n И/h die \ A0.10.4) i,aq~n+l/e, aq~n when def = abcql~n. Proof. Take the transformation A0.10.2) twice each time with different param- parameters; then take their product such that the function on the left is multiplied by the
10.10 Basic Hypergeometric Identities 525 function on the right in the other identity. This yields a,b \ (dex/f; д)ж (f/d, f/e dex\ ;q,x\— 20i , ;q,—r) с ) (x;q)oo \ f f J d,e \{abx/c;q)oo , (c/a,c/b abx\ ;q,x]— 201 \q, • Reduce this to the product of two series by taking ab de The equality of the coefficients of x" gives /e\ g)nk (dc/f) A (a; q)k(b; q)k (f/d; q)n-k{f/e\ g)n-k (dc/f)n-k „м) ^с; ^к^' q^>k (f' i)n-k(q; q)n-k _ ^ (d; q)k(e\ q)k (c/a; q)n-k(c/b; q)n_k ~^ (/; q)k(q; q)k (c; q)n-k(q\ q)n-k After some rearrangement and renaming of parameters, it can be seen that this is equivalent to A0.10.4). ¦ It should be observed that both sides of A0.10.4) are balanced series. There are many interesting limiting cases of this transformation given in the next theorem. Before we state it, note that if с = d in A0.10.4), we get A0.10.3). Theorem 10.10.3 'q-\a,b_ q-",a,b \ (e/a;q)n(f/a;q)n „ ( q~n,a,abql~n/ef A0.10.6) (a,b,c \ (e/a;qH0(de/bc;q)oo , (a,d/b,d/c . \ 302 ; q, de/abc = -—302 , , ,, ; q, e/a , V d,e J {e;q)o0{de/abc;qH0 \ d,de/bc J A0.10.7) fa,b,c , , , \ (a;q)oo(de/ab;q)oo(de/ac;q)o 302 , ;q,de/abc = d,e J (d;qH0{e\q)x{de/abc\qH0 'd/a,e/a, de/abc , ,4,» i- A0.10.8) de/ab, de/ac '
526 10 Introduction to g-Series Proof. To prove A0.10.5), let / and с tend to zero in A0.10.4), keeping f/c fixed. Similarly, for A0.10.6), let d and с tend to zero with d/c = q~"+1ab/ef. Finally, to prove A0.10.7), set f — ql and let n -»¦ oo with n + к fixed so that qn+x _ abcq/dg. Now A0.10.7) is a generalization of Kummer's transformation a,b,c \ r(e)r(d + e-a-b-c) / a,d-b,d-c fa,b,c \ _ r(e)r(d + eabc) / a,db,dc 3 2\ d,e ' У ~ r(e-a)r(d + e-b-cK 2\d,d + e-b-c' A0.10.9) When A0.10.7) is applied to itself, the resultis A0.10.8), witha and с interchanged. Remark 10.10.1 It is worth mentioning that if we set x = c/e and let с -> oo, Kummer's transformation becomes "•''-* " ¦ A0.10.10) In contrast, if x = a/e and a —>• oo it becomes ¦^i-o-xj-W-V-^i. We have already seen a g-extension of the last formula. Let us determine a q- extension of A0.10.10). In formula A0.10.7), let de/abc = x and с and e tend to zero. The result is a, .qx\ _ Уах'У>°° y^ (a'q>n( ' 'q)"q [——фх)п. d ) {x\q)oo ^ (d; q)n{ax; q)n{q; q)n A0.10.12) When a, b, and с are replaced by qa, qb, and ^e respectively and we let q —> 1~, this reduces to A0.10.10). This is an instance where the ^-extension is nicer than the q = 1 case in the following sense: The left side of A0.10.10) is analytic for |x| < 1, while the right side has two factors, the first analytic forx in С — [1, oo) and the second analytic for Rex < 1/2. In A0.10.12), the 2<pi on the left is analytic for |x| < 1, while on the right, l/(x; q)^ has poles when x — 1, q~l, q~2, The other factors are entire functions since {ax; q)oo/(ax; q)n = (axq"; q)^ is entire while the series converges uniformly for |x| < A for each A.
10.11 g-Ultraspherical Polynomials 527 10.11 4-Ultraspherical Polynomials In Section 6.11, we introduced the (continuous) g-ultraspherical polynomials de- defined by Cn(x; p | q) = У^ —:—-—'-—— cos(n — 2k)9 f^ {q;q)k(q;q)n-k ^'i)n JM , , , ,r o-^-iw i A011 Л) (<?; q)n (Because in this section we discuss only the continuous g-ultraspherical polyno- polynomials, the word continuous will be dropped.) When the generating function found in Chapter 6 is combined with the g-binomial theorem, the result is It is also easy to conclude from F.11.5) and F.11.6) that the polynomials Cn(x; /5 | q) satisfy the recurrence relation 2A - Pq")xCn(x; /5 | q) = A - qn+l)Cn+l(x; /3 | q) + (l-p2qn-l)Cn-i(x;P\q). A0.11.3) An implication of Theorem 6.6.2 is that [Cn(x; ft \ q)}, x — cos0 is an orthog- orthogonal polynomial sequence with respect to the distribution 2 cop(cosd)dd — lw;q)o dd. A0.11.4) (Pe2if>; q Since we did not prove Theorem 6.6.2, we verify this fact directly here. Theorem 10.11.1 When \q\ < 1 and |/8| < 1, / Cn(cos6>; p | <?)Cm(cos6>; p \ q)a>e(cos0)d9 Jo ^2n(\-p) (p2; g)n (Р;д)х(Рд;д)оо5 A - pq") ' (q; q)n ' (p2; ?)«,(?; q)K mn' where cop(cosd) is given by A0.11.4). Proof. Take p and q real. Since the integrand in A0.11.5) is unchanged by the transformation в ->¦ —в and Cn(cos#; p \ q) has an expansion in terms of ei(n-ik)B^ start wjtll tlie integral i-n („2i0. _\ („-Ив. „\ = em %,' Ч)°°(е 'q)°° dB. J-n (Pe^-q^ipe-^-q)^
528 10 Introduction to q-Senes For | f} | < 1, the g-binomial theorem gives When k is odd, the integral is zero. When к = 21, г-ж " / •"" J —11 n=0 (q;q)n(q\q)e+n к ¦ V qe+l Apply A0.10.1) to the above 2Ф1 to get This 20i has only two terms, and so we have nee . ал 2лР(Г; g)<d + ge) e (°(COS9) Le A second way to evaluate this integral is to use Ramanujan's 1^1 sum A0.5.3). Let m < n. Now consider the integral / cos(m - 2к)в Cn(cos6>; /8 | q)со^(cos в)dd Jo = I Г em-W у (Р-'ЯШ'ЯЪ-'^-трв (cose)de_ 2 j-п j^ (<?; q)dq\ q\-t Since m -Vn — 2k — 21 has to be even for the contribution of the integral to be nonzero, let m — n — 2k = — 2s orm—2k = n—2s. Then the integral is у (Р;яМР;я)»-1 Г ш-i-,» (cosQ)dQ J P a}fi(cos0)d0 ~-°(\+qs-' 3s(l+qs q; q)e(q; q)n- (Pq;q)oo(P;q)oo (Р;я)п(Р~1;я), A (g-"; яНя"Р~и, яМР; яМ-я1-'; q)d ' j^o (q; q)t(ql-"p-l\ q)dqx~sP\ q)d-q~s\ q)e '
10.11 g-Ultraspherical Polynomials 529 The second equation in the above calculation is obtained by changing t to n — I and в to —в. The last sum, which is a balanced 4^3, when transformed by Sears's formula (Theorem 10.10.2) becomes Pn(ql-s\q)n(qX-n/P2;q)n (q~\ P, ~P, q~[. V ?"' Ч'~п P2 (qt-'P; q)n{qx-nIP; ?)« V -?"', Ч'~п, P2 This 40з has only two nonzero terms and the above expression becomes, after a simple calculation, (P2;q)n(ql-s\q)n-iu-qn-2s) (P;q)n(ql-'P;q)n(l+q-s) The factor (qx~s; q)n-i is zero for s = 1, 2,..., и — 1. This gives / cos(« -2sN>Cn(cos6>; fi \q)cop(cosd)de Jo {0 for.s = 1,2, ...,n- 1, (P;q)oo(Pq;q)oo(P2;q)n . „ n— fors = 0 or n. (P2;q)oc(q;q)oc(Pq;q)n Jo By A0.11.1) and the above relation we obtain the orthogonality Cm(cos6>; p I <?)C,,(cos6>; p | ^)«/j(cos6>)u?6> (<8; q)n (<82; g)n(<8; g)oo(j6g: q)"° j (q;q)n (Pq;q)n(P2\q)oo(q;q)oo A-/8) (I2; <?)„ (P;g)oc(Pq;q)oos n(\-Pq") (q;q)n {q; q)x(P2\ q) The theorem is proved. ¦ The polynomials Cn(cos#; p\q) satisfy a difference equation. To state it, we need the #-difference operator Dq defined by P with Sgg(ew) = g(q1'2ew)-g(q-^2ew), x= bqx A0.11.6) It can be shown from the generating function for Cn(cos#; p\q) that DqCn{x; p\q)= ^ J ^'""^Сп-К*; Pq\q). A0.11.7) The ^-difference equation is A - q2)Dq[(oN(x)Dqy(x)] + V~n(l - qn)(\ - p2qn)coft(x)y(x) = 0, A0.11.8)
530 10 Introduction to g-Series with y(x) = Cn(x; p | q). As a first step to the proof of A0.11.8), one can show that Dq(wp(x)Ca(x;p\q)) 2q-n/1(l - qn+l)(l - p2qn~l) One can use A0.11.7) to give a proof of the connection coefficient formula: Cn(x; у | <?) = > ^— — —Сп-2к(х; P I q). f^ (q;q)k(Pq;q)n-k 1-/8 A0.11.9) The proof follows the same steps as that of Theorem 7.1.4'. Formula A0.11.9) was first given by Rogers [1895]. Rogers also found the following linearization formula: Cm(x;p \q)Cn(x;P \ q) min(m, = E mm(m'n) (<?; g)m+n-2k(P; g)m-k(P; q)n-k(P; q)k(P2\ q)m+n-k 0 2' q)m+n-2k(q; q)m-kiq; q)n-k(q; q)kiPq\ q)m+n-k - pqm+n-2k) -Ст+п„2к(х; р \q). A0.11.10) A - Р) Proving this by induction is easy. It is likely that Rogers first computed the formula for a few small values of m and then guessed the general result. The simplest direct evaluation of the formula may be the one similar to the proof of Theorem 6.8.2, which uses the ^-analog of Whipple's transformation given in Chapter 12. For details see Gasper [1985]. The proofs of A0.11.7) to A0.11.10) are left to the reader. When we set p = 0 in Cn (x; p \ q) we get the (continuous) g-Hermite polyno- polynomials. They are defined by Н„(х | q) = (q; q)nCn(x; 0 | q). A0.11.11) The following properties of the g-Hermite polynomials are now immediate: #n(cos6> \q) = ir (Jtlh cos(n _ 2к)в trt(q;q)k(q;q)n-k They satisfy the orthogonality relation Hn(cose | q)Hn(cose | q)\(e2w; q^fde = / m" ¦ A0.11.13) /" Jo (qn+u,q)o
10.11 g-Ultraspherical Polynomials 531 Since the weight function for the Hermite polynomials is e~x and we may regard the integral f71 f J-a /0 as an extension of the normal integral /¦00 e'^dx = -00 The generating function for Hn(x \ q) is given by J2 n(x\1>rn = —_ __ ) x = cos6>; A0.11.15) and the three-term recurrence relation is 2xHn(x | q) - Hn+i(x \q) + (l- qn)Hn-x(x \ q). A0.11.16) By A0.11.10), the linearization formula is easily seen to be Hm(x | q)Hn{x | q) _ sr-^ Hm+n-2k(x | q) (q;q)m(q;q)n ~^ {q\ q)k{q\ q)n-k(q\ q)m-k A direct proof is indicated in Exercise 41. From A0.11.17) and the ^-binomial theorem, it is possible to derive a formula for the Poisson kernel of the ^-Hermite polynomials: 2;<?)oo A0.11.18) where r is real, —1 < r < 1. The derivation is left to the reader as an exercise. Finally we note that the integral I (a, b, c, d) of Section 10.8 can be written in terms of #-Hermite polynomials as v^ akbecmdJ I(a,b,c,d)= } A0.11.19) i >o ^' tiM* 4)kiq\ q)e(q\ q) • Г Hk(x | q)Ht(x | q)Hm(x \ q)Hj(x | q)\(elw;, Jo where x = cos#. This follows easily from the generating function A0.11.15). By means of the linearization formula A0.11.17), another evaluation of the integral is
532 10 Introduction to g-Series obtained. This observation is due to Ismail and Stanton [1988]. They also pointed out that A0.11.17) and A0.11.18) are equivalent. Al-Salam and Ismail [ 1988] used the connection coefficients and linearization for (continuous) g-ultraspherical and Hermite polynomials to prove Theorem 10.8.2. 10.12 Mellin Transforms The integral of xa~l(—ax; q)oo/(-x, <?)oo, evaluated in Section 10.8 by a func- functional equation, is also a particular case of an interesting Mellin transformation formula. This formula, given by Ramanujan, has many important applications, some of which will be presented in the exercises. The Mellin transform connects up in an important way the transformations of some ^-series with functional equa- equations satisfied by certain Dirichlet series. Earlier we mentioned a transformation for n=\ that was useful in computing a Riemann sum approximation of the normal integral f°° e~c'2dt. In Exercise 2.28, the reader was asked to use this transformation to obtain the functional equation for the Riemann zeta function, ?(s). This relation- relationship between ^-series, which arise from elliptic functions, and Dirichlet series is particularly important in number theory. We discuss a few examples, especially those involving ^-series, as considered in the previous sections. Ramanujan stated his formula as / _ - хф{\) + х2фB) }dx = sin sn In this form, one must put some strong restrictions on 0C$). Hardy [1940, pp. 189-190] gave fairly general conditions for the validity of this formula. See also Berndt [1985, p. 299]. Hardy's theorem is stated below. Let H(S) denote the half plane и = a + it, a > -8, 0 < S < 1. Suppose A < n; let К (A, P, S) denote the set of all functions ф, holomorphic on H (<5), that satisfy \ф(и)\ = O{ePa+m). A0.12.1) Take 0 < с < S, and define 1 fc+i°° n Ф(х) = — / ф{-и)х~ийи. A0.12.2) 2ni Jc-iao &mnu The integrand is
10.12 Mellin Transforms 533 thus that the integral converges uniformly in any interval 0 < xo < x < X. There- Therefore, the integral represents an analytic function Ф (x) for x > 0. An application of Cauchy's theorem gives 1 f n 2ni y_Ar_i_,-00 smnu = 0@) - x0(l) + • • • + (-lfxN(p(N). A0.12.3) If we take 0 < x < e~p, then the series 5^°(-1)п0(и).хп converges and the integral in A0.12.3) goes to 0 as N -> oo. So, for 0 < x < e p, Ф(х) = 0@) - 0(l)x + 0B)x2 .... A0.12.4) Theorem 10.12.1 LetO<Res<S.IfФ(x) is given by A0.12.2), then / xs~x^{x)dx — —'¦ Jo sir Sin 57Г Proof. Choose c\ and C2 so that 0 < C\ <Res=or <сг < S. Compute the absolutely convergent double integral i A rci+ioo — r- / / in two different ways. One way it equals o— / ^"M / <H-u)x-udu)dx = / х^1 2л-1 Jo \Jc-ioo Sin^M / Jo The other way it is Xs ldx)du = -—l -. du. ) 2J ,_I00 sin;rM \J0 ) 2niJCl_i00 &innus-u Therefore, Jo 2ni JCl_100 sin^M s — и Similarly, use the double integral 1 Г ri+io° л- . . . ..... /OO ЛС2+1 Л-2-ioo to get Г Г .-u/ ч^ ! Г2+'°° ^ ^z"L / x Ф(х)с?х = / du. J\ 2ni Jc i00 smnu s-u
534 10 Introduction to ^-Series An application of Cauchy's residue theorem gives fC2+i00 N д. /o ZTll \Jci—ioo Jc2—ioo П smns Corollary 10.12.2 If0<q<l,s>0,and0<a< qs, then /¦00 Гxs-i(-ax' Jo (x;q ;^)oo sinsn (q;q)x(aq s;q)oo Proof. Take ш = (a;q)^,q)Xj and check that ф satisfies the conditions of Theorem 10.12.1. ¦ Another corollary is Carlson's theorem, which we proved and used in previous chapters for ф bounded. Corollary 10.12.3 Suppose ф(и) is analytic in a half plane H(8) = [и \ Rew > -8], 0 < 8 < 1 satisfying A0.12.1), and ф(п) = Ofor n — 0, 1, 2,.... Then ф = 0. This corollary shows that Ramanujan's formula is actually an interpolation for- formula. In fact, Newton's interpolation formula k(-s) = A@) + ^ where ДЛ@) = Ml) - МО), Д2М0) = ЯB) - 2ЯA) + МО), can be derived from Theorem 10.12.1, under conditions on X. For this and other applications the reader should see Exercises 34-36. When we discussed Mellin transforms earlier, it was mentioned that Mellin transforms are just Fourier transforms with a change of variables. Thus theorems about Fourier transforms have analogs for Mellin transforms. Therefore, there is a uniqueness theorem for the Mellin transform. Suppose xa~lf(x) is integrable on @, oo) for a < о < p. Then /¦OO F(s)= xs~lf(x)dx A0.12.5) Jo exists and is analytic for a < Re s < p.
10.12 Mellin Transforms 535 Lemma 10.12.4 IfF(cro + it) = Ofor all t e (—oo, oo) and fixed go in («, P), then f(x) = 0 almost everywhere. Proof. Let x = e" in A0.12.5). Then /¦OO F(flr0+i0= / eituea«uf(e")du J — 00 is the Fourier transform of ea°uf(e") = 0 almost everywhere. Since ea°u never vanishes, the lemma is proved. ¦ Note that if, in the lemma, / is continuous at a point, then / must be zero at that point. This implies that if the Mellin transforms of two continuous functions are the same, then the two functions are equal. Here is an application of this fact: Theorem 10.12.5 For и > О I e-(utwlAt)^t = Proof It is sufficient to prove this for и = 1. Denote the integral by f(x). For Re s > 1, the double integral /°° f°° г dt Jo ft is absolutely convergent. Thus it is equal to 5 dt 1 f°° —p \ax — I x j(x)ax. wt \ Jo Jo [Jo Now change the order of integration to get /•00 — t Г /-00  /-00 e-~\ xs-le-x2'4tdx\dx=2s-lr(s/2) t Jo -ft \_Jo J Л The last step follows from Legendre's duplication formula. We have /•OO / xs-lf(x)dx = f^T(s). Jo This means that f(x) and fjre~x have the same Mellin transform and hence they are equal. This concludes the proof of the theorem. ¦ This proof follows Bellman [ 1961, p. 30], which contains an interesting discus- discussion of theta functions.
536 10 Introduction to ^-Series There is another related integral transform that is important, the Laplace trans- transform, defined by /*oo F(s) = / e-stf(t)dt. Jo If / is integrable and satisfies \f(t)\ = O(ebt) as t -> oo, then F(s) is analytic for Re s > b. There are uniqueness theorems for Laplace transforms as well. We can use this uniqueness to prove the transformation formula for theta functions mentioned earlier. Rex > 0. (A more general result can be proved by the Poisson summation formula, but this result shows that the theta function defined by the series is a modular form. See Remarks 10.12.1 and 10.12.2 at the end of the section.) Proof. For Re s > 0, -oo °° 1 °° 1 Theorem 10.12.6 oo и=—oo frc CO n=—oo / Jo e n=\ By Theorem 10.12.5, when Re s > 0 /¦0 / Jo n=l n=l ж 2тг е'1п^ л- + According to A.2.5), we have л cot лх = —1-2 X n=\ After proper identification, this shows that л 1 + < ^/c 1 _ p~2n^s c. /L^i „2 _]_ г and the lemma is proved by the uniqueness of the Laplace transform. This proof is due to Hamburger [ 1922]. ¦ When we take the Mellin transform of У^„ е~п x, the Riemann zeta function is obtained. This gives a connection between ^-series and Dirichlet series. An
10.12 Mellin Transforms 537 exercise in Chapter 2 asks the reader to use the result of Theorem 10.12.6 to obtain the functional equation of t;(s). We give the details here because the result and proof are important. Theorem 10.12.7 The expressionn~s/2T{sj2)C,{s) is invariant under s -> \—s. The following proof goes back to Riemann [1859]. Pmof. For Re s > 1, and ф(x) = J^Li e~"lnx- /00 °° /¦1 roo = / x(s/2)~l\j/(x)dx + / x(-sl2)-lf{x)dx. Jo J\ Change x to l/x in the first integral to get I X-il/2)-lf(l/x)dx. A0.12.6) By Theorem 10.12.6 +2ф(х) = ~( Thus A0.12.6) becomes — 1 1 Г°° = —- --+ x-^ l-s s Jx We can conclude that 1 + s l-s Ji A0.12.7) Because the integral is an entire function of s, the expression on the left is defined for all s. In particular, ?(.?) is defined for all s except s = 1 where it has a pole of order 1. The integral is invariant under s -> 1 — s. This proves the theorem. ¦ We have shown that the functional equation for the zeta function, the transfor- transformation formula for the theta function, and the partial fraction expansion of the cotangent function are equivalent results. Of course, we still need to show that the theta transformation is a consequence of the functional equation. This can be done by Mellin inversion. To illustrate the technique, we apply it to a different though
538 10 Introduction to ^-Series related function, that is, 00 tl(T)=ql'24Y[(l-q"), \q\<l, A0.12.8) n=l where q = elnix. This is called the Dedekind ^-function. Before we state the theorem, note that the integral in A0.12.7) is bounded in vertical strips, a < Re s < p. So the same is true for n-s'2T(s/2)l;(s) + - s 1 — s Theorem 10.12.8 The Dedekind function r)(x) satisfies the relation T](-l/T) = ^ TJ(T). Proof. We follow Weil [1968]. Since t]{t) has no zeros for Im r > 0, we prove that logi?(-l/T)=logij(T) + ilog(T/i). A0.12.9) From A0.12.8) Let ю = E m,n>l 00 Jlnimnx m m,n=l The Dirichlet series corresponding to / can be found by taking its Mellin transform as before: — Inmnx /o ^ m Now m=\ n=l It follows from Theorem 10.12.7 that A(s) - Bff)-T(j)C remains invariant under s -> —s. It is clear that A(s) has simple poles at s — ±1 with residues ±7г/12 respectively, since f (—1) = —1/12. This last fact can be
10.12 Mellin Transforms 539 verified from the functional equation for ? (s). At s = 0, however, Л (s) has a double pole. These are the only poles of A(s). Since ?'@) = —1/2, we conclude that / Л (Ю1210) 12(s-l) 12E+1) ' 2s2 x ' is entire and bounded on every vertical strip. The last observation follows from the remarks made before the theorem. By Cauchy's theorem, the Mellin inverse of r(,s) is «Г?, that is, 2 /-c+ioo e~y = r— / T{s)y-Sds, с > 1. From this we get i /¦c+ioo fdy) = T~ / Ms)y~'ds, с > 1. A0.12.11) Observe that t;(s)t;(s + 1) is bounded on the line Re s = с > 1, because the series for ?(s) converges absolutely. By Stirling's formula r(.s) ~ V2Jr\t\a-{1/2)e-*W2, s=a + it, \t\ -> oo, in any vertical strip a <a < /3. These two facts imply that for any ц, > О \A(s)\ = O(\t\-'*), \t\ -> oo, Res = ol. A0.12.12) Now choose c\ so that —c\ > 1. Thus, for any д > О, \A(s)\ = \A(-s)\ = Since the expression in A0.12.10) is bounded in every vertical strip, it follows from the Phragmen-Lindelof theorem that A0.12.12) holds in (ci,c) for large t. All this implies that we can move the line of integration in A0.12.11) to — с < — 1, while picking up the residues at s = ±1 and s =0. The expansion for y~s is 1 - s log у Н , so the residue at s = 0 is (log v)/2. At s = ±1, we have residues n/\2y and -тгу/12. The result is r-c+ioa ^: / 2ni J-c-ico 1 r— c+ioo = — / A{-s)y-sds + tt/12v + (log v)/2 - ttv/12 2ТП J-c-ioo 2 /¦c+ioo = — / A(s)y'ds + ir/12y + (logy)/2 - тгу/12 2тгг Jc_i00 (\ogy)/2 - nyj\2.
540 10 Introduction to ^-Series We may therefore conclude that the relation A0.12.9) holds on the imaginary axis. Since the functions involved are analytic, the result is proved. ¦ Remark 10.12.1 Note that g(r) = r?8(r) has the following two properties: g(-l/T)l/T4 = g(T). The interest in the two transformations r -> r + 1, r -> -1/r stems from the fact that they generate the modular group G, which consists of all the fractional linear transformations (at + b)j(cx + d), where a,b,c, and d are integers with ad — be = 1. A function / on the upper half plane is called a modular form if for some integer к f(Ax)A'{x)k = /(г) for all A e G. This holds for g(x) = t]8(x) with к = 2. Subgroups of G of finite index also play a very important role. Consider the theta function where q = enix. The function h{x) = 6>4(r) satisfies A(r+2)=A(r), й(-1/гI/г4 = The transformations r -> r +2 and r -> —1/r generate a subgroup of the modular group. A problem of interest to number theorists is to find properties of the Fourier coefficients of modular forms. For example, the coefficient of q" in the expansion of h(t) is the number of representations of и as a sum of four squares, which was studied in Section 10.6 by nonmodular methods. For a proof using modular forms and Hecke operators, see Koblitz [1984, p. 174]. Remark 10.12.2 Theorem 10.12.8 has a simple proof that considers the integral of cotzcot(z/r) over a suitable contour in the complex plane; it was given by Siegel [1954]. Rademacher [1955] obtained a more general result by a similar method. Let h, ti, and к be integers and let
10.12 Mellin Transforms 541 with gcd(h, k) = 1, к > 0, Re z > 0, and hti =e -l(mod k). Then Rademacher's result states that i°8 tf \ и i —о ¦< i i. ' ' 2 where the Dedekind sum s(h,k) is given by k-l It is, however, worth noting here that the transformation formula for tj(t) is implied by a similar formula for theta functions, a particular case of which is contained in Theorem 10.12.6. We have OO 00 J2 e-s(n+xI = s~l/2 J2 ennl/se2ltinx A0.12.13) n=—oo n=—oo (Appendix D, Equation (D.4.2)). Set q = enix and p = е~л'/г. By the definition of #з in A0.7.3), the above formula implies l, p) = */VlW/t03Grz,q). A0.12.14) Now the infinite product for #3 contained in A0.7.7) gives Я-+1 1 +g-^' J-1V Ч У V У n=l and f n=\ An easy calculation shows that A0.12.14), A0.12.15), and A0.12.16) imply -.3 тК-1/гK = - W- i V г or where с is a cube root of unity. Set r = i to see that с = 1. Before closing this chapter, we also note that quadratic Gauss sums are related to theta functions. Cauchy evaluated these sums from the formula in Theorem 10.12.6 by taking x = e + 2ni/N and letting e -> 0, where N is an odd integer. A more general reciprocity relation for Gaussian sums can also be obtained by this method.
542 10 Introduction to ^-Series Bellman [ 1961, p. 40] attributed this proof of the reciprocity relation to Landsberg, but it was published earlier by Henry Smith in 1859. Smith's [1859-1866] report contained the reference to Cauchy. For the connection of the reciprocity of Gaussian sums to Tauberian theory, see Bochner [1952]. The reciprocity formula is given in Exercise 43. Exercises 1. Let р„ (к) denote the number of permutations of 1, 2,..., n with к inversions. Prove that n\q = Y^Pn(k)qk- (Rodrigues) /t>o Then find the total number ofinversions for all the permutations of 1,2,..., n. (Stern) The problem on the total number of inversions was posed by Stern and a solution was given by Rodrigues. An interesting history of results on inversions is given in W. Johnson's unpublished notes, some old and new results on inversions. 2. Give a proof of A0.0.10) similar to the proof of the noncommutative binomial theorem given in the first part of the chapter. 3. Let p(m,n,k) = number of partitions of к into < n parts, each part < m. Show that i,n,k)q ¦ k>0 Deduce that p(m, n, k) = p(n, m, k). 4. Prove the following identities: m The last identity is a ^-analog of the Chu-Vandermonde identity. 5. Let \\ , /(9,0) = 1. Prove that f(q,m) = (l-qm~l)f(q,m-2). Deduce that ,( ч j(l-q)(l-q3)--(l-qm-1), m even, f(q,m) = < (Gauss) 0, m odd.
Exercises 543 6. Let n be an odd integer and x a primitive nth root of unity, (a) Use the result in Exercise 5 to show that k=l (b) Deduce that n-1 /fc=O = (-1)"г (x2 - x-2)(x4 -x~4) ¦ ¦ ¦ (xn~l - x-n+l). (c) Show that (±л/п torn = 4k + I, 1 ±iy/n for n = 4k + 3. (d) Set x = e2niln in (b) to obtain „ ,-.,s=i . 2л- . 6л- . 2(и-2)л- G = Bi) 2 sm — sm — • • • sm . n n n Note that when n = 4k +1, there are /t negative factors in the sine product. Conclude that Do a similar analysis for n = 4k + 3 to show that, in this case, G = i-s/n. (Gauss) 7. Define inv(mi, mi, ¦ ¦ ¦, mr; n) to be the number of permutations x\x2,..., хШ]+Ш2+...+ш„ of {lmi2m2,..., rm'\ in which there are exactly n inversions. Show that (a) inv(mi,...,mr;n) = ]?"=oinv(mH hmr_bmr; (b) Use induction and (a) to show that, for r > 1, \mi+m2-\ Ymr „>0 L mum2,...,mr Jq (q\ q)mt ¦¦•(#; q)mr This is a result of MacMahon. See Andrews [1976, p. 41]. 8. Prove that the Gaussian polynomial G{m,n; q) = is reciprocal, that is, G(m, n\ q) = qmnG(m, n\ 1/q). Deduce that p(m, n, k) = p(m, n, mn — k).
544 10 Introduction to <?-Series 9. Prove the following version of the q -binomial theorem: bkia;q)kib;q)n-k- State and prove a multinomial extension of this formula. 10. (a) Prove the following analog of the Bohr-Mollerap theorem: For 0 < q < \,Tq (x) is the unique logarithmically convex function that satisfies the functional equation fq(x + 1) = -~-fq{x) with fq{\) = 1. (b) Let g(x) be defined for x > 0, and let lim «W = 0. *->-oo x Prove that any two convex solutions of f(x + 1) - f{x) = gix) differ at most by a constant. Use this to derive the result in (a). (See John [1938].) 11. Prove the ^-analogs of Legendre's duplication formula and Gauss's multipli- multiplication formula contained in Theorem 10.3.5. 12. Forq > 1, define Show that (a) Tqix) satisfies the functional equation in Exercise 10. (b) Иш^1+Г,(х) = Г(х). (c) The residue of Tq (x) at x = — n is iq - l)"+iqn(n+i)/2 iq;q)n\ogq (d) If q > 1 and qx - 1 d3 /(x+l) = ^ -fix), /A) = 1, and --^ q — 1 for x > 0, then Я*) = Г,(дО forx>0. (e) If q > 1 and ^ =1, and J-j for x > 0, then fix) = Tqix) forx>0. (See Moak [1980].)
Exercises 545 13. Compute C0(q2) in A0.4.4) as follows: (a) Note that n=l and n=l (b) Show that n=\ (Hint: Use 2 (c) Show that Deduce that 00 Co(q2rl = Y[(l-q2n). (Gauss) 14. Prove the quintuple product identity: 00 tf (x) = Д(! - «")(! ~ V)d " q"~l/x)(l ~ Л2""')A - q2"~l/x2) П = — 00 Deduce that 00 OO n=l n=—oo One method is to take H(x) — JZ^l-oo c(«)x". To find c@), compute Щ^- and ^У? to determine c(n) in terms of c@). Then specialize x. 15. Prove that the quintuple product identity in the previous problem is equivalent to the identity For Exercises 16-24 and related results and references, see Gasper and Rahman [1990].
546 10 Introduction to ^-Series 16. Prove the following q -analog of Dougall's formula for a 2-balanced, very well poised 7^6 : / a,q*Ja,—q*Ja,b,c,d,e,q~N V л/а, —л/а, aq/b, aq/c, aq/d, aq/e, aqN+1' (aq; q)N(aq/cd;q)N(aq/bd;q)N(aq/bc; q)N (aq/b;q)N(aq/c;q)N(aq/d; q)N(aq/bcd;q)N ' when bcde = a2qN+l and TV is a positive integer. One proof goes as follows: (a) Write / instead of q~N and express the formula as / a,q*Ja,—q*Ja,b,c,d,e,f \-Ja, -л/а, aq/b, aq/c, aq/d, aq/e, aq/f' (aq)oa(aq /cd)oo(aq /bd)oa(aq /bc)o (aqlb)oo{aqlc)oc,(aq/d)OCl(aqlf)oo(aqlbcd)oo when a2q = bcdef. (b) Suppose the formula is true for / = 1, q~l, q~2,..., q~N+1 and take / = q~N. By symmetry the result is true if с or d = a2q/bcef is equal tol,*-1,...,*-^1. (c) Observe that if the original formula is multiplied by (aq/c; q)iy and (aq/bcd; q)iy, then the formula gives the identity of two polynomials in с of degree 2N. From (b) the two sides are equal for 2N values of с Now set с = aqN and verify the equality in this case. (Jackson) 17. Use the formula in Exercise 16 and that method of proof to show that . -,,„-., -ly~,-,d,e,f,g,h,j *' л/а, —л/а, aq/c, aq/d, aq/e, aq/f,aq/g,aq/h,aq/j' k, q*/k, -q'/k, kc/a, kd/a, ke/a, f, g, h, j лД, -лД, aq/c, aq/d, aq/e, kq/f, kq/g, kq/h, kq/j 4'4 when к = a2q/cde and cdefghj = a3q2, and /, g, h, or j is of the form q~N where TV is a nonnegative integer. (Bailey)
Exercises 547 18. Derive the following formula from Bailey's formula given in the previous exercise: / a,q^/a,q^/a,c,d,e,f,g 2 2 \<Ja,-*/a,aq/c,aq/d,aq/e,aq/f,aq/gy ' Ч (aq)oa(aq /fg)oo(aq / ge)oo(aq /ef)oo {aq I e)oo(aq I f)oo(aq I g)oa(aq I efg)o aq/cd,e,f,g •403 , . , ,,;?-?, \efg/a,aq/c,aq/d ) when e, f, or g is of the form q~N. This is a ^-analog of Whipple's transfor- transformation. Deduce Sears's transformation formula in Theorem 10.11.2. (Watson) 19. Let c, d, e, f, and g = q~N tend to oo in Watson's formula given in Exercise 18 to get у (-1) WE")/2O - aq2n){aq'' q)n~l Simplify when a = 1 and a = q. 20. Derive the following identities from Jackson's formula in Exercise 16: (a) 6051 r~ /-' ,, ; q,aq/bcd) _ (aq; q)oa(aq/bc; qHO(aq/bd; q)oo(aq/cd; q)^ (aq/b; q)oo(aq/c\ q)oo(aq/d; q)oa(aq/bcd; q)^ provided \aq/bcd\ < 1. (b) The g-Dixon formula / a,-qy/a,b,c _ \ 403 r- ,, . ;q,qVa be) \-<Ja,aq/b,aq/c * ) __ (aq;q)oa(aq/bc;q) (aq/b;q)oo(aq/c;q provided \q*Ja/bc\ < 1. 21. Prove the following identities and also deduce (b) from (a): . ч . fq-",b,c \ (de/bc;q)n (bc\n fq-",d/b,d/c (a) 302 , ; q, q I = —:—-— -r 302 , , ., ; \ d,e ) (e;q)n \d) \ d,de/bc
548 10 Introduction to ^-Series (Ь) 2<i /\ л(ч~П>а>Ь de(in\ (Ф^дЬ, fg'na,d/b . (c) 302 , , q, —j— = —,—-—3021 , , , ; g, g • V d, e ab ) (e\q)n \d,aql-n/e > 22. Prove Watson's g-Barnes integral formula fa,b 20i \q,z J-ioo (aqs;q)oo(bqs\q)ocismns \z\ < 1, |arg(-z)| < 7Г. 23. Let Re с > 0, Re d > 0, and Re(x + y) > 1. Prove the following analog of Cauchy's beta integral: J_ f°° (-csq^qUidsq^q)^^ 2xi J_ioo {-cs; q)oo(ds; q)^ Fq(x + y-l) (-cqx/d;q)oo(-dqy/c;q)oo rq(x)rq(y) i when 0 < q < 1. (Wilson) 24. Verify the following analogs of Barnes's first and second lemmas. (a) _I_ /"'°° (з'~С+*: g)oo(g'"^; <7)oq nq'ds 2ni Jioo (qa+s; q)oo(qb+s; q)oo sin^(c - s) smn(d - s) qc (q\q)o<. d)" (qa+c;q)oo(qa+d;q)oc(qb+c;q)oc(qb+d;q)oo ' (Watson) _L Г(qi+s' ^°°tid+s< q)™^' 1H0 *qsds 2ni J_ioo (qa+°; q)x(qb+s; q)oo(qc+s\ q)<*> sinns sinn(d + s) (g; gU{qd\ gHO(g'"'; gHO(ge~a; gHO(g'-''; g)oo(ge"c; g)<x, f', q)ooiqb', q)oo(q1', q)co(q]+a d;q)oo(q1+b d;q)<x(q1+c~d;q)<x when d + e=\+a+b + c, 0 < q < 1. (Agarwal) 25. Prove the theta relations A0.7.5), A0.7.6), and A0.7.7). 26. Prove the product formulas for the Jacobi elliptic functions in A0.7.13). 27. Prove that when и = 2Кх/п, ж 2л x-^ qn cos 2nx (a) dn и = 1— n=\
(b) 28. Show that for к Also prove that and Note that (- I en и ¦¦ Exercises 2n -^-\ qn+ll2 cosB« + l)x К ^ 1 +q2n~1 ж 2тг ^ q2n+l sinBn + l)x «=o /0,±l, sn(M, du —sn(M, к) = сп(м, А:)ёп(м, к), du sn2(u,k)+en2 (u,k) = 1 2 / 29. Use the ideas of Section 10.8 to prove that poo h _c_! (-ax; q)oo(-qb/x; q)oo d , (-x;q)oo(-q/x;q)oo (ab; q)oo(qc; q)oo(q^c; q)ooTC 549 (bqc\ q)oo(aq~c; q)oo(q; q) 30. Prove Theorem 10.9.2, that for max (|a,-|, |o|) < 1, i<;<,5 h(x, l)h(x, y/q)h(x, -l)h(x, -Jq)h(x, A) dx J-i Х\\Цх,щ) VI -x2 where A = i 31. Prove the formulas (where 0 < q < 1 and Re a > 0 and Re b > 0): -xqb;qHO(—qa+l/x;qHOdqx _ Yq{a)Yq(b) о (-x;q)oc(-x/q;q)oc x Vq(a+b)' -xqb; q)oo(—qa+l/x; q)oc dx logq Tq(a)Tq(b) (-x;q)oo(-q/x;q)x x \-q Tq(a+b)' (a) (b) / Jo
550 10 Introduction to g-Series (c) Extend the formula in (a) to 700 , dqX о (-x;q)oo(-qlx;q)oc _ (-qc; q)oo(-ql~c;q)oo Tq(a + c)Tq(b -. (-i;q)<x>(-q;q)oo rq(a+b) where Re(a + c) > 0 and Re(b — c) > 0 and 0 < q < 1. 32. Prove that when max(|ai |, \a2\, \щ\, \a4\) < 1, 33. Use Theorem 10.12.1 to prove the following results: (a) If 0 < s < min(a, fc), then Jo s_i P, , л, Г(с) r{s)T(a-s)T(b-s) Г(а)Гф) T(c-s) (b) IfO< j < l.then 00 7Г Sini'TT 34. Show the formal equivalence of the formulas f°° -i 2 л- (a) / x @@) — хф(\) + x d>(z) — ¦ ¦ -}dx = — Jo /•oo v v.2 (b) / xs- Jo 35. Suppose s > 0, and «=o converges for all x. Show the equivalence of Newton's difference formula s X(-s) = X@) Ak@) + where AX(n) = X(n) — X(n + 1), to formula (b) in the previous problem. 36. Suppose a > 0, b > 0, and iy(xey. T^o nl Use formula (b) of Exercise 34 to prove that X(n) = a(a + nb)"~l. The results in Exercises 34-36 are due to Ramanujan. See Hardy [1940, Chap- Chapter 11].
Exercises 551 37. Use Exercise 5 and the q -binomial to prove that ^ 2, q2)n t, 38. (a) Use Heine's transformation in Theorem 10.9.1 to show that fb2,b2/c:q2,cq/b2' 201 (b) Take b = q " and let с -> оо to obtain « /„2. „2\ „m ^0(g2;g2Ug2;g2)«-™ v ^'" The last identity was obtained by Gauss [1808, §9] by a different method. 39. From Exercise 38(b) and the ^-binomial theorem derive the following identi- identities: 2 2 = {-aq \q )oo 40. Show that the recurrence relation A0.11.3) follows from the generating func- function A0.11.2), and conversely. 41. (a) Prove the linearization formula A0.11.17) by first observing that for x=cos#, n^hmCn fW Hk(x\q)Hm(x\q)Hn(x\q)\(e2ie;q)\2de / 27Г (q; q)oo(ab; q)oo(ac; q)oo(bc; q)c 2n ^ ar+sbr+'cs+t (g; g)oo rf^0 (g; g)r(g; g)s(g; g)» (b) Complete the evaluation of the integral in A0.11.19). 42. Prove the formula A0.11.18). 43. Take x = e — nim/ln in Theorem 10.12.6 and let e -> 0 to show that 1 2n — 1 ^ . . m —1 1 nik2m/2n _ l ~T ' V^ ~2nik2n/m
552 10 Introduction to ^-Series Deduce that for odd q, 1=0 ' 44. Consider a more general divided difference operator ~y\(x) If D takes polynomials of degree n to polynomials of degree n—l, show that 3>i (x) and уг(х) satisfy a quadratic equation Ay1 + Bxy + Cx2 + Dy + Ex + F = 0, and show that two solutions of this equation can be used to define D so that it takes polynomials of degree n to polynomials of degree n — l. Specific choices of уг(х) and y\ (x) and limits of them give the operators used in this book, and the operators used in this book are essentially the standard forms to which y\{x) and уг(х) can be reduced. See Magnus [1988]. 45. Show that 46. If / Jo and / is smooth then 00 / (cos в) = У^ at cos кв. к=\ If /(cos в) is a polynomial in cos в or the coefficients ak decrease sufficiently rapidly, then Define Iqgix) = f(x). Show that 1 Г Iqg(x)=— g(cos<p)K(e +<p) sin <pd<p, Л J-n where See Brown and Ismail [1995].
11 Partitions The theory of partitions is a subject that, on the one hand, fits naturally into the subject of ^-series, and on the other, is highly combinatorial in its methods. This provides for a variety of treatments of this subject. P. A. MacMahon, one of the pioneers in the study of partitions, titled his seminal two-volume work, Combi- natory Analysis. It is clear that he saw a major role for analysis in the study of partitions. We shall follow his lead and examine partitions by means of the an- analytical technique he developed: partition analysis. This method is used to find the generating functions of various kinds of interesting partition functions. Exam- Examples of a few other ways of developing the theory of partitions will be given in passing. 11.1 Background on Partitions The theory of partitions concerns representing integers as sums of positive integers. Thus there are five partitions of4, namely 4, 3 + 1,2+2, 2+1 + 1, and 1 + 1 + 1 + 1. Note that the order of the summands (or parts) is not considered: 1 + 2 + 1 is the same partition of 4 as 2 + 1 + 1. One object of study is p(n), the number of partitions of n. Other examples of interesting partition functions are pm{n), the numbers of partitions of n into < m parts and p$\n), the number of partitions of n into distinct parts. Thus pD) = 5, ргD) = 3, and p^D) = 2. An explanation for the last notation is given below. The theory of partitions dates back to Euler. The generating function of a given partition function has turned out to be one of the most fundamental objects in the study of partitions. Euler's basic observation lay in the introduction of the geometric series in treating generating functions. Suppose A is some set of positive integers. A partition of n into elements of A is a representation of n as a sum of elements of A (where the order of summands can be disregarded.) Thus n = a\ + п2 + • • • + ar, 553
554 11 Partitions a, e A, and to make the representation unique we may require ai > a^ > ¦ ¦ ¦ > ar. Let Pa (n) denote the number of partitions ofn into elements of A. The generating function for pA (и) is given by 00 ?) PA(n)q" = JJA +qa+q2a+q3a + -- •)• A1-1.1) n=0 aeA The equality in A1.1.1) becomes clear once we multiply the terms together and see that the general exponent on q that arises is /lfll + /2«2 H h fjuj Л . This last expression is an arbitrary partition into elements of A, where a\ appears /i times, аг appears fi times, and so on. Consequently, n=0 aeA If we were to require that each part appear < .? times and to define p^ (n) to be the number of these partitions of n, then we would see as before n=0 aeA aeA Note that when s = I, p^ (n) is the number of partitions of n into distinct elements of A and n=0 aeA aeA These observations on generating functions allow us to prove one of Euler's striking, albeit elementary, theorems. Theorem 11.1.1 The number of partitions ofn into elements of в (the set of odd numbers) equals the number of partitions ofn into distinct parts (i.e., the parts taken from N, the set of all positive integers.) More succinctly,
11.2 Partition Analysis 555 Proof. This is now easily seen because 1 л=1 Ч i=0 Here A1.1.2) was used in the first step and A1.1.4) in the last step. The theorem is proved. ¦ At this point the reader may try to prove the following result by the method of Theorem 11.1.1: The number of partitions ofn into summands not divisible by 3 is equal to the number of partitions ofn where no summand occurs more than twice. One of the difficulties encountered in studying the theory of partitions lies in the fact that each new result seems to require some new trick. Although this fact may seem charming to insiders it is somewhat discouraging for noncombinatorial outsiders. In this chapter we hope to present a systematic derivation of a variety of elementary results. Our focus will be MacMahon's partition analysis for obtaining generating functions for a number of interesting partition functions. 11.2 Partition Analysis To illustrate this method, we start with the following problem: What is a closed form for the generating function YlT=o Pm(n)qnr> Recall that pm(n) is the number of partitions ofn into < m parts. It is not difficult to see that we can write this generating function as a multidi- multidimensional sum. Thus 1=0 П1>П2>--->Пт>0 The requirement ni > П2 > ¦ ¦ ¦ > nm > 0 comes from the fact that order is disregarded in a partition. Thus each partition can be written uniquely as a sum of a nonincreasing sequence of numbers. MacMahon's idea is to introduce new variables Х\,Хг, ¦¦¦, A.m-i that handle the inequalities satisfied by n, while the nj themselves become free. Consider the sum E-B1+B2H Hm 1  ~ т. 12-13 I, 1m-l -1m q kx A2 •¦•Am_i 11,12,...,1m>0
556 11 Partitions If we select only terms in this sum with nonnegative exponents on the X, then the corresponding exponent will be a partition of n into < m parts. For example, when m = 2 and n = 4 the exponents that result are 4 + 0,3 + 1, and 2 + 2. The method of partition analysis applies a linear operator ?2> to such multiple Laurent series in Л],. •., A.m-i ¦ The operator annihilates terms with any negative exponents and in the remaining terms sets each X,¦ = 1. Hence oo Рт\П)Я — " /_^ У 1 Am-1 n=0 ~~ ni,...,nm>0 ~ «i>0 12>O 1m>0 = (l лоA The next step is to produce an algorithm to evaluate the effect of ?2>. To this end we state and prove the next lemma. Lemma 11.2.1 Proof. The left side equals x"ym- ~ n,m>0 n>m>0 Set к = n — m so that the last sum becomes *,m>0 *>0 m>0 U Д ^^ and the lemma is proved. ¦ Repeated application of Lemma 11.2.1 gives the closed form for the generating function of pm(n). Theorem 11.2.2 00 ^2pm(n)qn = 1/A - <?)A - Ч2) ¦ ¦ ¦ A - <Л- n=0 Proof. One application of Lemma 11.2.1 and A1.2.1) gives „=0
11.3 A Library for the Partition Analysis Algorithm 557 A second application gives 1 It is now clear that the result of the theorem follows after applying the lemma to each of Ai, X2,..., Am_i. The theorem is proved. ¦ Before developing partition analysis further, we consider a simple example to further illustrate the power of this method. We need an extension of Lemma 11.2.1, which will be useful for other purposes as well. Lemma 11.2.3 If a is a nonnegative integer, X-" x" ?2- The proof of Lemma 11.2.2 will work here as well. The reader may check this. The example is the following: Let A(n) denote the number of noncongruent triangles of perimeter n with positive integer sides. What is X^o Д(и)<7"? Suppose n\,n2,n3 are the sides of the triangle in decreasing order. We must have n2+щ > n i +1. MacMahon's partition analysis gives us the answer automatically: 00 П=0 П[уП2,Пз>0 ~ 4 4 A - qki Дз)A - ЧХ2Хг/Хх){\ - q k^ ~ A /X)(l - q2X2)(l - qX3/X2) Аз 3 Therefore, A(n) = the number of partitions of л into twos, threes, and fours with at least 1 three. 11.3 A Library for the Partition Analysis Algorithm The examples of the preceding section illuminate the technique of partition analy- analysis. From a variety of simple and simply proved evaluations of the operator ?2> (such as Lemma 11.2.3), it is possible to apply the operator to numerous rational functions in several A,. We list a few more results similar to Lemma 11.2.3. These and others were originally given by MacMahon.
558 11 Partitions Proposition 11.3.1 1 (a) > A-Лх)A-л A)(i-j2A) ¦¦•(!- 1 (b) Q7r l l~xyz - ky)(l - z/k) A - x)(l - j)(l - xz)(l - yz) 1 1 + xyz — x2yz — xy2z Proof. Each such result (and countless others) can be proved in several ways. The method of partial fractions will reduce most to applications of Lemma 11.2.3. The proofs of (a) and (b) are given below and (c) is left to the reader. Note that for j = 1, the result in (a) is just Lemma 11.2.1. Using induction, suppose the result is true up to j — 1. Observe that 1 1 ( У 1-х yj A - yj.i/Л)A - yj/k) yj-i -yj\l- yj-i/k 1 - yj/k Then the expression in (a) can be written as 1 J j-i ~ yj* Ld -^)d - y\A) • yj A - ЛХ)A - J, A) • • • A - yj- 1 -x)(l ~xyi)---(l - - xyx) •• 1 A - x)(l - хУ1) ¦ ¦ ¦ A - xyj) The proof of (b) is very similar. We have x =?A-xA)A-jA)A-zA) х-у1>\\-хХ l-ykj l-z/k x у (x - j)(l - x)(l - xz) (x - j)(l - j)(l - jz) 1 — xyz This proves the proposition.
11.4 Generating Functions 559 11.4 Generating Functions In this section we apply partition analysis to find the generating functions of some important partition functions. Theorem 11.4.1 Let Qm (n) denote the number of partitions ofn into exactly m distinct parts. Then 00 qm{m+\)/2 Proof. If n = n\ + n2 + ¦ ¦ ¦ + nm, then we require n\ > n2 + 1, n% > n3 + 1, ¦ ¦ ¦. nm > 1, because the parts are distinct and exactly m in number. So 00 (JmWq — S2 2^ ^ Л1 Л2 •¦•Лш-1 Лш п=0 ~ п\,...,пт>0 • • • A - X Apply Lemma 11.2.3 to Ль А.2,..., Я„ with a = 1 and obtain q ¦ q-q2---qm which proves the theorem. ¦ In exactly the same way, we may consider Q^'e) (n), the number of partitions of n into m parts where each part differs from the next by at least к and the smallest part is >?. The closed form for the generating function is given in the next theorem. Theorem 11.4.2 alm+km(m-l)/2 Proof Reasoning as before, we can see that \-k\-k Л1 Л2 и>0 ni,.-.,nm>0
560 11 Partitions The only change is that in the first m — 1 applications of ?2>, a —k, and in the final (trivial) application, a — t, у = 0 in Lemma 11.2.3. This proves the theorem. ¦ We can also introduce further variables that keep track of other facts about the partitions. For example, we have the following theorem. Theorem 11.4.3 Suppose pm(j, n) (respectively Qm(j, n)) denotes the number of partitions ofn into <m parts (respectively exactly m distinct parts) with largest part j. Then Y^ A»0\ «)ZV = 1/d - Zq)d - zq1) • ¦ • A - zqm) j,n>0 and J2 Q*V- »)zV = Zmqm(m+i)/2/d - zq)(l - zq2) ¦ ¦ ¦ A - zqm). j,n>0 Proof. From the definition of pm(j, n) it is clear that Pm\J,n)Z q — Ы 2_^ Z Я Al A2 A j,n>0 ~ ii),...,nm>0 Пт-,-Пт m-1 1 - zq)(l - Zq2X2)(l- 1 1 For the other generating function, the argument is similar. Thus, j,n>0 = П ~ ni,...,nm>0 К Л2 A2 ' ' 'Am-1 zm m(m+l)/2 The theorem is proved. ¦ In what follows we use the notation [zJ] YlT=oanZn — aj, that is, the operator [V] applied to a power series gives the coefficient of z' ¦ An observation needed
11.4 Generating Functions 561 in the proof of the next theorem is that N N oo Л=0 Л=0 л=0 = [zN][(\ + z + z2 + ¦ ¦ -)(ao + an + a2z2 + •••)] [N]^° " A1.4.1) L M ', [z] ¦ 1 -z Theorem 11.4.4 Suppose p(N, M, n) (respectively Q(N, M, n)) denotes the number of partitions ofn into <M parts (respectively exactly M parts), each <N. Then and л=0 Here \nm ]q is the q-binomial coefficient defined by A0.0.5) Proof. By Theorem 11.4.3 and the observation A1.4.1), oo N ^T p(N, M, n)qn = ^[z*] Y n=0 h=Q j,n>0 N Note that Corollary 10.2.2(d) was used to get the second-to-last equation. For the second part, we have oo N У^ Q(N, M, n)qn = y\zh] 2_. QmU> n)zJ'<?" n=0 h-0 j,n>0 = \7N~[7MaM{M+Y)l2 Y^ I ^ ~*~ ^\ zk = OM<M+1)/2 I ^ f^0 [ м \q [m This proves the theorem. ¦
562 11 Partitions Corollary 11.4.5 Q(N> M> n)zMqn = A + Zq) ¦ ¦ • A + ZqN). n,M>0 Proof. By Theorem 11.4.4 and Corollary 10.2.2(c) п,М>0 М>0 \ л>0 (-D/2 | М and the corollary is proved. ¦ The next theorem gives limiting cases and other consequences of some previous results. Theorem 11.4.6 Let p{n) denote the total number of partitions of n and let p(m, n) denote the number of partitions ofn into exactly m parts. Then 00 (a) Y n=0 n,m>0 (c) Y, m,n>0 (e) Proof. 00 ? n=0 n,m>0 n,m>0 By Theorem 11.2.2, 00 p(n)qn = lim Y^Pm(n)t m—юо ^—^ n=0 m=0 00 m=0 j" = lim 1/[A — m—>oo = \/{q\ q)oo- -<Л]
11.5 Some Results on Partitions 563 To prove (b), first observe that p(m,ri) = pm(n) — pm_i(n). Then ? P(m, n)zmq" = J2 z>n Hfo»(«) - Prn-x (n))qn n,m>0 m>0 n>0 1 1 = у z (q;q)m-i z>nim where the last step follows from Corollary 10.10.4. Formula (c) is obtained from Corollary 11.4.5 by letting ./V —> oo. To derive (d), observe that, by Theorem 11.4.2, ? Q%l)(n)zmq" = E^ E Q^ n,n>0 m>0 n>0 The final formula is obtained in a similar way. The theorem is proved. ¦ The series on the right side of (d) and (e) occur in the Rogers-Ramanujan formulas, which will be stated and proved in the next chapter. 11.5 Some Results on Partitions In Section 11.1, we showed that the number of partitions of n into odd parts equals the number of partitions into distinct parts by showing that their generating functions are equal. This is a very powerful way of obtaining results on partitions. In this section, we give some applications of the theorems derived in the previous section. For example, a simple consequence of Theorem 11.4.4 is the next result. Theorem 11.5.1 The number of partitions ofn into <M parts each <N equals the number of partitions ofn into <N parts each <M, that is, p{N,M,n) =p(M,N,n). Proof. By Theorem 11.4.4, the generating function for p(N, M, n) is ~N + M~ M which is clearly the generating function for p(M, N, n) as well. This proves the theorem. ¦
564 11 Partitions An immediate consequence of this theorem is the following: Corollary 11.5.2 The number of partitions ofn into <m parts equals the number of partitions ofn in which each part <m. A direct proof of this corollary can also be given by showing that the generating function of b(m, n), the number of partitions where the parts are <m, is also i/(q\q)m- Let Ik denote the number of times к occurs in some partition of n. Then л=о и em>o In, 1 (l-q)a-q2)---(l-qm)' Here Ik is the number of times к occurs in some partition ofn. The results of the previous section can also be used to give a different deriva- derivation of some identities obtained in Chapter 10 from the ^-binomial theorem. For example, it is intuitively clear that A + zq + Z2qi+1 + ¦ ¦ 0A + zq2 + Z2q2+2 + ¦ ¦ -)A + zq3 + Z2q3+3 + ¦••)¦•¦ p(m,n)zmq", m,«>0 where p(m, n) is the number of partitions ofn with exactly m parts. However, by partition analysis (Theorem 11.2.2) (Pm(n) ~ pm-l(n))q" n>0 «>0 1 1 qm (q;q)m (q\q)m-i (q;q)n Thus 1 _ ^ g^z* (zq;q)oo ^ (q\q)m or V z l ^ (q\q)oo (z;q)x' a result of Euler (Corollary 10.2.2).
11.6 Graphical Methods 565 In a similar way, (~zq; <7)oc = A + zq)d + zq2)(l + zq3) ¦ ¦ ¦ „m(m+l)/2 m m,n>0 m=0 ^>^m where the last equality follows from partition analysis (Theorem 11.4.1). This implies the other result of Euler in Corollary 10.2.2, namely We end this section with the statement of the Rogers-Ramanujan identities and their partition theoretic interpretation. A more complete discussion of these identities and their proofs is given in the next chapter. The identities are m=oKq,q)m n=0 and >. A1.5.2) m=0 4"J' ~2"" n=0 The right-hand side of A1.5.1) is obviously the generating function for the number of partitions in which the parts are identical to 1 or 4 mod 5. The left-hand side, by Theorem 11.4.6(d), is the number of partitions where the parts differ by at least 2. A similar interpretation holds for A1.5.2). Theorem 11.5.3 (a) The number of partitions ofn in which the difference between any two parts is at least 2 equals the number of partitions ofn into parts = 1 or 4 (mod 5). (b) The number of partitions ofn in which the least part > 2 and the difference between any two parts is at least 2 equals the number of partitions ofn into parts = 2 or 3 (mod 5). This interpretation of A1.5.1) and A1.5.2) is due to MacMahon [1917-1918]. 11.6 Graphical Methods A powerful way of studying partitions is by representing them graphically. This method was discovered by N. M. Ferrers in the 1850s. It has since been used exten- extensively in partition theory, unlike MacMahon's partition analysis, which has begun to gain some prominence only recently. This may partly be due to MacMahon's admission that partition analysis failed to give significant results in plane partitions
566 11 Partitions Figure 11.1 for which he had initially developed it. As a contrast, Sylvester, who published Fer- rers's graphical method, gave this method much positive publicity. In a short paper titled, "Note on the graphical method in partitions," published in 1883, Sylvester wrote, The discovery of this process is due to Dr. Ferrers, who informs me that he himself never published it but left it to me to do so in his name in the London and Edinburgh Philosophical Magazine for 1853.1 may mention that I have never missed an opportunity of expressing my sense of the great importance of the discovery and bringing it under the notice of my pupils Ferrers's graphical representation of a partition is a collection of lattice points where each row of points (or dots) corresponds to a part of the partition. For example, the graphical representation of 5 + 2 + 2+1 is shown in Figure 11.1. The conjugate of this partition is another partition obtained by reading the columns of the above partition. In this case the conjugate is 4 + 3 + 1 + 1 + 1. According to Sylvester, the result that Ferrers proved by this method is the following: The number of partitions of n into exactly m parts equals the number of partitions of и with maximum part m. This is intuitively obvious from Ferrers's graph. Each partition with exactly m parts has a conjugate with largest part m and conversely. In a footnote Sylvester [1853] makes the following interesting remark: "I learn from Mr. Ferrers that this theorem was brought under his cognizance through a Cambridge examination paper set by Mr. Adams of Neptune notability." The reader should check that Theorem 11.5.1 and its corollary also follow immediately from an application of Ferrers's method. In this section, we consider a few applications of this method to see some of its scope and power. Consider the result contained in Theorem 11.4.1: where Qm(n) is the number of partitions of n into exactly m distinct parts. We have seen that the generating function for the number of partitions with <m parts is I/O?; <?)m- To understand the factor m(m +1)/2, take the partition 7 + 6+4 + 2+1 where m = 5. Graphically this is depicted in Figure 11.2. There are |E)F) dots
11.6 Graphical Methods 567 Figure 11.2 Figure 11.3 inside the triangle. The remaining nodes form the partition of some number with <5 parts. Now consider Euler's identity contained in Corollary 10.10.4: -qJ(l-q2J-"(l-qnJ n=\ For each partition n we find the largest square of points (starting at the upper left-hand corner) contained in the Ferrers graph. Such a square is called a Durfee square, named after a student of Sylvester who used this idea. Suppose n is given by 6+4+4+2+1 + 1; then its Ferrers graph and the 3 x 3 Durfee square are shown in Figure 11.3. In general each partition n of m has a Durfee square of side и, for some n, and we can write n = n2 + 7t\ + тхг, where п\ is the partition made by the points below the square and m is the conjugate partition of the points to the right of the square. Since the partitions with parts <n are generated by l/(q; q)n, it follows that the set of all partitions with Durfee square of side n is generated by 1 1 4 (q;q)n (q-q)n A - qJ(l - q2J ¦ ¦ • A - q»J " Now the generating function for p(n) is \/{q\ q)oo, and thus E n=0 1 (q;q)n(q\q)n n=\
568 Figure 11.4 As an exercise the reader should prove the following identity: 00 7ПП У Li = f^0 (q; q)n(zq; q)n using the Durfee square. As a final example, we take another look at the triple product identity, 1 (q;q)c к" = (-xq;q)oo(-x l;q)oo. We have seen that the real difficulty in some proofs is to show that the term independent of x on the left-hand side is in fact I/O?; <?)oo- One way to show this fairly easily is to use the Frobenius symbol for a partition. To describe this idea, consider the partition 6 + 4 + 3 + 3+1 whose Ferrers graph consists of the points in the graph shown in figure 11.4. Associate the points as indicated in the Figure and read off the Frobenius symbol for n: '5 2 04 2 The top row represents the horizontal lines to the right of the diagonal and the bottom row the vertical lines below the diagonal. Clearly the sum of the numbers in the Frobenius symbol plus the number of columns gives the number being par- partitioned. More generally, every partition n of n can be represented by a Frobenius symbol a\ a2... ar b2...br where a\ > a2 > ¦ ¦ ¦ > ar > 0, b\ > ¦ ¦ Now let us prove that the constant term in > br > 0, and n = r + xqn) f[A + дт~1/х) Y,bt. n=\ m=\
11.7 Congruence Properties of Partitions 569 is l/(q;q)oo, which will imply the result we want. Observe that a contribution to the constant term is obtained whenever r terms of the xqn are multiplied with r terms of the form qm~1/x, that is, _oi+O2+-+ar _ „Ь1+Ь2+-+Ьг where the as are positive and distinct and the bs, are nonnegative and distinct. This is a partition represented by the Frobenius symbol i — 1 u2 — 1 ... ar — b\ b2 ... br Hence the constant term is the generating function for p(n), which is l/(o; q)oo- The result is proved. 11.7 Congruence Properties of Partitions Congraence properties of p(n), the number of partitions of и, were first discovered by Ramanujan on studying the table of values of p(n) constructed by MacMahon from n = 1 to 200. Ramanujan gave simple proofs of the theorems pEn+4) =0(mod5), A1.7.1) p(ln + 5) = 0(mod7). A1.7.2) He also found expressions for the generating functions of pEn +4) and p(ln + 5) as products (or a sum of two products). These are given by the formulas below: pD) + p(9)q + pA4)q2 + • ¦ ¦ = 5 - q)(\ - дг)(\ - q^) ¦ ¦-f ' A1.7.3) /?E) + pA2)q + p(\9)q2 + • • • = 7' + 49q- {A - q)(l - q2)(l ~ q3) ¦ ¦ ¦}* ' A1.7.4) Ramanujan [1927, paper 25] sketched a proof of A1.7.3) promising to give details in a later paper. A year later he died and the promised paper never appeared. How- However, he gave the necessary details in an unpublished manuscript. See Ramanujan [1988, p. 238]. We reproduce this proof here. It has also appeared in an interesting unpublished book manuscript by Thiravenkatachar and Venkatachaliengar. Observe that the congruences A1.7.1) and A1.7.2) follow immediately from A1.7.3) and A1.7.4) respectively. These generating functions can also be used to
570 11 Partitions prove congruences modulo 52 and 72. First observe that for any prime p, A -q)p = \-qp(mod p) or ^sHmodp). A1.7.5) Thus A1.7.3) and A1.7.5) with p = 5 imply that pD)q + p(9)q2 + • ¦ ¦ q A - q5)(l - q10) ¦ ¦ ¦ _^10)...}4 A (mod 5). A1.7.6) (\-q)(\-q2)--- Since the coefficient of q5m is divisible by 5. The coefficient of q5m on the left of A1.7.6) is pB5m — 1) and hence pB5m -l) = 0(mod25). A1.7.7) Similarly A1.7.4) implies pD9m - 2) = 0(mod49). A1.7.8) To prove A1.7.3), start with Euler's pentagonal number theorem (Corollary 10.4.2(c)): Partition the series into five parts according to n sO,±l, ±2(mod 5). For example, the subseries of terms with n = 5m — 1 is 00 /_1уПдEт-1)A5т-2)/10 _ _ 1/5 V^ t-\\m„5-mCm-l)/2 =—oo m=—oo 00
11.7 Congruence Properties of Partitions 571 Thus 00 Па-* n=\ «/5) 00 m=—oo _\yn mEm+l)/2 _|_ 00 ^ (—I) q m=-oo 00 E m=—oo i-2)E»j+ / D/2 ^m Cm-l)Em-2)/2 00 _ V^ (_l)m^m(l5m+7)/2 m~—oo Divide by the infinite product Щ1] A - q5n) to get л=1 where f and |i are power series in q. Our claim is that %%\ = 1. We recall that by Corollary 10.4.2(e) and A1.7.9), Since the power of q, given by n(n +1), is either 0, 2, or 6 (mod 10), it follows that no power of q is of the form 2/5 + an integer. So the term 3^2/5^ — 3fj2f q2/5 = 3<?2/5fi(l - f?i) on the right side of A1.7.10) must be zero. This implies that §i =§~1. Therefore, i =: 1 The denominator on the right-hand side is Consider the expression AT1 — A. — 1, where к — %qllbco, and со is a fifth root of unity. Observe that if A. — A. = 1; then a simple calculation shows that A.-5+A.5 = 11. Thus Г5 - И? -*V = П^"' -?1/V -^2"<o2k), (П.7.12) where со = е2я'/5. It is now easy to check by long division that A1.7.11) can be
572 11 Partitions written as 1 - q5" Пт? A1.7.13) (In fact, Ramanujan [1927, paper 25] starts the sketch of his proof of A1.7.3) by observing that A1.7.13) can be shown to be true.) Now multiply across by ql/5 and replace g1/5 withql/5e2jnk/5, where A: = 1,2, 3,4. Add the five identities after using the fact that n=\ n=\ to obtain = —; T-i- (П.7.14) t — 1 \Q — Q Ё Replace q with qe2ltik, where к = ±1, ±2 in A1.7.11) and multiply the five equations. Note that By A1.7.12), we now get Combine this with A1.7.14) to get PEn + 4),- = 5{ About this identity, Hardy wrote that if he were to select one formula from Ra- Ramanujan 's work for supreme beauty, he would agree with MacMahon in selecting this one. Remark 11.7.1 Ramanujan remarked that й=оA-^+1)A_^5„+4)-
Exercises 573 Note the connection of this function with the Rogers-Ramanujan identities, which Ramanujan used to express f-1 as a continued fraction. Formula A1.7.15) can be proved by first observing that A1.7.11) and the pentagonal number theorem, stated immediately after A1.7.8), imply that 00 1 П/1 -<?5") = S (_i)Y»/io>C»+i)p n=l n=0,3 (mod 5) Now apply the quintuple product identity (Exercise 10.14) to the last sum and rearrange the resulting product to get A1.7.15). The continued fraction for i-~l can be found in Hardy [1940, p. 99]. Remark 11.7.2 Several proofs of A1.7.3) and A1.7.4) have been given. For a relatively simple one, which is similar to Ramanujan's proof in some respects, see Kolberg [1957]. For a proof that uses the machinery of modular functions, see Knopp [1971, §8.3]. 1. Verify the (a) (c) (d) (e) ft (f) О following п > > о >A-А formulas: 1 1 Exercises -у л2) A-А2х)A-уД) 1 A-A.xXl 1 1 1 -Ху)A- 1 — хуш 1 1 + 1 -у/А/) A-х)A 1 +хуA -у' V/X) 1 - Z/X) Z/X) A 1 xy -xy2)' -x'y)' 1 + xy + xz + xyz 1+xz )A -Xz)(l -со/Х) — xzco — yzco + xyzco -xy2)(l-xz2)' - хуг - хуг2 y)(l-yz)(l-xz2)' + xyzco2 - yu>){\ - zco) 2. Prove that the number of partitions of n into parts not divisible by 3 equals the number of partitions of n where no part occurs more than twice. 3. Show that the number of partitions of n in which only odd parts may be repeated equals the number of partitions of n in which no part appears more than three times.
574 11 Partitions 4. Generalize Exercise 3 by showing that the number of partitions of и in which only parts ^ 0(mod 2m) may be repeated equals the number of partitions of и in which no part appears more than 2m+1 — 1 times. 5. Prove that the number of partitions of n in which each part appears 2,3, or 5 times equals the number of partitions of n into parts congruent to 2, 3, 6, 9, or 10 modulo 12. (This result is due to Subbarao; see Andrews [1976, p. 15].) The next three exercises are from Ramanujan [1927, paper 25]. 6. Prove that pEm + 4) = 0(mod 5) as follows: (a) Show that -. OO OO _ _ ?=—oom=—oo (b) Show that if the exponent 1 + ?(? + l)/2 + т(Ът + 1)/2 is a multiple of 5, then the coefficient 2? + 1 is also a multiple of 5. (c) Show that ^^ = ^( (d) Use (c) to observe that П°° (\ 9 n=l (e) Deduce that the coefficient of ^5m+5 in f(q) is a multiple of 5. (f) Conclude that pEm + 4), the coefficient of q5m+5 in 9/П^=1 C1 - #").is a multiple of 5. 7. Show that pGm + 5) = 0 (mod 7). Use the identity n=l (=—oom=—oo and the method employed in Exercise 6. 8. Use A1.7.4) to prove that pD9m - 2) = 0(mod 49). 9. Prove A1.7.4). 10. Show that the number of partitions of и that are self-conjugates, that is, iden- identical with their conjugates, equals the number of partitions of n with distinct odd parts. 11. Let Mi (n) denote the number of partitions of n into parts, each greater than 1, such that consecutive integers do not appear as parts. Let M2(n) denote the
Exercises 575 number of partitions of n in which no part appears exactly once. Show that The result in Exercise 10 is due to Sylvester and that in Exercise 11 to MacMahon. See Andrews [1976, p. 14]. 12. Use Euler's pentagonal number theorem, namely n~l m=oo to show that "^ ifn = mCm±l)/2, •- 0 otherwise. Here Ре\п) (respectively Р^1)(п)) is the number of partitions of n into an even (respectively odd) number of distinct parts. 13. Prove the following relation that gives an efficient algorithm for computing p(n): p(n) - p(n - 1) - p(n - 2) + p(n - 5) + pin — 7) + ¦ • • + (— \)m(p(n — mCm — l)/2) + (-l)mp(n - mCm + l)/2) H =0, where p(M) = 0 when M is negative. [Hint: (q; q)oo, Л = 1-1 14. Use the Durfee square discussed in Section 11.6 to prove mat OO n П2 — = (xq; q)^. n (q\ q)n{xq; q)n 15. Show that ~ nyn n2 °o X1n n 2n2 <» X2n+1 n+1 (n+l)Bn+l) > — = > V +> — • ^ (xq;q)n(yq;q)n ^0 {xq; q)n(yq; q)^ ^ (xq; q)n(yq; qJn+\ For the left-hand side, use the idea of Exercise 14 to show that the coefficient of ymxrqn is the number of partitions of n into m parts with the largest part equal to r. Do the same for the right side, where instead of a Durfee square consider the largest rectangles that can be of size n x 2n or (n + 1) x Bn +1). Note that the largest rectangles of these dimensions cover all the possibilities. 16. Use the fact that n2 = 1 + 3 + 5 -I 1- 2n - 1 to see that ?^0 q/(q; q)n is the generating function for the number of partitions in which the difference between the parts is at least 2. The next six results are the partition theoretic interpretations of the six identities of Rogers given in Exercise 12.6. Prove them. For references see Andrews and Askey [1977]. 17. The number of partitions b\ + b2 + ¦ ¦ •+br of n where b\ > b2 > &з > • • • and each bi is odd or = ±4 (mod 20) equals the number of partitions c\+c2-\ of и where c\ > c2 > c3 > c4 > c% > • • •.
576 11 Partitions Hint: Note that ra2 = 0+l + l+2H \-(n-l)+n and that l/(q; qJn is the generating function for partitions in which there are at most 2n parts. (Gordon) 18. The number of partitions i>i + b2 + ¦ ¦ ¦ of и where b\ > bt > ?>з... and each bi is odd or = ±8 (mod 20) equals the number of partitions c\ + c2 + ¦ ¦ ¦ + c2k+\ of n into an odd number of parts where c\ > c2 > cj, > с*, > cs > с в > cj > Hint: n2 + 2n-l + l+2 + 2-\ <r (n - 1) + (n - I)+n +n + n. (Connor) 19. The number of partitions b\ + b2 + ¦ ¦ ¦ + br of n where b\ > b2 > b-i > • ¦ • and each fe,- is ^ ±1, ±8, ±9, 10 (mod 20) equals the number of partitions c\ + c2 + ¦ ¦ ¦ + C2k of n into an even number of parts where c\ > c2 > cj, > Q > C5 > • • ¦. (Connor) 20. The number of partitions b\ + b2 + ¦ ¦ ¦ + br of n where b\ > b2 > Ъг > • ¦ • and each b, ^ ±3, ±4, ±7,10(mod20) equals the number of partitions c\ + c2 + ¦ ¦ ¦ + q of n where c\ > c2 > cj, > c,\ > c$ > eg > • • •. (Connor) 21. The number of partitions of и with distinct parts and with each even part larger than twice the number of odd parts equals the number of partitions of n into parts = 1 or 4 (mod 5). Hint: The left-hand side of Exericise 12.6 (e) can be written as „1+3+5+-+2л-1 q 22. The number of partitions of n into distinct parts each larger than 1 in which each even part is larger than twice the number of odd parts equals the number of partitions of n into parts = 2 or 3 (mod 5). Hint: 3 + 5 + ¦ • • + Bи + 1) = n2 + 2n.
12 Bailey Chains L. J. Rogers is the pioneer of the work leading to the Rogers-Ramanujan identities and beyond. His idea, published in Rogers [1917], provides the starting point for the work of this chapter. We shall recount his seminal idea in Section 12.1. In the 1940s, W. N. Bailey began a systematic study of identities of the Rogers-Ramanujan type. See Bailey [1949]. He saw great generality in the methods introduced by Rogers. This greater level of generality provides for a wide variety of applications well beyond those considered by Rogers. Motivation for the techniques presented here is scant. As you will see, Rogers's original idea seems almost magical in its construction. Since the advent of computer algebra we can better see how to make sense of Rogers's fortuitous discoveries. However, it is still not evident why one would initially expect that this method would bear fruit. A systematic account of Bailey's ideas leading up to Bailey's lemma is given in Section 12.2. As an application of these ideas, the important 8</>7 transforma- transformation formula of Watson is derived in the next section. A few consequences of this formula are also included. The last section makes passing mention of other applications of the ideas in Section 12.2. 12.1 Rogers's Second Proof of the Rogers-Ramanujan Identities The Rogers-Ramanujan identities were first discovered by Rogers [1894]. Rogers made considerable contributions to several areas of mathematics but surprisingly his work went largely unnoticed and did not have the influence it should have had. Part of the surprise comes from the fact that Rogers's early work in invariant theory was noticed by Sylvester and given a prominent place in his "Lectures on the The- Theory of Reciprocants" [1886]. The long neglect of invariant theory after Hilbert's discoveries may have played a role here. Rogers did not receive credit for his dis- discovery of the Holder inequality, and his papers containing the Rogers-Ramanujan identities, among the most beautiful formulas in mathematics, went unheralded. 577
578 12 Bailey Chains These identities were later rediscovered by Ramanujan. Ramanujan's first letter to Hardy in 1913 contains some continued-fraction formulas that are consequences of these identities. Ramanujan did not have a proof of the identities and he posed them as a problem in 1914 in the Journal of the Indian Mathematical Society. MacMahon stated them without proof in the second volume of his Combinatory Analysis. He also noted the connection with partitions. What happened next is best described in the words of Hardy [1940, p. 91]: The mystery was solved, trebly, in 1917. In that year Ramanujan, looking through old volumes of the Proceedings of the London Mathematical Society, came accidentally across Rogers's paper. I can remember very well his surprise, and the admiration he expressed for Rogers's work. A correspondence followed in the course of which Rogers was led to a considerable simplification of the original proof. About the same time I. Schur, who was then cut off from England by war, rediscovered the identities again. Schur published two proofs, one of which is "combinatorial" and quite unlike any other proof known. In this section we discuss Rogers's second proof published in Rogers [1917]. As noted before, the Rogers-Ramanujan identities are CO „2 ? _*!_ = ___L__ = —'- °° ' ч °°—- A2.1.1) and y> <?"(n+1) i (<?; <?5)co(<?4; 2 ¦ \ ¦ ¦ ) n(q\q)n (q2;q5)oo(q3;q5)<x> (q\q)<x> The products on the right side can be transformed by the triple product identity (Theorem 10.4.1), (x; q) ]? k=-oo Replace q with q5 and x with q2 and then with q to get k=-oo and (q; q5)oo(q4; q5)oo(q5; q5)oo = ? { ?=-00
12.1 Rogers's Second Proof of the Rogers-Ramanujan Identities 579 Thus it is sufficient to prove that and ' - (n- n\ S<*«>« A2.1.3) A2.1.4) Rogers's second proof of the Rogers-Ramanujan identities depends on the fol- following lemma. Lemma 12.1.1 Let 1 (_1)t k=-e ~k k=-i (q)e+i-k(q)e+k Then S2e = S2e+i — ——, where (q)k = (q; q)k. (q)e Proof. The proof consists in rearranging the terms in the sums. In S^, combine the terms corresponding to the indices — к and к + 1. (Note that when к = I, the term corresponding to I + 1 is 0.) We get (q)t+k+iiq)e-k~\ 1- It is readily seen that this is also the expression obtained in combining the terms corresponding to indices — к and к + 1 in the sum S^+i- Thus •%< = SWi- We now prove that S2(+i = (I - qt+l)S2e+2, A2.1.5) which together with S2i = S2i+\ will prove the lemma by induction. Consider the sum of the terms in ^2^+1 corresponding to +k and —k. For к ф О, 1 + = A - ql
580 12 Bailey Chains This expression is A — qe+]) times the sum of the terms in S2e+2 corresponding to the indices ±k, when к ф 0. When к = 0, the corresponding terms in S21+1 and S2t+2 are {q)t+\iq)i+\ This proves A2.1.5) and the lemma. and The idea of Rogers's second proof of A2.1.1) and A2.1.2) is to expand the function (-^/qe'e; q)oo(—->/q~e~'e\ #)co as a Fourier series in two different ways. One way is to use the triple product identity and the other is to apply the q- binomial identity to each of the products. Since the Fourier expansion is unique, the corresponding Fourier coefficients in the two expansions are identical. Hence the exponentials е'пв can be replaced in the two expansions by any numbers as long as the series are convergent. Rogers shows that the Rogers-Ramanujan identities are obtained if this replacement is done appropriately. The details follow. By Euler's result in Corollary 10.2.2 (b), we have and So the product is 7[n(n+l)+m(m+l)]/2 m+n J(n-m)l {-xewq;qHO{-xe-ieq;q)O0= ]T 2 _eiBn-p)e Break up the last sum into two parts for p even (p = 21) and p odd (p = 2? + 1) and set n = I + к. Then f „к2-к „(lk~\)ie q
12.1 Rogers's Second Proof of the Rogers-Ramanujan Identities 581 When qx = q"l/2, this relation becomes „к1 „2Ш °° e+l „k2-koBk-l)ie + /я yV(<+1) V ^ (q; q)e+k(q;q)i+i-k qk2e2kW V е=к (<?; ?)*+*(?; ?)*-* This gives one Fourier expansion. To get the other, start with the triple product identity OO (q, q)oo(x\ q)oo(q/x\ q)oo = Yl (-1)*?*(*-1)/2х* k=—00 and replace x with —^/qe'e to get k2/2 Jk6 v^oo 2k2 2ki6 (9; 9H0 (q;q)°c A2.1.7) Thus the right sides of A2.1.6) and A2.1.7) are equal to each other. Now replace e2kw with (-1)V№~1)/2 andeB*-1)ie withO. Apply Lemma 12.1.1 to get A2.1.3), which is equivalent to the first Rogers-Ramanujan identity. The second identity is obtained by taking e2kw = 0 and e^'^6 = (-l/^+D/2 and applying Lemma 12.1.1. Remark 12.1.1 Rogers's argument provides oneofthesimplestproofsofA2.1.3) and A2.1.4). It appears to depend on formulas from the theory of theta functions. These formulas, however, are used only for motivation and are not necessary for the proof. Observe that, in the above argument, the equality of the coefficients of e2kie in A2.1.6) and A2.1.7) are a consequence of the uniqueness of Fourier series. Thus we have > = — . A2.1.8) ?-? (?; q)i+kiq\ q)i-k (q\ q)x But this follows directly from Cauchy's formula in Corollary 10.9.4, which can be written V — = . A2.1.9) ^ (q; q)n(xq; q)n (xq; q)x In fact, we can give a very simple graphical proof of A2.1.9), using Durfee squares, where the exponent of x counts the number of parts of a partition. Then A2.1.8)
582 12 Bailey Chains follows by taking x = q2k. Next, A2.1.3) is obtained by multiplying both sides of A2.1.8) by (—\)kqk(k~l)l2, summing over all k, and applying Lemma 12.1.1. In the next section, we shall see that the scope of this simple argument can be greatly extended. Remark 12.1.2 Note that we get Euler's pentagonal number theorem k=—oo when we let I ->¦ oo in Lemma 12.1.1. 12.2 Bailey's Lemma In a series of two papers in the 1940s, W. N. Bailey elucidated the underlying stracture of Rogers's proof. He began with Rogers's replacement of 2 cos пв — ешв +е~1пв in certain Fourier-series expansions and quickly observed the following simplified version of the relevant expansion. Lemma 12.2.1 (weak Bailey lemma) Suppose an and fin are two sequences related by n Pn = У" ; A2.2.1) f-? (q; q)n-r(aq; q)n+r then, subject to convergence conditions (which in most applications boil down to \q\ < U oo , X ^ и2 A. ^ (ftq; ?)«, Proof. The proof of this is quite short: A2-2.2) 00 00 „ „2 a" q" <xr = ?? ?; q)n(aq; q)n+ir OO OO n(n4-2r) _f where the last step follows from Corollary 10.9.4.
12.2 Bailey's Lemma 583 The proof given by Rogers can be subsumed by this result. For example, to get the first identity, take a = 1 and 1, n = 0, a" ~ \ (-1) v*3""'^! + qn), n > 0. By Lemma 12.1.1, ff (q\ q)n-riq\ q)n+r ^ (q;q)n-r(q\q)n+r (q,q)n' so that Bailey's result implies y> q ; q)n (q; q) (q\ q)oo The reader should work out the details for the second identity. Seeing this, it is clear that the method may be greatly extended by using more gen- general summations than the very special limiting case of the ^-analog of Gauss's sum. Bailey observed in his second paper that one could indeed invoke the full force of the <7-Pfaff-Saalschiitz summation. He carefully described the proof of such a generalization. However, seeing that it would look quite complicated he chose not to write it down. This omission caused him to miss the full power of what we now call Bailey's Lemma. We start with the statement of "Bailey's transform." Lemma 12.2.2 Subject to suitable convergence conditions, if r=0 and Yn = then л=0 л=0
584 12 Bailey Chains Proof. 2_^anYn = 2.^1^ a"°rUr~n Vr+" n=0 n=0 r=n 00 T r=0 n=0 r=0 and the lemma is proved. ¦ The above argument is purely formal and the suitable convergence conditions are those necessary to make all the infinite series converge and to validate the change of order of summation. We now state and prove Bailey's Lemma. Theorem 12.2.3 If for n > 0 И ar r=0 yi>i;n~r(aq; q)n+r then " a' # = V , A2.2.3) f^ (q\ q)n-r(aq\ q)n+r where , _ (Рь q)r(P2', q)r(aq/PiPiYoir r (aqIp\; q)T{aqIрг\ q)r and a' _ y^ JP^'DjiPi, q)j{aqlP\Pi\ q)n-j(aq/piP2)}Pj A2 2 4) j^0 {q\q)n-j{aq/puq)n{aq/P2\q)n Proof. Take Un = l/(q; q)n and Vn = l/(aq; q)n and _ (Pu q)n{pi\ q)n{q~N\ q\qn (PiPiq~N/a; q)n in Lemma 12.2.2. To compute yn, we need the g-Pfaff-Saalschiitz identity A0.11.3), which we restate here for the reader's convenience: _ (c/a; q)n(c/b; q)n (c;q)n(c/ab;q)n '
12.2 Bailey's Lemma 585 Then 00 Yn = У~]8гиг-„Уг+п -,-N \ к/1 w? t /a; q\(q; g)r-n(ag; q)r+n — (Puq)r+n(P2, q)r+n(q~N; q)r+nqr+n = T ^ (PiPig /a\q)r+n(q;q)r(aq;q)r+2n = (Pug)n(P2-,g)n(g-N;g)ngn^ fpig", m", g-(N~n); g, g (piP2q~N/a;q)n(aq;qJn \ PiPiqn~N/a, aq2n+1 _ (P\\q)n{p2\ q)n(q~N'; q)nq"(aqn+l /P\\ q)u-n(aqn+l I Pi\ q)N-n (P\Piq~N/a; q)n(aq; q)in(aq2n+u, q)N-n(aq/'р\рг\ q)N-n \q)N (-\)n(puq)n(p2\q)n(q~N;q)n (aq; q)ы{aq/p\pi\ q)N (aq/рп q)n(aq/p2; q)n(aqN+l; q)n We now turn to the proof of A2.2.3). Note that r=o v"^"' i/i\-ii r2\ q)r(q\ q)N-r(aq; q)N+r r=o (ag/pi>g">r(aq/P2;q)r(aq\q)N+r (aq/'р\рг\ q)N . , (aq/pi;q)N(aq/p2; q)N jzt (aq/pip2;q)N v~- a t /u т п-пч fir8r (by Lemma 12.2.2) (aq/pu q)N(aq/P2\ q)N ~ (дд/Р1р2\д)ы y-y (p\\q)r(pi\q)r(q~N\q)rqrpr (aq/pu q)N(aq/pi\ q)N f^> iPiPiq~N/a; q) where the last equation follows after an algebraic simplification that produces the expression given in A2.2.4). The theorem is proved. ¦
586 12 Bailey Chains Remark 12.2.1 The Weak Bailey Lemma (Lemma 12.2.1) follows from the full Bailey Lemma by letting n, pi, рг -> oo. We call (an, pn) a Bailey pair if they are related as in Bailey's lemma. The power of the full Bailey Lemma is that, given a Bailey pair (а„, Р„) a new Bailey pair is produced. Consequently, from one such pair one can construct an infinite sequence (а„, р„) ->¦ (а'„, P'n) -*¦ {a'n\ P'n') ->¦ • ¦ ¦ of Bailey pairs by successive application of Bailey's Lemma. This sequence is called a Bailey chain. The simplest conceivable chain starts with „ fl ifn = 0, ,,„,> Pn = <5„,о = < A2.2.5) l-O if n > 0 . It is an simple exercise to show that the corresponding an is {l-aq2n){a;q)n{-lfqii) a« = tj г; : • A2.2.6) A -a){q;q)n Remark 12.2.2 The fact that (an, pn) is a Bailey pair follows immediately from a formula of Agarwal [1953], A ; q)j(a; q)jqn> _ (aq; q)MqnM{д'~п\ q)u 2 ^ 1;9)y(9;«)y (q;q)M(aq"+i;q)M ' a result that can be proved directly by induction on M. Another way is to use the "inversion" formula: If pn is given by A2.2.1), then a. = (.^)?(^)-+'-'(-''"-'«'':')ft A2.2.8) 12.3 Watson's Transformation Formula In this section we use the Bailey chain to derive some important formulas. In particular, we obtain a <?-analog of Whipple's transformation due to Watson [1929]. This transforms a terminating very well poised %ф1 to a terminating balanced 4</>3. The phrase "very well poised" is defined below. We start with a simpler result. Recall that (а„, рп) in A2.2.5) and A2.2.6) form a Bailey pair. By Theorem 12.2.3, the next pair in the chain, namely (a'n, p'n), is given by a, {aq/piP2\q)n Pn = QL = (q; q)n(aq/pi; q)n{aqlpi\ q)n ' (pi; q)n(p2; q)n(aq/p1p2)na - aq2n)(a; q)n{- ; q)n(aq/p2\ q)n0- ~ a)(q\ q)n
12.3 Watson's Transformation Formula 587 And the relation П i when written out in full, is (aq/p\p2)n (q)n{aqlp\)n{aq/pi)n ^ (q)n-r(aq)n+r(aq/p1)r(aq/p2)r(l-a)(q)r where (x)r = (x; q)r. Note that 1 - aq2r = A - V^?r)(l + ^qr) _ (Vflg; q)r(-^q; q\ and {aq)n+r = (aq)n(aqn+1)r. Thus A2.3.1) is equivalent to the formula u p2,q~" aqn+1\ (aq)n(aqIp\pi)n (ЩI P\)n(aq I pi)n A2.3.2) The 6</>5 is well poised because the product of a numerator parameter with the corresponding denominator parameter is aq. Additionally, the presence of the factor A — aqlr)/(\ — a) now makes it a very well poised 6</>5. Watson's transformation is obtained by moving up the Bailey chain to (a^, P'J). The relation v— a" f^ (q)n-r(aq)n+r is seen to be equivalent to (aq/Xq)r(aq/k2)r(aq/Pi)r(aq/P2)r(l - a)(q)r(q)n-r(aq)n+r j(k2)j(aq/k\k2)n _ v^ (A.Qj(k2 y^(q)n-j(a q/*.\)j(aq/X2)j(q)J(aq/pl)j(aq/p2)J' 3 34
588 12 Bailey Chains By the formulas used to obtain A2.3.2), it is easy to show that A2.3.3) can be written as follows: / a,q^/a,q^/a,kuk2,pi,p2,q~n . a2qn+1 \ 1 \/a, --Ja,aq/ki,aq/k2,aq/p1,aq/p2,aqn+1' 'k1k2p1p2j (aq)n(aq/p1p2)n ( aq/kik2, Pu P2,q \ „,,,.. *3 __„ __„ _„, \q,4 • A2.3.4) n{aq/P2)n \aq/ku aq/k2, p\piq n/a This is Watson's formula. When aq/k\k2 = p\Piq~n/a, the 40з becomes a bal- balanced 302, which can be summed by the g-Pfaff-Saalschutz formula A0.11.3). The result in this case is the ^-analog of Dougall's formula due to Jackson [1921]: \^/a,-^/a,aq/k1,aq/k2,aq/p1,aq/p2,aqn+r (aq)n(aq/pip2)n(aq/kipi)n(aq/klk2)n 2 = , when a q = k{k2p\p2q . (aq/k1)n(aq/k2)n(aq/p1)n(aq/k1k2pl)n A2.3.5) Let n ->¦ oo in A2.3.5). This gives / a,q*ja,-qja,ki,k2, pi aq \ 605 I y- y- ., ., . ;^'7~; I \Va' -Va'a(l/^uaq/b2,aq/pi kxk2px ) (aq)oc(aq/pip2)oc(aq/kipi)oo(aq/kxk2)oc = . (Iz.3.6) (aq /ki)oc(aq /k^^aq / pi)oo(aq /ki^pi)^ This is a nonterminating form of A2.3.2). Now observe that the 807 in A2.3.4) is symmetric in Ль к2, pi, p2 and hence (aq/p1p2)n ( aq/kik2,pup2,q-" 403 I ! ^! i (aq/pi)n(aq/p2)n \aq/kuaq/k2, pip2q-nja (aqjk{k2)n ( aq/p1p2,ki,k2,q~'n . -4*3 __,. _.,. ,,._„,_; 9.9 • A2-3.7) (aq/k1)n(aq/k2)n \aq/pu aq/p2, k\k2q n/a A ^-analog of Dixon's formula for a well-poised ?,F2 is obtained by setting p\ = Vain A2.3.6): / a, -q^/a,kuk2 403 I r ,, ., ; q, \Jaq/kaq/k -Ja,aq/ki,aq/k2 kxk2 J (aq)oo(aq/kik2) (aq/ki)oc(aq/k2)oo(q^/a)oo(q^/a/k1k2)c
12.4 Other Applications 589 The two Rogers-Ramanujan formulas can also be derived from Watson's trans- transformation A2.3.4). Let A.i, A.2. Pl P2 ->• °° to get y^(fl;g)j(l- h (q;q) Now let n —>- oo and apply the dominated convergence theorem. After simplifica- simplification, the result is l + E i For a = 1, this is the first Rogers-Ramanujan identity and for a = q it is the second one. If one wants to generalize a <?-series identity that has quadratic powers in it, it is natural to try to replace each #B> with (a; q)k times something else that will give q( 2 'using ]im(a;q)k/{-a)k=qti). a—>-oo Since there are two Rogers-Ramanujan identities and five factors q(?\ the mini- minimum extension of the Rogers-Ramanujan identities of this type should have six free parameters. Formula A2.3.4) is this extension. 12.4 Other Applications The summation formula for a very well poised 6</>s and Watson's transformation were fairly straightforward consequences of Bailey's Lemma. Further interesting results are possible by starting with the Bailey pair A2.2.5) and A2.2.6) and moving further along the chain. For example, v- q" ' -> (n\ (n\ П n#0,±(fc+l)(mod 2/1+3) This identity reduces to the first Rogers-Ramanujan identity when к = 1. As noted before, it can be proved by examining (a*+1), Д^+1)) in the Bailey chain starting with the pair A2.2.5) and A2.2.6).
590 12 Bailey Chains It should be stressed that this is not the only Bailey chain of interest. In a totally different context, another of Ramanujan's series, ("+1) „=0 <¦ q'q)n n=0 was studied. The relevant Bailey pair is P" ~ (q2; q2)n and From r=0 (q; q)n-r(q\ q)n+r we can deduce S(q) = ^ (-l)n+Jqn^n+1)/2-j\l - q2n+1). и>0 From this result it can be shown that almost all sn equal zero and that for any integer M (positive, negative, or zero), there exist infinitely many n so that sn = M. For details, see Andrews, Dyson, and Hickerson [1988]. This paper also contains references to the work of Slater and Bressoud in Exercises 3, 4, and 5. Exercises 1. Obtain the second Rogers-Ramanujan identity A2.1.2) from the weak Bailey Lemma 12.2.1. 2. Prove Agarwal's formula A2.2.7), namely A A - agV)(q-n; q)j(a; q)mqnj = (flg; gbg"M(g'""; q)u l;q)j(q;q)j (q; q)M(aq"+1; q)M 3. Show that the (а„, fin) denned below is a Bailey pair with a = 1: ' -qf«2-^+\ m = Ъп - 1 > 0, q6n2~n+q6n2+n, т = Зи>0, .1, m =0, (Slater)
Exercises 591 4. Suppose Urn = < 6n42n -q 1, 6n2+2n m = 3n-l>0, m = 3n>0, т = Зи + 1>0, m = 0, and Prove that (an, fin) is a Bailey pair with a = 1. 5. Prove that (Slater) и = odd, is a Bailey pair with а = 1. (Bressoud) 6. Apply A2.3.8), Exercises 10.37 and 10.39, and Jacobi's triple product formula to obtain the following six identities of Rogers: (a) 6; q2° ; q2°)ool + (C) (d) (e) (f) 33 A n^±l,±8,±9,10(mod20) oo П (] n=l n^±3,±4,±7,10(mod20) ?__п I + 7^п = П A _ ^„ Whipple's 7^6 transformation was obtained in Chapter 7 as a byproduct of the solution of the connection coefficient problem for Jacobi polynomials. There are several ^-analogs of the Jacobi polynomials. The next set of four
592 12 Bailey Chains problems shows how the little <?-Jacobi polynomials can be used to derive Watson's 8</>7 transformation. See Andrews and Askey [1977]. 7. Define the little <?-Jacobi polynomials by pn(x;a,P :q) = 2</>i I ^ ;q,qx (a) Show that lim pn(x; q\ /:<?) = P^\\ - (b) Derive the orthogonality relation Ealqi(ql+1;q)oo ¦ i ,„ ,-+1. . pn(q';a,P :q)pm{ql\u,P : q) 0 ifm/и, anqn(q; q)oc((xPqn+1; q)oo(q; q)n Wqn+u,q)oc{uq\q)oc{aq;q)n{\-aPq2n+l) [Hint: First obtain the following identity: ifm=n (=0 0 if 0 < т <п, l)oo(q\q)n .f n ifm=n.] 8. Suppose that n pn(x; y,8 : q) = ^У~\акпрк{х; a, fi : q). k=0 Show that ,q)k(q n;g)k(oiq;q)k (aPqk+u,q)k(q;q)k(yq;q)k q~n+k,y8qn+k+\aqk+i 9. Note that the identity in Exercise 8 is a polynomial identity in x. Deduce that with the same akn, we have (q-n,y8qn+\ax,...,ar г+гФг+i \q,qx \ yq,bu...,br акпг+2Фг k=0 (q k,aPqk+1,au...,ar aq,bi, .. .br 10. Take r = 2, ft = 8, a\ = aq, x = 1, and b2 = q2a8a2/b1 in the previous exercise. Observe that the з</>2 in akn becomes balanced and may be computed
Exercises 593 by A0.10.3). Deduce that q-n,y8qn+l,aq,a2 \ anqn(8q; q)n(y/a; q)n yq,bi,q2a8a2/bl ' ' / (a8q2; q)n(yq; q)n ~n,q^a&q, -q^/aSq,aSq,aq,aSq2/bu ySqn+l,b\/a2- The standard form of Watson's transformation is obtained from this formula after proper identification. 11. The big g-Jacobi polynomials are defined by {q; q)n{-q; q)n /q-n,qn+a+P+1,xqa+ld/c ъФг „,, „,, ,. ,q,<l Prove that the orthogonality relation is Г y_rf « x,c, . m . q О т ф n, (-qH+lc/d; q)n(-qa+ld/c; q)n{\ - t ¦ , m=n, (-q\ q)n(-q\ q)n where M _ cd{\ - q)(q; q)oc(qa+li+2; q)co{-d/c; q)oo(-c/d; q)oc (c + d)(qa+1; q)oc(qP+1', q)oo(—qa+ld/c; q)oo(—qP+lc/d; q)oo Note that when с = d = 1, the weight function in the <?-integral tends to A -х)аA+х)Р asq -+ Г. 12. A set of g-Laguerre polynomials is defined by - q)k(qn+a+lx)k (a) Replace x with -A - q)q~{P+1)x and let /J ->¦ -oo in the little ^-Jacobi polynomial of degree n. Show that the result is (q«+l;q)n " *4" (b) Prove that lim^,- L(na)(x; q) = Lan(x).
594 12 Bailey Chains (с) Prove the discrete orthogonality relation ka+k q 0, (Use Ramanujan's i^i sum.) (d) Prove the continuous orthogonality relation dx x;q) (-A -q)x\ q) О, тфп, m — n. Yq(-a){q\q)nq« (Moak) 13. A multiplicative shift of the variable x in Exercise 12 gives a set of g-Laguerre polynomials defined by Lan(x;q) = (a) Show that q)x (~q x , qH m = n, (-x;q)oc(-qx~l;q)oc g-n{q\q)n{q\q)oo Г (a + 1 - 0)Г@ - а) = I (qa+];я)п(qa+]; q\ r,(a + 1 -p)T4(p-a)' О, тфп. (b) Set a - p = с (fixed c) and let a ->• oo in L"(jc; #). Show that the result is the Stieltjes-Wigert polynomial /i=u ()n-k (c) Prove that a weight function for the Stieltjes-Wigert polynomials is xc w(x) - . (-x;q)oo(~q/x;q)x For Exercises 12 and 13, see Gasper and Rahman [1990, Chapter 7].
Infinite Products A.I Infinite Products For readers unfamiliar with infinite products, a brief introduction is given here. Definition A. 1.1 Let pn^ = Yili=k^~^am)- Vthere is akforwhich рп^ converges to a nonzero value p as n ->• oo, then we say that the infinite product ]^=1(\+ап) converges. We write it as p = ГС^ЦA + an)- The reason for not taking к = 1 is to allow an finite number of zero factors. The convergence of the product is said to be absolute if fl^li A + \an\) converges. The following basic theorem reduces the problem of convergence of a product to that of a series. For simplicity, assume that Re an > -1, n — 1, 2, If not, start the product after this holds. Theorem A.1.2 The product fl^iO + an) converges if and only if the series J2T=\ loiA + a») converges. Proof. Suppose that Sn = J2"m=i l°g(l + am) converges to S. Since exp is a continuous function, n oc Yl A + am) = ехрF"и) converges to es = JJA + an). m = l и=1 To prove the converse, let 1 + am = AmeWm and П^Д1 + am) = В„е'ф». The convergence of ПA + а„) implies am ->• 0 so that 9m ->• 0 and фп can be chosen so that, say, фп ->• ф. We also have ву+в2 + \-6n =ф„ + 2пкп, where kn is an integer. Therefore, 0n+i = </>n 595
596 A Infinite Products Since kn+i — kn is an integer, вп+х ->• 0, and since фп+\ — фп -> 0, we must have kn = k, a constant for sufficiently large n. Thus, for sufficiently large n, n +27rfei. Let n —>• oo to get = log p + 2n ki. This proves the theorem. For absolute convergence the condition is simpler and is contained in the next theorem. ¦ Theorem A.1.3 The product Y\^=1(l + an) converges absolutely if and only if ^2T=i a" converges absolutely. Proof. Convergence of the series or the product implies that an -> 0. For suffi- sufficiently large n we must have \an \ < 1/2. Suppose an ф 0 and n large. Then 1 - an < -[|tf«l + KI 1 la, I 21-|ая| So or 1 ^ log(l+an) an -K|<|log(l+an)|<-K|. These inequalities together with Theorem A. 1.2 imply the result and also the fact that absolute convergence of a product implies its convergence. ¦ Definition A.1.4 The infinite product n=l where x is a real or complex variable in a domain, is uniformly convergent if n Pn(x)= ]J(l+am(x)) m=k converges uniformly in that domain, for each k.
Exercises 597 Theorem A.1.5 If the series Y2T=i \an(x)\ converges uniformly in some region, then the product Yl™=l(l + an(x)) also converges uniformly in that region. The proof of this result is left to the reader. Corollary A.1.6 If an(x) is analytic in some region of the complex plane and П A +an (jc)) converges uniformly in that region, then the infinite product represents an analytic function in that region. Exercises 1. Prove directly from the definition that the product {\ - \){\ + \){\ - \) ¦ ¦ ¦ \s convergent. 2. Prove that an absolutely convergence product is convergent. 3. Show that if Еа„, Ha^,..., Sa*, ?|а„|* are all convergent, then ПA +ап) is convergent. 4. Discuss the convergence of the product
в Summability and Fractional Integration B.I Abel and Cesaro Means The following theorem, the first part of which was proved by Abel, is often en- encountered in a first course in real analysis. Theorem B.I.I Suppose that the series Y2T=o ^« converges to B. Then Y2T=o b"x" converges uniformly on [0, 1], and in particular lim 2^bnxn = В. (В.1.1) X^ л=0 Moreover, if Bn = YH=o bk> then lim — l- — = B. (B.I.2) и^ос n + 1 The example of the series 1 — 1 + 1 — ¦ • ¦ shows that the limits in (B .1.1) and (B.I .2) may exist even when the series does not converge. Thus (B. 1.1) and (B. 1.2) may be used to assign a sum to divergent series. The series YlT=o bn is Abel summable or summable A to В if (B. 1.1) holds; and it is Cesaro summable or summable (C, 1) to В if (B.I .2) holds. We also speak of (B. 1.1) as the Abel mean and (B.I.2) as the Cesaro mean of the series J2T=o ^n¦ The Abel and Cesaro means of the series 1 — 1 + 1 — • ¦ • are both 1/2. The next theorem shows that Cesaro summability is a stronger requirement than Abel summability. Theorem B.I.2 If the series YlT=o^n is (C, 1) summable to B, then it is A summable to B. 599
600 В Summability and Fractional Integration Proof. Use summation by parts to get 00 ОС *„*" - В = A - *) ? Bnxn - В where Bn= л=0 л=0 V k=0 ОС — X ) > (on Т* ' * ' Т* Вп)Х — & ос = A - хJ^2{(В0 + ¦ ¦ ¦ + Вп) - (п + })В}хп. (В.1.3) я=0 Now (В. 1.2) implies that, for e > О, there exists an integer N such that |(Bo + ¦ ¦ ¦ + Bn) — (n + l)B\ <e(n + l) forn > N. Use this in (B. 1.3) to arrive at the necessary result. ¦ The example of the series 1 — 2 + 3 — • • ¦ shows that the Abel mean may exist (in this case it is 1/4) but the Cesaro mean may not. The Abel and Cesaro means, together with other summability methods, are very useful in analysis and analytic number theory. As an elementary example, consider the following theorem of Abel on the product of two series: Theorem B.1.3 Suppose J2T=o an and J2T=o ^n are convergent series. Let cn = 121=0 аФп-к and suppose that J2T=o c" *s convergent. Then л=0 и=0 и=0 Proof. Since cn = k=0 ос ос к=0 j=0 k=0 and Theorem В. 1.1 completes the proof. Ш The Cesaro and Abel means play a very significant role in the theory of Fourier series and transforms. We take a very brief look at how they appear in Fourier series since these ideas will play a role elsewhere in the book. Suppose f(x) is an integrable function of period 2л. Let J2°°oc а"е'"х be its Fourier series. An important question is: When does the Fourier series of a function converge to the function? Once again it is easier to deal with the Cesaro mean or
В. 1 Abel and Cesaro Means 601 the Abel mean. The Abel sum is given by Since an = — / f(t)e~(n'dt, 2я Jo the sum in (B.1.5) is equal to 1 Г2л 2л- Jo The sum inside the integral is called the Poisson kernel and it is a straightforward calculation to show that it is equal to 1-r2 = Pr(x-t). 1 - 2r cos(jt - t) + r2 The following properties of Pr(x) are worth noting: (i) Pr(x) > 0, (ii) ±fin Pr(x)dx = I, md (iii) for 8 > 0, max Pr(x) -> 0 as r -> 1". S<x<2n-S These properties can be used to give a proof of the following theorem: Theorem B.1.4 If f is periodic and integrable on @, 2л) then the Abel means of the Fourier series converge to \{f(xo+) + f(x0—)} at every point x0 where the right and left limits /(xo±) exist. A similar result exists for Cesaro means. The nth partial sum of the Fourier series is given by sn(x) = 2^ a*« = - k=-n " "'" i=-n The sum inside the integral is A sin(n + 1/2)(jc - t) 1+2 > cos/t(jc —f) = . (B.1.7) *-^ sin((jc - 0/2) k=0 This expression is called the Dirichlet kernel. One drawback of the Dirichlet kernel is that it is not always positive. However, if we take the (C, 1) mean of the Fourier
602 В Summability and Fractional Integration series, then we get so(x) + si(x) H \-sn(x) an(x) = n+l ^ sin((x - 0/2) The sum in the integral denoted by Kn(x — t) is called the Fejer kernel. The sum is equal to {, ч 2 svnUn + l)(x - t) 2. i > > 0. (B.I.8) The Fejer kernel has the three properties of the Poisson kernel mentioned above. One may then deduce the following theorem: Theorem B.1.5 For f as in Theorem B.I.4, the Cesaro means of the Fourier series of f converge to \{f(xo+) + f(xo—)}at every point xq where the right and left limits /(*o±) exist. If we assume / to be continuous on [0, 2л], then the Abel and Cesaro means converge to /. An important consequence of Theorem B.1.5 is the following corollary: Corollary B.I.6 A continuous function on [0,2л] can be uniformly approximated by trigonometric polynomials, that is, polynomials of the form Yl"-n ake'kx. Since eikx is uniformly approximated on [0, 2л] by partial sums of its Taylor series, we get another proof (see Exercise 1.40) of the Weierstrass approximation theorem. B.2 Cesaro Means (C, ex) In the previous section, we saw that the (C, 1) means do not assign a value to the sum 1-2 + 3—4H .To handle this and some other situations, we define Cesaro means of higher order. For the series У"Ч?° bn, set where k=0
B.2 Cesaro Means (С, а) 603 The limit of the sequence {^tj} gives the (C, 1) mean. Now, set ВР = В + В and The limit of the sequence {^5-} is the (C, 2) mean of the series Y^ bn. Similarly, в<р = в™ + в™ + --- + вр and jtP) 1 , * l j (" + DC» + 2) (и+1)(и + 2)(и + 3) Thus, the (C, 3) mean is given by Define B^*' and ?^ inductively by Bf = and ?« = ?() + ?(« + ... + ?»»). It is possible to express B* explicitly in terms of Ь„ as follows. Note that n=0 n=0 and n=0 So ?=0 and ±!>1. (В.2.3)
604 В Summability and Fractional Integration We define the (СД)теап of Ym bn as the limit of the B^/E^a&n -» oo.Note that these quotients are meaningful even when к is not a positive integer. We take к to be real and greater than — 1. Definition B.2.1 The series J^° bn is (C, a) summable to В (for a > —1) if ^r^^^B. (B.2.4) Remark B.2.1 Note that the limit in (B.2.4) is equal to lim ?^±22вИ B.3 Fractional Integrals It is possible to extend the summability definitions to integrals. Suppose f{t) is integrable on finite intervals @, x). We say, for example, that / is Abel integrable, that is, /0°° f{t)dt exists in the Abel sense, if lim / lim / e~x'f(t)dt exists. A— n' We also write this as /•00 /.00 / /(r)c?f = lim / e'ktf{t)dt (Abel). To define Cesaro integrability (C, ?), where ? is an integer, we first have the integral analog of Blk) (see (B.2.2)) as /(*>(*) = /o Ak-i)(Odt = Jox(x - t)f(k-2)(t)dt = • •. = 1 /^(дс - (B.3.1) Following the remark after (B.2.4) we say that /0°° f{t)dt is (C, fc) integrable if ГГук+11 Л* / f \ * lim v 7 /*(*) = lim / I 1 - - I /@^ exists. Observe that the final expression for /*(jc) in (B.3.1) is meaningful for all real к > — 1. Thus we have a definition of (C, a) integrability for a > — 1. Note also that the formula for /да in (B.3.1) expresses /№) as a (jfc + l)-fold integral of /. Thus, we define Iaf, the a-fold integral of / for Re a > 0, by the formula f\x - t)a-x Jo » fix) = 7^- f\x - t)a-xfit)dt. (B.3.2) Г (a) J
В.4 Historical Remarks 605 It is easy to check that the operator Ia satisfies the relation The operator Ia is called a fractional integral of order a. Euler's integral expression for the 2.F1 hypergeometric series can now be interpreted as a fractional integral. Write the formula as r(b)r(c- b)J0 where fit) = tb-l(l -t)~a. Remark B.3.1 Abel and Cesaro integrability can be used to study Fourier inte- integrals just as the corresponding summability is used to study Fourier series. B.4 Historical Remarks Euler had used Abel means and other methods for associating a sum to a divergent series. In particular, he obtained the functional equation of the zeta function in the form - 2 5 1 _ 2- + 3-» B'-i When Re s > 1, the series in the denominator converges but the series in the nu- numerator is interpreted in the Abelian sense. It can be proved that oo lim ^(-ly-V-1*" = f(l -s)(T - 1); thus Euler had the functional equation correct. He verified it only for integer values of s and for s = j and |. Abel's name is attached to this method of summation since he proved formula (B.I.I). Leibniz used (C, 1) means to evaluate 1 — 1 + 1 — • • • and somewhat later D. Bernoulli employed the same method to consider the more general series J2T=o^n, where bn+p = bn for all n and Ylo~l bk = 0. Neither Leibniz nor Bernoulli explicitly stated that he was giving a new definition of convergence. This was done by Cesaro. For more on the history of Abel and Cesaro means the reader should consult the very interesting treatment given in Hardy [1949]. Abel was apparently the first mathematician to use fractional calculus, though the concept had been considered by others before him. Abel used fractional calculus in his solution to the problem of finding the tautochrone - the curve down which
606 В Summability and Fractional Integration a particle slides freely under gravity with zero initial velocity and reaches the bottom in the same amount of time regardless of the starting point, provided that the starting and lowest points are distinct. This problem had already been solved by Huygens; the motivation was the construction of a clock in which the period of oscillation does not depend on the amplitude of the pendulum. Suppose that the particle slides down from the point (a, b) and that the bottom of the curve is the origin of coordinates. If s is the length measured from the origin, then conservation of energy implies that velocity = v = ds/dt — ^/2g(b — y) at a point on the curve with ordinate y. Set ds/dy — f(y) to see that the time taken to reach the bottom is given by , Гь T(b) Abel observed that T{b) = J^( Since T{b) is independent of b, use (B.3.2) to get /о Jb-y ж Now take the derivative of both sides to obtain Since we get a differential equation that can be solved. The tautochrone curve turns out to be a cycloid. An English translation of Abel's [1826] paper on this topic is also available. The above calculations suggest that the concept of a fractional derivative would also be useful. For smooth enough functions, in particular analytic functions, it is possible to define fractional derivatives as follows: Daf(x) = — /„_«/(*), (ВАЛ) where 0 < Re a < n, and n is an integer. Observe that we took the integral first and then the derivative. If the calculations are done in the opposite order, a different
Exercises 607 function arises since the first n — 1 terms are removed by the nth derivative. For an analytic function oo 0 the fractional integral has the form 00 k\ as is easily verified. This shows that the value of n in (B.4.1) may be arbitrarily chosen as long as n > Re a. In fact, DkIa — DnIa+n-k. However, the inclusion of fractional derivatives with fractional integrals does not give a group, for although DaDfi = Da+fi when a, fi and a + fi are not integers, Dl/2D ф D3/2 for the reason mentioned above. Exercises 1. Prove Theorem B. 1.1. 2. With the notation of Theorem В. 1.3, prove that if J^° an converges absolutely and Y^ju bn converges, then Ya^ cn converges. 3. Prove that if the series Y^ bn is (C, a) summable to B, and fi > a, then the series is (C, fi) summable to B. 4. Prove thefollowing extension ofTheoremB. 1.2: If the series Y™ bn is (C, a) summable to B, then it is Abel summable to B. 5. Show that 00 Deduce that 1 - 22k + 32k = ¦ ¦ ¦ = 0, 1 _ 22* + 32* - • • • = (-1)* ——Bk 2k
608 В Summability and Fractional Integration in the sense of Abel summability. This result is due to Euler. See the historical introduction in Hardy [1949]. 6. A sequence Sn converges to S in the sense of Borel if OO о Show that if Sn converges to S then Sn -» S in the Borel sense. Also prove that if Sn (x) = 1 + x н \-xn, then Sn (x) -* 1/A - x) in the Borel sense for Rex < 1. 7. Show that if fi > a > — 1 and the (C, fi) means of Еа„ are nonnegative, then the (C, a) means of Hanrn are nonnegative for 0 < r < (a + \)/ф + 1). Hint: Observe that (l-fflJ-'-'EvV = (l-a>r"-lu-rw)f>+l(l-rw)-f>-li:anrnaf and show that the power series coefficients of A — a))~a~x(l — rw)P+1 and those of A — rvS)~P~xYjanrnaf are nonnegative. This proof is due to Bustoz [1974]. 8. A series Y^,T an is summable (L) (Lambert summable) to S if oo „ .. „ , v^ na"x с lim A - x) 2_^ = S. Prove that if Т,а„ = S(C, a), for some a, then Еа„ = S(L). 9. Prove Dirichlet's theorem that if x is a quadratic Dirichlet character, then L(l, x) = YZ=\ X(n)/n ф 0 as follows: (a) Show that J2d (b) Show that (c) Let f<x\ _ x ' ' лч"' XKn)x Show that b\ > Ьг > ¦ ¦ ¦ > К > bn+i > ¦ ¦¦¦ (d) Note that | J2n=i X(n)\ < M, where M is a constant that does not depend on m. Use summation by parts to prove that f(x) < ^~ = M. (e) Deduce that L(l, /) ф 0. This proof is taken from Monsky [1994]. 10. Prove the following theorem of Hardy and Littlewood: If an > 0 and oo lim A -x)Y^anxn = 1, x~* л=0
Exercises 609 then hm — y^ Щ = 1. Karamata's proof is sketched below. (a) Show that oo lim A — x) 2^onxng(x") - wheng(f) (i) is a polynomial, (ii) is a continuous function, or (iii) has a discontinuity of the first kind. (b) Take g(t) = 0 @ < t < l/e) = \/t(\/e < t < 1) and complete the proof of the theorem.
Asymptotic Expansions C.I Asymptotic Expansion Let x be a real or complex variable in an unbounded region D and let Y^ anx~n be a formal power series that may be convergent or divergent. Definition C.I.I The series Y^ anx~n is an asymptotic expansion of a function /(*) if n-\ iX'k + Rn(x), (C.I.I) where Rn(x) = 0(x~") as x ->¦ oo in D. Usually {C.I.I) is written as f(x) ~ ao + a\X~x -\-aix~1 + ¦ ¦ ¦ as x-^ oo in D. (C.I.2) This definition is due to Poincare [1886]. As a simple example we have the following asymptotic expansion of A + x): 1 111 1 + X X X2 X3 as x —>¦ oo. (C.1.3) In this case the series is convergent. The more interesting situation occurs when the series is divergent. Consider the complementary error function of a real variable x >0, Successive integration by parts gives 2 \e-*2 [°°e-r erfc x = —= — / —— y/n\2x Jx It2 2 [°° , erfc x = —= / e dt. dt (C.I.4) k=\ 611 *.¦ г* г--:^
612 С Asymptotic Expansions where Rn(x) = (-1У+1 It is easy to see that 1 • 3 • • ¦ Bи + 1) 2" -xe Jx t < ¦ 3 • ¦ • (In Bx2)«+i (C.I.5) Hence 1 ¦ 3 ¦ ¦ ¦ Bk - 1) Bx2)k is an asymptotic expansion of the complementary error function. It is clear that the series diverges for all x > 0. However, if a fixed number of terms is taken, then for large enough x a good approximation of erfc x is obtained. On the other hand, for a given x, taking more and more terms of the series does not improve the approximation, since the series diverges. C.2 Properties of Asymptotic Expansions The following theorem follows almost immediately from the definition. Theorem C.2.1 A function f(x) has an asymptotic expansion Т,а„х~" if and only if for each n, л-1 fix)- --* k=0 as x —>¦ oo in D, (C.2.1) uniformly with respect to arg x when x is complex. Proof. It is easy to see that (C.2.1) implies (C. 1.2). To reason in the other direc- direction, observe that x"Rn(x) = xn[anx and the theorem is proved. ¦ Rn+i(x)]-+an asx-*oo, A consequence of this theorem is that a function has at most one asymptotic expansion in D. In a different unbounded region the asymptotic expansion may be different. However, two different functions may have the same asymptotic expansion in some region. For example, the function e~* in | arg x\ <\n—&< \ж satisfies x"e~x -» Oasx -» oo. By (C.2.1), n 0 0 '0 + - + — + ¦ x x2 |argx| <-n- 00. So the zero function and e x have the same expansion in this region. The next theorem gives some algebraic properties of asymptotic expansions.
C.2 Properties of Asymptotic Expansions 613 Theorem C.2.2 Suppose that f(x) ~ T,anx~" in D\ andg(x) ~ ~Ebnx~" in D2. Then: (i) For constants A. and д Xf(x) + iig(x) ~ T,(Xan + iibn)x~n in Dx П D2. (ii) f(x)g(x) ~ T,cnx~n in Dx П D2, w/геге си = ao^o + ai^/i-i H h anbo- (Hi) Ifa0 ф 0, then 1 "' л/ = У^-4 + O(x~") asx -> ooinD], f(x) f^Q x where a.Q+ldk is a polynomial in uq, a\,..., ak. The dk can be obtained from the relations aod() = 1, aQdk = —(a\dk-\ + a2dk_2 + • • • + akdo), к = 1, 2,.... The proof of this theorem is left to the reader. Asymptotic series can be integrated over the interval л: < t < oo,ifao = a\ = 0. In this case XL XJ and Г0° tl u «2 ЙЗ Integration is possible here because f(t) = a2t~2 + O(t~2) for t large. If a0 and a\ are not zero, then — an — a\t ]dt ~ 1 + • • •, x —> oo. x 2x3 Differentiation of an asymptotic expansion may not always be valid. A standard example where differentiation fails is /(*) = e~Jtsine* where x > 0. The derivative f'(x) = cose* — e~* sine* oscillates as x —*¦ oo and hence does not have an asymptotic expansion by Theorem C.2.1. But 0 0 f(x) ~0H 1—- + ¦¦¦, лг-4-оо. If f'{x) is continuous and has an asymptotic expansion, then it can be obtained from term-by-term differentiation of the expansion for f(x). This follows from the result on integration and uniqueness of the asymptotic expansion.
614 С Asymptotic Expansions C.3 Watson's Lemma In this section, we discuss a method of obtaining the asymptotic expansion in some region of a function expressible as a Laplace integral. Section 1 has an example of an integral whose asymptotic expansion was obtained using integration by parts. The failure of this method in one case was discussed in Chapter 2. Also see Wong [1989, p. 18]. The next theorem, called Watson's lemma, gives the asymptotic expansion of f™e~x'f(t)dt. See Watson [1918]. Theorem C.3.1 Let f(t) be analytic in \t\ < a + S, where a > 0, S > 0, except possibly for a branch point at 0; and let m=\ when \t\ < a and r > 0. Suppose also that \f(t)\ < кеы, where к and b are positive numbers independent oft, when t > a. Then /•О / as x -» oo (C.3.2) for |argx| < \n - S < \n. Proof It is clear that for any fixed integer M, a constant С can be found such that for t > 0 M-l f(t) - У* amt{m/r)- Hence e-x'f(t)dt = V / e-^flmr(m/r)-^r + RM »=i ^ M-l = \J атГ(т/г)х~т/г + Rm, where /•OO I^mI < / \e~xt\Ct(Mlr)-lebtdt Jo = СГ(М/г)/[Кех-Ь]м/г, provided Rex > b. Since |argx| < \ж — 8, it follows that Rex > b if |x| > b esc S. Thus, This proves the theorem.
C.4 The Ratio of Two Gamma Functions 615 Watson's lemma can be generalized to the case in which the integral in (C.3.2) is a contour integral, /^+> e~xtf(t)dt and (C.3.1) is replaced by an asymptotic expansion for f(t). See Olver [1974, pp. 112-115] and Wong [1989, p. 22]. C.4 The Ratio of Two Gamma Functions We give an application of Watson's lemma to obtain the asymptotic expansion of a ratio of gamma functions. This expansion is due to Tricomi and Erdelyi [1951]. Observe that for Re(x + a) > 0 and Re(b — a) > 0, we have Г(х 1 f°° -!— / e-xte-at{\-e-<) ' - a) Jo Г(х + Ь) Гф Define the generalized Bernoulli polynomials B° (u) by taeut _y> a tn ^ n=0 Then for a = a + 1 - b, jT^Fr = ? ^TB« (a)r""CT' |r| <2;r- Watson's lemma now implies that T{x + a) _ y, (-1)" Гф-а+п) 1 n! "v F(ft-a) x*-a+»' ^-«, |x|->oo. (C.4.3) If instead of the integral representation (C.4.1), one were to use Г(* +a) _ ГA +a - b) [^ ,x+a)u, .^ ¦ I e<-x+a^t(et — J —ooe'a l)b-a'ldt, (C.4.4) T(x+b) 2ni J_oo.e,, Re[(x + a)eia] > 0, |ot| < я/2, and for small \t\, a - тс < arg(e' — 1) < a + ж; then one could extend (C.4.3) to |arg x\ < я — 8. There is an improvement of (C.4.3) from the computational standpoint due to Fields [1964]. Note that if и is replaced by a - и in (C.4.2), then B%{o - u) = {-\)kB%(u). This implies that В%к+1(а/2) = О. Now write the integrand in
616 С Asymptotic Expansions (C.4.1)as л! «=о °° DJ / _ -(x+a-a/2)t ST^ a2n\a ?—< ПпЛ This implies „=о W- - a + 2n) Bи)! Г(й-а) (x + a - CT/ for |arg(jc + a)| < \n — S, \x\ ->¦ oo. Exercises 1. Let Re x > 0. Show that as x -> 0, OO -( ^ dt~Y{-\)nn\xn by two methods: (a) Expand 1/A + xt) as a series, (b) Integrate by parts repeatedly. 2. Suppose that F{x) = f e~xtf{t)dt Jo converges for some x = xo and / has continuous derivatives of all orders in 0 < t < a. Show that F(x) ~ Y,f{n)(O)x-n-1 uniformly in arg x, as x ->¦ oo in |arg jc| < я/2 — e for e > 0. 3. Show that Л 4. Suppose 6»„ is defined by Show that and hence л 1! On = M »~i + л^ (I 4 j 8 J -7 + '- Л" 1 (yp \ ^ as, 1 n ) ufx n —>¦ oo. For a discussion of this result of Ramanujan, see Bemdt [1989, pp. 181-184]. 5. Prove Theorem C.2.2.
D Euler-Maclaurin Summation Formula D.I Introduction Some consequences of the close connection between series and integrals are brought out even in a first course in calculus. The integral test, for example, states that for a decreasing continuous function on [1, oo) the series YT /(n) and integral /j°° f(x)dx converge or diverge together. In the derivation of Stir- Stirling's approximation for F(jc) given in Chapter 1, we saw that for finite sums the function / need not be decreasing for the integral to provide a good ap- approximation. The Euler-Mclaurin summation formula makes the connection be- between the sum and the integral explicit for sufficiently smooth functions. In this appendix we give a statement and proof of the formula and a few applica- applications. Start with a differentiable function defined on a set that contains the interval [m, n], where m and n are integers. Let m < j < n, where j is an integer. As a first approximation we consider f(x)dx « l-(f{j) + f(j + 1)). (D.I.I) It is possible to express the error in this approximation as an integral. Observe that /o+d)= [J+14-\(*-j-l -j--)f'(x)dx. (D.1.2) j Ji Set j = m, m + 1,..., n — 1 to get \(f(m) + f(m + 1)) = f + f(x)dx +[ + (х-т-\ )f'{x)dx, 617
618 D Euler-Maclaurin Summation Formula i rm+2 pm+2/ i \ -(f(m+l) + f(m+2))= f(x)dx+ [x - (m + 1) - - )f'(x)dx, 1 Jm+l Jm+l \ ?J Ufin - 1) + /(«)) = f f(x)dx + [ (x - (л - 1) - ^]f'(x)dx. ? Jn-l Jn-l\ l) The second integrals on the right can be added together if we note that the integrand can be written as (x — [x] — \)f'{x). Addition gives f (*) + \ (/(m> + /С"» = / fWdx + [ (x - W - I) f'(x)dx. (D.1.3) This is a particular case of the Euler-Maclaurin formula. To put it in a slightly different form, set B\(x) = x — i, so that B\{x — [x]) = x — [x] — \. Also let Bx = Вхф) - -1/2. Now write (D.1.3) as J2 /(*) + Bi{f(n) - f{m)} = / /(jc)djc + /" Bi(jc - W)/'(jc)djc. m+l ^m m (D.1.4) Recall that Bi (x) is the first Bernoulli polynomial and B^ the first Bernoulli number. As an application take f(x) = logx and m = 1 in (D. 1.4) to get n+-)logn+n = 1+ / — —-dx. (D.1.5) Since B\(x — [x]) is periodic of period 1 and /•l «+1 Bl(x-[x])dx= B1(x-[x])dx = 0 (D.1.6) Jo Л for any t > 0, we see that the limit as n —> oo in (D.1.5) exists and is equal to This is exactly Stirling's approximation, if we can show that the expression (D. 1.7) equals \ log 2л-. We prove this later by a method different from the one given in Chapter 1, which uses Wallis's formula. Formula (D.1.4) was obtained by one integration by parts. With Stirling's for- formula in mind, it makes sense to repeat the process. This would give an ex- expression with higher derivatives of f(x). If f(x) = log*, then f'(x) — 1/x, f"(x) = —1/x2, and so on. These quantities get small for large x and so an extension of (D. 1.4) would be useful.
D.2 The Euler-Maclaurin Formula 619 D.2 The Euler-Maclaurin Formula Let B2(x) be a primitive (antiderivative) of Bi(x - [x]) = Bi(x). It follows from (D.1.6) that B2 is periodic with period one. In particular, this implies that B2@) = B2{\) = • • • = B2(j) = B2(j + 1) = • • •. Now, integration by parts gives f •I j [J B2(x)f"(x)dx. j We assume that / has continuous derivatives of as high an order as necessary. Sum from j = m to j — n to get Г B1(x)f'(x)dx = B2(O)(/'(n) - f'{m)) - Г B2{x)f"{x)dx. (D.2.1) J m J m Note that it was the periodicity of B2(x) that gave us the simple expression on the right side of Equation (D.2.1). This suggests that we should choose the constant of integration in B2{x) such that /¦l rt+i / B2(x)dx = / B2(x)dx = 0. (D.2.2) Jo Jt Let \B2(x) = B2(x) for 0 < x < 1, so that B2(x) = \B2{x - [x]). From the definition of B2(x), it follows that B2(x)=x2 -x + -, о which is the second Bernoulli polynomial. We state the general Euler-Maclaurin formula in the next theorem: Theorem D.2.1 Suppose f has continuous derivatives up to order s. Then J2 fix) = Г m+\ Jm f f + m+\ J / Jm where Bs{x) are the Bernoulli polynomials and Bi = ^@), the Bernoulli num- numbers. Proof. As in the derivation of (D.2.1), apply integration by parts successively to obtain a sequence of periodic functions Bn(x) such that B'n(x) = Bn-\(x) @ < x < 1, n > 1) and/0 Bn{x)dx = 0, n > 1. With respect to these functions,
620 D Euler-Maclaurin Summation Formula we obtain the formula ?/(*) = Г f(x)dx+J2(-VeBd0){f(t-l)(n) - fl-l\m)} m+\ ^т l=\ + (-l)'-1 f Bs(x)f(s)(x)dx. Jm To show the relation of Bt{x) to Bernoulli polynomials, consider the generating function oo G(x, t) := J2 Bn(x)tn, where B0(x) = 1, (D.2.3) о of the sequence {Bi(x)}. Observe that for 0 < x < 1, „ „ OO OO 00 — = Y, Kw = Yl Ёп-лхУ = t J2 E^x">tn =tG- „=1 n=l n=0 We then have G(x,t) = A(t)ext. (D.2.4) Use /0' Bn(x)dx = 0 in (D.2.3) and (D.2.4) to obtain ё -\ t 1 = A(t) or A(t) = e' - 1 Thus and G(x, t) = = V Bn(x)-, (D.2.5) e' — 1 *-^ n\ Bn(x) = forO < x < 1. By the periodicity of Bn (jc) we arrive at Bn(x) = -Bn(x-[x]) for all x. л! Although this formal argument has not been justified, it can be done easily since the generating function (D.2.5) is analytic in the disk \t\ < 2л-. This proves the theorem. ¦
D.3 Applications 621 D.3 Applications Take f{x) = x~s logx, m = 1, and n = N in (D.1.3) to get A Nl~slogN 1 Nl~s -5 (S -IJ E-1J n Let Re s > 1 and N -> oo to obtain f /0 (s ~ IJ The integral on the right side of the equality converges for Re 5 > — 1 by Abel's test. So -' - Г л By Corollary 1.3.3 we have f'@) = -ilog27r. Thus the constant (D.1.7) in Stirling's formula is \ Iog2^. As another application, we prove the following useful theorem on the order of t,{s) and t,'(s) for large Im s. Theorem D.3.1 Let s = a +it. For 1 — ^r- < a < 2, where A is any positive constant, and \t\ large enough, = o{\Og\t\) and );l{s) = O{\og1\t\), Proof, In (D.1.3), set f(x) = x~s, m = N, and let n -> oo to arrive at ¦A 1 iV1^ 1 , f00 x - [x] - \ f^n- s - 1 2 JN xs+l The integral is 0{\t\/oNa). Note that where the last equality holds for n < \t\. Thus, take ^V = [|г|] to get N П\ Ы = O(log|f|).
622 D Euler-Maclaurin Summation Formula To derive the result for f '(s), take f(x) = x s logx and do a similar calculation. See Rademacher [1973, p. 100], if necessary. ¦ We complete this section with a proof of the following theorem stated in Chapter 1. Theorem D.3.2 Let z e С - (-oo, 0]. Then 2 2m Jo B2m(x - [x]) dx. (x + zJm Proof Start with the following expression of the gamma function: z—1 r^=}™U l=\ z + I - 1 Then logT(z) = lim z + l-\ i=\ (D.3.2) where the principal branch of the log function in С — (—oo, 0] is chosen. In Theorem D.2.1, take to get x + z — 1 f{x) = log = log(* + г - 1) - logx -l i=\ = logz + / [log(x + z-D- j^ 2jBj - 1) [(n + z - lJJ~l n2; + -[log(n +z - 1) - logn - logz] 1 Г 2m Jo B2m(x-[x]) 1 (z+x-lJm x 1 ~2^ dx. Here we have used the fact that Bi = -1/2 and B2j+i = 0 for j > 1. Compute the first of the above integrals and observe that after some cancellation the terms
D.4 The Poisson Summation Formula 623 that involve n are (n + z - 1) log(n + z — 1) — n logn + - log 2 D n +y B2j Г i l 2jBj - Subtract this from (z — l)log(n + l) and let и -> oo to compute the limit in (D.3.2). The result is log ГЙ - (z - I) .o8Z - г + . + ± JJ^- l r°° г l il - — / B2m(x - [x]) — -=- - -5- \dx. (D.3.3) 2ш У! [(x + z-lJm x2mj From (D.1.5) and (D.3.1) we know that lim [log Г(г) - (z - 1/2) log z + z] = - log 2л-. г—>оо 2 So let г -> oo in (D.3.3) to see that This result combined with (D.3.3) gives the formula in Theorem D.3.2. ¦ D.4 The Poisson Summation Formula In this section we state and prove an important and useful formula from the theory of Fourier series. This is the Poisson summation formula. It has numerous applications though we mention only a few. One consequence of Poisson's formula is the Euler- Maclaurin summation formula. Start with a result on Fourier series attributed to Jordan. Recall that a function f(x) is said to be of bounded variation on an interval [a, b] (which may be infinite), if there is a constant С > 0 such that for any set of points xo < x\ < ¦ ¦ ¦ < xn in the interval k=\ If the interval is the whole real line, then we say that / is of bounded total variation. An important though easily proved result is that if / is of bounded variation on [a,b], then / = g — h, where g and h are increasing functions on [a,b].
624 D Euler-Maclaurin Summation Formula Jordan's theorem is the following: Suppose / is integrable on [0, 1] and periodic with period one. If / is of bounded variation on [0, 1], then f(x+)+f(x-) f(t)e~27lintdt )e2lTinx. 2 Poisson's summation formula is contained in the next theorem. Theorem D.4.1 Suppose g is integrable on (—oo, oo) and of bounded total vari- variation. Suppose h is an even, positive, integrable function that is decreasing on [0, oo) and such that \g(x)\ < h(x). Then Г J- (D.4.1) Proof. The inequality N-l У] \g(x+n)\<\g(x)\+ / h(x+t)dt n=-M implies that Yl°?<x 8(x +л) is an absolutely convergent series and defines a periodic function f(x) with period one. Moreover, since g is of bounded total variation, / is of bounded variation on [0, 1]. By Jordan's theorem f(x+) + f(x-) = lim V I' Now -1 oo / f(t)e-27lir"dt= f Jo Jo = Yl I S(m + t)e-2nimdt 00 rm+l = J2 / g{u)e-2*inudu -oo Jm = Г J-c The summation in the second line can be taken outside the integral by the dominated convergence theorem. ¦
D.4 The Poisson Summation Formula 625 Remark D.4.1 If the series on the right-hand side in Theorem D.4.1 is absolutely convergent, then we can write An interesting example of Poisson's formula connects the Poisson kernel for the upper half plane given by g(x) — y/(x2 + у2), у > 0, with the kernel for the unit disk given by A — r2)/(l — 2r cosx + r2). Observe that with g as defined here, -oo X* + у1 The reader may verify this by contour integration or by other means. Substitution of this in Theorem D.4.1 gives У V4 -2ny\n\ 2ninx _ 1-е y2 + (x + лJ ^ 1 - 2е~2жУ cos 2тгх + е~ЛлУ ' n=—oo J v ' n=—oo The partial fraction expansions for the trigonometric functions can also be de- derived from Theorem D.4.1. Take s > 0 and set je~s', t>0, 8it) = [O, t< 0. The left side of Poisson's formula becomes e-s(n+x) + f 1/2, x integer, I 0, otherwise. — x The right-hand side is given by E oo e2ninx s + 2nin n=—oo When x = 0, we get es + \ 1 . ^ ( 2 + 1 es — 1 5 ^ \s + 2nin s — 2nin By analytic continuation, we can take s = 2niy. The result is 1^/1 1 Trcot ny = —(- у 1- У ^ы^у + п y~n
626 D Euler-Maclaurin Summation Formula When x = 1/2, we obtain es - 1 s ^-^ \s +2nin n=l ч This time, s = 2niy gives 7Г CSC TCy = - + У^(- 1)" ( 1 У ^Г[ \У+п y- An important transformation formula for theta functions follows from the Poisson summation formula by taking g(x) = e~S71*2. The formula is oo V^ e-nn2/se2ninx (D 4 2) The Euler-Maclaurin formula follows from the Poisson summation formula by taking \ 0, otherwise. Here a and b are assumed to be nonnegative integers and f(x) is continuously differentiable in [a, b]. This implies that / is of bounded variation on [a, b] and hence g{x) is of bounded total variation. Use this g(x) in (D.4.1). The result is f a /b oo rb f(t)dt +2^ f({)c°s2nntdt. (D.4.3) n=l ^ k=a+l ^^ \n\sN b oo rb Integration by parts gives fb fb , sin2rmt / f(t)cos2nntdt = - fit)— dt. Ja Ja 2Л7Г Also, oo ¦ „ v-^ sin 2mcx 2^Bl(x[x]), n=l except when x is an integer (see Exercise 44 in Chapter 1). Thus, by the dominated convergence theorem, we can write (D.4.2) as Y, /(«)= / f(t)dt-Bl(f(b)-f(a))+ f f'(x)Bl(x-[x])dx. a<n<b Ja Ja
Exercises 627 If we assume that f(m) (x) is continuously differentiable, then the application of dBn{x) — =nBn-i(x), 0<x<\, ax and successive integration by parts gives Theorem D.2.1. For other applications and extensions of the summation formulas studied here, see Berndt [1975]. Exercises 1. Prove that with the notation and conditions of Theorem D.3.1 $'(s) = 0(log2 |f |). 2. Define A/@) = /A) — /@). Show that if f(x) is a polynomial of degree m, then /"' m В /@) = / f(x)dx + Y] -Z д/У-^О). Jo j=1 J 3. Use the Euler-Maclaurin formula to obtain the relation where 0 < в < 1 and у is Euler's constant. 4. Show that there is a constant С such that O( V —!— =loglogx t^« log n loglogx + C + O(}. 2t^,« log n \x\ogxj 5. Use the Poisson summation formula to prove that oo . oo Ee-s(n+xJ _ V^ c-7in2/sc2ninx Js ^ —oo —oo 6. Use Poisson summation to obtain the following theorem of Lipschitz: Suppose Re x > 0, 0 < a < 1, and Re s > 1. Then °° O2nina (n+a)= у ? (For a different proof where or = 1, see Exercise 37 in Chapter 2.) 7. Suppose a, b, and с are integers such that ac+b is even. Use Poisson summation to prove the reciprocity for Gaussian sums H-l I |a|-l eni(an2+bn)/c _ / ? E V a n=0 n=0
E Lagrange Inversion Formula E.I Reversion of Series There are situations in analysis in which one knows a series expansion for y(x) but would like to obtain the series for x in terms of 3;. Newton, for example, encountered such a problem when he had the series for sin~' x by integrating the expansion of A - x2)~1/2 term by term and he wanted the series for sinx. Some results of Newton on reversion of series can be found in Newton [1960, p. 147]. Suppose a series for x in powers of у is required when x = уф (х). Assume that ф is analytic in a neighborhood of x = 0 with ф @) ф 0. Then = ^апх\ ахф0. (E.I.I) «=1 We shall see below that Lagrange's inversion formula gives n=\ where 1 dn~l This means that nbn is the coefficient of x"~l in the expansion of ф"(х) or the coefficient of x~l in the expansion of 1/y". More generally, suppose that (E. 1.1) holds and that f(x) is an analytic function in a neighborhood of x = 0. Then Lagrange's formula is fix) = /@) + > /— n=\ d n~l (E.I.2) 629
630 E Lagrange Inversion Formula Б.2 A Basic Lemma The proof of Lagrange's formula (E. 1.2) depends on a simple lemma given below. The treatment here follows that of Gessel and Stanton [1982], which is based on the work of Jacobi [1830]. See also Bromwich [1926]. An extension of Lagrange inversion to several variables is in Good [1960] and ^-analogs are discussed in Stanton [1988]. A history of this formula can be found in W. Johnson's unpublished manuscript, Notes on the Lagrange inversion formula. Define the residue of a formal Laurent series f{x) = YfjL-m aixJ ^У Res[/(x)] = a-\. The next lemma states that the residue does not change un- under a certain change of variables. Lemma E.2.1 Let G(x) = ?°L_m ajXJ and h(x) = ]P~i Ьх'> where bi Ф °- Then Res[G(h(x))h'(x)] = Res[G(x)]. Proof. Since both sides of the equation are linear in G, it is sufficient to prove it for G(x) = xm. Since Res[g'(x)] = 0 for any Laurent series g(x), it follows that, form ф — 1, Res[hm(x)h'(x)] = —^—Res[{hm+1(x)}'] = 0. m + 1 For m = -\, let h(x) = bxxf(x). Then Res f^l = Res \- + ^1 = 1 + Res[{log f(x)}'] = 1. [h(x)j [x f(x)\ The lemma is proved. ¦ To prove (E.1.2) note that у = axx + агх1 + ¦ ¦ ¦ (x a complex variable) is conformal in a neighborhood of 0, because a\ ф 0, and so x can be expanded in powers of y. Since f{x) is analytic, it can be expanded in powers of y. Suppose that n=\ Then fix) = LetG(x) = T,T=incnX"~]- By Lemma E.2.1, \Г(х)фг(х)] \f(x)] \G(y)dy/dx] \G(x)] Res = Res = Res = Res = rcr. L xr J L У J L У ] L ^r J
E.3 Lambert's Identity 631 But the Taylor series of /'(х)фг(х) is and so Thus which proves Lagrange's formula (E.I.2). A variant of (E.I.2) is the formula x=0 where 3; is defined by (E. 1.1). To prove (E.2.1), note that since x — уф(х) = 0, we have (E.2.2) or dy = 1 - уф'(х) dx ф(х) Take the derivative of (E. 1.2) with respect to x and use (E.2.2) to get Set F(x) = /'(х)ф(х) to obtain (E.2.1). E.3 Lambert's Identity Lagrange's inversion formula can be used to obtain a number of interesting series expansions for specific functions. A few examples are given in the exercises. Here we derive an identity of Lambert. This is given by (ЕЗЛ) where 3; = x{\ +x)~P. Take f(x) = A + x)a апйф(х) = A + xf in (E.I.2). Then v" Г dn~x n=l x=0
632 E Lagrange Inversion Formula The expression in square brackets equals a(nP + a - 1) ¦ ¦ ¦ (и/3 + a - n + 1) = a(n - 1)!, n — 1 and this gives (E.3.1). The identity corresponding to (E.2.1) is An interesting binomial identity comes from (E.3.1) and (E.3.2). Equate the co- coefficient of y" on both sides of ( The result is ^ у {у+рк\{а + Р(пк)\ fa + y+Pn\ A ){ ) = { n У (R33) E.4 Whipple's Transformation Gessel and Stanton showed how to derive Whipple's transformation, which we obtained in different ways in Chapters 3 and 7, from Lemma E.2.1. Theorem E.4.1 Suppose A(x), B{x), C(x), and D(x) are power series whose coefficients ofxk are At, Дь С*, and D^ respectively. Suppose = B(x), - x)-yC(x/(l - x)P+l) = D(x). (E.4.2) Ifn(P + l)= 1 -a-y, then k=0 k=0 Proof. First note that Res , = coefficient of x" in B(x)D(x) = V^ BkDn^k. L x J L x J k=o Now write h(x) = x(l - x)~^1 so that h'(x) = A + /?*)A - x)"^ and B(x)D(x) A +/?лс)A - jc)- 1 A(h(x))C(h(x)) A{h{x))C{h{x))h'{x)
E.4 Whipple's Transformation 633 By Lemma E.2.1, -B(x)D(x)] \A(x)C(x) Res — — Res' Kn+x j у xn+i By the remark at the beginning of the proof, this implies that *=0 k=0 This proves the theorem. ¦ To derive Whipple's transformation for a jF^, recall Whipple's quadratic trans- transformation noted in Chapter 2, namely, 'a/2,(l+a)/2,l+a-b-c 4x (I — X) з/-2 l+a-b,l+a-c ' A-хJ, h r \ ;x). (E.4.3) l +a — b, \ +a — с J This corresponds to (E.4.1) with a = a and ft = 1. The transformation corres- corresponding to (E.4.3) is a result of Bailey [1929]: + l-b-c4x a, I + a/2, b, с ~4 3[a/2,l+a-b,l+a-c'X l+a-b,l+a-c A-хJ (E.4.4) This can be obtained from (E.4.3) by differentiation or directly by equating the coefficients of x". To apply Theorem E.4.1, note that from (E.4.3), Ak = (fl/2)t((l + fl)/2)t(l +a-b- с)к(-4)к/[к\A +а- b)k(l +a- c)k], Bk = (а)к(Ь)к(с)к/[к\(\ + a - b)k(l + a - c)k], and from (E.4.4) after renaming the parameters, Ck = (A + d)/2)k(B + d)/2)k(l +d-e- /M-4)V [k\(l+d-e)k(l+d-f)k], Dk = (d)k(l + d/2)k(e)k(f)k/[k\ (d/2)k(l +d- e)k(\ +d- f)k]. So +d - e - •E: n\(l + d - e)n(l + D - f)n (a/2)k((\ + a)/2)k(l +a-b- c)k(-n)k(e -d- n)k(f - d - n)k k\(\ +a- b)k(\ + a ~ c)*((l - d)/2 - n)k(-n - d/2)k(e + f - n - d\
634 E Lagrange Inversion Formula Theorem E.4.1 is applicable when 2n = — a - d. This simplifies the expression in the sum. For example, — n — d/2 = a/2, and the terms involving these expressions cancel. After simplification the sum becomes / — n, 1 + a — b — c, e + a + n, f + a + n 43V l+a-b, l+a-c, e +f + a+n ' The sum Y^k=o ^k^n-k leads to a iF^ and the equality n n fc=0 k=0 results in Whipple's theorem / a, l + a/2, b, c, e + a + n, f + a+n, -n 1 6\a/2,l+a-b, l+a-c, 1-е- n, I- f-n, 1+a+n' _ (fl+ \)n(a + с + f + n)n /l+a-b-c,e + a + n,f + a+n,-n\ X4 3\ l+a-b, l+a-c, e +f + a + n ' )' Exercises 1. Show that n=0 and that 00 „л-1 n=\ 2. Show formally that n Give sufficient conditions for this to be correct. 3. Show that {хкЪ) k\ k=0 4. Show that if f~l(x) is the inverse of /(x) and /@) = 0, then assuming the necessary analyticity of the functions,
Exercises 635 5. Show that when у = x(l + x)~P. 6. Suppose that xm+1 + ax - b = 0. Show that 2m + 2 fr2m+1 Cwi + 2№m + 3) b3m+l X ~ a am+2 + 2! a2m+3 3! a3m+4 + ' Use this formula to find a solution of x5 + 4x + 2 = 0 to four decimal places of accuracy. When m = 0, this series reduces to the geometric series. Write this sum as a hypergeometric series. 7. Use the Lagrange inversion formula to derive the generating function for Laguerre polynomials.
Series Solutions of Differential Equations F.I Ordinary Points For readers not familiar with series solutions of differential equations, we give a few basic definitions so that the discussions of the hypergeometric equation in Chapter 2 and of the confluent and Bessel's equations in Chapter 4 are intelligible. Consider the differential equation ^ + c(x)yO (F.I.I) dx with a(x), b{x), and c(x) analytic in the neighborhood of x = x0. We take x to be a complex variable in this discussion. The simplest situation occurs when a (x0) ф 0. In this case, x0 is called an ordinary point of the Equation (F.I.I). It is not difficult to show that if xo is an ordinary point of (F.I.I), then (F.I.I) has a unique solution /(x) analytic in a neighborhood of x0 with prescribed values /(x0) = /o and /'(xo) = f\. This implies that there are exactly two linearly inde- independent solutions in a neighborhood of x0. To prove this result, it is convenient to divide (F.I.I) by a(x) and rewrite the equation as y" = B{x)y' + C(x)y. (F.I.2) Again for convenience, take x0 = 0. Then oo and С(х) = ^с„х". л=0 п=0 Suppose that (F.1.2) has an analytic solution /(x) with Then л=0 637
638 F Series Solutions of Differential Equations Substitute this series and the series for B(x) and C(x) in (F.I.2); equate the coeffi- coefficients of x". The result is (и + 2) (и + 1)/я+2 = 5> + 1 - *)/»+1-*** + ? ^«-*С*' (RL4) <fc=0 *=0 и = 0, 1, 2, This shows that /0 and /i uniquely determine /„ for n > 2. It is now enough to prove that the series (F.1.3) has a positive radius of convergence. Since the series for B(x) and C(x) have a positive radius of convergence, there exist constants M and R such that |&Л|<М/Л" and \cn\<M/Rn. We show by induction that there exist suitable positive numbers M\ and r such that \fn\<Ml/rn. (F.1.5) This implies the needed result. Take M\ such that \f\\ < Mi and choose r such that r < /?, |/i| < Mi/r, and M(r/2 + r2) < 1. Suppose n > 2 and assume (F.1.5) true up to n — 1. By (F.1.4), the inequality (F.1.5) follows after a small calculation. F.2 Singular Points Whena(xo) = 0 and b(x0) and/or c(x0) is not zero, then x = x0 is called a singular point of (F.I.1). Divide (F.I.I) by a(x) and write it as y" + d(x)y' + e(x)y = 0. (F.2.1) Consider first the simplest case where xo is a singular point and a(x) has a simple zero at xo. Then d(x) and e(x) have at most simple poles. Take xo = 0. Then oo oo d(x) = J2 dnx" and e(x) = ^ enx". (F.2.2) л=-1 л=-1 Substitute the series (F. 1.3) in (F.2.1) and equate coefficients as before. The reader may check that in this case we get only one solution, since the value of /0 determines /„ for n > 1. Moreover, if d-\ is a nonnegative integer, then this method may fail to produce a solution. Now consider the case where d{x) has at most a simple pole and e(x) at most a double pole. The simplest special case of (F.I. 1) that leads to this is when a(x) = a2(x - x0J, b{x) = bi(x - x0), c(x) = c0. (F.2.3)
F.3 Regular Singular Points 639 Two linearly independent solutions exist, at least one of the form у = (x — Хо)д. To determine д, substitute this expression for у in (F.I.I) to get а21лAл-1) + Ь11л + с0 = 0. (F.2.4) If this quadratic has two unequal roots, these roots give two independent solutions of (F.I.I). If the two roots equal Д] (say), then we have the solution (x — хо)Д|. To find the other independent solution, set у = (x — xq)^w. The differential equation for w has log(x — x0) as a solution. Thus the second independent solution is (x - х0У log(x - x0). F.3 Regular Singular Points A point x = xo is called a regular singular point of (F.I. 1) if (x - xo)b(x) (x - xoJc(x) lim and hm x^x0 a(x) x-*xo a(x) both exist. If one of these limits does not exist, the singular point is irregular. To see the difference between a regular and an irregular singular point, the easiest case to consider is the first-order analog of (F. 1.1). A regular singular point occurs when b(x)y' + c(x)y = 0 (F.3.1) with b(x) and c(x) analytic in a neighborhood of x = x0, and lim (x - xo)c(x)/b(x) x^x0 exists. When b(x) = (x — x0J and c(x) = c0, then y(x) = Ae-^*-^ is the solution, and it has an essential singularity at x = xo. The simple case considered in the previous section, where a (x), b (x), с (х) are as in (F.2.3), shows that, at a regular singular point, solutions may involve noninteger powers of x —x0. Moreover, the second solution may have a logarithmic singularity. Now suppose that Xo is a regular singular point of (F.I. 1) such that a(x) has a double zero at x0. Then п{х-х^)п, and с(х) л=2 л=1 л=0
640 F Series Solutions of Differential Equations Suppose у = (x — хо)д/(х) is a solution of (F.I.1). The equation for /(x) can be seen to be a(x)f" + V n=l oo " - Xo)" + H(JI ~ 1) n=0 Now if [L is chosen so that ц(ц - \)a2 + fj-bi + c0 = 0, (F.3.3) then (F.3.1) is an equation of the same type as the one considered at the beginning of Section F.2. In that case a solution of the form /(x) = YlT=o fnix ~ xo)" is always possible provided 2д + Ь\/а2 is not a nonnegative integer. Note that the quadratic (F.3.2) has two solutions Д] and д2- Thus it may be possible to obtain two solutions of the form (x — хо)д/(х). It follows from (F.3.2) that Д! -Д2 = 2/Л -(Д! +Д2) = 2m-(l-bi/a2) The remarks in the previous paragraph now imply that if the difference of the two roots of (F.3.2) is not an integer, then (F. 1.1) has two independent solutions of the form Y^=o fn(x — -Хо)"+Д- However, if /л\ — \i2 > 0 is an integer, then a solution of this form with \i = \i\ exists. The other solution may involve a logarithmic singularity. Equation (F.3.2) is called the indicial equation. In practice it is obtained by substituting the series Е/„(х — хо)"+д in (F.I. 1) and equating the coefficients of the lowest power of x - x0. For the hypergeometric equation B.3.5), there are three regular singular points 0, 1, and infinity. For the confluent equation D.1.3) and the Bessel equation D.5.1), 0 is a regular singular point, and infinity is an irregular singular point.
Bibliography Abel, N. H. [1826]. Auflosung einer mechanischen Aufgabe, J. Reine Ang. Math., 1, 153-157. English translation in Source Book in Mathematics, D. E. Smith, ed., McGraw-Hill, New York, 656-662. Adams, J. С [1877]. On the expression of the product of any two Legendre coefficients by means of a series of Legendre's coefficients, Proc. R. Soc. London, 27, 63-71. Agarwal, R. P. [1953]. On the partial sums of series of hypergeometric type, Proc. Cambridge Phil. Soc, 49, 441^45. Ahern, P. and Rudin, W. [1996]. Geometric properties of the gamma function, Am. Math. Monthly, 103,678-681. Al-Salam, W. A. and Ismail, M. E. H. [1988]. g-Beta integrals and the g-Hermite polynomials, Рас. J. Math., 209-221. Amend, B. [1996]. Fox Trot, Comic strip, June 2. Anastassiadis, J. [1964]. Definition des Functions Euliriennes par des Equations Functionnelles, Gauthier-Villars, Paris. Anderson, G. W. [1990]. The evaluation of Selberg sums, Сотр. Rend. Acad. Sci., Paris, 311, Serie 1, 469^72. Anderson, G. W. [1991]. A short proof of Selberg's generalized beta formula, Forum Math., 3,415-417. Andrews, G. E. [1976]. The Theory of Partitions, Addison-Wesley, Reading, MA. Andrews, G. E. [1986]. q-Series: Their Development and Application in Analysis, Number Theory, Combinatorics, Physics, and Computer Algebra, Amer. Math. Soc, Providence, RI. Andrews, G. E. [1994]. The death of proof? Semi-rigorous mathematics? You've got to be kidding!, Math. Intelligencer, 16 D), 16-18. Andrews, G. E. and Askey, R. [1977]. Enumeration of partitions: The role of Eulerian series and ^-orthogonal polynomials, in Higher Combinatorics, M. Aigner, ed., Reidel, Dordrecht, 3-26. Andrews, G. E. and Burge, W. H. [1993]. Determinant identities, Рас. J. Math., 158, 1-14. Andrews, G. E., Dyson, F. J., and Hickerson, D. [1988]. Partitions and indefinite quadratic forms, Invent. Math., 91, 391^07. Anno, M. and Mori, T. [1986]. Socrates and the Three Little Pigs, Philomel Books, New York. Aomoto, K. [1987]. Jacobi polynomials associated with Selberg integrals, SIAMJ. Math. Anal, 18,545-549. Artin, E. [1964]. The Gamma Function, Holt, Rinehart and Winston, New York. 641
642 Bibliography Askey, R. [1970]. Orthogonal polynomials and positivity, Studies in Applied Mathematics 6, in Wave Propagation and Special Functions, D. Ludwig and F. W. J. Olver, eds., SIAM, Philadelphia, 68-85. Askey, R. [1975]. Orthogonal Polynomials and Special Functions, SIAM, Philadelphia. Askey, R. [1987]. Ramanujan's i i^i and formal Laurent series, Indian J. Math., 19, 101-105. Askey, R. [1988]. Beta integrals and ^-extensions, Papers of the Ramanujan Centennial International Conference, Ramanujan Math. Soc, 1987, 85-102. Askey, R. [1994]. A look at the Bateman project, Contemporary Math., 169, 29-43. Askey, R. and Gasper, G. [1971]. Jacobi polynomial expansions of Jacobi polynomials with non-negative coefficients, Proc. Cambridge Phil. Soc, 70, 243-255. Askey, R. and Gasper, G. [1977]. Convolution structures for Laguerre polynomials, J. Analyse Math., 31, 48-68. Askey, R., Gasper, G., and Ismail, M. [1975]. A positive sum from summability theory, J. Approx. Theory, 13, 413^120. Askey, R., Koornwinder, Т., and Rahman, M. [1986]. An integral of products of ultraspherical functions and a g-extension, J. London Math. Soc, 33 B), 133-148. Askey, R. and Steinig, J. [1974]. Some positive trigonometric sums, Trans. Am. Math. Soc, 187, 295-307. Askey, R. and Wilson, J. A. [1984]. A recurrence relation generalizing those of Apery, J. Austral. Math. Soc. (Series A) 36, 267-278. Askey, R. and Wilson, J. A. [1985]. Some Basic Hypergeometric Orthogonal Polynomials that Generalize Jacobi Polynomials, Am. Math. Soc., Providence, RI. Auslander, L. and Tolimieri, R. [1979]. Is computing with finite Fourier transform pure or applied mathematics?, Bull. Am. Math. Soc. (New Series), 11, 847-897. Azor, R., Gillis, J., and Victor, J. D. [1982]. Combinatorial applications of Hermite polynomials, SIAM J. Math. Anal, 13, 879-890. Baernstein II, A., Drasin, D., Duren, P., and Marden, A. [1986]. The Bieberbach Conjecture, Amer. Math. Soc. Surveys and Monographs No. 21, Providence, RI. Bailey, W. N. [1929]. Some identities involving generalized hypergeometric series, Proc. London Math. Soc, 26 B), 503-516. Bailey, W. N. [1931]. The partial sum of the coefficients of the hypergeometric series, J. London Math. Soc, 6, 40-41. Bailey, W. N. [1932]. On one of Ramanujan's theorems, J. London Math. Soc, 7, 34-36. Bailey, W. N. [1933]. On the product of two Legendre polynomials, Proc. Cambridge Phil. Soc, 29, 173-177. Bailey, W. N. [1935]. Generalized Hypergeometric Series, Cambridge University Press, Reprinted by Hafner Pub. Co., New York, 1972. Bailey, W. N. [1938]. The generating function of Jacobi polynomials, J. London Math. Soc, 13, 8-11. Bailey, W. N. [1941]. A note on certain ^-identities, Quart. J. Math. (Oxford), 18, 157-166. Bailey, W. N. [1949]. Identities of the Rogers-Ramanujan type, Proc. London Math. Soc, 50B), 1-10. Bailey, W. N. [1954]. Contiguous hypergeometric functions of the type 3^A), Proc. Glasgow Math. Assoc, 2, 62-65. Bak, J. and Newman, D. J. [1982]. Complex Analysis, Springer-Verlag, New York. Barnes, E. W. [1908]. A new development of the theory of the hypergeometric functions, Proc. London Math. Soc, 6 B), 141-177. Barnes, E. W. [1910]. A transformation of generalized hypergeometric series, Quart. J. Math., 41, 136-140. Bateman, H. [ 1909]. The solution of linear differential equations by means of definite integrals, Trans. Cambridge Phil. Soc, 21, 171-196.
Bibliography 643 Bateman, H. [1932]. Partial Differential Equations of Mathematical Physics, Cambridge University Press, Cambridge, UK. Bellman, R. [1961]. A Brief Introduction to Theta Functions, Holt, Rinehart, and Winston, Inc., New York. Berggren, L., Borwein, J., and Borwein, P. [1997]. Pi: A Source Book, Springer-Verlag, New York. Berndt, B. [1975]. Periodic Bernoulli numbers, summation formulas and applications, in Theory and Application of Special Functions, R. Askey, ed., Academic Press, New York. Berndt, B. [1985]. Ramanujan'sNotebooks, Parti, Springer-Verlag, New York. Berndt, B. [1989]. Ramanujan's Notebooks, Part II, Springer-Verlag, New York. Beukers, F. [1979]. A note on the irrationality of f B) and f C), Bull. London Math. Soc, 11, 2268-2272. Boas, R. P. [1954]. Entire Functions, Academic Press, New York. Bochner, S. [1952]. Remarks on Gaussian sums and Tauberian theorems, J. Indian Math. Soc, 15, 99-106. Bochner, S. [1955]. Harmonic Analysis and the Theory of Probability, University of California Press, Berkeley. Bochner, S. [1979]. Review of Gesammelte Schriften by Gustav Herglotz, Bull. Am. Math. Soc. (New Series), 1, 1020-1022. Bohr, H. and Mollerup, J. [1922]. Laerebog i Matematisk Analyse, Vol. Ill, J. Gjellerup, Kopenhagen. Borwein, J. and Borwein, P. [1987]. Pi and the AGM, Wiley, New York. Boyarsky, M. [1980]. p-adic gamma functions and Dwork cohomology, Trans. Am. Math. Soc, 257, 359-369. Brafman, F. [1951]. Generating functions of Jacobi and related polynomials, Proc Am. Math. Soc, 2, 924-949. Brent, R. P. [1976]. Fast multiple-precision evaluation of elementary functions, J. Assoc. Сотр. Mech., 23, 242-251. Bressoud, D. M. [1987]. Almost poised basic hypergeometric series, Proc. Indian Acad. Sci. (Math./Sci.), 97, 61-66. Bromwich, T. J. ГА. [1926]. An Introduction to the Theory of Infinite Series, 2nd ed., Macmillan, London. Brown, G. and Hewitt, E. [1984]. A class of positive trigonometric sums, Math. Ann., 268, 91-122. Brown, M. and Ismail, M. E. H. [1995]. A right inverse of the Askey-Wilson operator, Proc Amer. Math. Soc, 123, ^071-2079. Brualdi, R. A. and Ryser, H. J. [1991]. Combinatorial Matrix Theory, Cambridge University Press, New York. Bustoz, J. [1974]. Note on "Positive Cesaro means of numerical series," Proc. Amer. Math. Soc. ,45, 69. Caratheodory, C. [1954]. Funktionentheorie, English translation, Theory of Functions, Vols. I and II, [1958, 1960], Chelsea, New York. Cartier, P. and Foata, D. [1969]. Problimes Combinatoires de Commutation et Rearrangements, Lecture Notes in Math., No. 85, Springer-Verlag, Berlin. Cauchy, A.-L. [1843]. Memoire surles fonctions dont plusieurs valeurs..., C.R. Acad. Sci. Paris, 17, 523, reprinted in Oeuvres de Cauchy, [1893], Ser. 1, 8, 42-50. Cauchy, A.-L. [1843a]. Deuxieme Memoir sur les fonctions dont plusieurs valeurs..., C.R. Acad. Sci. Paris, reprinted in Oeuvres de Cauchy, [1893], 8, 50-55. Chebyshev, P. L. [1870]. Sur les fonctions analogues a celles de Legendre, in Oeuvres de P. L Tchebychef, Vol. 2, A. Markoff and N. Sonin, eds., St. Petersburg [1907], 61-68, reprinted Chelsea, New York [1961].
644 Bibliography Chihara, J. S. [1978]. An Introduction to Orthogonal Polynomials, Gordon & Breach, New York. Clausen, T. [1828]. Ueber die Falle, wenn die Reihe von der Form у = 1 + f • &x + 2^±i ¦^ц\х2+ etc. ein quadrat von der Form z=\ + j'^p'7x + " °2+' ' ^v+T ¦jr^x2+ etc. hat, J. Reine Ang. Math., 3, 89-91. Cooley, J. W. and Tukey, J. W. [1965]. An algorithm for the machine calculation of complex Fourier series, Math. Сотр., 19, 297-301. Copson, E. T. [1935]. Theory of Functions of a Complex Variable, Oxford Univ. Press, London. Cox, D. [1984]. The arithmetic-geometric mean of Gauss, Enseign. Math., 30, 270-330. Davenport, H. and Hasse, H. [1934]. Die Nullstellen der Kongruenzzetafunktion in gewissen zyklischen Fallen, J. Reine Ang. Math., 172, 151-182. de Boor, С [1980]. FFT as nested multiplication, with a twist, SIAM J. Sci. Stat. Сотр., 1, 173-178. de Branges, L. [1972]. Gauss spaces of entire functions, J. Math. Anal. Appl., 37, 1—41. de Branges, L. [1972a]. Tensor product spaces, J. Math. Anal. App., 38, 109-148. de Branges, L. [1985]. A proof of the Bieberbach conjecture, Ada Math., 154, 137-152. de Bruijn, N. G. [1967]. Uncertainty principles in Fourier analysis, in Inequalities, O. Shisha, ed., Academic Press, New York, de Bruijn, N. G., Saff, E. В., and Varga, R. S. [1981]. On the zeros of generalized Bessel polynomials, Proc. K. Ned. Akad. Wet., Series A, 84 A), 1-25. de Moivre, A. [1730]. Miscellanea Analytica de Seriebus et Quadraturis, Tonson and Watts, London. Dedekind, R. [1853]. Uber ein Eulersches Integral, J. Reine Ang. Math. DeSainte-Catherine, M. and Viennot, G. [1983]. Combinatorial interpretation of integrals of products of Hermite, Laguerre and Tchebycheff polynomials, Polynomes Orthogonaux et Applications, С Brezinski et al., eds., Lecture Notes in Math. 1171, Springer-Verlag, 120-128. Din, A. M. [1981]. Lett. Math. Phys., 5, 207. Dixon, A. C. [1903]. Summation of a certain series, Proc. London Math. Soc, 35 A), 285-289. Dougall, J. [1907]. On Vandermonde's theorem and some more general expansions, Proc. Edinburgh Math. Soc, 25, 114-132. Dougall, J. [1919]. A theorem of Sonine in Bessel functions, with two extensions to spherical harmonics, Proc. Edinburgh Math. Soc, 37, 33^7. Edwards, С. Н. [1979]. The Historical Development of Calculus, Springer-Verlag, New York. Edwards, A. W F. [1987]. Pascal's Arithmetical Triangle, Charles Griffin and Co., London. Eisenstein, G. [1847]. Genaue Untersuchung der unendlichen Doppelproducte, J. Reine Ang. Math., 35, 153-274. Elliot, E. B. [1904]. A formula including Legendre's EK' + KE' - KK' = \it, Messenger of Math., 33, 31^Ю. Erdelyi, A. [1937]. Der Zusammenhang zwischen verschiedenen Integraldarstellungen hypergeometrischer Funktionen, Quart. J. Math. (Oxford), 8, 200-213. Erdelyi, A. [1938]. Note on the transformation of Eulerian hypergeometric functions, Quart. J. Math. (Oxford), 9, 129-134. Erdelyi, A. [1939]. Transformation of hypergeometric integrals by means of fractional integration by parts, Quart. J. Math. (Oxford), 9, 176-189. Erdelyi, A. [1953]. Higher Transcendental Functions, Vols. I, II, III, A. Erdelyi, ed., McGraw-Hill. Reprinted by Krieger Publishing Co., Malabar, FL [1981]. Erdelyi, A. [1956]. Asymptotic Expansions, Dover, New York.
Bibliography 645 Erdelyi, A. [1960]. Asymptotic solutions of differential equations with transition points or singularities, J. Mathematical Phys., 1, 16-26. Euler, L. [1730]. De progressionibus transcendentibus sen quaroum termini generales algebrare dari nequeunt, Comm. Acad. Sci. Petropolitanae, 5, 36-57. Euler, L. [1739]. De productis ex infinitis factoribus ortis, Comm. Acad. Sci. Petropolitanae, 11, 3-31. Reprinted in Opera Omnia, 14 [1924], 260-290. Euler, L. [1748]. Introductio in Analysin Infinitorum, Marcum-Michaelem Bousquet, Lausannae. English translation published by Springer-Verlag, [1988]. Euler, L. [1769]. Institutiones Calculi Integralis, II, Opera Omnia, Ser. 1, Vols. 11-13. Euler, L. [1794]. Specimen transformationis singularis serierum, Nova Ada Acad. Sci. Petropolitanae, 12, 58-70. Reprinted in Opera Omnia, Ser. 1, Vol. 16, Part 2, 41-55. Evans, R. [1981]. Identities for Gauss sums over finite fields, Enseign. Math., 27 B), 197-209. Evans, R. [1991]. The evaluation of Selberg character sums, Enseign. Math., 37 B), 235-248. Fejer, L. [1925]. Abschatzungen fur die Legendreschen und verwandte Polynome, Math. Zeit., 24, 267-284. Feldheim, E. [1941]. Contribution a la theorie des polynomes de Jacobi (Hungarian, French summary), Mat. Fiz. Lapik, 48, 453-504. Feldheim, E. [1941a]. Sur les polynomes generalises de Legendre, Izv. Akad. Nauk. SSSR Ser. Math., 5,241-248. Feldheim, E. [1943]. Contributi alia teoria della funzioni ipergeometriche di piu variabili, Annali della Scuola Norm. Super, di Pisa, Ser. II, 12, 17-60. Feldheim, E. [1963]. Appendix by G. Szego. On the positivity of certain sums of ultraspherical polynomials, J. Anal. Math., 11, 275-284. Ferrers, N. M. [1877]. An Elementary Treatise on Spherical Harmonics and Subjects Connected with Them, Macmillan, London. Fields, J. L. [1964]. A note on the asymptotic expansion of a ratio of gamma functions, Proc. Edinburgh Math. Soc, 15, 43^45. Fine, N. J. [1988]. Basic Hypergeometric Series and Applications, Amer. Math. Soc, Providence, RI. Foata, D. [1965]. Etude algebrique de certains problemes d'analyse combinatoire et du calcul des probabilites, Publ. Inst. Stat. Univ. Paris, 14, 81-241. Forrester, P. J. and Rogers, J. B. [ 1986]. Electrostatics and the zeros of the classical polynomials, SIAM J. Math. Anal, 17, 461^68. Frobenius, F. G. [1871]. Uber die Entwicklung analytischer Functionen in Reihen, die nach gegebenen Functionen fortschreiten, J. Reine Ang. Math., 73, 1-30. Reprinted in Gesammelte Abhandlungen, Band I, Springer-Verlag, [1968], 35-64. Funk, P. [1916]. Beitrage zur Theorie der Kugelfunktionen, Math. Ann., 77, 136-152. Fuss, N. [1843]. Correspondance Mathematique et Physique de Quelques Cilebres Geometres du XVIIF™ siecle, 1, Saint-Petersbourg. Gasper, G. [1975]. Positive integrals of Bessel functions, SIAMJ. Math. Anal, 6, 868-881. Gasper, G. [1977]. Positive sums of the classical orthogonal polynomials, SIAMJ. Math. Anal, (8), 423^47. Gasper, G. [1985]. Rogers' linearization formula for the continuous g-ultraspherical polynomials and quadratic transformation formulas, SIAMJ. Math. Anal, 16, 1061-1071. Gasper, G. and Rahman, M. [1990]. Basic Hypergeometric Series, Cambridge University Press, Cambridge, UK. Gauss, C. F. [1808]. Summatio quarumdam serierum singularium, Commen. Soc. Reg. Sci. Getting. Rec, vol. I. German translation in Arithmetische Untersuchungen, Chelsea, New York [1981].
646 Bibliography Gauss, С. F. [1812]. Disquisitiones generates circa seriem infinitam, Comm. Soc. Reg Gott. II, Werke, 3, 123-162. Gauss, С F. [1866]. Zur Theorie der neuen Transscendenten, II, Werke, 3, 436-445. Gauss, C. F. [1866a]. Hundert Theoreme tiber die neuen Transscendenten, Werke, 3, 461^69. Gauss, С F. [1866b]. Arithmetisch Geometrisches Mittel, Werke, vol. 3, 361^03. Gautschi, W. [1967]. Computational aspects of three-term recurrence relations, SIAM Review, 9, 24-82. Gegenbauer, L. [1875]. Ueber einige Bestimmte Integrale, Sitz- Math. Natur. Klasse Akad. Wiss. Wien, 70, 433-443. Gessel, I. and Stanton, D. [1982]. Strange evaluations of hypergeometric series, SIAM J. Math. Anal, 11, 295-308. Godsil, С D. [1981]. Hermite polynomials and a duality relation for the matching polynomial, Combinatorica, 1, 257-262. Godsil, С D. [1993]. Algebraic Combinatorics, Chapman and Hall, New York. Good, I. J. [1960]. Generalization to severable variables of Lagrange's expansion, with applications to stochastic processes, Proc. Cambridge Phil. Soc, 56, 367-380. Good, I. J. [1962]. A short proof of MacMahon's "master theorem," Proc. Cambridge Phil. Soc, 58, 160. Good, I. J. [1970]. Short proof of a conjecture of Dyson, J. Math. Phys., 11, 1884. Gosper, R. W., Jr. [1978]. Decision procedure for indefinite hypergeometric summation, Proc. Natl. Acad. Sci. USA, 75, 40-42. Gould, H. W. and Hsu, L. С [1973]. Some new inverse series relations, Duke Math. J., 40, 885-891. Gray, J. [1986]. Linear Differential Equations and Group Theory from Riemann to Poincare, Birkhauser, Boston. Gronwall, T. H. [1912]. Ueber die Gibbssche Erscheinung und die trigonometrischen Summen sinx + 1 /2 sin 2x + 1 /3 sin 3x-\ h 1 /n sinnx, Math. Ann., 72, 228-243. Gross, B. and Koblitz, N. [1979]. Gauss sums and the />-adic gamma function, Ann. Mart., 109, 569-581. Halmos, P. [1950]. Measure Theory, Van Nostrand, Princeton. Hamburger, H. [1922]. Ueber einige Beziehungen, die mit der Funktionalgleichung der Riemannschen f -Funktion aequivalent sind, Math. Ann., 85, 129-140. Hankel, H. [1875]. Bestimmte Integrale mit Cylinderfunctionen, Math. Ann., 8, 453-470. Hardy, G. H. [1922]. A new proof of the functional equation for the zeta function, Mat. Tidsskrift,B,l\-73. Hardy, G. H. [1933]. A theorem concerning Fourier transforms, J. London Math. Soc, 8, 227-231. Hardy, G. H. [1940]. Ramanujan, Cambridge University Press, Cambridge, UK. Hardy, G. H. [1949]. Divergent Series, Cambridge University Press, Cambridge, UK. Hecke, E. [1918]. Ueber orthogonal-invariante Integralgleichungen, Math. Ann., 78, 398^04. Heckman, G. and Schlicktkrull, H. [1994]. Harmonic Analysis and Special Functions on Symmetric Spaces, Academic Press, San Diego. Heine, E. [1847]. Untersuchungen iiber die Reihe..., J. Reine Ang. Math., 34, 285-328. Helversen-Pasotto, A. [1978]. L'identite de Barnes pour les corps finis, C. R. Acad. Sci. Paris, Ser. A-B, 286, A297-A300. Helversen-Pasotto, A. and Sole, P. [1993]. Barnes' first lemma and its finite analogue, Canadian Math. Bull, 36, 273-282. Hermite, С [1890]. Sur les Polynomes de Legendre, Rend. Circ. Mat. Palermo, IV, 146-152. Reprinted in Oeuvres, 4, 314-320.
Bibliography 647 Hewitt, E. [1954]. Remark on orthonormal sets in Ьг(а, b), Amer. Math. Monthly, 61, 249-250. Hill, M. J. M. [1908]. On a formula for the sum of a finite number of terms of the hypergeometric series when the fourth element is equal to unity, Proc. London Math. Soc, B), 6, 339-348. Hille, E. [1962]. Analytic Function Theory, Vol. II, Ginn and Co., Republished, Chelsea, New York [1977]. Holder, O. [1889]. Uber einen Mittelwertsatz, Goettinger Nach., 38^7. Hsu, H.-Y. [1938]. Certain integrals and infinite series involving ultraspherical polynomials and Bessel functions, Duke Math. J., 4, 374-383. Hylleraas, E. [1962]. Linearization of products of Jacobi polynomials, Math. Scand., 10, 189-200. Ingham, A. E. [1932]. The Distribution of Prime Numbers, Cambridge University Press, London. Ireland, K. and Rosen, M. [1991]. A Classical Introduction to Modern Number Theory, 2nd Ed., Springer-Verlag, New York. Ismail, M. E. H. [1977]. A simple proof of Ramanujan's 1i/f1 sum, Proc. Amer. Math. Soc, 63, 185-186. Ismail, M. E. H. and Stanton, D. [1988], On the Askey-Wilson and Rogers polynomials, Canadian J. Math., 40, 1025-1045. Ismail, M. E. H. and Tamhankar, M. V. [1979]. A combinatorial approach to some positivity problems, SIAM J. Math. Anal, 10, 478^85. Jackson, D. [1911]. Ueber eine trigonometrische Summe, Rend. Circ. Mat. Palermo, 32, 257-262. Jackson, F. H. [1910]. On ^-definite integrals, Quart. J. Pure Appl. Math., 41, 193-203. Jackson, F. H. [1921]. Summation of a g-hypergeometric series, Messenger of Math., 50, 101-112. Jacobi, С G. [1826]. Uber Gauss' neue Methode, die Werthe der Integrate naherungsweise zu finden, J. Reine Ang. Math., 1, 301-308. Jacobi, С G. [1829]. Fundamenta Nova, Regiomontis, fratrum Borntraeger. Reprinted in Werke, Vol. 1, Chelsea, New York [1969], 49-239. Jacobi, С G. [1830]. De resolutione aequationum per series infinitam, J. Reine Ang. Math., 6, 257-286. Jacobi, С G. [1834]. Demonstratio Formulae..., /. Reine Ang. Math., 11, 307. Jacobi, C. G. [1859]. Untersuchung iiber die Differentialgleichung der hypergeometrischen Reihe, / Reine Ang. Math., 56, 149-165. Jensen, J. L. W. V. [1915-1916]. An elementary exposition of the theory of the gamma function, Ann. Math., 17, 124—166. John, F. [1938]. Special solutions of certain difference equations, Ada Math., 71, 175-189. Karlin, S. and McGregor, J. L. [1957]. The differential equations of birth-and-death processes, Trans. Amer. Math. Soc, 85, 489-546. Kirillov, A. N. [1994]. Dilogarithm Identities, Preprint Series, Dept. of Math. Sci., University of Tokyo. Klein, F. [1894]. Vorlesungen iiber die Hypergeometrische Funktion, Gottingen. Knopp, M. [1971]. Modular Forms and Analytic Number Theory, Markham, New York. Koblitz, N. [1977]. P-Adic Numbers, p-adic Analysis, andZeta-Functions, Springer-Verlag, New York. Koblitz, N. [1984]. Introduction to Elliptic Curves and Modular Forms, Springer-Verlag, New York.
648 Bibliography Koekoek, R. and Swarttouw, R. F. [1994]. The Askey-scheme of hypergeometric orthogonal polynomials and its ^-analogue, Reports of the Faculty of Technical Mathematics and Informatics, no. 98-117, Delft. Kogbetliantz, E. [1924]. Recherches sur la sommabilite des series ultraspheriques par la methode des moyennes arithmetiques, J. Math. Pures Appl, (9), 3, 107-187. Kolberg, O. [1957]. Some identities involving the partition functions, Math. Scand., 5, 77-92. Koornwinder, Т. Н. [1972]. The addition formula for Jacobi polynomials, I, Summary of results, Indag. Math., 34, 188-191. Koornwinder, Т. Н. [1974]. Jacobi polynomials, II. An analytic proof of the product formula, SIAMJ. Math. Anal, 5, 125-137. Koornwinder, Т. Н. [1975]. Jacobi polynomials, III. An analytic proof of the addition formula, SIAMJ. Math. Anal, 6, 533-543. Koornwinder, Т. Н. [1978]. Positivity proofs for linearization and connection coefficients of orthogonal polynomials satisfying an addition formula, / London Math. Soc. B), 18, 101-114. Koornwinder, T. H. [1990]. Jacobi functions as limit cases of g-ultraspherical polynomials, / Math. Anal. Appl, 148, 44-54. Kummer, E. [1836]. Ueber die hypergeometrische Reihe, J. Reine Ang. Math., 15, 39-83, 127-172. Kummer, E. [1847]. Beitrag zur Theorie der Function Г(х), J. Reine Ang. Math., 35, 1-4. Lang, S. [1980]. Cyclotomic Fields II, Springer-Verlag, New York. Lanzewizky, I. L. [1941]. Ueber die Orthogonalitat der Fejer-Szegoschen Polynome, С R. (Dokl)Acad. Sci. USSR, 31, 199-200. Lewin, L. [1981]. Poly logarithms and Associated Functions, North-Holland, New York. Lewin, L. [1981a]. Structural Properties of Polylogarithms, Amer. Math. Soc, Providence, RI. Liouville, J. [1839]. Sur quelques integrates definies, / Math. Pures Appl, Ser. 1, 4, 229-235. Liouville, J. [1855]. Sur un theoreme relatif а Г integrate eulerienne de seconde espece, / Math. Pure Appl, Ser. 1, 20, 157-160. Littlewood, J. E. [1986]. Littlewood's Miscellany (B. Bollobas, ed.), Cambridge University Press, Cambridge, UK. Lorch, L., Muldoon, M. E., and Szego, P. [1970]. Higher monotonicity properties of certain Sturm-Liouville functions, III, Canadian J. Math., 22, 1238-1265. Lorch, L., Muldoon, M. E., and Szego, P. [1972]. Higher monotonicity properties of certain Sturm-Liouville functions, IV, Canadian J. Math., 2Д, 349-368. Lorch, L. and Szego, P. [1963]. Higher monotonicity properties of certain Sturm-Liouville functions, Ada Math., 109, 55-73. Lorentz, G. G. and Zeller, K. [1964]. Abschnittlimitierbarkeit und der Satz von Hardy-Bohr, Arch. Math. (Basel), 15, 208-213. Lorentzen, L. and Waadeland, H. [1992]. Continued Fractions with Applications, North-Holland, Amsterdam. Macdonald, I. G. [1995]. Symmetric Functions and Hall Polynomials, 2nd ed., Oxford University Press, Oxford. MacMahon, PA. [1917-1918]. Combinatory Analysis, Cambridge University Press, Cambridge, UK. Reprinted by Chelsea, New York [1984]. Magnus, A. [1988]. Associated Askey-Wilson polynomials as Laguerre-Hahn orthogonal polynomials, in Orthogonal Polynomials and Their Applications (Segovia, 1986), M. Alfaro et al, eds., Lecture Notes in Math. no. 1329, Springer-Verlag, New York, 261-378. Makai, E. [1974]. An integral inequality satisfied by Bessel functions, Ada Math. Acad. Sci. Hungaricae, 25, 387-390.
Bibliography 649 Mehta, M. L. [1991]. Random Matrices, 2nd Ed., Academic Press, Boston. Miller, W., Jr. [1972]. Symmetry Groups and Their Applications, Academic Press, New York. Milne, S. С [1988]. Multiple ^-series and U(n) generalizations of Ramanujan's 1i/r) sum, in Ramanujan Revisited, G. Andrews et al., eds., Academic Press, New York, 473-524. Milne, S. С and Lilly, G. M. [1995]. Consequences of the At and Ce Bailey transform and Bailey lemma, Discrete Math., 139, 319-346. Moak, D. S. [1980]. The g-gamma function for q > 1, Aequat. Math., 20, 278-285. Monsky, P. [1994]. Simplifying the proof of Dirichlet's theorem, Amer. Math. Monthly, 100, 861-862. Morita, Y [1975]. A p-adic analog of the Г-function, J. Fac. Sci. Tokyo, Sec. 1A, 22, 255-266. Morris, W. G. [1984]. Constant term identities for finite and affine root systems: Conjectures and theorems, Ph.D. Thesis, University of Wisconsin-Madison. Miiller, С [1966]. Spherical Harmonics, Lecture Notes in Mathematics, 7, Springer-Verlag, New York. Murphy, R. [1835]. Second memoir on the inverse method of definite integrals, Trans. Cambridge Phil. Soc, 5, 113-148. Nassarallah, B. and Rahman, M. [1985]. Projection formulas, a reproducing kernel and a generating function for ^-Wilson polynomials, SIAM J. Math. Anal., 16, 186-197. Natanson, I. P. [1965]. Constructive Function Theory, Vol. Ill, Frederick Ungar Publishing Co., New York. Nemes, I., Petkovsek, M., Wilf, H., and Zeilberger, D. [1997]. How to do Monthly Problems with your computer, Amer. Math. Monthly, 104, 505-519. Nevai, P. [1979]. Orthogonal Polynomials, Amer. Math. Soc, Providence, RI. Nevai, P. [1990]. Orthogonal Polynomials: Theory and Practice, P. Nevai, ed., Kluwer Academic Publishers, Dordrecht. Newton, I. [1960]. The Correspondence of Isaac Newton, 2, 1676-1687, Cambridge University Press, Cambridge, UK. Nielsen, N. [1906]. Handbuch der Theorie der Gamma Funktion, В. В. Teubner, Leipzig. Olver, F. W. J. [1974]. Asymptotics and Special Functions, Academic Press, New York. Papperitz, E. [1889]. Ueber die Darstellung der hypergeometrischen Transcendenten durch eindeutige Functionen, Math. Ann., 34, 247-296. Petkovsek, M., Wilf, H., and Zeilberger, D. [1996]. A = B, A. K. Peters, Wellesley, MA. Pfaff, J. F. [1797]. Disquisitiones Analydcae, I, Helmstadt. Pfaff, J. F. [1797a]. Observations analyticae ad L. Euleri Institutiones Calculi Integralis. Vol. IV, Supplem. II et IV, Historie de 1793, Nova Acata Acad. Scie. Petropolitanae. XI, 38-57. (Note: The history section is paged separately from the scientific section of this journal.) Poincare, H. [ 1886]. Sur les integrales irregulieres des equations lineares, Acta Math.. 8. 295-344. Poisson, S. D. [1823]. Suite du Memoire sur les integrales definies et sur la sommation de* series, Paris Jour, de I'Ecole Poly technique, 19, 404—509, especially, pp. 477—4~S. Polya, G. [1984]. Collected Papers, Vols. I-IV, MIT Press, Cambridge, MA. Polya, G. and Szego, G. [1972]. Problems and Theorems in Analysis, Vols. I and II. Springer-Verlag, New York. Rademacher, H. [1955]. On the transformation of log r](r), J. Indian Math. Soc. 19. 25-30. Rademacher, H. [1973]. Topics in Analytic Number Theory, Springer-Verlag. New YorV Rahman, M. [1981]. A non-negative representation of the linearization coefficient- of t.v product of Jacobi polynomials, Canadian J. Math., 33, 915-928.
650 Bibliography Rahman, M. [1986]. ^-Wilson functions of the second kind, SIAM J. Math. Anal, 17, 1280-1286. Ramanujan, S. [1927]. Collected Papers, Cambridge University Press. Reprinted by Chelsea, New York [1962]. Ramanujan, S. [1988]. The Lost Notebook, Narosa Publishing House, New Delhi. Raynal, J. [1979]. On the definition and properties of generalized 6-j symbols, /. Math. Phys., 20, 2398-2415. Riemann, B. [1857]. Beitrage zur Theorie der durch Gauss'sche Reihe F(a, /б, у, x) darstellbaren Functionen, K. Gess. Wiss. Gottingen, 7, 1-24. Riemann, B. [1859]. Uber die Anzahl der Primzahlen unter einer gegebene Grosse, Monatsb. Berliner Akad., 671-680. Reprinted in Gesammelte Mathematische Werke, paper 7, Springer-Verlag, [1990]. Rogers, L. J. [1888]. An extension of a certain theorem in inequalities, Messenger of Math., 17, 145-150. Rogers, L. J. [1894]. Second memoir on the expansion of certain infinite products, Proc. London Math. Soc, 25, 318-343. Rogers, L. J. [1895]. Third memoir on the expansion of certain infinite products, Proc. London Math. Soc, 26, 15-32. Rogers, L. J. [1907]. On function sum theorems connected with the series ?^1, x"/n2, Proc. London Math. Soc, 4, 169-189. Rogers, L. J. [1917]. On two theorems of combinatory analysis and allied identities, Proc. London Math. Soc, B), 16, 315-336. Roosenraad, С. Т. [1969]. Inequalities with orthogonal polynomials, Ph.D. Thesis, University of Wisconsin-Madison. Rothe, H. A. [1811]. Systematisches Lehrbuch der Arithmetic, Leipzig. Roy, R. [1990]. The discovery of the series formula for n by Leibniz, Gregory and Nilakantha, Math. Mag., 63, 291-306. Roy, R. [1993]. The work of Chebyshev on orthogonal polynomials, in Topics in Polynomials of One and Several Variables and Their Applications, Th. Rassias, H. M. Srivastava, A. Yanushauskas, eds., World Scientific, Singapore, 495-512. Rudin, W [1976]. Principles of Mathmatical Analysis, 3rd ed., McGraw-Hill, New York. Saalschutz, L. [1890]. Eine Summationsformel, Zeitschrift Math. Phys., 35, 186-188. Saff, E. B. and Varga, R. S. [1976]. Zero-free parabolic regions for sequences of polynomials, S/АМУ. Math. Anal, 1, 344-357. Salamin, E. [1976J. Computation of n using arithmetic-geometric mean, Math. Comput., 30, 565-570. Sapiro, R. L. [1968]. Special functions related to representations of the group SU(n), of class I with respect to SU(n — 1) (и > 3), Izv. Vyssh. Uchebn. Zaved. Matematika, 71 D), 97-107 (in Russian). Sarmanov, I. O. [1968]. A generalized symmetric gamma correlation, Dokl Akad. Nauk SSSR, 179, 1276-1278; Soviet Math. Dokl, 9, 547-550. Scharlau, W and Opolka, H. [1985]. From Fermat to Minkowski, Springer-Verlag, New York. Schiff, L. I. [1947]. Quantum Mechanics, Addison-Wesley, New York. Schur, I. [1918]. Uber die Verteilung der Wurzeln bei gewissen algebraischen Gleichungen mit ganzzahligen Koeffizienten, Math. Zeit., 1, 377^02. Schur, I. [1921]. Uber die Gaussschen Summen, Nach. Gessel Gottingen, Math-Phys. Klasse, 147-153. Schiitzenberger, M.-P. [1953]. Une interpretation de certaines solutions de l'equation fonctionelle: F(x + y) = F(x)F(y), С R. Acad. Sci. Paris, 236, 352-353.
Bibliography 651 Sears, D. B. [1951]. Transformations of basic hypergeometric functions of special type, Proc. London Math. Soc, B), 53, 138-157. Selberg, A. [1944]. Bemerkninger om et multipelt integral, Norske Mat. Tidsskr., 26, 71-78. Sheppard, W. F. [1912]. Summation of the coefficients of some terminating hypergeometric series, Proc. London Math. Soc, B), 10, 469-478. Siegel, C. L. [1945]. The trace of totally positive and real algebraic integers, Ann. Math., 46,302-313. Siegel, С L. [1954]. A simple proof of ??(-1/т) = ti(t)</tJ2, Mathematica, 1, 4. Simpson, T. [1759]. The invention of a general method for determining the sum of every second, third, fourth, or fifth, etc. term of a series, taken in order; the sum of the whole series being known, Phil. Trans. Royal Soc. London, 50, 757-769. Smith, H. J. S. [1859-1865]. Report on the Theory of Numbers, Reprinted by Chelsea, New York [1965]. Stanton, D. [1988]. Recent results for the g-Lagrange inversion formula, in Ramanujan Revisited, G. E. Andrews, R. Askey, B. Berndt, K. G. Ramanathan, and R. A. Rankin, eds., Academic Press, San Diego, 525-536. Stieltjes, T. J. [1993]. Collected Papers, Vols. I and II, Springer-Verlag, New York. Stirling, J. [1730]. Methodus Differential, London. Stone, M. H. [1962]. A generalized Weierstrass approximation theorem, in Studies in Modern Analysis, R. С Buck, ed., Math. Assoc. America, Sylvester, J. J. [1853]. On Mr. Cayley's impromptu demonstration of the rule for determining at sight the degree of any symmetrical function of the roots of an equation expressed in terms of the coefficients, Phil. Mag., 5, 199-202. Collected Papers, Vol. 1, 594-598. Sylvester, J. J. [1883]. Note on the graphical method in partitions of n, Proc. Cambridge Phil. Soc, 19,207-210. Sylvester, J. J. [1886]. Lectures on the theory of reciprocants, Amer. J. Math., 8, 196-260; 9, 1-37, 113-11661, 297-352; 10, 1-116. Szasz, O. [1950]. On the relative extrema of ultraspherical polynomials, Bollettino della Unione Matematica Italiana, 5 C), 125-127. Szasz, O. [1951]. On the relative extrema of the Hermite orthogonal functions, J. Indian Math. Soc, 25, 1340-1345. Szego, G. [1933]. Ueber gewisse Potenzreihen mit lauter positiven Koeffizienten, Math. Zeit., 37, 674-688. Szego, G. [1936]. Inequalities for the zeros of Legendre polynomials and related functions, Trans. Amer. Math. Soc, 39, 1-17. Szego, G. [1948]. On an inequality of Turan concerning Legendre polynomials, Bull. Amer. Math. Soc, 54, 401^405. Szego, G. [1950]. On the relative extrema of Legendre polynomials, Bollettino della Unione Matematica Italiana, 5 D), 120-121. Szego, G. [1975]. Orthogonal Polynomials, Amer. Math. Soc, Providence, RI. Szwarc, R. [1992]. Orthogonal polynomials and a discrete boundary value problem II, SIAMJ. Math. Anal, 23, 965-969. Takacs, L. [1973]. On an identity of Shih-Chieh Chu, Ada Sci. Math., (Szeged), 34, 383-391. Thomae, J. [1869]. Beitrage zur Theorie der durch die Heinesche Reihe..., J. Reine Ang. Math., 70, 258-281. Thomae, J. [1879]. Uber die Funktionen welche durch Reihen von der Form dargestellt werden: 1 + ^jf + ¦¦¦, Journal Math., 87, 26-73.
652 Bibliography Titchmarsh, E. С [1937]. Introduction to the Theory of Fourier Integrals, Oxford University Press, London. Todd, J. [1950]. On the relative extrema of Laguerre orthogonal functions, Bollettino della Unione Matematica Italiana, 5 C), 120-125. Tricomi, F. G. and Erdely i, A. [ 1951 ]. The asymptotic expansion of a ratio of gamma functions, Рас. J. Math., 1, 133-142. Turan, P. [1952]. On a trigonometrical sum, Ann. Soc. Polonaise Math., 25, 155-161. Tweddle, I. [1988]. James Stirling, Scottish Academic Press, Edinburgh, van der Poorten, A. [1979]. A proof that Euler missed.. .Apery's proof of the irrationality of f C), Math. Intelligencer, 2, 195-203. Van Loan, C. [1992]. Computational Frameworks for the Fast Fourier Transform, SIAM, Philadelphia. Viennot, G. [1983]. Une Theorie Combinatoire des Polynomes Orthogonaux Generaux, Lecture Notes, UQAM. Vietoris, L. [1958]. Ueber das Vorzeichen gewisser trigonometrischer Summen, Sitzungsber. Oest. Akad. Wiss., 167, 125-135. Vietoris, L. [1959]. Ueber das Vorzeichen gewisser trigonometrischer Summen, II, Sitzungsber. Oest. Akad. Wiss., 167, 192-193. Vilenkin, N. J. [1958]. Some relations for Gegenbauer functions, Uspekhi Matem. Nauk (N.S.), 13 C), 167-172. Vilenkin, N. J. [1968]. Special Functions and the Theory of Group Representations, Translations of Math. Monographs 22, Amer. Math. Soc, Providence, RI. Vilenkin, N. J. and Klimyk, A. [1992]. Representation of Lie Groups, Special Functions and Integral Transforms, Kluwer Academic, Amsterdam. Wallis, J. [1656]. Arithmetica Infinitorum, Oxford. Wang, Z. X. and Guo, D. R. [1989]. Special Functions, World Scientific, Singapore. Watson, G. N. [1918]. Asymptotic expansions of Hypergeometric functions, Trans. Cambridge Phil. Soc, 22, 277-308. Watson, G. N. [1925]. A note on generalized hypergeometric series, Proc. London Math. Soc, B), 23, xiii-xv (Records for 8 Nov. 1923). Watson, G. N. [1929]. A new proof of the Rogers-Ramanujan identities, J. London Math. Soc, 4, 4-9. Watson, G. N. [1944]. A Treatise on the Theory ofBessel Functions, 2nd Ed., Cambridge University Press, Cambridge, UK. Weil, A. [1949]. Number of solutions of equations in a finite field, Bull. Amer. Math. Soc, 55, 497-508. Weil, A. [1968]. Sur une formule classique, /. Math. Soc. Japan, 20, 400-402. Weil, A. [1974]. Two lectures on number theory, past and present, Collected Papers, Vol. Ill, Springer-Verlag, New York. Weil, A. [1976]. Elliptic Functions According to Eisenstein and Kronecker, Springer-Verlag, New York. Weil, A. [1983]. Number Theory, from Hammurabi to Legendre, Birkhauser, Boston. Whipple, F. J. W [1926]. On well-poised series, generalized hypergeometric series having parameters in pairs each pair with the same sum, Proc. London Math. Soc, B), 24, 247-263. Whittaker, E. T. [1904]. An expression of certain known functions as generalized hypergeometric series, Bull. Amer. Math. Soc, 10, 125-134. Whittaker, E. T. and Watson, G. N. [1940]. A Course of Modern Analysis, 4th Ed., Cambridge University Press, London. Wiener, N. [1933]. The Fourier Integral and Certain of Its Applications, Cambridge. Reprinted by Dover, New York [1958].
Bibliography 653 Wilkins, J. E. [1948]. Nicholson's integral for У„2(г) + Уи2(г), Bull. Amer. Math. Soc, 54, 232-234. Wilson, J. A. [1977]. Three-term contiguous relations and some new orthogonal polynomials, Pade and Rational Approximations, E. B. Saff and R. S. Varga, eds., Academic Press, New York. Wilson, J. A. [1978]. Hypergeometric series, recurrence relations and some new orthogonal polynomials, Ph.D. Thesis, University of Wisconsin, Madison. Wilson, J. A. [1991]. Orthogonal functions for Gram determinants, SIAM J. Math. Anal, 22, 1147-1155. Wong, R. [1989]. Asymptotic Approximations of Integrals, Academic Press, New York. Zagier, D. [1989]. The dilogarithm in geometry and number theory, in Number Theory and Related Topics, R. Askey et al., eds., Oxford University Press, Oxford, UK, 231-249. Zeilberger, D. [1982]. Sister Celine's technique and its generalizations, / Math. Anal. Appl, 85, 114-145. Zeilberger, D. [1994]. Theorems for a price: Tomorrow's semi-rigorous mathematical culture, The Math. Intelligencer, 16 D), 11-14.
Index Abel, N. H., 14, 106, 600, 605 Adams, J. C, 320 Agarwal, R. P., 586 Ahern, P., 36 Airy, G. В., 201 Al-Salam, W. A.,532 Amend, В., 61 Anastassiadis, J., 36 Anderson, G. W., 401, 402, 411, 430 Andrews, G. E., 171, 175, 574, 590, 592 Anno, M., 450 Aomoto, K., 401, 402, 406, 407, 428, 440 Apery, R., 355 Archimedes, 485 Artin, E., 34, 36, 53 Askey, R. A., 67, 185, 314, 322, 363, 369, 391, 394, 505, 519, 575, 592 Auslander, L., 43 Azor, R., 328 Baernstein, A., 355 Bailey, W. N., 93, 124, 143, 144, 147, 150, 152, 154, 170, 173, 174, 181, 182, 184, 185, 277, 319,386,398,400 Bak, J., 306 Barnes, E. W., 61 Bateman, H., 277, 313, 314, 475 Baxter, R. J., xv Bellman, R., 535, 542 Berggren, L., 139 Berndt, В., 532, 627 Bernoulli, D., 2 Bernoulli, J., 605, 615, 619 Bessel, F. W., 187, 200, 202, 203, 222, 225, 228, 230,231,288,321,464,475 Beukers, E, 355 Bieberbach, L., 355, 362, 382 Binet, J. P. M., 28, 29 Boas, R. P., 410 Bochner, S., 464, 542 Bohr, H., 363, 494 Borel, E., 608 Borwein, J., 139 Borwein, P., 139 Brafman, E, 400 Brent, R. P., 138 Bressoud, D. M., 401, 429, 590, 591 Bromwich, T. J., 630 Brown, G., 376 Brown, M., 552 Brualdi, R. A., 371 Burge, W. H., 171 Bustoz, J., 608 Butlewski, Z., 238 Caratheodory, C, 49 Carlson, F., 109 Cartier, P., 371 Cauchy, A. L., 263, 442, 491, 497, 523, 541, 542, 548 Cayley, A., 268 Cesaro, E., 355, 599, 604, 605 Charlier, С V. L., 349 Chebyshev, P. L., 117 Christoffel, E. В., 240, 261, 272, 373 Chu, S.-C, 67 Clausen, Т., 116 Connor, W. G., 576 Cooley, J. W., 43 Cooper, S., 409 Copson, E. Т., 463 Cox, D., 134 Darboux, J. G., 240, 243, 271, 272 Davenport, H., 430, 432 de Boor, C, 43 de Branges, L., 152, 355, 382, 520 de Bruin, N. G., 234, 282 deMoivre, A., 18 Dedekind, R., 1 655
656 Index DeSainte-Cathenne, M., 330 Din, A. M., 322 Dirichlet, P. L., 1, 2, 401, 413, 434, 463, 532, 601 Dixon, A. C, 72, 239 Dougall, J., 71, 143, 147, 182, 183 Durfee, W. P., 567 Dwork, В., 46 Dyson, F. J., 410, 426, 427, 590 Edwards, A. W. E, 243, 485 Edwards, С. Н., 485 Eisenstein, E G. M., 12, 42 Elliot, E. В., 138 Erdelyi, A., 68, 111,615 Euler, L., 1, 2, 4, 469, 471, 490, 493, 497, 501, 508, 521, 523, 524, 553, 564, 627 Evans, R., 415 Favard, J., 245, 248 Fejer, L., 269, 272, 313, 334, 355, 371, 372, 388, 602 Feldheim, E., 278, 291, 315, 382, 390 Fermat, P., 1, 485, 486 Ferrers, N. M., 320, 565, 566 Fields, J. L., 615 Fine, N. J., 506 Foata, D., 371 Forrester, P. J., 426, 441 Fourier, J., 445, 464, 466, 540, 600, 602, 623 Friedrichs, K., 365 Funk, P., 445, 458 Fuss, N., 2 Gasper, G., 355, 363, 369, 382, 390, 523, 530, 545 Gauss, С F, 1, 61, 63, 67, 78, 94, 95, 240, 254, 262, 355, 411, 431, 491, 496, 497, 499, 500, 508,521,523 Gautschi, W., 212 Gegenbauer, L., 187, 204, 302, 316, 318, 334, 464, 475, 478 Gelfond, A., 410 Gessel, I., 630 Gillis, J., 328 Godsil, С D., 266, 277, 328 Goldbach, C, 2 Good, I. J., 371,427, 630 Gordon, В., 576 Gosper.R. W., 116 Gould, H. W., 186 Graf, J. H., 239, 475 Gram, J.-P, 277 Gray, J., 84 Gronwall, Т. Н., 372, 380 Gross, В., 45 Guo, D. R., 29 Hahn.W., 331,332 Halmos, P., 468 Hamburger, H., 536 Hamilton, W. R., 268 Hankel.H., 121,289-291 Hardy, G. H., 363, 410, 532, 572, 578, 608 Hasse, H., 430, 432, 436 Hecke, E., 445, 458 Heine, E., 274, 491, 520-523 Heisenberg, W., 282 Helversen-Pasotto, A., 93 Herglotz, G., 1 Hermite, C, 100, 198, 266, 269, 271, 273, 299,531 Hewitt, E., 306, 376 Hickerson, D., 590 Hill, M. J. M., 63 Hille, E., 226, 496 Holder, O., 647 Hsu, H.-Y., 321 Hsu,L. C, 186 Hurwitz, A., 1,226 Hylleraas, E., 322 Ingham, A. E., 394 Ireland, K., 432 Ismail, M. E. H., 363, 370, 504, 532, 552 Ivory, J., 99 Jackson, D., 372, 375, 380 Jackson, F. H., 485, 486, 588 Jacobi, С G., 240, 277, 297, 298, 401, 411, 485, 496,500,501,508-510,548,591, 592 Jensen, J. L. W. V., 14 Jimbo, M., xv John, F, 544 Johnson, W., 542, 630 Jordan, C, 623 Karamata, J., 609 Karlin, S., 396 Kenko, Т., 183 Kirillov, A. N., 102 Klein, F, 65 Klimyk, A., xiii Knopp, M., 573 Koblitz, N., 540 Koekoek,R.,331 Kogbetliantz, E., 388 Kolberg, O., 573 Koornwinder, Т. Н., 322, 369, 371, 475, 491 Kummer, E., 1, 63, 73, 124, 144, 154, 191, 522, 526 Laguerre, E. N., 266, 273, 282, 290, 292, 306, 330,363,368,370,418,635 Lambert, J. H., 608
Index 657 Landen,J., ЮЗ Landsberg, M., 542 Lang, S., 46 Lanzewizky, I. L., 278 Laplace, P. S., 100 Laurent, P. A., 426, 496, 497 Lebesgue, H., 6, 69, 311 Legendre, A. M., 132, 136, 162, 199, 240, 252,277,311,334,496,535 Leibniz, G. W., 283, 485, 605 Lerch, M., 17 Lewin, L., 102 Lewy, H., 365 Lilly, G. M., xv Lindelof, E., 539 Liouville, J., 82, 99, 307 Lipschitz, R. O., 121 Littlewood, J. E., xvi, 648 Lorch,L., 188,229,231 Lorentz, G. G., 363 Lorentzen, L., 98 Maclaurin, C, 618, 626 MacMohan, P. A., 369, 553, 555, 565, 569, 572 Madhava, 59 Magnus, A., 552 Makai, E., 391 Markov, A., 240, 254, 255, 259, 275 Maxwell, J. C, 479 McGregor, J. L., 396 Mehler, F. G., 313 Mehta, M. L., 410, 411,425 Meixner, J., 349 Mellin, R. H., 61 Miller Jr., W., 445 Milne, S. C, xv, 649 Moak, D. S., 544 Mollerup, J., 494 Monsky, P., 608 Mori, Т., 450 Morita, Y., 45 Morris, W. G., 428, 442 Muldoon, M. E., 231 Muller, C, 445 Murphy, R., 240 Nassarallah, В., 519 Natanson, I. P., 252, 269 Nemes, I., 166 Neumann, C. G., 235 Nevai, P., 277, 312, 334 Newman, D. J., 306 Newman, E W., 4 Newton, I., 248, 534 Nicholson, J. W., 187,224 Nielsen, N., 4 Olver, F. W.J., 312, 615 Opolka, H., 55 Orr, M. McE, 184 PopperitzE., 73 Parseval, M. A., 265 Pascal, В., 482, 485 Petkovsek, M., 166 Pfaff, J. E, 63 Phragmen, E., 539 Plana, G. A., 55 Poincare, H., 649 Poisson, S. D., 401, 446, 531, 601, 602, 623, 625, 627 Pollaczek, E, 349 Rademacher, H., 622 Rahman, M., xvi, 322, 519, 523, 545 Ramanujan, S., 182, 485, 501, 505, 506, 509, 532,550,563,569,577, 589 Raynal, J., 124 Riemann, В., 1, 63, 65, 73, 160, 497, 532, 536 Rodrigues, O., 99 Rogers, J. В., 426, 441 Rogers, L. J., 72, 319, 530, 575, 577, 579, 582 Rosen, M., 432 Rothe, H. A., 490, 491, 497 Roy, R., 59, 100,183 Rudin, W., 10,61 Ryser,H.J.,371 Saff.E. В., 188,231 Salamin, E., 138 Sapiro, R. L., 475 Sarmanov, I. O., 368 Schutzenberger, M.-R, 485 Schafheitlin, P., 236 Scharlau, W., 55 Schiff, L. I., 199 Schlomilch, O., 4 Schur,I.,401,423, 578 Sears, D. В., 524 Selberg, A., 110, 401, 402, 410, 417, 428 Sheppard, W. E, 141 Siegel, С L., 401,419, 423 Simpson, Т., 14 Slater, D., 590 Smith, H. J. S., 542 Sole, P., 93 Sonine, N. J., 236, 291, 292, 466 Spence, W., 106 Stanton, D., 630 Steinig, J., 381 Stieltjes, T. J., 240, 254, 259, 401, 416, 419, 425, 496, 594 Stirling, J., 356, 496, 617, 618
658 Index Stokes, G., 223 Stone, M. H., 265 Subbarow, M. V., 574 Swarttouw, R. R, 331 Sylvester, J. J., 566 Szasz, O., 306 Szego, G., 238, 245, 274, 304, 312, 313, 342, 365,368,380, 390,419 Szego, P., 188,229,231 Szwarc, R., 323 Takacs, L., 70 Tamhankar, M. V., 370 Tannery, J., 72 Thiruvenkatachar, V. R., 569 Thomae.J., 142, 485,521 Titchmarsh, E. C, 89 Todd, J., 304 Tolimieri, R., 43 Tricomi, E G., 615 Tukey, J. W., 43 Turan, P., 342, 343, 384 Tweddle, I., 18, 652 van der Poorten, A., 394 Van Loan, C, 43 Vandermonde, A., 70 Varga,R. S., 188,231 Venkatachaliengar, K., 569 Victor, J. D., 328 Viennot, G., 277, 330 Viete, E, 243 Vietoris, L., 355, 375, 377, 380, 398 Vilenkin, N. J., 315,382, 445 Vitali, G., 496 von Staudt, C, 57 Waadelund, H., 98 Wallis, J., 4 Wang, Z. X., 29 Watson, G. N., 149, 179, 227, 237, 385, 512, 547,548,587,614 Weierstrass, C, 4, 463 Weil, A., 501, 538 Whipple, E J. W., 124, 130, 141, 143, 144, 146, 181,355,362,632-634 Whittaker, E. Т., 187, 195, 198, 512 Wiener, N., 289 Wiles, A., 509 Wilf.H., 125 Wilkins, J. E., 224 Wilson, J. A., 124, 152, 154, 157, 184, 277, 293,331 Wong, R., 614 Zagier, D., 102 Zeilberger, D., 125 Zeller, K., 363
Subject Index Abel summability, 599 absolutely monotonic series, 389 adjacency matrix, 266 arithmetic-geometric mean, 134 associated Legendre function, 456 asymptotic expansion, 611 Bailey chain, 586 Bailey pair, 586 Bailey's 9Fg transformation, 182 Bailey's lemma, 584 Barnes's beta integral, 90 basic hypergeometric series, 523 it-balanced, 524 well-poised, 524 Bernoulli numbers, 12 Bernoulli polynomials, 20, 615 generalized, 615 Bessel functions, 200, 204-206, 212 Bessel's formula, 212 Gegenbauer's formula, 205 generating function, 212 Hankel's formula, 206 Poisson integral, 204 second kind, 200 Bessel's equation, 200 Bessel's inequality, 264 beta integral, 4 Carlson's theorem, 110 Cesaro summability, 599 characters, 38, 57 additive, 38 even, 58 multiplicative, 38 odd, 58 primitive, 79 Charlier polynomials, 347 Chebyshev polynomials, 101, 102 first kind, 101 fourth kind, 102 second kind, 101 third kind, 102 Christoffel-Darboux formula, 246 Chu-Vandermonde identity, 67 Clausen's identity, 116 confluent hypergeometric equation, 188, 190, 194 asymptotic solutions, 190 recessive solutions, 194 contiguous 4 F3, 155 cosine integral, 235 Coulomb wave function, 199 Darboux's method, 310 Dedekind ^-function, 538 digamma function, 13 dilogarithm, 102 Dirichlet /.-function, 58, 59 functional equation, 60 Dirichlet problem, 463 discriminant, 418 Dougall's 7F6 formula, 147, 152 Bailey's 7F6 integral analog, 152 Dougall's bilateral sum, 110 Dougall's identity, 71 Dyson's integral, 426 elliptic functions, 508 elliptic integrals, 132, 136 first kind, 132 second kind, 136 Erdelyi's formula, 113 error function, 196 Euler's angles, 469 Euler's reflection formula, 9 Euler's transformation, 69 Euler-Maclaurin summation formula, 18 Eulerian integral, 6 first kind, 6 second kind, 6 659
660 Subject Index Fermat measure, 486 finite-dimensional representation of a group, 466, 467, 469 irreducible representation, 469 isomorphism of, 467 subrepresentation, 469 unitary representation, 467 fractional integral, 605 fractional integration, 111 Fresnel integrals, 235 Funk-Hecke formula, 459 gamma function, 3, 6, 23, 44 Gauss's multiplication formula, 23 integral representation, 6 p-adic, 44 product representation, 3 Gauss quardrature formula, 250 Gauss sums, 38, 627 reciprocity, 627 Gauss's summation formula, 67, 90 integral analog, 90 Gegenbauer polynomials, 302 Gegenbauer's addition formula, 215 Gegenbauer's product formula, 478 Gibbs phenomenon, 372 Gosper's algorithm, 163 Graf's formula, 215 Harr measure, 468 Hahn polynomials, 331, 345, 346 continuous, 331 continuous dual, 331 dual Hahn, 346 Hankel functions, 208 Hankel pair, 216, 290 Hankel transforms, 216, 289 harmonic polynomial, 446 Hasse-Davenport relation, 432 Heine's transformation formula, 520 Hermite polynomials, 278-280, 318, 419 discriminant, 419 generating function, 279 linearization formula, 318 orthogonality, 278 Poisson kernel, 280 Hurwitz zeta function, 15 hypergeometric differential equation, 75 hypergeometric functions, 64, 65, 87, 88, 94, 97, 125, 130 asymptotic expansion, 88 Barnes's integral, 87 contiguous relations, 94 continued fraction, 97 cubic transformations, 130 integral representation, 65 quadratic transformations, 125 hypergeometric series, 61, 70, 140, 146 balanced, 70 it-balanced, 140 nearly poised, 140 very well poised, 146 well-poised, 140 hypergeometric term, 163 incomplete gamma function, 197 Jackson's formula, 587 Jacobi elliptic functions, 510 Jacobi polynomials, 99, 298, 475, 476 generating function, 298 Koornwinder's product formula, 476 Laplace-type integral, 475 Poisson kernel, 385 Jacobi sums, 39 kernel polynomial, 260 Krawtchouk polynomials, 347 Kummer's first transformation, 191 Kummer's second transformation, 191 /.-function, 434 Lagrange interpolation, 249 Lagrange inversion formula, 629 Laguerre polynomials, 283, 288, 418 discriminant, 418 generating-function, 283 Poisson kernel, 288 Lambert summable, 608 Landen's transformation, 103 Laplace equation, 198 Laplace transforms, 536 Legendre polynomials, 252 Legendre's differential equation, 162 Legendre's duplication formula, 22 Legendre-Fejer polynomials, 334 Lerch's theorem, 18 logarithmic convexity, 34 logarithmic integral, 197 MacMahon's Master Theorem, 369 Markov-Stieltjes inequalities, 254 matching polynomials, 324 Meixner polynomials, 346 Meixner-Pollaczek polynomials, 348 Mellin transforms, 85, 534 Miller's algorithm, 211 modified Bessel functions, 222 modular forms, 540 modular group, 540 Nicholson's formula, 224 noncommutative binomial theorem, 485
Subject Index 661 orthogonal matrix, 452 orthogonal polynomials, 452 parabolic cylinder functions, 198 Parseval's formula, 264 partition analysis, 555 partitions, 553, 559, 566-569 congruences, 569 conjugate, 567 Durfee square, 567 Ferrers graph, 566 Frobenius symbol, 568 generating functions, 559 pentagonal number theorem, 501 Pfaff's transformation, 69 Pfaff-Saalschiitz identity, 69, 91, 92 integral analog, 91 nonterminating form, 92 Poisson summation formula, 623 g-beta integrals, 514 ^-binomial coefficients, 483 g-binomial theorem, 488 g-difference operator, 488 g-Dixon formula, 588 g-Dougall formula, 588 g-gamma function, 493 q -Gauss summation formula, 521 g-Hermite polynomials, 530, 531 generating function, 532 linearization formula, 532 Poisson kernel, 532 ^-integral, 486 g-Jacobi polynomials, 592, 593 big, 593 little, 592 g-Kummer sum, 522 g-Laguerre polynomials, 593 q-Pfaff-Saalschutz identity, 524 g-ultraspherical polynomials, 527, 530 connection coefficient formula, 531 linearization formula, 531 quadratic reciprocity law, 53 quintuple product identity, 545 Racah polynomials, 344 radial functions, 464 Ramanujan's i ф\ formula, 505 reciprocal polynomials, 150 resultant, 414 Riemann zeta function, 15, 16 functional relation, 16 Rogers-Ramanujan identities, 565, 578 partition theoretic interpretation, 565 Schrodinger equation, 199 Schur's inequality, 423 Sears transformation, 524 Selberg sum, 435 Selberg's integral formula, 402 Aomoto's extension, 402 shifted factorial, 2 Siegel's inequality, 420 sine integral, 235 Sonine's integrals, 218 spherical harmonics, 451, 456 addition theorem, 456 Stieltjes's problem, 415 Stieltjes-Wigert polynomials, 594 Stirling numbers, 356 Stirling's formula, 18 Sturm's comparison theorem, 227 theta functions, 509 triple product identity, 497 Turan's inequality, 342 ultraspherical polynomials, 302, 303, 316, 319, 462 addition formula, 462 generating function, 302 Laplace integral, 316 linearization formula, 319 Rodrigues's formula, 303 Vietoris's inequalities, 377 W-Z method, 167 Watson's 807 transformation, 587 Watson's lemma, 614 Weber's equation, 198 Whipple's^ transformation, 140 Whipple's jFf, transformation, 143 Whittaker functions, 195 Whittaker's equation, 195 Wilson polynomials, 158 orthogonality, 158 Wilson's integral, 151 Wronskian, 136 zonal harmonics, 455
Symbol Index (а)пЛ 1а(х),222 (a;q)k,48l /(/,»?), 39 B(x,y),4 Ja(x),200 В„,\2 /я, 325 Bn(x),20 K(k),tt2 Bq{a,p),494 Ка(х),22Ъ C(x), 235 Kn(x;p,N), 347 Ci(x),235 ЦХ, 0,433 сп(нД),511 И, 197 Cnx(x),302 Li2(x), 102 Cn(x;p\q),521 Umx, 106 С„(х;а),347 Lan{x), 283 C*,37 L^(x;?),594 4,-(n),506 L<")(x;?),593 dn(H,*), 511 M(a,b), 134 D«^)-322 М„(х;Ь,с),346 D"(x)'198 Mtm(x), 195 D4/,529 WWF 430 E(k), 136 ';¦ ' ч erf, 196 PSfll *' Cl xb75 erfc, 196 \аг bl °2 ) Eq(x),492 p(N,M,n), 561 F(a+), 96 p(m,n), 562 pFq(ai ap; b,...., bq; x), 62 PA(n), 554 g(<t>.e. vH.470 р„^х;и,р, q,,jy Gi(V').466 /f(x),457 W^'i.ti. 208 Pi"'^ (Jt), 99 //i:'m. 208 4(x),326 Я,. 472 б(«, М,п),561 //„1Д1.446 em(n), 559 //„(.» i. 2"8 QmU, n), 560 Я„1л- </». 530 6m'')(n),559 //<¦„(«!. 324 Qn(x;a,8,N),3' 663
664 Symbol Index p (p), 37 R(F,G), 414 ZQ>)*,38 »•*("). 506 Г f(x)dljx,4%6 Rn(k(x)),346 fx,y), 467 Rn(}.(xy,y,S,N),344 ["i 483 S(*),235 L*/2J>15 sn(«,t),510 {а,к),209 Si(x),235 a(G), 324 5t2(C),471 у з ,476 у'(в,л),197 , 466 Г(в,л),197 458 5„(Л ;с,Ь,с),331 Г,(л),493 Cte).473 д*/>488 ^.^(^,472 f(*) 15 Тп(х),Ш <(л,*),15 r^/F. 430 ч(г)) 538 С/"(л)' 101 Si(z,q),i = 1.....4, 510 Vn(cose), 102 ПС*) 3 ^C-1.3-2), 136 f0ii523 Wn(cos6>), 102 й-(л) 13 Wt.-W.196 ,^,505 Ya(x),200 fi>>556 Zp-45 ft(z, 9),i = 1,..., 4, 509