/
Author: Cassels J.W.S.
Tags: mathematics number theory numerical methods algebraic geometry
ISBN: 0-521-41517-9
Year: 1991
Text
London Mathematical Society Student Texts 24
Lectures on Elliptic Curves
J.W.S. Cassels
Department of Pure Mathematics and Mathematical Statistics,
University of Cambridge
The nght of the
University of Cambridge
to print ami sell
all manner of books
was granted by
Henry Vllt in 1534.
The University has printed
and published continuously
since 1584.
CAMBRIDGE UNIVERSITY PRESS
Cambridge
New York Port Chester
Melbourne Sydney
LONDON MATHEMATICAL SOCIETY STUDENT TEXTS
Managing editor: Dr C.M. Series, Mathematics Institute
University of Warwick, Coventry CV4 7 AL, United Kingdom
1 Introduction to combinators and X-calculus, J.R. HINDLEY & J.P. SELDIN
2 Building models by games, WILFRID HODGES
3 Local fields, J.W.S. CASSELS
4 An introduction to twistor theory, S.A. HUGGETT & K.P. TOD
5 Introduction to general relativity, L.P. HUGHSTON & K.P. TOD
6 Lectures on stochastic analysis: diffusion theory, DANIEL W. STROOCK
7 The theory of evolution and dynamical systems, J. HOFBAUER & K. SIGMUND
8 Summing and nuclear norms in Banach space theory, G.J.O. JAMESON
9 Automorphisms of surfaces after Nielsen and Thurston, A. CASSON & S. BLEILER
10 Nonstandard analysis and its applications, N. CUTLAND (ed)
11 Spacetime and singularities, G. NABER
12 Undergraduate algebraic geometry, MILES REID
13 An introduction to Hankel operators, J.R. PARTINGTON
14 Combinatorial group theory: a topological approach, DANIEL E. COHEN
15 Presentations of groups, D.L. JOHNSON
16 An introduction to noncommutative Noetherian rings, K.R. GOODEARL &
R.B. WARFIELD, JR.
17 Aspects of quantum field theory in curved spacetime, S.A. FULLING
18 Braids and coverings: selected topics, VAGN LUNDSGAARD HANSEN
19 Steps in commutative algebra, R.Y. SHARP
20 Communication theory, C.M. GOLDIE & R.G.E. PINCH
21 Representations of finite groups of Lie type, FRANCOIS DIGNE & JEAN MICHEL
22 Designs, graphs, codes, and their links, P.J. CAMERON & J.H. VAN LINT
24 Lectures on elliptic curves, J.W.S. CASSELS
Published by the Press Syndicate of the University of Cambridge
The Pitt Building, Trumpington Street, Cambridge CB2 1RP
40 West 20th Street, New York, NY 10011-4211, USA
10 Stamford Road, Oakleigh, Melbourne 3166, Australia
© Cambridge University Press 1991
First published 1991
Printed in Great Britain at the University Press, Cambridge
Library of Congress cataloging in publication data available
A catalogue record for this book is available from the British Library
ISBN 0 52141517 9 hardback
ISBN 0 521 42530 1 paperback
Contents
0 Introduction 1
1 Curves of genus 0. Introduction 3
2 p-adic numbers 6
3 The local-global principle for conics 13
4 Geometry of numbers 17
5 Local-global principle. Conclusion of proof 20
6 Cubic curves 23
7 Non-singular cubics. The group law 27
8 Elliptic curves. Canonical form 32
9 Degenerate laws 39
10 Reduction 42
11 The p-adic case 46
12 Global torsion 50
13 Finite basis theorem. Strategy and comments 54
14 A 2-isogeny 58
15 The weak finite basis theory 66
16 Remedial mathematics. Resultants 75
17 Heights. Finite basis Theorem 78
18 Local-global for genus 1 85
19 Elements of Galois cohomology 89
20 Construction of the jacobian 92
21 Some abstract nonsense 98
22 Principal homogeneous spaces and Galois cohomology 104
vi
23 The Tate-Shafarevich group 108
24 The endomorphism group 114
25 Points over finite fields 118
26 Factorizing using elliptic curves 124
Formulary 130
Further Reading 135
Index 136
о
Introduction
Diophantine equations, that is to say equations whose solution is to
be found in integers, or, alternatively, in rationale, have fascinated man
from the earliest times: a Babylonian clay tablet dated to between 1600
and 1900 B.C. lists 15 solutions of the “Pythagorean” equation
X2 +Y2 = Z2.
Diophantos himself lived in Alexandria in the 3rd Century A.D. We
shall meet some of his ideas. His work was continued by Hypatia, the
only female mathematician of antiquity whose name has come down to
us. (She was cruelly done to death by the Christians: their leader was
canonized.) Another mathematician whose ideas continue to play a key
role is Fermat (1601-1665). For a fuller historical account in a modern
context, see A. Weil Number theory: an approach through history from
Hammurabi to Legendre (Birkhauser, 1983). [For Hypatia, see Gibbon
Decline and Fall]
In this course we concentrate attention on rational solutions of Dio-
phantine equations. The study of integral solutions requires further
considerations, which we shall not touch on.
It is now clear that an appropriate language to discuss many aspects
of Diophantine equations is that of algebraic geometry: not so much the
classical algebraic geometry, which works over the complex numbers,
but a version working over a general ground field such as the field Q of
rationale and often called “Diophantine geometry”. Some of the argu-
ments and results of classical geometry go over to Diophantine geometry
unchanged, for some the conclusions are more limited, and for others we
2
Lectures on Elliptic Curves
must make further hypotheses which are automatically satisfied in the
classical theory.
Diophantine equations can be interpreted as questions about the ex-
istence of points on algebraic varieties. Here we will be concerned only
with curves. Geometers classify curves by a non-negative integer, the
genus. The Diophantine theory of curves of genus 0 is well understood.
For curves of genus 1, there is a rich body of well-established theory
and an equally rich corpus of conjecture which is currently beginning
to succumb to intensive research. The Diophantine theory of curves of
genus > 1 is in a rudimentary state (despite Faltings’ Theorem).
The main subject of this course is some of the basic Diophantine
theory of curves of genus 1. To set the scene, we start with an account
of genus 0. Here the situation is dominated by the local-global principle
(Hasse principle). This relates behaviour over the rational field Q to that
over its local completions, the p-adic fields Qp, where things are simpler.
A unifying theme for curves of genus 1 is the extent to which local
(i.e. p-adic) behaviour determines rational behaviour. This material
generalizes smoothly to algebraic number fields but we have restricted
attention to the rationals in the belief that new concepts are easiest
acquired in the simplest contexts.
The final three sections mark a change of goal. Two of them introduce
the more sophisticated theory over finite fields, culminating in the esti-
mates for the number of points known as the “Riemann hypothesis for
function fields” (of genus 1). The very last section indicates how these
ideas are used in the modern technology for factorizing large integers.
Prerequisites.
In this course the prerequisites have been reduced to a minimum. We
have spoken above about curves of genus 0 and 1, but the focus will be
on concrete classes of curves such as conics and plane cubics. The p-adic
numbers are introduced from scratch. A knowledge of algebraic number
theory is not required, provided that the reader is prepared to take one
statement on trust. Algebraic number theory is, however, indispensable
for many applications, as we shall indicate in optional passages. We do
require the rudiments of Galois theory: indeed one of the interests will
be its application in novel contexts.
1
Curves of genus 0. Introduction
We shall say that a point is rational, or defined over Q, if its co-
ordinates are rational. A curve is said to be defined over Q if it is given
by an equation or equations with coefficients in Q. [Unfortunately the
term “rational curve” was preempted by the geometers as a synonym
for “curve of genus 0”.] More generally we shall say that we are working
over Q, or that the ground field is Q, if all the coefficients of the algebraic
expressions involved are in Q.
Sometimes elementary geometric arguments continue to be valid when
we work over Q. For example, consider a cubic curve such as
С : X2 - У 2 = (X - 2У)(Х2 + У2),
which has a double point at the origin.
A line through the origin meets the curve in one further point, so giving
4
Lectures on Elliptic Curves
a description of all the points on the curve. More precisely, consider the
line
X = sY
for given s. This meets the curve where
Y\s2 - 1) = Y\s - 2)(з2 + 1),
and so in the point (z,j/) where
s(s2 — 1) з2 — 1
X = (з-2)(з2+1) ’ У= (з-2)(з2 + 1)'
Conversely, given (r, y) on the curve, it is of the above form with
s = x/y.
We say that C is birationally equivalent to the line [given by a single
variable and no equation]. In this case the birational equivalence is
defined over Q [i.e. the rational functions expressing the equivalence
have coefficients in Q. Note the unfortunate clash in the double meaning
of the term “rational”].
In general there is a 1 — 1 correspondence between the rational points
on the one curve and those on the other, the correspondence being given
by the birational correspondence. There are, however, exceptions. For
example s = 2 does not correspond to any point (x,y) and з = ±1 both
correspond to (x, j/) = (0,0). If we had had X2 —2Y2 instead of X2 — Y2
on the left hand side, then (x,y) = (0,0) would not correspond to any
rational value of s. It is not difficult to see however that if two curves
are birationally equivalent over Q there are only finitely many rational
points on the ones which do not correspond to rational points on the
other. To study the rational points on a curve, it is thus sufficient to
consider it up to a birational equivalence defined over Q.
A classical theorem working over the complex field C states that every
curve of genus 0 is birationally equivalent to the line: we could treat this
as a definition of “genus 0”. When the ground field is Q, this theorem
no longer holds. Instead we have the
Fact. A curve of genus 0 defined over Q is birationally equivalent over
Q either to the line or to a conic.
This reduces the Diophantine study of curves of genus 0 to that of
conics.
Theorem 1. A conic defined over Q is birationally equivalent to the
line if and only if it has a rational point.
1: Curves of genus 0
5
Proof. The “only if’ part is trivial. Suppose then that there is a rational
point. After a change of co-ordinates we may take it to be the origin, so
that the equation of the conic is
FI(X,y) + F2(X,y) = 0,
where Fj is homogeneous in X, Y of degree j. The birational equivalence
with the line follows by putting X = зУ, as in the cubic case discussed,
earlier.
The Diophantine theory of curves of genus 0 is thus reduced to de-
ciding when a conic defined over Q has a rational point. It is certainly
easy to write down conics without rational points. For a change, let us
use homogeneous co-ordinates. There is no rational point on
X2 + У2 + Z2 = 0,
since clearly there are no real points. Again, there are no rational points
on
х2 + У2-зг2 = о. (*)
For suppose (r,j/, z) were such a rational point. By homogeneity, we
may suppose that x, y, z are integers without common divisor. Now (*)
implies x2 + y2 = 0 (3) and so x = у = 0 (3). Then (*) gives z = 0 (3),
so x, y, z have the common factor 3: a contradicition.
For our purposes, it is convenient, and ultimately indispensable, to
express the last argument in a different way. We shall introduce the
fields Qp of p-adic numbers, where p is a prime (here p = 3); and what
we have just done can be expressed as proving that there are no points
on (*) defined over Q3.
2
p-adic numbers
Most of the familiar properties of the ordinary absolute value on the
real or complex fields are consequences of the following three:
(i) |r| > 0, with equality precisely for r = 0.
(ii) |rs| = |r||s|.
(iii) |r + s| < jr] + |s|.
A real-valued function |.| on afield k is said to be a valuation if it satisfies
(i), (ii) (iii). Since ( — I)2 = 1, properties (i)-(iii) imply that | — 1| = 1,
| — r| = |r| (all r).
The rational field Q has other valuations than the absolute value. Let
p be a fixed prime. Any rational r 0 can be put in the shape
r = ppu/v, p 6 Z, и, v g Z, p Л u, p / v.
We define
|r|p = p~13
and
|0|p = 0.
This definition clearly satisfies (i), (ii) above. Let
s = pam/n m, n g Z, p / m, p % n,
so
where without loss of generality
c > p, i.e. |s|p < |r|p.
2: p-adic numbers
7
Then
r 4- s = pp(un pIT~pmv)/vn.
Here p % vn. The numerator un +p!T~pmv is an integer, but, at least for
for p — a, it may be divisible by p. Hence
к + -s|P <P~p,
that is
(iii*) |r 4- s|p < max{|r|p, |s|p}.
Clearly (iii*) implies (iii), so | |p is a valuation. We call it the p-adic
valuation. The inequality (iii*) is called the ultrametric inequality, since
(iii), the triangle inequality, expresses the fact that |r — is a metric.
A valuation which satisfies the ultrametric inequality is said to be non-
archimedean.
We can transfer familiar terminology from the ordinary absolute value
to the p-adic case. For example, we say that a sequence {an}, n = 1, 2,.. .
is a fundamental sequence if for any г > 0 there is an По (г) such that
|am — an\P < s whenever m,n > n0 (e).
The sequence {a„} converges to b if
lan — 1>|P < г (all n > n0 (e)).
For example let
p = 5
and consider the sequence 33, 333, 3333, ....
{»>.} : 3,
Then (lyn = 0>n mod 5" (m > n)
i.e. | | s <5-" (m > n).
Hence {an} is a fundamental sequence. Indeed it is a convergent se-
quence, since
3a„ = 99 ...99 s —1(5”),
i.e.
|3on 4- l|s < 5
and so
an -» -1/3
5-adically.
8
Lectures on Elliptic Curves
As the above example shows, the main difficulties with the p-adic val-
uation are psychological: something is p-adically small if it is divisible
by a high power of p. Not every p-adic fundamental sequence is conver-
gent. Let us take p = 5 again. Then we construct a sequence of an g Z
such that
«2n + 1 = 0 (5n)
and
ttn+l = (5 )•
We start with = 2. Suppose that we already have an for some n and
put an+1 = an + J5", where b g Z is to be determined. We require
(n„ + &5")2 + 1 = 0 (5"+I),
that is
2anb 4- c = 0 (5), (*)
where we already have
c = (a2 + l)/5" € Z.
Clearly 5 / an and so we can solve the congruence (*) for the unknown
b.
The sequence {a„} just constructed is a 5-adic fundamental sequence
since
- “n|s < 5"" (m > n).
Suppose, if possible, that a„ tends 5-adically to some e g Q. Then
a2n + 1 e2 + 1.
On the other hand, by our construction,
a2 + 1 - 0.
Hence e2 + 1 = 0; a contradiction.
Just as the real numbers are constructed by completing the rationals
with respect to the ordinary absolute value, so the rationals can be
completed with respect to | |p to give the field Qp of p-adic numbers. In
fact the process can be simplified because | |p is non-archimedean. For
the reader who is unfamiliar with this way of constructing the reals, we
sketch a construction of Qp at the end of this section.
We say that a field К is complete with respect to a valuation |.| if
every fundamental sequence is convergent. A field К with valuation ||.||
is said to be the completion of the field k with valuation |.| if there is an
injection
A : к -> К
2: p-adic numbers
9
which preserves the valuation:
||Aa|| = |a| (a € fc)
and such that
(i) К is complete with respect to
(ii) К is the closure of Xk with respect to the topology induced by ||.| |
(A' is not “too large”).
The completion always exists and is unique (up to a unique isomor-
phism). We henceforth identify к with Xk and |.| with ]|.||, so regard fc
as a subfield of K.
We now discuss the structure of the p-adic field Qp with its valuation.
II,-
We note that
|a + b|p = |a|p if |b|p < |a|p.
For by (iii*) |a + b|p < |a|p and, since a — (a + b) ( — b), we have a
contradiction if |a + b|p < |a|p. It follows that the set of values taken
by | |p on Qp is precisely the same as the set for Q. Indeed if a € Qp,
a 0 then by (ii) of the definition of the completion, there is an a € Q
with |a - a|p < |a|p, so |a|p = |a|p.
The set of a g Qp with |a| < 1 is called the set of p-adic integers Zp.
Because | |p is non-archimedean, Zp is a ring:
|a|„ 1^1, < 1 => \afi\p <1, |a + 0\P < 1-
A rational number b is in Zp precisely when it has the form b = u/v,
where u, v g Z, p / v.
The numbers e g Qp with |e| = 1 are the p-adic units. From what
was said about the values taken by |.|p on Qp, every /3 0 in Qp is of
the shape ft = p"e, where n g Z and г is a unit. The units are just the
elements г of Qp such that г € Zp, г-1 g Zp.
As we have already noted, elementary analysis continues to hold in
Qp, but can be simpler; as the following lemma shows.
Lemma 1. In Qp the series 0n converges if and only if [jn —> 0.
Proof. By saying that the sum converges, we mean, of course, that the
partial sums tend to a limit.
That convergence implies /?„ —» 0 is true even in real analysis. To
10
Lectures on Elliptic Curves
prove the opposite implication, we note that
N M N
I57 - 571? =i 57 Pn I?
о о M+i
< max |^„|p
M<n<N e
by an obvious extension of the ultrametric inequality (iii*) to several
summands. Hence is a fundamental sequence, so tending to
a limit by the completeness of Qp.
We are now in a position to give an explicit description of Zp. We
write
A = {0,l,...,p-1}.
Lemma 2. The elements of Zp are precisely the sums
oo
a = 57a„pn,
о
where
an g A (all n).
Proof. By the proceeding lemma, the infinite sum converges, and its
value is clearly in Zp.
Now let a g Zp be given. There is a b £ Q such that |b — a|p < 1,
and it is easy to prove that there is precisely one a0 £ A such that
|«o — b\p < 1- Then
a = a0 +pa1
where |cni | < 1, i.e. £ Zp. Proceeding inductively, we get
a = ao + dip + ... + awpN +
with адг £ Zp.
For the final result we must distinguish between p = 2 and p 2.
Lemma 3 (p/2). Let a £ Qp be a unit. A necessary and sufficient
condition that a = for some f) £ Qp in that there is some ~f £ Qp
with
- y2|p < 1.
Proof. Necessity is obvious. We have already in effect given a proof in
the special case p = 5, a = — 1. That in the general case is similar: one
§£: Exercises
11
constructs inductively ffi = у, f}2, fh, • • . such that
|^-a|<p-
|^n+l - P„\ <p-n
If we already have /9„, we take /3„+1 = + <5, so
/^+I =/?" +w + *2
and it is enough to take
5 = (a - ^)/2^„.
This lemma ceases to hold for p = 2 (consider a = 5, /3 = 1). We have
Lemma 4 (p = 2). Let a € Q2 be a unit. A necessary and sufficient
condition that a = for some ft G Q2 is that |a — 1| < 2-3.
Proof. Here again, the necessity is obvious. For sufficiency we construct
a sequence ffi = 1, ffi, ffi,... as in the previous proof. The details are
left to the reader. 1
We conclude this section be the promised sketch of the construction
of Qp.
Denote by S the set of fundamental sequences {«„} for | |p, where
a„ € Q- Then S is a ring under componentwise addition and multipli-
cation.
{} T {bn} — {nn + bn} . J{bnJ — (anbn J.
A sequence {an} is a null sequence if an —♦ 0 (p-adically). The set Hl of
null-sequences is clearly an ideal in J.
Let {an} € S but {an} Hl. Then it is easy to see that there is at least
one N such that |адг — a„| < |a^|p for all n > N. Then |a„|p = |ajv|P
for all n > N. We write |{a„}|p = |a^|p. If an 0 for all n, it is now
easy to deduce that {a))1} g S'.
We show that Hl is a maximal ideal in S- For, if not, let ffil be a
strictly bigger ideal than Hl. It must contain an {an} Hl. Then only
finitely many of the a„ can be 0, and replacing them by (say) 1 merely
adds an element of Hl. Hence we can suppose that a„ 0 for all n. Then
{a"1} € S, and so € ®l- Hence we should have 9Л = S, a
contradiction. We conclude that Hl is maximal, and thus S/Hl is a field.
The field Q is mapped into S/Hl by
r —> {r} € S-
The function |{a„ } | on? induces a function on S/Hl which is easily seen
to be a valuation and to coincide with | |p on the image of Q.
12
Lectures on Elliptic Curves
Finally, it is not difficult to check that S'/Ш is itself complete by a
diagonal argument on a sequence of elements of J.
§2. Exercises
1. For each of the sets of p, m, r given, either find an x € L such that
|r - x\p <p~m,
or show that no such x exists.
(i) p = 257, r = 1/2, m = 1;
(ii) p = 3, r — 7/8, m = 2;
(iii) p = 3, r = 7/8, m = 7;
(iv) p = 3, r = 5/6, m = 9;
(v) p = 5, r = 1/4, m = 4.
2. Construct further examples along the lines of Exercise 1 until the
whole business seems trivial.
3. For given p, m, r either find an x € 1 such that
I 21 — m
|r - X Ip <p
or show that no such x exists.
(i) p = 5, r = —1, m = 4;
(ii) p = 5, r = 10, m = 3;
(iii) p = 13, r = —4, m = 3;
(iv) p = 2, r = —7, m = 6;
(v) p = 7, r = —14, m = 4;
(vi) p = 7, r = 6, m = 3;
(vii) p = 7, r — 1/2, m = 3.
4. As Exercise 2.
5. Let p > 0 be prime, p = 2 (3). For any integer a, p / a, show that
there is an x E with x3 = a.
3
The local-global principle for conics
We have seen that the theory of curves of genus 0 over Q turns on
deciding whether a given conic has a rational point.
We use homogeneous co-ordinates. A conic C defined over Q is given
by an equation
т(Х) = £/1;ед = о
where X = (Хг.ХгЛз),
fii = fji e Q
and the quadratic form F (recall a form is a homogeneous polynomial)
is nonsingular, i.e.
det(/,y) 0.
In our initial discussion we noted that, apart from reality considera-
tions, we could disprove the existence of rational points by congruence
considerations. These we now replace by reference to p-adic numbers.
A criterion for the existence of a rational point on a conic was given,
by Legendre. It was left to Hasse to give it the following succinct for-
mulation.
Theorem 1. A necessary and sufficient condition for the existence of a
rational point on a conic C defined over Q is that there is a point defined
over the real field R and over Qp for every prime p.
Necessity is trivial. We shall prove sufficiency, but it will require some
time and preparation. First we introduce some conventional terminol-
ogy.
14
The real field R is somewhat analogous to the Qp and is conventionally
denoted by Qqq. When we write Qp we will not include p — oo unless
we explicitly say so. The fields Qp (including p — oo) are called the
localizations of Q. In contrast, Q is called the global field. We say that
something is true “everywhere locally” if it is true for all Qp (including
oo). In this lingo the theorem becomes “A necessary and sufficient con-
dition for the existence of a global point on a conic is that there should
be a point everywhere locally”.
The local-global theorem for conics implies a local-global theorem for
curves of genus 0 but some care must be taken in the formulation [“point"
must be interpreted as “place”]. We do not pursue this further.
In the rest of this section we transform the theorem into a shape better
suited for attack1.
A transformation
T-. =
i
with
tij G Q, det(tjj) / 0
takes the quadratic form T’(X) into a quadratic form G(Y), say. Then
T takes points defined over Q on T’(X) = 0 into points defined over Q
on G(Y) — 0 and, similarly, the inverse T~x takes points on G(Y) = 0
to points on T’(X) = 0. Likewise for points defined over Qp for each p
(including oo). Hence the theorem holds for T’(X) = 0 if and only if it
holds for GCY1 = fi.
By suitable choice of transformation T we thus need consider only
“diagonal” forms
F (X) = + /2X22 + f3X3.
By substitutions Xj —* tjXj (tj G Q) we may suppose without loss of
generality that the
4ez
are square free.
If /1! fi, fa have a prime factor p in common, we replace T’(X) by
p~1F(X). If two of the fj, say fly f2 have a prime p in common but
p f f3t we replace X3 by pX3 and then divide F by p. Both of these
1 The details of the proof of Theorem 1 will not be required for the treatment
of elliptic curves. The reader who is interested only in the latter should omit
the rest of this § and also omit §§4,5.
3: The local-global principle for conics
15
transformations reduce the absolute value of the integer /1/2/3- After a
finite number of steps we are reduced to the case when /1/2/3 is square
free. We have thus proved the
Metalemma 1. To prove the Theorem, it is enough to prove it for
conics
FIX = /1^? + + /3Х2 = 0,
where fj ET. and /1/2/3 i-s square free.
The next stage is to draw conclusions from the hypothesis that a conic
as described in the Metalemma has points everwhere locally. There is a
point defined over Qp when there is a vector a = (a!, <22,03) f. (0,0,0)
with oj € Qp such that F(bl) = 0. By multiplying the <2y by an element
of Qp we may suppose without loss of generality that
max|a3|p = 1. (*)
For our later purposes we have to consider several cases.
First case, p / 2, p | /1/2/3. Without loss of generality p | /1, so p / /2,
p / /3. Then 1/ioJlp < 1. Suppose, if possible that |аг|р < 1- Then
|/заз1р — l/iai + faal\p < 1
and |a3|p < 1. Now
l/iailp — \faaz + /заз1р < P-2
and so |oi|p < 1 since /1 is square free. This contradicts the normaliza-
tion (*), and so |a2|p = |a3|p = 1. But now
1/2^2 + /заз1р < 1-
On dividing by the unit <22, we deduce that there is some rp £ 7 such
that
/2 + fp/з = 0 (p)-
Second case, p = 2, 2 / /1/2/3- It is easy to see that precisely two of
the aj are units, say a2 and a3. Now a2 = 1 or (I (4) for a £7; and so
/2 + /3 н 0 (4).
Third case, p — 2, 2 | /1/2/3, say 2 | fa. Now |а2|г = |аз|з = 1- Now
а2 = 1 (8) for a G 1, 2 / a; and so
/2+ /з = 0(8)
or
/1 + /2 + /3 = 0 (8)
16
according as |ai|з < 1 or |ai|2 = 1.
In the next two sections, we show that the conditions just derived are
sufficient to ensure the existence of a global point on E(X) = 0.
§3. Exercises
1. (i) Let p > 2 be prime and let 6, c G Z, p / 6. Show that bx2 + c takes
precisely |(p + 1) distinct values p for x G Z. (ii) Suppose that, further,
a G Z, p / a. Show that there are x, у G Z such that bx2 4- c = ay2 (p).
2. Let a, b, c G Zp, |a|p = |6|p = |c|p — 1 where p is prime, p > 2. Show
that there are x, у G Zp such that bx2 + c = ay2.
3. Let p > 2 be prime, G Z (1 < i, j < 3), aj; — and let
d = det(aij). Suppose that p / d. Show that there are Xi, rr2, x-j G Z,
not all divisible by p, such that . aijX(Xj = 0 (p).
4. Let a, b, c G Z, 2 / abc. Show that a necessary and sufficient condition
that the only solution in Q2 of ax2 + by2 + cz2 — 0 is the trivial one is
that a = b = c (4).
5. For each of the following sets of a, b, c find the set of primes p
(including 00) for which the only solution of ax2 + by2 + cz2 — 0 in Qp
is the trivial one:
(i) (a,b,c) = (1,1,-2)
(ii) (a, b, c) = (1,1, —3)
(iii) (a, b, c) = (1,1,1)
(iv) (a, b, c) = (14,-15,33)
6. Do you observe anything about the parity of the number N of primes
(including 00) for which there is insolubility? If not, construct similar
exercises and solve them until the penny drops.
7.(i) Prove your observation in (6) in the special case a — 1, b = —r,
c = — .s, where r, s are distinct primes > 2.
[Hint. Quadratic reciprocity]
(ii) [Difficult], Prove your observation for all a, b, c G Z.
4
Geometry of numbers
At this stage we require a tool from the Geometry of Numbers, which
we shall develop from scratch.
A generalization of the pigeon-hole principle (Schubfachprinzip) says
that if we have N things to file in H holes and N > mH for an integer
m, then at least one of the holes will contain > (m +1) things. We start
with a continuous analogue.
Let Rn denote the vector space of real n-tuples г = (rj,... , rn). It
contains the group Zn of г for which rj G Z (all j). By the volume
V(<S) of a set 5 C Rn we shall mean its Lebesgue measure, but in the
applications we will be concerned only with very simple-minded S.
Lemma 1. Let m > 0 be an integer and let 5 CR" with
У(<$) > m.
Then there are m + 1 distinct points So,..., sm of S such that
s, — Sj G Z" (0 < i, j < m).
Proof. Let W C R" be the “unit cube” of points w with
0 < Wj < 1 (1 < j'< n).
Then every x G Rn is uniquely of the shape
x = w + z,
where z G Zn. Let ^(x) be the characteristic function of S (= 1 if x G <S,
18
Lectures on Elliptic Curves
= 0 otherwise). Then
m < 7(5) = / i[>(x)dx
= / / V’(w + z) I dw.
Jw W" /
Since V(W) = 1, there must be some w0 g W such that
V’fwo + z) > m,
zg2n
so > m + 1.
We may now take for the Sj the w0 + z for which i/>(w0 + z) > 0.
The set S is said to be symmetric (about the origin) if —x g S when-
ever x g S. It is convex if whenever x, у g 5, then the whole line-
segment
Ax + (1 —A)ygS (0 < A < 1)
joining them is in S. In particular, the mid-point |(x + y) is in S.
Theorem 1. Let Л be a subgroup of Z" of index m. Let С C Rn be a
symmetric convex set of volume
V(C) > 2nm.
Then C and Л have a common point other than 0 = (0,. . . , 0).
Proof. Let S = yC be the set of points |c, c g C. Then
V(-C) =2-"7(C) >m.
2
By Lemma 1, there are m + 1 distinct points c0,.. . , cm EC such that
-С,- — -Cj gZ” (0 < i, j < m).
2 2
There are m + 1 points
1 1 . .
-С,----Co (0 < i < m)
2 2
and tn cosets of Z" modulo Л. By the pigeon hole principle, two must
be in the same coset, that is there are г, j with i j such that
i i
-C,----C; g Л.
2 2
Now —Cj g C by symmetry; and so
1 1 1 1 / . z,
-c;----Cj = -Ci + -(-Cj) g C
2 2 2 2
by convexity.
Exercises
19
Note. Lemma 1 and Theorem 1 with m = 1 are due to Blichfeldt and
Minkowski respectively. The generalizations to m > 1 are by van der
Corput.
As a foretaste of the flavour of the application in the next section, we
give
Lemma 2. Let N be a positive integer. Suppose that there is are / G Z
such that
I1 2 = -1 (IV).
Then N = u2 + v2 for some u, v £~L.
Proof. We take n = 2 and denote the co-ordinates by x. y. For C we
take the open disc
x2 + y2 < 2m
of volume (= area)
V(C) = 2?rm > 22m.
The subgroup Л of Z2 is given by
x, у G Z, у = lx (m).
It is clearly of index m. Hence by the Theorem there is
(0,0) + (u,v) G AflC.
Then
0 < u2 + v2 < 2m
and
u2 t>2 = м2(1 I2) = 0 (m).
Hence u2 + v2 = m, as required.
We note, in passing, that the condition of the lemma is certainly
satisfied for primes p with p = 1 (4).
§4. Exercises
1. Let m G Z, m > 1 and suppose that there is some f G Z such that
f2 -|- f + 1 = 0 (m). Show that m = u2 + uv + v2 for some и, v G Z.
2. Find a prime p > 0 for which there is an f G Z such that
1 + 5/2 = 0 (p)
but p is not of the shape u2 + 5v2 (и, v G Z).
5
Local-global principle. Conclusion of proof
We now complete the proof of the local-global principle for conics
using the theorem of the last section. We recall that we had reduced the
proof to that for
f.xl + f2X2 + f3X23 = 0
where ft, f2, /3 6 Z and /1/2/3 is square free. We assume that there
are points everywhere locally and we showed that this implied certain
congruences to primes p dividing 2/i/2/3.
We first define a subgroup Л of Z3 by imposing congruence conditions
on the components of x = (а?!, z2, £3).
First case, p / 2, p/ /1/2/3, say p | /]• We saw (end of §3) that then
there is an rp € Z and that
/2 + r2f3 = 0 (p).
We impose the condition
£3 s rpx2 (p).
Then
•f(x) = Ml + + /з^з
= (/2 + r2f3)x22
sO (p).
Second case, p = 2, 2 / /1/2/3- Then without loss of generality
/2 + /з = 0 (4).
5: Local-global principle. Conclusion of proof
21
We impose the conditions
(2)
= £3 (2)
which imply
F(x) = 0 (4).
Third case, p = 2, 2 | /1/2/3, say 2 | /1. Then
s2fi + /2 + /3 = 0 (8),
where s = 0 or 1. We impose the conditions
X2 = xs (4)
£j = fx3 (2)
which imply
F(x) = 0 (8).
To sum up. The group Л is of index m (say) = 4|/i f2 f31 in Z3, where
throughout this section | | is the absolute value. Further,
F(x) = 0 (4 |Л/2/з|)
for x e л.
We apply the theorem of the previous section to Л and the convex
symmetric set
С:|/1|^ + |/2|^+|/з|^<4|/1/2/з|.
School geometry shows that
V(C) = (тг/3).23. |4/1/2/з I
>23|4/1/2/3|
= m.
Hence there is an c 0 in Л П C. For this x we have
F(x) = 0 (4|/1/2/з|)
|F(x)| < |/1 |£1 + |/2|£2 + |/з|£2
< 4|/г/2/з|;
SO
F(x) = 0,
as required.
We conclude with some remarks.
22
Lectures on Elliptic Curves
Remark 1. We have not merely shown that there is a solution of
F(x.) = 0, but we have found that there is one in a certain ellipsoid.
This facilitates the search in explicitly given cases.
Remark 2. We have made no use of the condition of solubility in Qp for
p ]( 2f\ f2f3. In fact this condition tells us nothing [cf. §3, Exercises 2,
3]. It is left to the reader to check that for any J\, f2, f3 and p with
p / 2f3f2f3 there is always a point defined over Qp on
fiX2 + f2X22 + f3X23 = 0.
Remark 3. We have also nowhere used that there is local solubility for
Qoo = R-
Hence solubility at is implied by solubility at all the Qp (p Y- oo).
This phenomenon is connected with quadratic reciprocity. In fact for
any conic over Q, the number of p (including oo) for which there is not
a point over Qp is always even [cf. §3, Exercises 6,7]. See a book on
quadratic forms (such as the author’s).
§5. Exercises
1. Let
F(X, У, 2) = 5X2 + ЗУ2 + 8У2 + 6(У2 + ZX + ХУ).
Find rational integers r, y, z not all divisible by 13, such that
Е(т,у,г) = 0 (mod 132).
[Hint. cf. Hensel’s Lemma 2 of §10.]
2. Let
F(X, У, Z) = 7X2 + ЗУ2 - 2Z2 + 4YZ + QZX + 2XY.
Find rational integers x, y, z not all divisible by 17 such that
F(x, y,z) = 0 (mod 173).
6
Cubic curves
In this section we consider curves given by
C-. F(X) = F(X1,X2,X3) = 0,
where F is a homogeneous cubic form. The case of interest is when
the ground field is the rationale Q, but our initial remarks apply to any
ground field.
A point x on C is said to be singular when
dF
—(x) = 0 (j = 1,2,3).
(7 A j
If we choose co-ordinates so that x = (0,0,1), this is equivalent to F
not containing terms in X3, XjXj,
A singular point counts with multiplicity at least 2 as an intersection
with a line. More precisely, if a, b are two points on the line, the general
point on it is
Aa + /2b,
where the numbers A, /2 are not both 0. The intersections with C are
given by
F(Aa+/rb) = 0, (*)
a homogeneous cubic in А, //. What is claimed is that if one of the
intersections is a singular point of C then the corresponding ratio A : /2
occurs as a multiple root of (*). An easy way to check this is to take
b = x.
24
Lectures on Elliptic Curves
Suppose that C has two distinct singular points x, y. The line joining
them cuts C at both x, у with multiplicity > 2. This can happen only if
P(Ax + ^y) vanishes identically, i.e. if C contains the whole line. If we
suppose, as we shall, that C is irreducible (i.e. that F does not factorize),
this cannot happen. An irreducible cubic curve has at most one singular
point.
Now take the ground field to be Q. If there is a singular point over the
algebraic closure Q, there is at most one. By Galois theory2 it must be
defined over Q. Hence, as we have already seen in §1, C is birationally
equivalent over Q to the line.
From now on we restrict attention to non-singular cubic curves, i.e.
those which have non-singular points over Q. Let a, b be rational points
on C. The line joining them meets C in a third point, in general distinct:
it is also rational since it is given by a cubic equation, two of whose roots
are rational. This process was used already by Diophantos to find new
unobvious points from known obvious ones. The variant in which one
takes the third point of intersection with the curve of the tangent at a
rational point was, according to Weil, first noted by Newton. An older
generation of mathematicians refer to these as the “chord and tangent
processes”.
In general, starting from one rational point a on C one obtains in-
finitely many by the chord and tangent processes. If this is not the case,
a is said to be exceptional. For example we have
Lemma 1. Let a > 1 be a cubic-free integer and let
С : X3 + Y3 - aZ3 = 0.
The point (1, —1, 0) is exceptional. For a = 1 the points (0,1,1), (1, 0,1)
are also exceptional. For a = 2 the point (1,1,1) is exceptional. No other
rational point is exceptional.
Proof. We first show that the given points are indeed exceptional. The
tangent at (1,—1, 0) is X -|-У = 0, which meets C only at (1, —1,0). The
other cases for a = 1 are similar. The tangent at (1,1,1) for a = 2 is
X + Y — 2Z = 0, which meets C again only at (1, —1,0).
Let x = (r, y, z) be a rational point other than those named. We may
2 For the cognoscenti. If the ground field is not perfect, the conclusion does not
necessarily hold. See Note at end of §9.
§5: Exercises
25
suppose that x. y, z are integers without common factor. The equation,
for C implies that then x, y, z are coprime in pairs.
Let Xj = (x3 , yi, zj) be the third point of intersection, where again.
Xj, j/j, zj are integers without common factor. It may be verified3 that
: У1 : Zj = x(x3 + 2y3) : -y(2x3 + y3) : z(x3 - y3)
Let d be the greatest common divisor of the three terms on the right
hand side. If a prime p divides both x and d it must also divide y, a
contradiction. Hence d divides x3 + 2y3 and 2x3 + y3. It thus divides
3x3 and 3y3, so d = 1 or 3. Hence
z3 = iz(x3 — y3) or zi = ±z(x3 - y3)/3.
In either case, it is readily verified that |zj | > |z| except for the x listed
in the enunciation. By repeating the tangent process we thus get a
sequence of points x, Xj, x2,. • with
H < |zj| < |z2| < ... .
Hence the Xj are distinct, and x is not exceptional.
§6. Exercises
1. (i) Show that the cubic curve
Y2Z = X3 + AXZ2 +BZ3
is non-singular provided that
4.43 +27B2 yt 0.
(ii) If 4.4 3 + 27B2 = 0, find a singularity and decide whether it is a
cusp or a double point with distinct tangents.
2. (i) Let
F(x) = aiX3 + a2X3 + a3X3 + <DGX2X3,
where
al a2a3 7^ 0.
Show that F(x) = 0 is non-singular provided that
a1a2a3 + d3 ф 0.
(ii) If aj = a2 = a3 = 1, d = —3, show that any point (xj, x2, x3) with
a3 = x2 = x3 = x3x2x3 = 1 is a singularity.
3 This is essentially a special case of elegant formulae of Desboves for the chord
and tangent processes. See Exercises and Formulary.
26
Lectures on Elliptic Curves
(iii) How does the result of (ii) square with the result proved in the text
that a cubic curve has at most one singularity?
3. Let F(x) be as in the previous question and suppose that F(x) = 0
is non-singular.
(i) Let F(x) = 0. Show that the third intersection t of the tangent at
x is given by
tj = xj+i — aj+'2xj+i) (J = 1,2,3),
where the suffixes are taken mod 3.
(ii) Let x, у be distinct points on F"(X) = 0. Show that the third
intersection z of the line joining them is given by
zj = xjUj+iyi+i ~ yjxj+ixj+'2-
[Formulae of Desboves].
4. Starting with the solution (2, —1, —1) of X3 + Y3 + 7Z3 = 0, find 10
distinct solutions.
7
Non-singular cubics. The group law
Let C be a non-singular cubic curve and let о be a rational point on
C. We show that the set of rational points on C has a natural structure
of commutative group with о as neutral element (“zero”).
Hence the ground field is arbitrary, the curve C is defined over it; and
by rational point we mean point defined over the ground field.
The group law is defined as follows. Let a, b be rational points. Let
d be the third point of intersection with C of the line through a, b. Let
e be the third point of intersection of the line through o, d. Then we
write
a 4- b = e.
The construction has to be interpreted appropriately if two or more of
the points involved coincide. For example if b — a we take the tangent
at a.
28
Lectures on Elliptic Curves
We have to show that this operation gives a structure of commu-
tative group. Clearly
a -J- Ь = b + a
and
о + a = a
for all a.
Next we construct the inverse. Let the third intersection of the tangent
at о be k. Let a- be the third intersection of the line through a and k.
Then by definition
a + a- = о
The crunch is to show that + is associative:
(a + b) + c = a 4- (b 4- c).
We give two proofs; the first geometric, the second more fundamental.
Let a, Ь, c be given. Consider the diagram
a w V
b f c u
d e о
1 m n
Here r, s, t, 1, m, n are the names of lines and the remaining symbols
7: Non-singular cubics. The group law.
29
are points on C. All except f, w are intersections of two of the lines.
The whole figure is determined once a, b, c and о are given.
We have (a+ b) — e, and so (a-f-b) + c is the third intersection of the
line through o, f. Similarly a + (b + c) is the third intersection of the
line through o, w. To prove associativity, we thus have to show that f,
w are not as shown but coincide with the unlabelled intersection of the
lines r, m.
We now recall a geometrical
Lemma 1. Let Xi,... , x8 be 8 points of the plane in general position4.
Then there is a 9th point у such that every cubic curve through X],... , x8
also passes through y.
We briefly recall the proof of the lemma. A cubic form F(X), X =
(Xi, X2, X3) has 10 coefficients. An equation F(x) = 0 imposes a linear
condition on the coefficients. Passing through Xj,.. . ,Xg imposes 8 con-
ditions. Hence if Fi(X), F2(X) are linearly independent forms through
the 8 points, any other F is of the shape
F(X) = AF1(X) + ^F2(X).
Now Fi = 0, Fj — 0 have 9 points in common; and clearly F = 0 passes
through them all.
Now to the application of the Lemma. Let an equation for the line 1
be Z(X) = 0 etc. and consider the two (reducible) cubics
Fi(X) = Z(X)m(X)n(X) = 0
F2(X) = r(X)s(X)/(X) = 0.
Our nonsingular cubic C passes through 8 of the points of intersection of
Fi = 0, F2 = 0 and so by the Lemma must pass through the 9th. Hence
f = w, as required.
We now present a second proof of the associativity of the relation
for points which is more basic.
A linear form Z(X) (say) does not give a meaningful function on the
curve C because the coefficients X are homogeneous. On the other hand,
if Z(X) is another linear form, then the quotient
ff(X) = Z(X)/Z(X)
does give something meaningful. In the situation just discussed, the line
4 Thia is the geometer’s way of saying “such that the proffered proof works”. In
this case, what is needed is that the xj give linearly independent conditions
on the coefficients of F: so no 4 on a line and no 7 on a conic.
30
Lectures on Elliptic Curves
1(X) = 0 passes through a, b, d and t(X) = 0 through d, o, e, all being
points on C. The function j(X) thus has a zero a, b and a pole at o, e.
At the point d there is neither a zero nor a pole, as the zeros of the
linear forms cancel out.
There is the notion of the order of a pole or zero at a nonsingular point
of an algebraic curve which generalizes in an obvious way the notion of
the order of a zero or pole of a rational function of a single variable. In
our case, ff(X) clearly has simple poles at a, b and simple zeros at o, e.
The equation e = a-f- b is equivalent to the existence of such a function.
Similarly, the equation
x = (a + b) + c
is equivalent to the existence of a function with simple poles at a, b, c,
a double zero at о and a simple zero at x. The equation
(a + b) 4- c = a + (b + c)
is now obvious.
This point of view shows that the group law is unchanged under bi-
rational equivalence, since it depends only on the function field of the
curve. The geometer would say that a + b = c precisely when the divisor
{a, b} is linearly equivalent to the divisor {o, c}.
We conclude with an informal explanation of what is meant by say-
ing that a nonsingular cubic curve is of genus 1. Let г > 2 and let
Xj,... , хг,У1,... ,yr-i be points on C, for simplicity all distinct. By
manipulating linear forms in X, as we did in the construction of <?(X),
one can construct a function /г(Х) on the curve where only poles are
simple poles at Xi,. .. ,xr and which has zeros at yj,.. . , yr_j. Then
h(X) has one further zero, which is completely determined.
Contrast the position on the line. Let Cj,... , cr, dl,... , dr be any 2r
distinct numbers. Then the function
пм/пм
has simple zeros at the dj, simple poles at the Cj and no further zeros
or poles (even at infinity).
The genus of a curve is a measure of the freedom in imposing the zeros
and poles of a function. The precise statement, which we shall not need,
is slightly complicated and is called the Riemann-Roch Theorem.
§7; Exercises
31
§7. Exercises
1. Let o, a be rational points on the nonsingular cubic C. Construct
the point —a with respect to the group law for which о is the neutral
element.
2. Let o, Oj be rational points on the nonsingular cubic C. Show how
the group law for which O; is the neutral element can be expressed in
terms of that for which о is the neutral element.
3. Let o, a be rational points on the nonsingular cubic C and suppose
that За = о with respect to the group law based on o. Let b = 2a.
Show that each side of the triangle o, a, b meets the tangent to C of the
opposite vertex at a point of C. Take о, a, b as the triangle of reference
and express this condition in terms of the coefficients of the cubic form
determining C.
4. Let C be the curve
X3 + Y3 - XZ2 - YZ2 + 7XYZ = 0
and let x = (ar, y, z) be a point on C defined over some Qp. Show that
y/x —+ —1 as x —+ (0,0,1) (with respect to the p-adic topology).
5. In this question everything is defined over Qp for some p. Let a be a
nonsingular point on the cubic curve
Г(Х,У,2) = 0
and let i(X) = 0 be the tangent. Let /(X) = 0, m(X) = 0 be lines
through a distinct from the tangent. Show that there are d, e, f such
that
dl(X) + em(X) + ft(X) = 0
(identically) with d ф 0, e 0. Show that
m(x)//(x) —» — d/e
as x -+ a.
8
Elliptic curves. Canonical Form
We are concerned with algebraic curves defined up to a birational
equivalence over the ground field. For genus 0 we saw that every curve is
equivalent to a conic (or line). For genus 1 no such reduction to a special
form or forms is possible. The situation changes when we are also given
a point on the curve which is defined over the ground field (a “rational
point”)- It is convenient to have a special name for this situation: an
elliptic curve is a curve of genus 1 together with the specification of a
rational point on it.
As canonical form we take
C : Y2 = X3 + AX + В
or, in homogeneous co-ordinates
Y2Z = X3 +AXZ2 +BZ3.
The right hand side does not have multiple roots provided that
4Л3 + 27B2 0.
The specified rational point о is the point (X, Y, Z) = (0,0,1) at infinity.
Since the line at infinity is an inflexional tangent at o, the group law
on C is especially simple:
-(*,!/) =
and a + Ь + с = о precisely when a, b, c are collinear.
We shall find this choice of canonical form particularly convenient
when the ground field is Q. When the ground field is of characteristic 2
or 3, we can no longer use C as a canonical form but must use
Y2 + a,iXY &3 Y = X3 T C12X T a4X as.
8: Elliptic curves. Canonical Form.
33
However this is quite peripheral to our purposes and we leave it to the
reader, if she wishes, to deal with these cases.
As we have not formally defined curves of genus 1, we will not give a
formal proof that elliptic curves are birationally equivalent to the canoni-
cal form. In compensation we will give detailed algorithms for converting
certain kinds of elliptic curves to that form. These could well be omitted
at first reading.
Fact, (characteristic 2,3). Any elliptic curve is birationally equivalent
over the ground field to the canonical form for some A, B.
More precisely the curve is equivalent to C and the equivalence takes
the specified rational point О on ii into the point at infinity on C.
Proof for the Cognoscenti. By the Riemann-Roch theorem, the set of
functions on the curve with at worst a pole of order 2 at О has dimension
2. Let a basis be 1, £. Similarly the set of functions with at worst a triple
pole is of dimension 3 at O, with basis say 1, £, rj. Then the functions
all have at worst a pole of order 6. By the Riemann-Roch Theorem, there
must be a linear relation between the 7 listed functions. The relation
must involve both and rfi. A transformation
£ Cj£ + c2
4 -» C37] + C4£ + C5
reduces the relation to
q2 = C + + В
for some A, B.
Note for the Cognoscenti. The reason why there is no canonical form,
or finite family of canonical forms for curves of genus 1 is that
2(y — 1) = 0 for g = 1.
For every other genus we can use the divisor of the differential of a
function defined over the ground field to give a birational map. For
example, for genus 2, there is always equivalence with some curve
У 2 = sextic in X.
Particular cases. The above proof does not, in any case, usually provide
a practical algorithm. We discuss some special cases. Note that it is
34
Lectures on Elliptic Curves
enough to transform the curve into the shape C. For if it takes О into
a, we can make the translation x —» x — a on C.
(i) Cubic curve T>. Rational point О has inflexional tangent. Here a
linear tranformation of co-ordinates is enough, taking О to о and the
tangent to be line at infinity.
For example
T> : X3 + У3 + dZ3 = 0
О = (1, —1,0).
Put
X = и + V, Y = U~V.
Then
бПУ2 = ~2(73 +dZ3,
so
Y12Z1 = X3 - 24,33-d2Z3,
where
Xj = -6dZ, У1 = 62dV, Zx = U.
(ii) Cubic curve T>. Rational point О not on inflexional tangent5.
The tangent at О meets T> again at a rational point P, say. We
may take an affine system of co-ordinates with P as origin and with the
tangent as У-axis
5 The argument is due to Nagell: Sur les proprietes arithmetiques des cubiques
planes du premier genre. Acta Math. 52 (1928-9), 92-106. Older geometrical
techniques (adjoint curves etc.) had shown that every elliptic curve is bira-
tionally equivalent to a cubic, but he was the first to show that it can be
reduced to the canonical form.
8: Elliptic curves. Canonical Form.
35
Then the curve T> is given by F(X, Y) = 0, where
F(X,Y) = F^X,Y) + F2(X,Y) + F3(X,Y),
with Fj is homogeneous of degree j.
The У-axis meets the curve at (0, y), where
0 = yF^O, 1) + y2 F2 (0,1) + y3F3(0,1).
Since the У-axis is a tangent, we have a double root:
F2(0,l)2 -4F1(0,l)F3(0,l) = 0. (*)
Now consider the intersection of the curve with У = tX. Then
0 = zTVM) + T2F2(l,t) + x3F3(l,t).
Discarding the solution x = 0, we have
s2 = F2(l,i)2 - 4JF1(l,/)^’3(l,i)
= G(t) (say),
where
s = 2Tj(1, f)ar + -F2(l, f).
Now G(f) is a cubic by (*); and we achieve the canonical form by a linear
transformation on s,f.
(iii) Curve T> is Y2 = Quartic in X with rational point.
Let the rational point be (a, b). By a transformation
1 У
v , _______ у ) ______________
X — a' (X-a)2’
we may suppose that the rational point is at infinity:
У2 = fo + fiX + f2X2 + f3X3 + f,X\
where /4 is a square. On dividing by /4, we have without loss of gener-
ality
A = l.
We can write the right hand side as
G(X)2 +H(X),
where
G(X) = X2 + g.X + gQ
H(X) = h1X + /i0,
and the gj, hj are easily given in terms of the fj.
The equation of the curve is now
(У + С(Х))(У-С(Х) = Я(Х).
Put
у + G(X) = T,
36
Lectures on Elliptic Curves
Y-am^
and
2G(X) = T-^H).
Multiply by T2 and put TX = S. We get
2S2+2fflTS + 2ff0T2 =T3 -h'S-hoT.
This is readily brought to the canonical form.
(iv) Intersection of two quadric surfaces with a rational point.
We use homogeneous co-ordinates X, У, Z, T and may suppose that
the common rational point is (0, 0, 0,1). The two quadric forms are thus
of the shape
Qi = TL + R
q2 = tm + s
where L, M are linear in X, У, Z and R, S are quadratic.
Suppose, first, that L and M are linearly dependent. Then without
loss of generality M = 0. The intersection is
S(X,Y,Z) = 0, T = R(X,Y,Z)/L(X,Y,Z)-,
which is of genus 0.
Otherwise, eliminating T, we have
C(X,Y,Z) = LS-RM = 0,
where C is a homogeneous cubic. It has the rational point
L(X,Y,Z) = M(X,Y,Z) = 0.
Hence we are reduced to an earlier case.
§8. Exercises
1. Transform the following curves to canonical form:
(i) X3 + Y3 + dZ3 = 0
(ii) X3 + Y3 + Z3 - 3mXYZ = 0
(iii) У2 - kT2 = X2, У2 + kT2 = z2
(iv) X2X2 - X,X2 - X-LX2 + A'2X3 = о
2. [Difficult]. Show that the group law on
J^"2 ___________________ -y-2 _ rp2 r?2 _ y-2 । rp2
§5: Exercises
37
with (1,1, 1, 0) as neutral element is given by X3 = Xi + x2, where
X3 = Xit-lVlZ-l - 1^^222
Уз = У2^2^1Х1 - y1tlZ2X2
Z3 = Z2t2Xiyi - Z1t1X2y2
- t2T2 - /2т2 - A/2 - A,2 - /А2 - t2r2
S3 — t-2xl llx2 — ‘'2У1 41У2 — l2z1
3. (i) Find all the points defined over the field Fs of 5 elements on each,
of
Y2Z ~X3 +XZ2
Y2Z = X3 + 2XZ2
Y2Z = X3 + Z3.
Check in each case that they form a group under the group law, with
(0,1, 0) as neutral element.
(ii) As (i) but with other Fp and other curves
Y2Z = X3 + AXZ2 +BZ3.
Find an example where the group is not cyclic. Can you find an.
example where the group requires more than 2 generators?
4. In the curves considered below, the point at infinity is taken as neutral
element for the group law.
(i) Let Y2 = (X —a)(X2 + aX + 6) be an elliptic curve. Show that the
transformation x —» x + (a,0) induces a fractional-linear transfor-
mation
T : x -> (tuar + ti2)/(i2i^ + <22).
Check that T2 : x —» x.
(ii) Consider Y2 = (X — a1)(X — a2)(X — a3) and let T), T2, T3 be
as in (i) with a — aj (j = 1,2,3). Show that T), T2, T3 commute
and that
TiT2T3 : x —> x.
(iii) Let Tj be the 2x2 matrix of coefficients (J with a = otj
(j = 1,2,3). Show that
717г + 7г 7) = 0.
(iv) Find the fixed points of 7) and show that they are interchanged
by T2.
5. Find a necessary and sufficient condition that a line Y = IX -f- m
38
Lectures on Elliptic Curves
should be an inflexional tangent to
Y2 = X3 + AX + B.
Hence find a general formula for the curves in canonical form having a
rational point of order 3.
6. Find a necessary and sufficient condition that a line Y = IX + m
should be an inflexional tangent to Y2 = X(X2 + aX + b).
Hence find a general formula for curves in canonical form having a point
of order 6.
7. Let
F(X, Y, Z) = X2Y + XZ2 + 2У3 + Z3.
Find a birational transformation defined over Q taking the curve F = 0
into canonical form with the point (1,0,0) going to the point at infinity.
8. Find a birational transformation defined over Q taking
X2 - 2X? + X2 = 0, X22 - 2X? + X2 = 0
into canonical form, with (1,1, 1,1) going to the point at infinity.
9. Invent similar exercises to the two preceding, and solve them.
9
Degenerate laws
In. this section we consider the curve
C : Y2 = X3 + AX + В (1)
when
4A3 + 27B2 = 0. (2)
There is then precisely one singular point. We recall that if (2) does not
hold, there is a group law on the curve given by6
a + b + c = 0
whenever a, b, c are the intersection of a line with C. We show that this
continues to give a group law on the nonsingular points in the degenerate
case (2), and we find out what it is.
There are two cases, the second with two subcases.
First case. Cusp. Suppose A = В = 0, so
C : Y2Z = X3
with a singular point at the origin. Any line not passing through the
origin can be written
Z = IX + mY.
It meets C where
X3 -Y2(IX + mY) =0
6
We write indifferently 0 or о for the neutral element of the group law.
40
Lectures on Elliptic Curves
If the three points of intersection are (xj,yj,Zj) (j = 1,2,3), it follows
that
«1 + «2 + “3 = 0,
where
ui = xilVi-
We therefore have the additive group, the zero being the point (0, 1, 0)
at infinity.
Second case1. Double point. (Characteristic yt 2). If not both A, В
vanish, then, after a transformation X —> X+ constant, we have
C : Y2 Z = X2(X + CZ) (CVO),
i.e.
(У2 -CX2)Z = A3.
Suppose, first, that C = 72 is a square. Put
U = Y+yX, V = Y-yX-
so C is given by
873UVZ = (U- V)2.
Any line not passing through the origin can be written
Z = IU + mV.
It meets C where
(C - V)3 - 8y3UV(lU + mV) = 0.
If the points of intersection are (uy, vy, zy) (j = 1, 2, 3), then
( U1 \ i
\vi J \v2 J \v3 J
We have the multiplicative group.
Now suppose that C is not a square. Adjoin 7 to the ground field,
where -у2 = C- For a point (t, y. z) on C, put
У + ix
------ = r + s~f (say),
У - ~tx
where
r2-s2C = l. (*)
We now have a “twisted” multiplication law on (*). Compare the mul-
tiplication of the complex numbers x + iy with x2 -f- у2 ~ 1.
7
We shall not require the details about this case in later work.
9: Degenerate laws 41
Note for the Cognoscenti. In characteristic 2 the curve
C: Y2Z =X3 +AXZ2 +BZ3
is always singular. Write the equation as
(У2 - BZ2)Z = X(X2 + AZ2).
Over a finite (or, more generally, a perfect) field, we have
B=/32, A = a2
for some a, /3. Then the curve is
(Y + /3Z)2Z = X(X + aZ)2\
which is clearly singular.
If the ground field is not perfect, we may have an example of a singu-
larity defined over an inseparable extension, compare footnote in §6.
10
Reduction
The philosophy is to approach the rational field Q through the local
fields Qp and, similarly, to approach the Qp through the finite fields
Fp by reduction modulo p. We do no more than is required for the
applications.
The mod p map —» Fp is denoted by a bar a —► a. This is extended
to the corresponding 2-dimensional projective planes V, V as follows.
Let (ai,a2, аз) be projective co-ordinates of a point a of V. By multi-
plying ai, аг, аз by the same element of Qp, we have without loss of
generality
maxfloil, |а21, |a3|} = 1,
where || = ||p. Then (aj", а7,аз) are the co-ordinates of a well-defined
point a of V.
In a similar way, we define the reduction 1 of a line
1 : liXi + /2-^2 + I3X3 = 0-
If the point a lies on the line 1, then clearly a lies on 1.
We need only the least sophisticated of the many ways of reducing a
cubic curve
C : F(X) = 0
defined over Qp. Here
F(X) = Y, fiikXtXjXk e Qp[X]
where the f ijk. € Qp are not all 0 and without loss of generality
10: Reduction
43
Then
Ж) = E e fp[X]
is not the zero polynomial, and defines the reduced curve
C : F(X) = 0
over Fp. It may, of course, be reducible8.
If a point a lies on C, then clearly a lies on C. There is a weak converse
Lemma 1. Let b be a nonsingular point ofC. Then there is an a on C
such that a = b.
Note. The notation b is intended to denote a point defined over F? not
necessarily derived from a b. We say that b lifts to a. It is easy to see
by examples that a singular point on C may or may not lift to a point
of C (cf. Exercises).
We construct a by successive approximation a la Newton. The generic
term for such constructions in p-adic analysis is Hensel’s Lemma.
Lemma 2. Lei G(T) € Zp[T] and let to € Zp be such thai
|G(t0)|<l, |G'(t0)| = 1,
where G' is the formal derivative of G. Then there is a t € ~Lf such that
G’(t) = 0 |t-t0| <G(t0).
Assuming the truth of Hensel’s Lemma for the moment, we complete
the proof of the Lemma. Since b is nonsingular on C, we may suppose
that
S“(b) °'
0Л. i
Pick any bj € Zp such that b = (bi,. . . , b„). Then the conditions of
Hensel’s Lemma apply to
G(T) = F(T,b2,... ,bn), t0 = b1.
Put a = (t, b2,... , bn), where t is provided by Hensel. Clearly F’(a) = 0,
a = b, so a does what is required.
It remains to prove the Hensel’s Lemma. Let U be an indeterminate.
8 In the sense that F(X) factorizes. There is an unfortunate clash of meanings
between “reduced” (mod p) and “reducible”.
44
Lectures on Elliptic Curves
Then
G(T + U) = G(T) + UG^T) + U2G2(T) + ...
where Gj € Zp[T] and Gi = G1. Now define
и = —G(io)/G/(to),
so
G(io + и) = w2G2(to) + и3Gs(fo) + ... .
Hence
|G(ii)l<|G(io)|2,
where
ti = to + u.
Clearly
IG'GJI = |G'(*o)l = 1.
We may therefore iterate the process and get a fundamental sequence tj
(t > 0). The limit t clearly does what is required.
We shall also need information about the behaviour of the intersection
of a line and a cubic curve under reduction. From what we have already
proved, if 1 meets C in a, then 1 meets C in a. But suppose that 1 meets
C in a, b with a b: if a = b, can we be sure that it has multiplicity
> 2 in the intersection?
The following lemma confirms expectations.
Lemma 3. Suppose that the line 1 meets the cubic curve C in a, b, c,
multiple points of intersection being given with their multiplicities. Then
either
(I) the entire line 1 is in C or
(II) I meets C in H, b, c) multiple points occuring with the correct mul-
tiplicities.
Proof. We have without loss of generality
1з = 1 = maxd/J, |I2|, R3I)-
Consider
G(X1,X1) = F(Xl,X1,-llX1-l2X2)
= IP[X1,X2].
Its reduction is
G(X1,X2) =F(X1)X2-T1X1 -12X2).
$10: Exercises
45
If G(Xi, X2) — 0, we have case (I) of the Lemina, so we may suppose
that
ед,х2)^о.
We normalize the coefficients of a, b, c so that
maxdaj, |a2|, |a3|) = 1.
Since la = 0, it follows that
(aI;-a2) (0,0)
etc.
By hypothesis, there is some A E Qp such that
С(Х1;Х2) = A(a2Xi — ajX2)(b2X1 — 61X2)(c2X1 — CjX2)
= ЛЯ(Х1,Х2).
Now
Я(Х1;Х2) = (a2Xi -alX2)(b2Xl -b1X2)(c2X1 -qX2)
^0.
Hence G, H differ only by a scalar multiple, which is what we needed
to prove.
§10. Exercises
1. (i) Let C be the curve Y2 = X3 +p over Qp. Show that the point
(0, 0) on the mod p curve does not lift to a point of C.
(ii) Find an example of an elliptic curve C over Qp such that the mod
p curve has a cusp which is the reduction of a point on C.
2. Find examples of curves C over Qp such that the mod p curve has a.
double point with distinct tangents which (i) lifts, (ii) does not lift, to
11
The p-adic case
Let
С : У2 = X3 + AX + В
be an elliptic curve defined over Q?, so
4.4 3 -I- 27B2 0
and, without loss of generality,
A,В € Zp.
In this section we study the group <3 of points on C defined over Qp.
Our tool will be the theory of reduction developed in the proceeding
section. For this, we write C homogeneously
C: Y2Z — X3AXZ2 + BZ3.
The reduced curve
C: Y2Z = X3 + AXZ2 + BZ3
over Fp may be singular but (with an eye to Lemma 3 of §10) we note
that C does not contain a line.
Let ® denote the set of points on C defined over Fp and let 25^ C ®
be the non-singular points. Write C ® for the set of points which
reduce mod p to в*' \ The map
<&<°) 0*O)
is surjective by Lemma 1 of §10.
How does the group structure behave? Let a, b, с E в with
a + b + c = o.
11: The p-adic case
47
This holds if and only if a, b, C are the intersection of C with a line
1. Then the reductions a, b, c are the intersections of C with 1. On C
we have defined a group law only for the non-singular points. If a, b,
0* \ then
a + b + c = o.
To sum up so far, we have a subgroup of ® such that there
is a group homomorphism ®(°) —» ®' onto ®' \ The kernel of this
homomorphism is the set of points which map into o, that is, in inhomo-
geneous co-ordinates, о itself together with the (x,y) g ® with x Zp,
у Zp. This is called the kernel of the reduction.
Next, we look at the structure of the kernel of reduction. If (ar, у) g ®,
x, у £ Zp, then clearly |г/|2 = |t|3 and so
l^l=p2n, M=p3n
for some n > 1. We call n the level of (ar, y). For (ar, y} not in the kernel
of reduction the level is 0, by definition. The level of о is oo.
Now for integer N > 1 make the transformation
Xw = p2NX, Yn = p3NY, zn = z,
so the equation of C becomes
CN- Yf,ZN = X^+p^AXnZ^+p^BZ^.
We may use the new co-ordinates for a reduction mod p: the reduced
curve is
cN-. y^zn = x3n.
We can now transfer what was done earlier to the new situation. A
point (ar, y) maps into the singular point (0, 0) of if its level is < N. It
is in the kernel of reduction for C,v if its level is > N. Finally, the group
of the non-singular points on the C?j defined over Fp is the additive
group of Fp. They are in the image of ®, as before.
For N > 1 define to be the set of points of ® of level > N. We
have proved
Lemma 1. The ©W are groups and
® D ®<°> D ®^ D • • • D ®<N> D • • • .
The quotient graphs of ®(,v</®(,v+1j for N > 1 are cyclic of order p.
The quotient ®(°)/®(1) is isomorphic to the group of nonsingular points
on C. Further,
n®^ = {o}.
N
48
Lectures on Elliptic Curves
The sequence of groups is called the p-adic filtration.
Corollary. Let x = (ar,y) € <5 be of finite order prime to p. Then x,
у e zP.
Proof. Otherwise x is of some level n > 1. Then x £ x ©(n+1)
and so maps into a non-zero element of /Q5^n+1\ But this is of order
P-
Our next aim is to free the statement in the Corollary from the re-
quirement that the order is prime to p.
The homomorphism of 0(,v)/0(,v+1) to the additive group mod p is
given by
(ar, y) —> p~Nx/y mod p.
For x € we introduce и(х) defined by
и(х) = xfy (x = (x,y)),
u(o) = 0.
Note that |u(x)| = j?-n, where n is the level of x.
Lemma 2. Let x1; x2 € Then
|u(xi + x2) - u(xi) - и(х2)| < max {|u(xi)|s, |(x2)|5}.
Proof. We may suppose that none of xj, x2, Xi T x2 is o. Without loss
of generality
l«(Xl)l > |и(х2)|.
Define N to be the level of Xj. We use the co-ordinates and
the curve C,v introduced above.
Since neither x1; nor x2 maps into the singularity (0,0) of Сдг, the
line joining them has the shape
ZN = IXN + шУу,
where
|m| < 1.
This meets C where
0 = -y£(/AW+myN) + A^
+ piN AX pfilX n + тУдг)2
+ p6NB(IXn + myN)3
=С3-^ДГ + + C! X-^Yf + СцУу (*)
11: The p-adic case
49
(say). Here
c3 = 1 +piNAl2 +p6NBl3
c2 = 2p4NlmA + 3p6Nl2mB,
SO
|c3 I = 1, |c2 I < p~4N.
The roots Ху/Уу of (*) are — p~Nu(x.1 + x2), p~Nu(x.i'j and
p-Nti(x2). Since the sum of the roots is —с2/сз, the result follows.
Corollary 1.
|«(sx)| = |s| |u(x)|
for all x € and all s ^,1.
Proof. By induction, for s > 0 we have
|n(sx) — su(x)| < |u(x)|5.
This proves the result for p f s and for s = p. Now use induction on the
power of p in s.
Corollary 2. Ь is torsion-free.
Corollary 3. Suppose that p ^2, |4A3 4- 27B21 = 1. Then the torsion
subgroup of 05 is isomorphic to a subgroup of 05.
Proof. For 05 = Q5(°); and so
05 = Q5/Q5(1),
where 05^ is torsion free.
Note for the Cognoscenti. This all generalizes to algebraic extensions
of Qp. The proof that torsion points of order prime to p have integral
co-ordinates continues to hold, but that for points of p-power order may
break down if there is ramification.
There is a power-series in и = и(х) which gives a parametrization
of the group (25^) for large enough N. This was originally shown by
transferring the formulae from the complex variable case. A modern
approach is by formal groups and formal logarithms, see, for example,
Silverman’s book.
12
Global torsion
Let
C : Y2 = X3 + AX + В
be an elliptic curve over Q, so
4.43 + 27В2 ± 0
and without loss of generality
A, В € Z.
Theorem 1. The group of rational points on C of finite order is finite.
If (x,y) о is of finite order, then
ж, у € Z
and
y = 0 or y2 | (4A3+27B2).
Proof. Let <5 be the group of points on C defined over Q and let &p be
the group for Q?, where p runs through the primes.
Let (x, у) о be torsion. Since © C ©p we have
x € Zp, у € Zp
for all p, and so
x € Z, у € Z.
Now let p be any prime with p fi 2, p / (4A3 + 27B2). Then by the
last Corollary of §11, the torsion group of © is isomorphic to a subgroup
of the group of points over F? = Z mod p. Hence the torsion group is
finite. By looking at different p, one can in general restrict the order of
§12: Exercises
51
the torsion group severely. But the following argument makes it easy to
find the torsion points themselves.
If '2(x,y) = o, then у = 0. Otherwise, 2(т,^) = (гсз, г/2) (say) is also
torsion, so X2, У2 € Z.
Now taking the tangent at (t, 3/), we have (cf. Formulary)
( 3т2 + A\2 (3т2 + A)2
t2 + 2x = I -------- 1 =-----------------.
\ 2y J 4(т3 + Ax + B)
and so y2 — x3 + Ax + В divides (3т2 + A)2.
But now,
(3X2 + 4A)(3X2 + A)2 ~ 4A3 + 27B2
mod (X3 + AX + B)
in Z[X, A,B], as in readily verified. Hence
у2 I (4A3 + 27B2),
as required. [For more on identity (*), see §16].
Note. There are stronger statements about the torsion of C when
AB = 0, see Exercises. Mazur has determined all possible forms of
the torsion group. It is one of
Z/nZ 1 < n < 10 or n = 12
or
Z/2Z x Z/2nZ 1 < n < 4;
all of which occur.
§12. Exercises
1. Find the torsion groups over Q of the following elliptic curves:
(i) Y2 = X3 + 1
(ii) Y2 = X3 - 43X + 166
(iii) Y2 = X3 - 219X + 1654
(iv) E2 = X(X - 1)(X + 2)
(v) Y2 =X(X + l)(X+4)
(vi) X3 + Y3 + Z3 - 15XFZ = 0
(vii) Y2 = X(X + 81)(X + 256)
(viii) X2X2 - XjX2 - XjX2 + X2X3 = 0
[Xote: not a random sample!]
52
Lectures on Elliptic Curves
2. Fill in the details of the sketched proof of the following theorem 9 [or
find a better one!].
Theorem. Lei A € Z be i-th power free. Then all the torsion points on
C:Y2 = X(X2 + A)
are given by (I), (II), (III) below:
(I) (0, 0) of order 2.
(II) If A = 4, the points (2, ±4,1) of order 4.
(Ill) If A = — C2, C € Z, the points (±C, 0) of order 2.
Sketch proof.
(i) If (ar, y) = 2(a, b), then
x = (a2 - A)2/4b2.
(ii) The points of order 2 are as stated.
(iii) (0,0) = 2(a,b) for some (a, b) precisely when A = 4. The (±C,0)
are never of form 2(a, b). From now on, let (a, 6) be a point of odd
order.
(iv) a = □
(v) If d = gcd(a, A) then a = daj, A = dAj b = dvbx where
b2 = (da2 Ai).
(vi) There exists f, g, h such that gcd(/,g) = 1 and a, — if2,
da2 + Aj = ig2, bj = fg, d = ±/i2
(vii) a2 — A = 2/t4/4 h2g2, b = h2fg.
(viii) a2 — A = 0 (mod 2b).
(ix) Hence f — 1, g = 0 (2), h = 0 (2). [Hint. First show that f | g],
(x) Hence 24 | A.
(xi) Contradiction!
3. Fill in the sketched proof of the following theorem10 [or find a better].
Theorem. Lei В Elbe 6-th power free and let
C . Y2 = X3 + B.
All the torsion points are given by the following.
(I) If В = C2, the points (0, ±C) of order 3.
(II) If В = D3, the points (-D, 0) of order 2.
9 cf. T.Nagell. Solution de quelques problemes dans la theorie arithmetique
des cubiques planes du premier genre. Skrifter vig. av det norske viden$k.-akad i
Oslo, Mai.-natvrv. kl. 1935, No 1, 1-25.
10 The result is due to R.Fueter: Ueber kubische diophantische Gleichungen.
Comm. Math. Helv. 2 (1930), 68-89; but the argument suggested is based
loosely on L.J.Mordell. The infinity of rational solutions of 4- k. J.
London Math. Soc. 41 (1966), 523-525.
§12: Exercises
53
(III) If В = 1, the points (2, ±3) of order 6.
(IV) If В = —432 = —24.33, the points (12, ±36) of order 3.
Sketch proof.
(i) If (ar, y) = 2(a, b), b 0 then
x = (w — 2)a, w = 9ar3/4г/2.
(ii) the elements of 2-power order are as stated.
(iii) Elements (0,6) are of order 3.
From now on, let (a, 6) be of odd order with a 0. The strategy is to
show that w g Z. The cases with w = 1, 2, 3 are then easily dealt with.
Otherwise, |г|со > |o|oo and so on repeated duplication lajoo -» oo a
contradiction. We sketch a proof that w € Z.
(iv) If p | B, p f a then p / x.
(v) If p | B, p / x then p f a [Hint. Consider repeated duplication.]
(vi) If 3' || b, 3m || a then I = 0, 1 or I = 2, m > 1. [Hint. If I = 3
deduce that either 3 / x oi 36 | B]
(vii) Hence w E Zj.
(viii) w € Z2-
(ix) w € Z, for q|B, q 2, 3.
(x) Hence w € Z.
4. Show that
X3 + Y3 ± dZ3 = 0
is birationally equivalent to
Y2 = X3 - 24.33.d2
If d > 0, d E Z is cube free, deduce from the preceding exercise that the
only cases of torsion are
d = 1, (1, 0, —1) and (0,1, —1) of order 3.
d = 1, (1,1, —1) of order 2.
Compare with results of §6 on exceptional points.
5. Let s € Q. Show that if there is one к £ Q such that
X3 ± sX ± к = 0
has 3 rational roots, then there are infinitely many.
[Hini. Let и be a rational root. Find the condition, in terms of s, u, k:
that the two remaining roots are rational.]
6. Let к € Q, к / 0. Show that if there are two s £ Q such that
X3 +sX ± к = 0
has 3 rational roots, then there are infinitely many.
13
Finite Basis Theorem. Strategy and
comments
The objective of the next few sections is the following.
Theorem 1. The group (S of rational points on an elliptic curve defined
over Q is finitely generated.
The theorem is due to Mordell and it was generalized to number fields
by Weil. It is usually referred to as the Mordell (or Mordell-Weil) Finite
Вазгз Theorem.
For example11, when C is
Y2 = X(X2 + 877)
the group <S is generated by (0,0) of order 2 and (u/v,r/s) of infinite
order, where
и = 37 5494 5281 2716 2193 1055 0406 9942 0927 9234 6201,
v = 6215 9877 7687 1505 4254 6322 0780 6972 3804 4100,
r = 256 2562 6798 8926 8093 8877 6834 0455 1308 9648 6691 5320
4356 6034 6478 6949,
s = 4900 7802 3219 7875 8895 9802 9339 9592 8925 0960 6161 6470
7799 7926 1000.
The proof of Theorem 1 subdivides into two parts requiring different
ideas and techniques.
11 A. Bremner, J.W.S. Cassels: On the equation У2 = X(X2 +p). Math. Comp.
42 (1984), 257-264.
13: Finite basis Theorem. Strategy and comments
55
(i) The “weak finiteness theorem” that
®/2®
is finite. The proof depends on the construction of a map of ®/2® into
a finite group. The proof is in some ways easier if ® has a point of order
2 and we do this first. For this we need to know about isogenies.
It is rather remarkable that the proof of the weak theorem is not
constructive - that is, it does not give an infallible procedure, even in
principle, for determining ®/2®. Even today no algorithm is known.
(ii) The second part of the proof of the finite basis theorem is a “de-
scent”. Suppose that we have a set of representatives bi,. . . , br of the
classes of 0/2®. Let a be any point. Then there is some s, 1 < s < r
such that
a — b3 g2®,
i’e’ a = b3 + 2c, eg®. (*)
The height measures the size of the numbers involved in a point of ®.
For example if x = (a;, y) and x = и/v with u, v g ~L in its lowest terms,
we can take Л(х) = max(|u|, |v|) (absolute values). Now it follows from
(*) that Л(с) < Л(а); at least if Л(а) is greater than some Ha. It
follows that ® is generated by the b3 and the finitely many a with
J(a) <Я0.
We conclude this section by giving one of Fermat’s own descent argu-
ments.
He wished to show that there are no integer solutions of
X4+K4=Z4 X^0,y^0.
This is a curve of genus 3 (not that Fermat knew about the genus), but
he remarked that it is enough to disprove
X4+F4=Z2 X/0,Y/0 (*)
On writing (*) in the shape
(Z/У2)2 = 1 + (А7У)4
one sees that we have an elliptic curve, though not given in canonical
form. However, following Fermat, we consider integer solutions of (*).
If (*) has an integral solution, we take one (x,y) for which
тах(|г|,|!/|)
is > 0 and as small as possible. (|| is the absolute value). Then x. y, z
have no common factor, and indeed are coprime in pairs. Since x4 = 1
mod 4 if г is odd, one of x, у must be odd and the other even. We
56
Lectures on Elliptic Curves
suppose that
2 К 2 [у, 2/z.
Write (*) in the shape
(z + y2')(z ~ у2) = xi-
Since z, у are both odd, the two factors on the left are divisible by 2 but
only one is divisible by 4. Hence (taking z > 0) we have two possibilities,
where u, v g Z:
First Case Second Case
z + y2 = 8u4 2u4
Z - У2 = 2v4 8u4
The first case gives
2 л 4 4 у = — V ,
which is impossible mod 4. Hence we have the second case:
y2 — u4 — 4u4.
Now
(u2 + t/)(u2 - y) = 4v4,
and so
и2 + у = 2v4
и2 — у = 2s4
for some г, з g Z. Hence
V4+? = u2.
This is another solution of (*). Further,
a;4 = 16u4v4 = 16u4r4s4.
Hence rs ф 0 and
max(|r|, |s|) < |ат| < max(|a;|, |т/|).
This contradicts the assumed minimality of the original solution, and
so we have a contradiction.
Note that (r, s,u) —> (ar, ?/, .z) is multiplication by 2. Thus Fermat’s
descent is essentially a converse of Diophantos’ ascent.
Note also that multiplication by 2 has been divided into two steps via
another curve
X4 -4У4 = Z2.
This is the phenomenon of isogeny, which we explore in the next section.
§13. Exercises
1. Let C : Y2 = X3 + AX + В be defined over Q. Let Q(\/d) be
§7У; Exercises
57
a quadratic extension of Q and let the non-trivial automorphism be
denoted by (') Let x be a point of C defined over Q(\/d). Show that
x + x' is defined over Q and that x — x' = (u, v) where и and v/yfd are
in Q.
Deduce that the group of points on C defined over Q(v/d) may be
determined once the groups over Q on C and dY2 = X3 + В are known.
2. This question assumes knowledge of the arithmetic of Q(p) where
P3 = 1, P Ф L
Fill in the details of the sketched proof of the
Theorem. Lei d = qiq2 where qx > 0, q2 > 0 are rational primes
with qt =2 (9), 92 = 5 (9). Then the only rational point on
С: X3 + X3 + dXf = 0
is (1,—1, 0).
Sketch proof.
(i) It is enough to prove that the only points on C defined over Q(p)
are those with X3 = 0
(ii) If x = (а?1, x2, X3) is defined over Q(p) and on the curve, without
loss of generality a?!, ^2, хз are coprime in pairs in Z[p].
(iii) (a;i + x2)(pxi + p-1 x2')(p~1x1 + px2) = — 9392^3. There are 03,
«2, «3, 6; G € %[p] such
Хз +x2 =
p-1^i + px2 = 0-3^3,
or
хг-\- x2 =
p-1^i + px2 = Аа3£з,
where A = p — p”1 [= x/—3].
either
pxi + P 1 *^2 = 02^2 1
010'203 = d,
pxl + p 1 X2 = Aa2^2,
030203 = d
(iv) Ojtf + O2^2 + 03^3 = 0, 010203 = d,
(v) Any rational 93-adic unit is congruent to a cube mod 9, but p is
not congruent to a cube. And similar for 92-
(vi) After multiplying 03,02,03 all by p, or by p2, if necessary, we may
suppose that {03,02, a3} is a permutation if {±1, ±1, ±93 ±92} or
{±1,±9з,±92}.
(vii) The equation £3 +93^ + q2f,3 = 0 is impossible mod 9 [and indeed
mod A3].
(viii) If {03,02,03} is a permutation of {±l,±l,±d}, then
I666I00 < kl^ZsIoo-
14
A 2-isogeny
An isogeny is a map
C -> V
of elliptic curves defined over the ground field and taking the specified
rational point oc on C into that on T>. Clearly the kernel of the isogeny,
i.e. the set of points mapped into oj> is a finite group and is defined
over the ground field as a whole.
In this section we consider the case when C has a rational point of
order 2. It is convenient to modify our canonical form to
C : Y2 = X(X2 + aX + b),
the point of order 2 being (0,0). The function on the right hand side
may not have a double root, so
b 0, a2 — 4b 0.
We take Q to be the ground field. Let x = (a;,g) be a generic point
of C; that is, x is transcendental and у is defined by
y2 = x(x2 + ax + 6).
The field Q(ar, y) is known as the function field of C over Q.
Let
Xi = X + (0,0).
The transformation
x —> Xj
is an automorphism of Q(z, y) of order 2. We will find the fixed field.
Ц: >1 2-isogeny
59
The line through (0,0) and (z,3/) is
X = tx, Y = ty,
which meets C in (0,0), x and —Xi = We get
xi = b/x
У1 = -by/x2.
One invariant under x —> Xi is clearly t2, which is
.2 ( , x2 z2 + az+6
t = (j//z) = -
= A
x
(say) [= x 4- X! + a].
Another is
У + У1-У (say).
To find an algebraic relation between A, у we compute
y2 = y2(l-b/x2)2
— x2 ax ^—(x2 — 2b + b2/x2).
X
Here the first factor is just A. The second is
(z + b/x)2 — 46 — (A — a)2 — 46
= A2 - 2aA + (a2 - 46).
Hence
y2 = A(A2 — 2aA + (a2 — 46)).
Conversely, we can express z, у in terms of A, у and
A1/2 = y/x,
since
A-1/2/j — x — b/x
A = x + (b/x) + a.
Hence
x - |(A + X~i/2y - a), у = A1/2z. (*)
The field extension Q(z, y)/Q(A, y) is of degree 2 and so by Galois
theory Q(A,^) is the complete field of invariants.
The point (A, y) is a generic point of
T> : Y2 = X(X2 — 2aX + (a2 -46)).
The map
ф . C-+T>
given by
x = (x, y) -<• A = (X,y)
60
preserves the group law12. For let a, b be points on C and let f € Q(x)
be a function with simple poles at a, b and simple zeros at o, a4- b. Let
fl be the conjugate under x —» Xi. Then ffi g Q(A): as a function of
A it clearly has simple poles at 0(a), 0(b) and simple zeros at 0(o) — о
and 0(a+ b). Hence
0(a + b) = 0(a) + 0(b).
The equation for 7? has the same general shape as that for C. On
repeating the process with A and 7?, we get p, cr with
<r2 = p(p2 + 4ap + 166);
and so
f = P/4, T) = <r/8
is a generic point of C again.
The points mapping into (A,p) — (0,0) are just the 2-division points
other than (0, 0). Hence the kernel of the map (a?, y'j —> (£, r?) is just the
2-division points and o. So the map must be multiplication by ±2.
We now consider the effect of the isogeny
0: C —> 7?
on rational points. Denote the rational points on C, 7? by <S, Sj respec-
tively.
We denote the multiplicative group of nonzero elements of Q by Q*.
Lemma 1. Let (u, u) € f). Then (u, i>) € 0® precisely when either
и € (Q*)2 or и = 0, a2 — 46 € (Q*)2.
Proof. For и 0, this follows by specializing A —► u, p —> v in (*). The
point (A, p) = (0, 0) comes from the points (a, 0) where a2 + aa + 6 = 0:
and a € Q if and only if a2 — 46 £ (Q*)2.
This suggests the map
<? = fl-QV(Q’)2
given by
g((u,u)) = u(Q*)2 (u/0)
= (a2 - 46)(Q*)2 (u = 0)
5(o) = (Q*)2.
12 The argument is quite general for isogenies of any degree. Note that Jfi is
the norm of f for the extension Q(x)/Q(A), cf. §24, Lemma 1.
1^: A 2-isogeny
61
We note that the equation
v2 = u(u2 — 2au + a2 — 46)
implies that
g((u, v)) = (u2 — 2au + a2 — 46)(Q*)2
whenever the right hand side is defined.
Lemma 2. The map
q :f) -Q’/(Q’)2
is a group homomorphism.
Proof. Write the equation of © as
P: V2 = U(U2 + a1P + 61).
Let Uj = (j = 1,2,3) € T) with
U1 + U2 + U3 = O,
so they are the intersection of P with a line
V = IU + m.
Substituting in the equation for P, we have
U(U2 +<21P + 61)-(lP + m)2
= (17-u1)(P-u2)(P_u3).
Hence
2
U1U2U3 = m .
This implies that
g(u1)5(u2)g(u3) = (Q*)2
except, possibly, when one of the uy is (0,0). The verification in this
case is left to the reader.
Lemma 3. The image of
g-.
is finite.
Proof. Without loss of generality
&i C Z, 61 C Z.
An element of Q*/(Q*)2 may be written r(Q*)2, where
r E Z, square free.
62
Lectures on Elliptic Curves
We show that r(Q*)2 is in the image of q only when r | Zq.
Suppose that g((u,i>)) = r(Q*)2. Then there are s, t g Q such that
rs
и = rt2.
Put t = Z/m, where
l,mgZ, gcd(Z,m) = 1.
Then, on eliminating u,
r I pa^rl m + btm = rn ,
where n = m2s g Z.
Suppose that there is a prime p with p | r, p / 6j. Then p | m, so
p2 | rn2 and hence p | n because r is square-free. Then p3 | r2Z4, so p | Z,
contrary to jcd(Z,m) = 1.
Putting the three lemmas together, we get the
Theorem 1. is finite.
Corollary. ®/2® is finite.
Proof. Consider the exact triangle
P
where and are both finite.
By considering in detail the equations arising in the Lemma 3, we can
get more information about <25/205; e.g. by looking at the equations lo-
cally. There is, however, no local-global theorem and indeed even today
there is no algorithm for deciding whether or not there is a solution. We
shall come back to these questions in a late section. So one should not
conclude from the fact that we can determine ®/2® in the examples
that one can always do so.
1J: A 2-isogeny
63
We first enunciate more precisely what was proved.
Lemma 4. The group Sj/<p<S is isomorphic to the group of ?(Q*)2 in
Q*/(Q*)2 where
(i) q g Z is square-free and q | Iq
(ii) The equation
ql* + a,il2m2 + (bt/q'jrrA = n2
has a solution in I, m, n£Z not all 0.
Further, the point (0,0) of f) corresponds to q = the square-free kernel
of bi.
Example 1.
C -. У2 = AT A"2 - A'+ 6)
V- Y2 = A'(A''2 + 2X - 23)
For Jj/</>® we have q | (—23). Since —23 corresponds to (0,0), we need
look at only one of q = +23, q = —1, say the latter. The equation of
Lemma 4 is
-I4 +2l2m2 + 23m4 = n2
i.e.
— (I2 — m2)2 + 24m4 = n2,
which is impossible in Q3. Hence $/</>0 is generated by (0,0).
For we have q | 6, so q = — 1 or q — ±2, ±3, ±6. Since the
form A'2 — X + 6 is definite, we must have q > 0. Hence q = 2, 3 or 6;
and 6 belongs to (0,0). Thus it is enough to look at one of 2,3, say 2.
The equation is
2I4 — I2 m2 + 3m4 = n2,
which is seen to have the solution (I,m,n) = (1,1,2). This corresponds
t° (x,y) = (2,4).
It follows that is generated by (0,0) and (2,4). To find gener-
ators for (S/2® we need to look at the effect of </> on the generators of
fj/</>®. In this case </>(0,0) = o, so ®/2® is also generated by (0,0) and
(2Л).
Second example. This is related to Fermat’s equation
Ui + V4 = v4.
Then
y = v2w2/u\ x = w2/u2
64
Lectures on Elliptic Curves
satisfy
C-. Y2 = X(X2 — 1),
so
D : Y2 = X(X2 + 4).
For we have q | 4, so q = —1, ±2. Since X2 + 4 is definite, we
need q > 0, so only q = 2 needs to be looked at. The relevant equation
is
2/4 4-2m4 = n2,
which has the solution (Z, m,n) = (1,1,2), giving (X, Y) = (2,4) as the
generator of The point (0,0) is in Ф&.
For we have q | ( — 1). Since —1 belongs to (0,0), there is
nothing to do. Then is generated by (0, 0) and ®/2® is generated
by (0, 0) and ^>(2,4) — (1, 0).
§14. Exercises
1. Find
(i) a set of generators for ®/2®, where ® is the group of rational
points and
(ii) the 2-power torsion, for the following curves
Y2 = X(X2 4-3X + 5)
Y2 = X(X2 - 4X + 15)
Y2 = X(X2 + 4X - 6)
Y2 = X(X2 -X4-6)
Y2 = X(X2 +2X 4-9)
Y2 = X(X2 -2X4-9)
2. Invent similar questions to 1 and solve them. [Xote. You cannot
expect to determine ®/2® in every case, but you can majorize its order.
It might be helpful to write a Mickey Mouse program to look for points
with small co-ordinates.]
3. Let C : Y2 = X(X2 4- aX 4- 6), D : Y2 = X(X2 4- c^X 4- 6L) with
eq = — 2a, bi = a2 — 46.
(i) Show that the odd torsion groups are isomorphic
(ii) Assuming the finite basis theorem, show that the ranks [= number
of generators of infinite order] are the same
§1^: Exercises
65
(iii) give a.n example to show that the orders of the groups of 2-power
torsion need not be the same. Determine what the possibilities
are.
4. (i) Construct an elliptic curve with a torsion element of order 8.
(ii) Show that no torsion element can have order 16.
(iii) Determine all abstract groups of 2-power order which can isomor-
phic to the 2-power torsion of an elliptic curve. Give elliptic curves
in the possible cases and give a proof of impossibility for the others.
5. (Another kind of isogeny). Let
C : Y2 = X3 + В
be defined over Q and let /?2 = 5, /? E Q.
(i) Show that Y — ±/3 are inflexions and that 2(0, B) = (0, — /3).
(ii) Let x = (x,y) be generic and put
x1=x + (0,/3), x2 = x 4-(0,-/9).
Show that
£ = x + arj + x2, г] = у + У1 + y2
are functions of (a;, y) defined over Q and that
T> : rj2 = f3 - 27B.
(iii) Show that the repetition of the above map is (essentially) multi-
plication by 3.
(iv) Denote by 0, Sj the groups of rational points on C, T> respectively.
Denote by Q(/3)* the multiplicative group of non zero elements of
Q(/3). If (x,y) € <0 and
y + ?e{W}3
show that x is in the image of Sj under T) —» C.
[Hint. Put у + /3 — (u + v/З)3 and equate the coefficients of /?.]
(v) Show that
(x,y^ (y+n){m*}3
is a homomorphism
whose kernel is the image of Sj.
(vi) (Requires algebraic number theory). Show that the image of p is
finite [Hint. cf. §16].
(vii) Deduce that 0/30 is finite.
15
The weak finite basis theorem
In this section we show that 0/2® is finite, where ® is the group of
rational points on the elliptic curve
Y2 = F(X),
where
F(X) = X3 + AX + B, 4A3+27B2y^0.
The argument has similarities with that in the previous section, where
we made the addition assumption that F(X) has a rational root.
Here we treat in a uniform manner the cases when F(X) has 3 rational
roots, one rational root, no rational root. We work with the commutative
ring
Q[0] = Q[T]/F(T),
where T is a variable and 0 is the image of T. Then Q[0] is the direct
sum of as many fields as F(T) has irreducible factors13 * is.
There is a norm map
Norm : Q[0] —* Q
defined as follows. Let a g Q[6). The map
£ -> £ g Q[0]
13 The preceding section has proved the weak finite basis theorem when F(T)
has a rational root, so it would be enough to consider the case when Q[0]
is a field. This brings some minor simplifications to the proof.
15: The weak finite basis theorem
67
takes Q[0] into itself. If Q[0] is regarded as a 3-dimensional vector space
over Q, the map is linear and its determinant is defined to be Norm(a).
Clearly
Norm(a/?) = Norm(a) Norm(/?);
and a is invertible (i.e. has an inverse) precisely when Norm(a) ф 0. It
is readily checked that
Norm(a — 0) = J’(a) (a 6 Q).
Denote by Q[0]’ the multiplicative group of invertible elements of
Q[0]. We shall work with the group
M c Q[0)7(Q[0]*)2
consists of the a(Q[0]*)2 for which Norma € (Q*)2.
There is a map
p : ® M
defined as follows.
(i) p(o) = 1(Q[0]‘)2
(ii) if a = (a, J) g 0, li / 0, then
p(a) = (a — 0)(Q[0]*)2
(iii) if14 a = (a,0), then F(a) = 0, so one of the summands in the
expression of Q[0] as a sum of fields is a copy of Q arising from
the map 0 —> a. Hence this component of a — 0 is 0. We replace
(patch) this component with any element of Q* such that the norm
of the new element of Q[0] is in (Q*)2.
Lemma 1. The map p is a group homomorphism.
Proof. Let aj = (<2j,bj) (j — 1,2,3) be elements of ® with
ai + a2 + аз = 0,
so that they lie on a line
Y = IX + m l,m g Q.
Then
F(X) - (IX + m)2 = (X - ai)(X - a2)(X - a3).
Replace X by 0:
(aj — 0)(a2 — 0)(“з - 0) = (L0 + m)2.
14
cf. preceding footnote.
68
Lectures on Elliptic Curves
If all the bj ф 0, then the ai — 0 are invertible and we are done.
It remains to deal with the case when F(T) is reducible and at least
one of the roots is among the ay. If only one of the roots, e (say), of F(T)
is among the ay, then Q[0] is a direct sum K3 ф К? or Ki ф К? ф К3 of
fields, where Ki is the copy of Q given by 0 —» e. The given proof shows
that the Lemma holds for the components in Kj (j ф 1). Since we have
patched things so that the norms are always a square, the Lemma must
hold for the Ki~components as well.
The remaining case is when all the by are 0 and the ay are the roots of
F(T). Then Q[0] is the direct sum of three copies Kj of Q by 0 —> ay
(y = 1, 2, 3). The components of 0 — a! in A’2, K3 are a2 — a3, a3 — a3
respectively. Hence the patch for the zero compound of 0 — a! in Кг is
(a2 — a!)(a3 — <2i)(Q*)2- Now the truth of the Lemma follows by direct
calculation.
Lemma 2. The kernel of p is 2(5.
Proof. Since M has exponent 2, the kernel certainly contains 2®. We
have to show it is no bigger.
Suppose that
Ha) = (Q‘[©])2, a = (a, b).
Then15
a — 0 = (p2 02 + pi 0 + po )2
for some po, pi, pi € Q. Further,
_ P2 Ф 0,
since 0 does not satisfy any equation of degree < 3.
We can find s0, -Si, r0, ri € Q such that
(S10 + s0)(p2Q2 +Pi& +po) = Г10 + r0,
since the vanishing of the 02-component on the right hand side is a
linear condition on So, «1• If si = 0, sa 0, we would have p2 = 0.
Hence, without loss of generality,
Sj = -1.
Now
(s0 - 0)2(a - 0) = (riQ + Го)2.
15 A moment’s consideration shows that this statement remains true when
6 = 0, though then 4" Pl ® + Po is not invertible.
15: The weak finite basis theorem
69
On replacing 0 by an indeterminate X, we have
(r.X + r0)2 - (s0 - X)2(a - X) = F(X),
since the coefficient of X3 is 1.
Hence the line
Y = r1X + r0
meets У2 = F(X) in (a, ±6) and (so,t) (twice) for some i. It follows
that (<i,b) g 2<S, as required.
Theorem 1. <S/2<*5 is finite.
Proof. It is enough to show that the image of /j. : > Л1 is finite.
We may suppose without loss of generality that
А, в e t.
Let x = (a:,y) g (S. Then y2 — F(x) implies that
x = r/t2, у = s/t3
where
r,s,t 6 Z, gcd(r, f) = gcd(s, f) = 1,
and
s2 = r3 + Art4 + Bi6.
To illustrate ideas, suppose for now that the roots ej, e2, e3 of F(X)
are rational, and so in Z. Then
•s2 = (r - eit2)(r - e2f2)(r - e3f2). (*)
Now
gcd{(r - ejt2),(r - e2t2)}
divides (ej — e2)t2 and (e3 — e2)r, so divides (e3 — e2): and similarly for
the other pairs of factors. Hence and by (*)
r — ej = djV2,
where dj is square free,
dj|(ei - e2)(e2 - e3)(e3 - ej,
and
did2dj — square.
There are thus only finitely many sets {d1;d2,d3}; which proves the
theorem in this case.
70
Lectures on Elliptic Curves
Before leaving this special case, we note that (tq,г>2,u3, t) lies on the
curve given redundantly by
' (ei — е2)12 = d2v% — d3v2
T> : < (e2 - e3)f2 = d3v3 — d2v2
. (e3 - ei)t2 = dxv2 - d3v2
We may therefore get further information about ®/2® by looking whe-
ther there is a rational point on T). In particular, one may be able to
show that there is not a rational point by local considerations.
Now consider the general case. Denote the roots of F(X) by £j g Q
(j = 1,2,3). We work16 in К = Q(ei,e2,£3). As in the rational case,
the ideal [r — Cit2,r — e2t2] divides — e2. Hence each principal ideal
[r — Cjt2] is a square up to one of a finite number of ideal factors. The
finiteness of the class-number and the finite generation of units now
imply that
r - Sjt2 = 8jX2,
where 8j. Xj g Q(£j) and {^i, £2,^3} is from a finite set. This clearly
shows that the image of ц is finite and so completes the proof of the
Theorem.
We now find a curve T> with properties analogous to those of the T>
constructed above in the case when the roots are rational.
We have shown that if (x,y) € ®, then
г_0 = гд2,
where <5, A g Q[0] and 8 is one of a finite set. Write
A = v0 + «1© + u2©2 £ Q).
Then the right hand side becomes
Яо(у) + Я1(у)0 + Я2(у)©2,
where Hj(v) g Q[v] is a quadratic form depending on 8. Hence there is
a rational point (v,t) on
( Я2(у) = 0
D : <
l Я3(у) = -t2.
16 This is the only place where the use of algebraic number theory is unavoid-
able. If she does not know the theory, the reader should take it on trust that
it is very like the rational case. But see next footnote.
15: The weak finite basis theorem
71
Again, we can get further information on <S/2<25 by examining whether
there is a point on T) everwhere locally. If not, then f> cannot occur. If
there is, we can make s further useful transformation. If T) has a point
everwhere locally, there is always a point everwhere locally on the conic
#2(v) = 0.
There is a point on = 0 globally by Theorem 1 of §3, and so (see
exercises)
#2(v) = hL2 -MN
identically, for some h g Q and some linear forms
L(v), M(v), N(v) e Q[v],
Hence the rational points on Hfiv) = 0 can be parametrized in terms
of r, s (say) by
Vj = Vfir,s) (j = 1,2,3)
where the Vj(r, s) € Q[r, s] are quadratic forms.
It follows that T> is birationally equivalent to
P': t2 = G(r,s),
where G is a quartic form. It would be possible to describe the possible
equivalence classes of quartic forms G in terms of its invariants instead of
the detour through algebraic number theory17. In fact this is what Birch,
and Swinnerton-Dyer did in their historic computations. [B.J. Birch and
H.P.F. Swinnerton-Dyer. Notes on elliptic curves I, II. J. reine angew.
Math. 212 (1963), 7-25; 218 (1965), 79-108].
We conclude this section by looking at a couple of examples.
First Example.
Y2 = X(X2 - 1).
We considered this already as an example of isogenies. Let (r/t2, s/t3')
be on the curve, so
r(r + f2)(r — i2) = s2.
The greatest common divisor of (r ± t2) is 1 or 2: that of r and r + t2
or r — t2 is 1. Further,
r + i2 > r > r — t2.
17 This line of argument proves the finiteness of <15/205 without algebraic num-
ber theory at the expense of a fairly substantial study of binary quartic
forms.
72
Lectures on Elliptic Curves
Hence if
r + t2 = di v2, r = d2v%, r — t2 = d3i>2
with the dj square free, the only possibilities are
(di,d2,d3-) =(1,1,1),
(2,1,2),
(1,-1,-1),
(2,-1,-2).
These are all realized by the points of order 2. Hence ®/2® is gener-
ated by them.
Second example. Most applications require algebraic number theory. We
give one such application, to which we will want to refer later.
The curve
Y3 + уз + 6QZ3 = Q
is birationally equivalent to
Y2 = A'3 — 33(30)2.
We shall work in Q(£) where <53 — 30. This has class number h = 3 and
fundamental unit18 rj = 1 + 96 — 362.
The roots of F(X) = X3 -33(30)2 агеЗб2, 3p<52, 3p2<52, where p3 = 1.
In our usual notation, if (r/f2,s/f3) g ®, a prime common ideal divisor
of any two of
r-362t2, r — 3p62t2, r-3p262t2
must divide 2.3.5. Since 2, 3, 5 ramify completely, r — 362t2 must be a
perfect ideal square.
In the real embedding, clearly
r - 362i2 > 0,
and so either r — 362t2 = a2 or r — 362t2 = rja2 for some a 6 Q(6).
We disprove the second alternative. Put
a = и + v6 + w62.
Equating coefficients of powers of 6 in
r — 362t2 = r]a2,
18 As it can be mildly troublesome to check that a unit is fundamental, all we
actually use is that т/ > 0 in the real embedding and r) is not a square. The
last fact follows by looking at rj modulo [2 — 6, 11].
§15: Exercises
73
we get
0 = 9u2 4- 2uv — 9ov2 — 180uw + 540uw + 30w2
—3t2 = —3u2 + 18uv + v2 + 2uw — 180uw + 270w2.
On putting
и = —28e + 90/,
v = —9e 4- 29/,
w = g — 3e 4- 9/,
in the first equation, it becomes
0 = 30ff2 -4e/.
Hence there are m, n such that
e : / : g = m2 : 30n2 : 2mn.
On substituting in the second equation, we get for some I:
—3l2 = 3m4 — 112m3n 4- 1620m2n2
- 10800mn3 4- 27900n4.
But this is impossible in Q2. (Consider |n|2 < |m|2 and |n|2 > |m|2
separately).
Hence ®/2® is the trivial group.
§15. Exercises
1. Determine the 2-power torsion and sets of representatives ®/2® for
У2 = E(X) in the following cases.
(i) F(X) = X(X - 3)(X 4- 4)
(ii) ДХ) = Х(Х-1)(Х + 3)
(iii) F(X) = X(X 4- 1)(X - 14)
(iv) etc.
(v) etc.
2. (i)Give the general form of an elliptic curve with a rational point of
order 4. [Hint'. use isogenies.]
(ii) Show that an elliptic curve cannot have two independent rational
points of order 4, i.e. points a, b such that 4a = 4b = о, 2a ф 2b,
2a ф o, 2b ф o.
3. Make more explicit the algorithms of the text for the case of rational
roots. More precisely, let
Г(Х) = (Х-е1)(Х-е2)(Х-ез)
74
Lectures on Elliptic Curves
where ej g Q and let
<rj• : 0 —» ej (j = 1,2,3)
be the homomorphisms of Q[0] into Q.
(i) For given tx, t2, t3 g Q, find an explicit A = /о + ^б+^г©2 (0 € Q)
with
<7;(А) = <; (j = 1,2,3).
Show that A is unique.
(ii) Let x g Q be such that
x — ej = t2 (J = 1,2,3).
Show that the A constructed in (i) satisfy A2 = x — 0.
(iii) Find in terms of the tj, ej the sq g Q such that
(s0 - 0)A = r0 + 7i0 (say)
has no terms in 02.
(iv) Show that (ar,<1<2<з) = 2(sq,?) for some ? g Q.
(v) On replacing tj by ±tj (independent signs) show that one gets in
general further Xj g <5 with 2xj = x. What is the relation between
the different Xi?
(vi) Using the above with F(X) = X(X — 3)(X + 5) and x = (4,6),
find all the Xj with 2xj = x.
4. [Fermat, Euler]. By transforming it to canonical form, or otherwise,
show that the only rational points (arj, x2, а'з, X4) on the curve
X2 - 2X2 + X2 = 0, X2 - 2X2 + X2 = 0
are those with x2 = x2 = x3 = х±.
If nj < n2 < n3 < П4 < are integers in arithmetic progression, deduce
that they cannot all be perfect squares.
16
Remedial mathematics. Resultants.
As they are often not included nowadays in undergraduate courses,
we give here some basic facts about resultants on discriminates. The
ground field is arbitrary.
Let
F(X) = fNXn + fn^Xn~l +... + /„
G(X) = gmXm + g^X™-1 +... + g0
be polynomials. The polynomials
F(X)
XF(X)
Xm-lF(X) >
G(A')
Xn-1G(X) .
can be regarded as m+n linear forms in the m+n variables Xm+n~1,..., 1
(the “forgetful functor”). The determinant R(F, G) is the resultant of
F, G. It is defined only up to sign.
By eliminating _X’m+n~1,... ,X determinantally, we express R(F, G)
as a linear combination of the rows (*), that is
A(X)F(X) + B(X)G(X) = R(F,G), (1)
where A(.Y), B(X) have degrees < m — 1, < n — 1 respectively. If
76
Lectures on Elliptic Curves
F, G have coefficients in a ring, say Z, then R(F, G) € Z and A(X),
5(X) eZ[X],
If F(X), G(X) have a common zero x (in the algebraic closure), then
(1) implies that R(F, G) = 0. Conversely, suppose that R(R G) = 0.
Then the (*) axe linearly dependent, and so there are A(X), B(X) of
degrees < rn. — 1, n — 1, not both zero19, such that
A(X)F(X) + B(X)G(X) = 0.
If we suppose that F(X), G(X), have precise degrees n, m (i.e. fn 0,
9m ф 0), it follows that FIX'), G(X) have a common factor, and so a
common zero in the algebraic closure.
If fn = gm = 0, then clearly R(R G) = 0. If fn 0 but gm = 0, then
clearly
R(F,G) = fnR(F,G*),
where
G* = gm-\Xm 1 + ... + gg.
Hence the elegant formulation is that the homogeneous forms
fnXn + fn-1Xn~1U + ...+f0Un
gmXm + ...+g0Um
have a common zero (x,u) (0,0) in the algebraic closure if and only
if R(F, G) = 0.
Revert to the inhomogeneous polynomials and let
Г(Х) = /„П(^-^)
С(Х) = !Л„П(Х-М
я
If fm, gn, 0i,. . • , 0n, ., фт are taken as variables, R(F, G) is a poly-
nomial in them. It vanishes when any 9j is equal to any ф^. Hence and
from considerations of degree,
R(F,G) = ±f™ ~ Фк)
i,k
= ±л,Пад
19 The particular A(X), B(X) given by the determinantal elimination which
gave (1) may, of course, both be 0.
§16: Exercises
77
Let H = H(X) be a further polynomial. Then it readily follows that
R(F,GH) = ±R(F,G)R(F,H).
Further, if Gi, G2 have the same degree m and G'i — G2 is divisible
by F, we have
R^G.) = ±B(F,G2).
Finally, we put G = F', the (formal) derivative. Since
F'W = fn № -
we have
Я(Г,Г') = ±/Г-’ П^-^)2
The function on the right side with 4- is the discriminant D^F). It
vanishes precisely when F has a multiple root.
For example, when F(X) = X3 + AX + B, we have D = 4A3 + 27B2,
and (1) gives
(6AX2 - 9BX + 4A2)(3X2 + A) - (18AX - 27B)(X2 + AX + B)
= 443 + 27B2.
§16. Exercises
1. Let F(X) g have discriminant D and let a g lp. If
|F(a)|P < |B|„,
show that |F'(a)|p > |D|p.
17
Heights. Finite Basis Theorem.
We are now in a position to introduce the notion of height, and so to
complete the proof of the Finite Basis Theorem.
Let u = (u0, .. ., un) be a point of projective n-dimensional space over
Q. As the co-ordinates axe homogeneous, we may suppose without loss
of generality that
Uj E Z, gcd(u0,...,un) = 1. (1)
The height H(u) of u is defined to be
H(u) = max |uj |
J
with the above normalization. In this section || = is the absolute
value.
We shall mainly but not exclusively be concerned with the projective
line. We identify x g Q with the point (x, 1) on the line, and so write
H(x) = max{|u0|, luj}
if x = Uq/uj with uq, ui S Z as a fraction in its lowest terms.
Lemma 1.
(i) Let D(Uo,Ui), E(Uij, Ui) E Q[I7o, be forms of the same degree
n. Let и = (uo,Ui) be a point on the rational projective line, and
suppose that D(u), £(u) do not both vanish. Then
H(D(u),E(u)) < cH(u)",
where c is independent of u.
11: Heights. Finite Basis Theorem.
79
(ii) Suppose, further, that the resultant of D, E is not 0. Then there
is a 7 > 0, independent of u, such that
Н(Л(и),£(и)) >7H(u)n.
Note. The additional hypothesis in (ii) is equivalent to supposing that
D, E do not have a common zero over the algebraic closure Q.
Proof. By homogeneity, we may suppose that
Л(17о,1Л),ад,^1)
and that u = (uq,^) is normalized by (1). Clearly
|-D(u)|, |E(u)| < c{max(|u0|, |“i|}n
for some c. In general D(u), -E(u) will have a common factor, but in
any case this implies the conclusion of (i).
Now suppose that the hypotheses of (ii) hold and let R be the re-
sultant. Then there are homogeneous forms Lj(Ug, U\), Mj(Ug, Uij g
Z[Co, Ci] (j = 0,1) such that
LjD + MjE = RU^'1 (j=0,l). (*)
On substituting u for U we deduce that
gcd{D(u),E(u)}|77.
Further, as in the proof of (i), there is a c' such that
l-^j(u)l, (u)| < c'{max(|u0|, |uj|}n-1 (j = 0,1).
On substituting in (*) (with u for U), we obtain the conclusion of (ii)
with 7 = |Я|/2с'.
Now let u, v be two points on the projective line and let
W = (uOVo,UOVi + UjVOjUiV!)
= (wo,wi,w2) (say).
Lemma 2.
1 < H(w) „
2 - H(u)H(v) - -
Proof. Let u, v be normalized by (1). Then the right hand inequality
is immediate.
It is readily verified that w0, uq, w2 have no common factor, so it will
be enough to show that
max(|w0|, IwJ, |w21) > ^{max(|u0|, |«i|)}{max(|v0|, |iq|};
80
Lectures on Elliptic Curves
which, is a simple exercise left to the reader.
Back to the elliptic curve
С : У2 = X3 + AX + в
with
A, Be Z, 4 A3 + 27 B2 / 0.
It is convenient (and conventional) to define the height H(x) of a
rational point x = (ar, y) on C to be the height H(r) of its X-co-ordinate.
In other words, if x = (x,y, z) in homogeneous co-ordinates, we have
B(x) = Н(.т, г). (x о)
Я(о) = 1.
Lemma 3. There are constants c1; 7 > 0 depending only on C such
that
Proof. Writing x = (x,y), 2x = (х2,У2)> we have (cf. Formulary)
x2 = D(x)/Е(х),
where
Л(т) = (За: + A)2 — 8ж(а:3 + Ax + B)
E(x) = 4(r3 -f- Ax + B)
Now the resultant of 3a:2 + A and a:3 + Ax + В is 4 A3 + 27B2 0,
and the formulae of the previous section show that the resultant R of
D(x~), E(x) is a power of 4 times (4A3 + 27B2)2. Hence the conditions
of both parts of Lemma 1 apply with x = Uq/uj and n = 4; and the
result follows.20
Lemma 4. Let x1; x2 g di. Then
H(*i + X2)B'(x1 - x2) < c2B'(x1)2B'(x2)2,
where C2 depends only on C.
Proof. Write
Xi + x2 = x3, Xi - x2 = x4
2° In fact in the proof of Lemma l(ii) in this case one may take 4A3 + 27B^
instead of R since a factor 4A3 4- 27B2 cancels from the Lj, Mj. Compare
the formula (*) at the end of §12. Detailed formulae are in Silverman’s book.
11: Heights. Finite Basis Theorem.
81
and xy = (xj,yj) as usual. Then (cf. Formulary)
(1,x3+x4,^4) = (Wo,JV1,JV2)
as elements of the projective plane, where
wo =(x-2-n)2
VTi = 2(зг 1 x 2 + А)(зг 1 +3:2)+ 4B
W2 = ^1^2 — 2Азг1зг2 — 4В(ЗГ] + зг2) + A2
On writing a?!, x2 as quotients of integers and homogenizing, it is
readily seen that
H( Wo, Wi, W2) < c3H(x: )2Н(зг2 )2
for some c3. On the other hand,
H(W0, W:,W2) = H(1 ,X3 + X4,X3X4)
> |н(з:з)Н(з:4)
by Lemma 2. The truth of the lemma follows with c2 = 2c3.
Corollary.
Мш(Я(Х1 +Х2),Я(х1 - X2)) < С4Я(Х!)Я(Х2)
with C4 = c\'2.
In another direction we have
Lemma 5. Let A be given. There are only finitely many x G ® with
Я(х)< A.
Proof. For x = (r, y~) with Н(з’) < A; that is x = u0/“i with u0, “1 € T.
and |u01, |“i| < A.
We are now in a position to prove the
Finite Basis Theorem. The group ® of rational points is finitely
generated.
Proof. By the “weak” theorem (§15, Theorem 1], ®/2® is finite. Let
bj,..., b, g ® be representatives of the classes of ® modulo 2®.
Now let a G ®- There is some j such that a ± by G 2® for both
choices of sign. By Lemma 4, Corollary, there is one choice of sign such
that
Я(а±Ьу) < с4Я(а)Я(Ьу).
82
Lectures on Elliptic Curves
Now a ± by = 2c, c € <&, and so
Я(а±Ьу)>71Я(с)4
by Lemma 3. Putting everything together, we have
Я(с)4 <
< кЯ(а),
where
к. = yj-1C4 max LZ(by).
j
Hence either
Я(с) < ^Я(а)
or
Я(а) < (16k)1/3
= A (say).
It follows readily that ® is generated by the by and the a with
Л(а) < A. But the latter are finite in number by Lemma 5.
We 'conclude this section with a brief review of the properties of
heights.
The inequality in Lemma 4 is supplemented by one in the other di-
rection:
Я(Х! + Х2)Я(хх - x2) > 72Я(х!)2Я(х2)2, (*)
where y2 > 0. Indeed the Wo, JVi, W2 of the proof of Lemma 4, consid-
ered as functions of indeterminates xx, ar2, have no common zero in the
algebraic closure: for Wo = 0 implies x2 = x^ and then Wi, W2 become
the functions D, E used in the proof of Lemma 3. Now (*) follows from
an appropriate generalization of Lemma 1. Note that Lemma 3 is now
just the case x2 — Xi of the extended Lemma 4.
We now move over to the logarithmic height
h(x) = log-H(x),
so that the extended Lemma 4 gives
|A(x! + x2) + A(x! - x2) - 2fi(xj) - 2A(x2)| < c
for some constant c. In particular,
|A(2x) — 4fi(x)| < c.
It follows that
h(x) = lim A(2nx)/4n
n—>oo
exists, and satisfies
h(x! + x2) + h(x! - x2) = 2fi(xJ + 2fi(x2).
§17: Exercises
83
It is now an undergraduate exercise (cf. Exercises) to deduce that
A(xj + x2) - A(xj) - Л(х2)
is bilinear in Xj, x2; and so that h(x) is a quadratic form on ®.
The function h(x) is called the22 canonical height.
In particular,
h(nx) — n2h(x),
so h(x) — 0 if x is torsion: the converse holds by Lemma 5 and since
h(x) — h(x) is bounded.
§17, Exercises
1. (i) Let a £ Q, а ф 0. Show that |a|p — 1 except for at most finitely
many primes p and that
П и? — 1-
p inc oo
(ii) Let «о,..., un £ Q, not all 0. Show that max |p — 1 except for
at most finitely many p and that
JJ тах|иу|р = H(u)
p inc oo
is the height of the point u = (uq, • • •, un) in projective space.
2. (Required in text.) Let /(z) be a function defined for x in a group
SUt and taking values in a field of characteristic 2. Suppose that
f(x + y) + f(x — y) — 2f(x) + 2f(y)
for all x, у £ SUt. Show that
/(z) = B(z, z),
where B(z, p) is a symmetric bilinear form.
[Hint. Take
B(x, У) ~ + У) - /(*) - f(.y)}-
One has to show that
B(z + z, y) - B(x, y) + B(z, y),
i.e. that
f(x + у + z) + /(z) + /(p) + f(z)
- f(y + z) + f(z + z) + /(z + p).
22 There are different definitions of the canonical height. They differ by a
constant factor.
84
Lectures on Elliptic Curves
One opening gambit is to observe that
(x + У + z) + X = (x + z) + (?/ + z).]
3. Let С : X3 + AX + В and suppose that Xj, x2 are independent
generic points. Let x3 = Xi + x2, x4 = X; — x2. Show that
01 - зг2)2(^1 + x2 + x3) = (2/1 - У2)2
(a.-! - зг2)2(т1 + x2 + 1-4) = (2/1 + У2)2.
Deduce that Ti + x2 + зг3, x'i + x2 + X4 are roots of an equation
(aq — x2)2T2 + uT + v = 0,
where u, v are polynomials in aq, x2.
Deduce that a similar result holds for x3, X4.
4. (Required in text.) Let G(X) € Q[X] be a nonsingular quadratic form
in X = (X, Y, Z) and suppose that there is an x = (aqi/,z) (0,0,0)
such that G(x) = 0. Show that there are linear forms L(X), A/(X),
X(X) € Q[X] and a d g Q* such that
G(X) = L(X)M(X) + <LV(X)2.
[Hints.
(i) Without loss of generality x = (1,0,0).
(ii) After a linear transformation on У, Z, we may suppose G(X) =
XY + form in У, Z.
(iii) Complete the square with respect to Z.]
5. Let h be the canonical height on some curve C and suppose that there
are representatives of all classes of <5/2(5 in h(x) < t for some t.
Show that <5 is generated by the a g <5 with h(a) < t.
18
Local-global for genus 1
Our attention now moves from elliptic curves to curves of genus 1
in general. In this section we give a couple of examples to show that
there is no local-global principle for rational points on curves of genus
1. Subsequently, we shall give a structure to the “obstruction” to a
local-global principle, namely the Tate-Shafarevich group.
The two examples we shall discuss are
ЗА3 + 4У3 + 5Z3 = 0, (1)
due to Selmer, and
A4 — 17 = 2У2, (2)
due (independently) to Lind and Reichardt. The techniques we have
developed so far enable us to disprove the existence of rational points.
We have not, however, developed techniques to show that there are
solutions everwhere locally. This is because we have left a fairly highbrow
discussion of curves of genus 1 over finite fields until the end (§25). The
reader may, of course, verify for any given p that there is a point defined,
over Qp but this can never disprove the existence of some P > IO10 (say)
such that (1) or (2) has no solution in Qp. We shall assume without
present proof that a curve of genus 1 over a finite field Fp always has a
point defined over Fp (§25, Theorem 2). If, therefore, a curve such as
(1) or (2) reduced mod p is still of genus 1, then there is a point mod.
p which can, by Lemma 1 of §10, be lifted to a point defined over Qp.
86
Lectures on Elliptic Curves
Assuming this22, the only Qp to be considered for (1) are p = 2, 3, 5
and the only ones for (2) are p = 2, 17. It may confidently be left to the
reader to confirm that there are points for these p.
The disproof of rational points on (1) uses
Lemma 1. Let a, b, c be distinct integers > 1 and suppose that d = abc
is cube free. Suppose that there are u, v, w gZ not all 0 such that
au3 + bv3 + cw3 = 0.
Then there are x, y, z tzl. with z 0 such that
a:3 + y3 + dz3 = 0.
Proof. Let p3 = 1, p 1 and put
£ = au3 + pbv3 + p2cw3
у = au3 + p2bv3 + pew3.
Then
£ + у = 3au3
p£ + p2 у = 3cw3
p2£ + py = 3bv3
and so
£3 + y3 + d£3 =0, £ = —3uvw.
Now the two points (£,pt?,£), (’?,/’2£,£) are conjugate over Q. Hence
the line joining them meets X3 + Y3 + dZ3 = 0 in a point defined over
Q and distinct from (1,—1,0).
Lemma 2. The only point defined over Q on
X3 + У3 + 60Z3 = 0
w (1, —1,0).
Proof. There is no torsion, e.g. by the discussion of exceptional points
on cubic curves (§6, Lemma 1). The curve is birationally equivalent over
Q to
У2 =X3 — 24.33.602,
22 For the specific curves (1), (2) the number of points mod p may be computed
(or estimated) by other fairly elementary means, e.g. by the use of finite
Fourier analysis.
18: Local-global for genus 1
87
for which. ®/2® is trivial by the proof at the end of the section on the
weak theorem (§15, Second example). It follows from the Finite Basis
Theorem that <6 is trivial.
Theorem 1. There are no rational points ondl).
Proof. The last two lemmas.
The preceding proof used the theory of algebraic numbers. The next
proof works entirely in the rationals.
Theorem 2. There are no rational points on (2).
Proof. If not, suppose (x,y) is on (2). Let x = a/c as a fraction in its
lowest terms. Then
a4 — 17c4 = 2b2, gcd(a, c) = gcd(6, c) = gcd(a, b) = 1.
Putting
A = a2, C = c2
we have
A2 - 17C2 = 2b2.
This equation is soluble everwhere locally, so globally, and in fact
52 - 17.12 = 2.22.
Now
(5A + 17C + 46)(5.4 + 17C - 4Ь) = 17(A + 5C)2.
If there is a common odd prime divisor of the two factors on the left
hand side, it divides 5A + 17 C and A + 5C, so divides SA and 8C: a
contradiction. The two factors on the left hand side have the same sign,
which for A = a2, C = c2 must be positive. Hence for integers u, v there
is one of two possibilities
First Case Second Case
5a2 + 17c2 ±46 = 17u2 34u2
5a2 ± 17c2 ±46 = v2 2v2
a2 ± 5c2 = uv 2uv
In the first case
10a2 ± 34c2 = 17u2 + v2
a2 + 5c2 =uv.
88
Lectures on Elliptic Curves
We show that this is impossible in Qit- Write || = ||it• By homogeneity
max(|a|, |c|, |u|, |v|) = 1.
Since 10 is a quadratic non residue mod 17, we have
|a| < 1, |v| < 1.
The second equation gives
|c| < 1.
Finally, the first equation gives
|u|<l.
Contradiction.
The second case gives
5a2 + 17c2 = 17u2 + v2
a2 + 5c2 = 2uv.
The proof that this is impossible in Qi? is similar.
§18. Exercises
1. [Uses algebraic number theory.] Supply the details of the following
alternative proof of Theorem 2.
(i) The field Q(\/17) has class number 1. A basis of integers is 1,
|(1 + \/17)- A fundamental unit is 4 + \/17 of norm —1. The
prime 2 splits into (5 ± \/17)/2.
(ii) Suppose a4 — 17c4 = 2b2 with a, b, c £ Z, gcd(a,c) = 1. Then a, c
are odd and
а2 ±с2л/Г7
2
axe coprime.
(iii)
a2 + У17с2 /5 ± v/17 \ 2
----------------------2---- = j W (*)
for some unit r) and some integer ц.
(iv) 7} > 0 in both real embeddings. Hence r) is a square and so can be
absorbed in ц2.
(v) Put t] = 1, /I = (u + v\/i"7)/2 in and equate terms independent of
\/17. Then 4a2 = 5(u2 + 17v2) ± 34uv, which is impossible in Q3
(and in Qit)-
19
Elements of Galois cohomology
In the next section we have occasion to consider two curves which are
both defined over Q and which are birationally equivalent over Q. Here
we consider a simpler case and then set up some general machinery.
The conic
A : X'f + X2 = 3
has no rational point and so is not equivalent over Q to the line (co-
ordinate Y, no equation). They are, however, equivalent over Q(a/3),
for example by the equations
У = (^i - V3)/x2
_ 73(1-y2) _ -2^32/
21 t/2 + 1 ’ X2 y2il'
Let у be transcendental, so rq, x2 is a generic point of A.
The Galois group Gal(Q(73)/Q) can be made to act in two different
ways on Q(a/3, У) = Q(T3, x-L, x2). We can either make it act trivially
on у or we can make it act trivially on (xq,iq).
In the first case, the non-trivial element of the Galois group induces
the automorphism
Xi —> — xl, x2 — x2
of A. In the second case, it induces the automorphism
У -> -1/y
of the line.
90
Lectures on Elliptic Curves
In the example, we have used the birational equivalence to identify the
two function fields. In the general theory it is better to make it explicit.
Let A, В be two curves defined over Q and let
ф; A-+B
be a birational equivalence defined over Q. Let a £ Gal(Q/Q). We
can let <r act on the coefficients in ф and so obtain another birational
equivalence
<гф : A —» B.
Then
0CT(say) = (,о’ф')ф~! : В —► В
is a birational automorphism defined over Q.
We can act on by г £ Gal(Q/Q). Then
тда = (таф^тфу1
= [(Г^Ф)Ф~1][Ф(.ГФ)~1]
= етав;1.
Hence
= (r0J0r.
This is the23 cocycle identity and {$<,} is a cocycle.
Let there be another birational equivalence
ф’ : A -► В
defined over Q, so
ф' — иоф
for some automorphism
w : В -» В.
Then
^(say) = (аф'^ф'-1)
The two cocyles {$<,} and {0^} are said to be cobounding.
If ф is defined over Q, we have
a coboundary. In this case A, В are birationally equivalent over Q; but
we have chosen to use a different equivalence.
23
We owe the rococo terminology to the topologists.
§19: Exercises
91
Given В and the cocycle {f)a }, we can reconstruct A (up to a birational
equivalence defined over Q). For let x be a generic point of B. We define
an action a of the a g Gal(Q/Q) on Q(x) as follows:
a acts on Q by a
ax =
Then for т g Gal(Q/Q) we have
g(dx) = (t0o.)(Fx)
= (T0ff)0Tx
= 9™*-
Thus
(та) = та.
The fixed field of the a is a function field over Q, and so gives A up
to birational equivalence over Q.
§19. Exercises
1. Let a run through Gal(Q/Q). Find a cocyle {#<,} of birational auto-
morphisms which twist the line into
Xj + X2 = n,
where n is any given element of Q*.
When n = 5 give an explicit representation of your cocycle as a
coboundary.
Is your cocycle a coboundary when n = 3?
20
Construction of the jacobian
Let P be a curve of genus 1 defined over Q. In this section we construct
an elliptic curve C, also defined over Q, which is closely related to it.
This relationship will be exploited in subsequent sections.
We must initially consider birational equivalences between elliptic
curves. We work at first over a general field.
Let
Cj : Y2 = X3 + AjX + Bj (j = 1, 2)
and let
Ф Ci —» C2
be a birational correspondence. By considering </>(x) — </>(°i) instead of
</>(x), we may suppose without loss of generality that
^>(01) = o2,
where oj is the point at infinity on Cj.
The correspondence must take functions with poles of order 1 into
such functions. Hence
ф(Х) = aX + b
for some a, b. Similarly
ф(У) = cY = cY + dX + e.
The form of the equations for Cj imply that
d = e = 0, b = 0, a3 = c2
and so
2 3
a = s , c = s
20: Construction of the jacobian
93
for some s. Hence
A2 = s4A1, B2 = siB1. (1)
In particular, A^/B^ = A^/B? is invariant under birational equiva-
lence.
It is conventional to work with, the birational invariant
J ; ' 4A3 + 27B2
of
С : У2 = X3 + AX + B.
The notation j is standard. The constant 1728 = 123 is suggested by
the complex variable theory. Note that every elliptic curve gives a finite
value of j: it is the degenerate curves that send j to infinity
Lemma 1. Two elliptic curves in canonical form which are birationally
equivalent are related by (1) for some s. In particular, they have the
same j-invariant. Further, s is in any field over which the curves and
the equivalence are defined.
Corollary. Any birational equivalence of the elliptic curve
C: Y2 = X3 + AX + В
taking о into о is of the form
Y -> s3Y, X -> s2X.
If AB ф 0, then s2 = 1. If В = 0, then s4 = 1 and if A = 0, then
s* = 1.
Proof. Clear from (1) with C = Ci = C2.
Let us return to the main topic of the section. Let P be a curve of
genus 1 defined over Q. In general it will not have a rational point and, if
it has, we may not be able to find one: but there is no difficulty in finding
a point defined over Q. Hence there is a birational correspondence
</>: B->C
defined over Q, where C is in canonical form but defined over Q.
Let a £ Gal(Q/Q). We can act on the birational correspondence with
a and obtain
аф : В —> aC,
where
aC: Y2 = X3 + a AX + aB.
94
Lectures on Elliptic Curves
Now C and aC are birationally equivalent over Q by (сгф)ф 1. Hence
that is, j(C) g Q or equivalently А3/В2 g Q if AB / 0. Hence by a
transformation X -> t2X, Y -> t3Y (t g Q) we may suppose without
loss of generality that C is defined over Q. Of course in general ф is
defined only over Q. Now
9a = (cr</>)</>-1
is an automorphism of C.
Suppose, first, that
AB /0.
Then by Lemma 1, Corollary, the automorphism of C must be
9 a x -+ £„x + &a
for some point defined over Q and = ±1. We are in the position
discussed in the previous section, so
9ra = (r9a)9T.
In particular, since Sa € Q, we have
Ста = £a£r',
so £a is a gr°uP character.
We would like to ensure that is always 1.. If not, there is some
d g Q such that
<z('/d) = Ca^/d-
The transformation
X -> dX, Y -> d\/dY
gives a new C defined over Q: and with this we do indeed have = 1
always.
If AB = 0, the same conclusion holds but the argument is a little
deeper24. Suppose that В = 0, so e4 = 1, where we define
x —» ex
by
X -> e2X, Y -> eY.
Now Gal(Q/Q) acts on e, and
era = (rea)er-
24
And may be omitted at first reading.
§20: Exercises
95
By “Hilbert 90” (see Exercises) there is a 8 € Q with S4 g Q such that
ab = ea8.
We can now modify C, as before, so that = 1 identically on the new
C. Similarly for A = 0.
Thus in every case we have found a C defined over Q and a birationad
equivalence
ф: T>—> C
defined over Q such that
(аф)ф^1=0а: x->xfa„
for all a g Gal(Q/Q).
To sum up, we have proved:
Theorem 1. Let T> be a curve of genus 1 defined over Q. There is an
elliptic curve C defined over Q and a birational equivalence
ф:
defined over Q such that, for every a g Gal(Q/Q), the map
0a = (сгф)ф-1 : C-+C
is of the form
0„ : x -+ x +
for some aa g (5.
Further, C is unique up to birational equivalence over Q.
The elliptic curve C is the jacobian of T>.
Before exploring this situation further, we require some new machin-
ery, introduced in the next section.
§20. Exercises
1. Construct the jacobian of
(i) Y2 = aX4 + ЬХ2 + с (a, c g Q*, b g Q, b2 — 4ac / 0).
(ii) aX3 + bY3 + cZ3 = 0 (a, b, c g Q*).
(iii) aX3 + bY3 + cZ3 + mXYZ = 0 (a, b, c, m g Q*).
(iv) Y2 = aX4 + bX3 + cX2 + dX + e.
2. Let T> be the curve of genus 1 given by the redundant equations
(e2 - ej)t2 = djv2 — d2v2
(e3 - e2)t2 = d2v2 - d3vj
(ej - e3)t2 = d3v3 - djv2,
§20: Exercises
where ei, e2, e3 are distinct and dj g Q*, d^d^dz = 1. Show that there
is a point of T> defined over к. = Q(dy2, d^2) and hence find a map
ф'. T>—> C
defined over к. into
С: У2 = (X - e1)(X - e2)(X - e3).
Show that the cocycle
=9a-. C-^C
for a g Gal(/t/Q) is of the type
x -> x + aff
where 2a,, = o. Deduce that C is the jacobian of T>.
3. In this exercise the ground field is Q(p), where p3 = 1, p 1.
Let a, b, c g Q(p), and let
V : aU3 + bV3 + cW3
С : X3 + Y3 + abcZ3 = 0.
Put к = Q(p, a1/3, b1/3) and let ф : T> —» C be given by
X=a1/3U, Y = b1/3V, Z = a~1/3b~1/3W.
Show that the corresponding 0a is
9a : x —> x, or x + (p, — p2,0) or x + (p2,— p, 0).
Deduce that C is the jacobian of T>.
The remaining exercises fill in the proof that (in the notation of the
text) one can arrange to have ea = 1 when AB = 0.
4. Let K./k be a finite normal (separable) extension of fields of degree n.
Let ,..., an be a basis of к./ k and let crj,..., an be the elements of
the Galois group. Show that
det[<7;aj] / 0.
[Hint. к. = fc(/2) for some (3.
Note. In what is still the finest introduction to Galois theory, [Galois
Theory. Notre Dame Mathematical Lectures 2, 1942. Second edn.,
1948.) E. Artin proves this at the onset by an induction argument.]
5. Let к/k be a finite normal (separable) extension. For <r g Gal(/t/fc)
let 9a g k* be given satisfying the cocycle identity
9Ta = [r9a)9T.
Show that {^o-} is a coboundary, i.e. that
^<7 = (o’?)?-1 (all cr),
§£0: Exercises
97
for some 7 g к*.
[Hint. Let A g к.. Show that
7 = ^(<7 A)
does what is required provided that 7 0. Use Lemma 2 to show that
A can be so chosen.
Note. This result is usually known as Hilbert 90 because it is Satz 90
in Hilbert’s Zahlbericht - his report on algebraic number theory to the
German Mathematical Society at the end of the last century.]
6. Let n > 1 be an integer. For <r € Gal(Q/Q) let be an nth root of
1 and suppose that is a cocycle. Show that there is a 8 g Q such
that = cr&l8 and 8n g Q.
21
Some abstract nonsense25
Let Г be a finite group which acts on an abelian group A (written
additively). The action is written aA (cr g Г, a g A).
A cocycle is a map Г —» A, say
<r -» aa
which satisfies the cocyle identity
та„ = aTa - ат (ст, r g Г).
note that for r = 1 (the identity of Г) this implies
ai = 0.
If b g A, then it is easy to see that
ar = crb — b
is a cocycle. Cocycles of this type are called coboundaries.
Cocycles form a group under elementwise addition
{a<z} + {b<z} = {a<z + b<z}.
The coboundaries are a subgroup. The quotient group is
№(T,A),
25 This is a self-contained account of what is needed from the cohomology of
groups and commutative Galois cohomology. For how it fits into a wider pic-
ture, see, for example, Chapters IV and V of J.W.S. Cassels and A. Frohlich
(Editors) Algebraic number theory, Academic Press (1967). The treatment here
is suggested by that in C. Chevalley Class field theory, Nagoya (1954).
21: Some abstract nonsense
99
the first cohomology group.
Now Г acts on the whole situation (“transfer of structure”). Г acts
on itself by inner automorphisms. So r acts on the map (cocycle)
{a,,} : cr-ta,
to give
r{a<7} : -> ra„
= dTtT Qr.
Or, writing a for rar-1,
r(a<,} : cr aaT - ar.
This is a cocycle, as it has to be; and indeed
r{a„} ~ {a<4 : —* aar - aT — aa
= aaT — aT
is a coboundary. Hence
Lemma 0. Г acts trivially on №(Г, A). .
Lemma 1. Every element of H1 (Г, A) is of finite order dividing26 ЦГ.
Proof. Let the element be represented by the cocycle {aff}. Then, from,
what we have seen, it is also represented by the cocycle
r{a„} = {a„r - ar}.
But now
= {о}
r
[Recall that аг = 0.]
Lemma 2. Let m g Z, m > 1. Denote by Am C A the set of elements
of order dividing m. Suppose that every element of A is divisible by m
in A.
Then every element о/Н1(Г, A) of order m is representable by a co-
cycle {(!„}, da g Am.
Proof. Let the given element of №(Г, A) be represented by {aff}. By
hypothesis, is a coboundary, say
ma„ = crb — b (b g A).
Under the hypotheses of the Lemma, b = тс, c g A so
maa = mac — me
26
We use Ц for the cardinality of a set.
100
Lectures on Elliptic Curves
that is
m(aa — ac + c) = 0.
Hence the element of H1 is represented by
<7 —» aa — ас + c € Am,
as required.
Denote by Ar the set of elements of A fixed by Г:
a g Ar aa = a (all a g Г).
Lemma 3. Notation and hypotheses as in previous Lemma. Then
Аг I mAF
is canonically isomorphic to a subgroup of H1 (Г, Am).
Proof. Let a g Ar. By hypothesis
amb b g A.
On applying cr g Г, we have
a = aa = mab,
and so
md„ = 0, d„ = ab — b.
Hence {d<z} is a cocycle with values in (indeed it becomes a cobound-
ary in A).
For given a, any other choice of b is of the type b + с, c g Am. Hence
the element of №(Г, Am) given by {do-} is uniquely determined by a.
If a g mAr, we may take b g Ar, so d„ = 0 for all a, and the image
in Н\Г,Ат) is 0.
Conversely suppose that the cocycle constructed above is a cobound-
ary, so
da = ae — e V<7 g Г, some e g Am
Then
a(b — e) = b — e (all <7 g Г) :
and so
b — e g Ar, m(b — e) = a.
We can put the last two lemmas together. We repeat the hypothesis.
Theorem. Suppose that m > 1 is an integer and that every element
of A is divisible by m. Then the sequence
0 -> Аг/тАг - Н\Г, Am) - [W1(r,4)]m 0
21: Some abstract nonsense
101
exact, where [. . .]m denotes the group of elements of order dividing m,
and the third map is induced by Дт » A.
Proof. After Lemmas 2, 3 we need only prove exactness at Н’1(Г, Дт),
i.e. that the image of
Аг/тАт №(Г,Дт)
is exactly the kernel of
тэ^г, дт) [№(r, A)]m.
Consider first an element of the image, given (say) by the cocycle
{her} By hypothesis, da = crb — b, b € A and so {d^) considered as
taking values in A, is a coboundary. Thus Image C Kernel.
Now let the cocycle represented by {d„} be the kernel, i.e. {da} is a
coboundary for A: d„ = ab — b some b g A. then
cr(mb) — mb = md, — 0 (all <r)
and so mb g Ar. Hence Kernel C Image.
Galois cohomology. Let к be a field and к its separable closure ( =
algebraic closure in characteristic 0, the case of interest). Put
Г = Gal(T/fc).
We say that the action a —» era (<z g Г, a g A) of Г on the abelian group
A is continuous if:
For every a g A there is an extension k. of к of finite degree [re : fc] < oo
(depending on a) such that
aa = a (all a g Gal(fc//t) C Gal(fc/fc)).
Note 1. An example is: к = Q, C a curve У2 = X3 + AX + В defined
over Q, A = C5.
Note 2. If A has any natural topology, this is disregarded. For us the
word “continuous” is just a term of art. The action is continuous in
the usual sense if Г is given an appropriate topology and A the discrete
topology.
A continuous cocycle is a map
a -+ a„ (<z g Г,астА)
which
(i) satisfies the cocycle identity
ra„ = aTa — ar (a, г g Г)
(ii) is continuous in the sense that there is a normal extension к/к of
102
Lectures on Elliptic Curves
finite degree [к : fc] < oo such, that a„ depends only on the action
of a on к. [Of course к. may depend on {«„}].
In particular,
aT = 0 (all r g Gal(fc//t)),
so
rria = aTa — aT = aa — 0 (all r g Gal(fc/к)
and hence
aa g к. (all a in Galfc/fc).
If {a<z}, {ba} are continuous cocycles, then clearly {aa + (>„.} is con-
tinuous.
A coboundary {<rc — с} c g A is automatically continuous, by our
hypothesis that Г acts continuously on A.
Definition. is the group of continuous cocycles modulo co-
boundaries.
By following the proofs of the Г finite case it is straightforward to
prove
Theorem 1. Н1(Г,А) is torsion (i.e. every element has finite order).
Theorem 2. Let m > 1 lie an integer and suppose that every element
of A is divisible by m. Then the sequence
0 -> Ar/m4r №(r,Am) -> [Я1(Г,А)]т - 0
13 exact where (as in the previous section)
(i) Ar is the set of a g A fixed by Г.
(ii) is the set of elements of A of order dividing m.
(iii) [№ (Г, A)]m is the set of elements of №(Г, A) of order dividing m.
Appendix.27 Localization
Let p be a fixed prime. Choose a fixed embedding
A : Q Qp.
Write
May be omitted at first reading. As will be explained, the result obtained
here is obvious from another point of view in the context of the course.
§21: Appendix. Localization
103
Г = Gal(Q/Q)
Гр = Gal(Qp/Qp);
so A induces an embedding
А*: Гр-+Г.
Let 4 be a continuous Г-module. Then it is via A* a continuous
Гр-module.
Let {a<z}, <7 g Г be a continuous cocycle. By restricting a to Гр, we
have a continuous Гр cocycle. Hence we have a group homomorphism
XI: №(r,A) -* H'(Yr,A)
[localization: A special case of the “restriction map”].
Ostensibly XI depends on the embedding A, but we show that it does
not.
Any embedding A of Q <-+ Qp is of the shape
A = Xp,
where p is an automorphism of QIQ. By the analogue of Lemma 0 of
the “Finite Г” section, p acts trivial on Н1(Г, A), and so Л/ = XI. Thus
the map
Н\Г,А) -> №(rp,A)
is canonical.
In the context of the course, we have an elliptic curve
Y2 = № + AX + В
defined over Q. Let (5, be the points defined over Q, Qp respectively.
We are concerned with the map
№(r,®)-*№(rp,®p),
which may be regarded as
№(Г,Ф) - Я\Гр, Ф) №(rp,0p),
the second induced by the embedding
Ф-+ Фр.
Later we interpret an element of .ff'fT, Ф) as a curve T> defined over
Q together with a choice of structure as a principal homogeneous space.
A curve T> defined over Q is certainly defined over Qp. with its struc-
ture of principle homogeneous space it thus corresponds to an element
of //’'(Гр, (Sp). The resulting map Ф) —» //’'(Гр, Фр) is precisely
the one constructed above.
22
Principal homogeneous spaces and
Galois cohomology
Let D be a curve of genus 1 defined over Q. We have seen (§20,
Theorem 1) that there is an elliptic curve
C : Y2 = A3 + AX + В
defined over Q and a birational equivalence
ф: T>-+C
defined over Q. Further, for any a g Gal(Q/Q) the map
: C -> C
is of the type
x —> x + a^,
where ®.
The elliptic curve C is unique up to a transformation
A -> ?A, Y -> s3K, s g Q*.
Of course ф and the aa are far from being unique. C is the jacobian of
T>.
We have to discuss how far the elements of the above situation are
arbitrary. We note first that (by the previous discussion) the aff satisfy
the cocycle identity
т*а.(у — ~ .
Now the aCT are in the commutative group ®, and we may invoke the
machinery of §21.
22: Principal homogeneous spaces and Galois cohomology
105
On replacing the map ф by фф. where
ф : C —>C, x—» x + b (be ®),
we replace {a„} by
a„ + (<rb - b)
where crb — b is a coboundary. In the commutative case, the cobound-
aries are a subgroup of the cocycles and so {ar} determines an element
of the quotient group
cocycles/coboundaries = №(Г, ®)
- the first cohomology group, where Г = Gal(Q/Q).
We now look at the information which an element of №(Г, 0) gives
us about T>.
In the first place, we certainly can construct a curve T> and a birational
equivalence ф by our general machinery. To remind: let x be a generic
point of C. There is an action a of Gal(Q/Q) on Q(x) given by
(i) cr acts like <r on Q
(ii) 5-x = x + a<7.
Then the fixed field is the function field of a curve T> defined over Q and.
Ф, defined over Q, is given by the identification of the two function fields
over Q.
The map ф gives T> a structure of principal homogeneous space over C
in the following sense.
Let £2 be independent generic points on 1?, which we treat as fixed
under Gal(Q/Q). Put
A(G,G)M(GW(G)-
Then
= (<^(€1) + a<z ~ (</>(G) +
= Д(Ш
That is, the algebraic map from two copies of T> to C given by A is
defined over Q. Clearly
д(е1,е2) + д(е2,е3) = д(е1,е3).
Hence the cocycle {aa}, or the corresponding elements of №(Г,®),
determines the pair (1?, A). The cocycle {—aa} determines the pair
(T>,— A). Thus to get a group structure we must consider not just the
curves T> with given jacobian, but the pairs (T>, A) where A is a structure
of principal homogeneous space.
The above account overlooks one tricky point. An element of H1 (Г, 0)
determines the function field of T>, and so determines only up to
106
Lectures on Elliptic Curves
birational equivalence defined over Q. Now it can happen that there is a
birational automorphism of T> defined over Q which interchanges A and
— A (!). A trivial example is when C is regarded as its own jacobian.
Consider two maps
Ф, (V=)C-^C, (j = l,2)
where <f>i is x —» x and is x -> —x. In both cases the cocycle
is identically 0. In the first case, АДх^Хг) = Xi — X2; and in the
second А^х^Хг) = X2 — Xi. The two are taken into one another by
the automorphism x —» —x of C = T>.
In the example just above, we have the trivial element of ЕРСГ, 0).
There is the same phenomenon for elements of order 2 (and only for
them) [Exercise for reader!].
To deal with this difficulty, we shall identify two structures of principal
homogeneous space which are birationally equivalent. With this conven-
tion each element of /Р(Г, 0) defines a unique principal homogeneous
space.
Conversely, a structure of principal homogeneous space determines
the element of №(Г, (5). Consider the map
ф : T> -> C.
By our initial construction, the corresponding cocycle is
a<z = = <№),
where £ is a generic point of T> fixed under Galois.
Now let a be any algebraic point on T> (i.e. defined over Q). Then
cr(^(a)) = (<7</>)(cra) = ф(сга) + a.a,
since a acts both a and on the coefficients of the map ф. Hence
A(a, era) = ф(а) — ф(аа)
= </>(<*) - cr(</>(a)) + a„
Thus {A(a, cra)}^ is a cocycle, and differs from [aa}a by a coboundary.
To sum up:
Theorem Th ere is a canonical isomorphism between principal homoge-
neous spaces (1?, A) (up io birational equivalence over Q) and elements
of Н\Г, 0). The element corresponding to (T>, A) is given by the cocyle
{A(a, cra)}^, w/iere a is any algebraic point on T>.
Note 1. Principal homogeneous spaces were introduced by Weil. He
defined their group structure directly, not by refernce to №(Г, 0).
<>22: Exercises
107
Note 2. For the cognoscenti. The “jacobian” defined here is a refinement
of the classical notion defined over the complex numbers.
Recall that a divisor a on D is a map from the algebraic points a on
7? to Z which is 0 for all except at most finitely many a. It is defined
over Q if it is invariant, (in an obvious sense) under Gal(Q/Q). The
degree is £)na, where a is a —» na.
Suppose that a is of degree 0. The jacobian map is the map from a
to
Jac(a) = ^2na<Ka) €
a
the summation being that on C.
The divisor a is in the kernel of the map precisely when the ф(а) with
their multiplicities are the poles and zeros of a function on C. Identifying
T> and C via ф, this is the same as saying that a is the divisor of a function
on T> [a principal divisor].
If a is defined over Q, then Jac(a) is defined over Q, as follows easily
from the formula for аф(сг'). Hence we have group monomorphism.
Divisors of degree 0 on P defined over Q
Principal such divisors
A final point. If the divisor a of degree 0 is defined over Q and
is principal, then it is the divisor of a function on T> defined over Q.
For suppose that f is a function with divisor defined over Q. Let
cr g Gal(Q/Q). Then a is also the divisor of erf and so
It is readily checked that er —> erf /f is a cocycle with values in Q ; and
so is a coboundary by Hilbert 90 [§20, Exercise 5]. Hence erf/f = crX/X
for some A g Q and all a. Then A-1 f is fixed under Galois, i.e. defined
over Q, and has divisor a, as required. [Of course this remark is general,
and applies to curves of any genus.]
§22. Exercises
1. If £, x are generic points of T>, C respectively, fixed under Galois,
show that the function A(£,x) = Ф~\Ф(£) + x) is defined over Q and
investigate its properties.
23
The Tate-Shafarevich group
We put together the results of the two previous sections.
As before, let
C : Y2 =X3 + AX + В
be an elliptic curve defined over Q. The groups of points defined over
Q, Q respectively are (5, ®; and Г is Gal(Q/Q). We have seen that the
first cohomology group Н}(Г, ®) is canonically isomorphic to the group
of equivalence classes of {77, A} where 77 is a curve of genus 1 and A
is a structure of principal homogeneous space on it. This group is often
referred to as the Weil-Chatelet group and denoted by WC = WC(C).
Let 771. > 1 be an integer. The group 0 is divisible by m since finding
a b such that mb = a g ® is just a matter of solving some algebraic
equations. The exact sequence of the previous section is now
0 ®/m® -> Н\Г, Am) [№(T, ®)]m - 0,
where Am C ® is the group of elements of ® of order m and the [.. .]m
denotes the subgroup of elements of order dividing m.
We now have an approach to the weak Mordell-Weil theorem. We
would like to find the elements of Am) which are the images of
®/m®. By the exactness of the sequence these are precisely the kernel
of the map
Я1(Г,Ди)-4Я1(Г,в) = 1КС(С).
Being in the kernel means that the image is a trivial principal homoge-
neous space {77, A}; i.e. that there is a point on 77 defined over Q.
For m = 2 we are back in the situation discussed in the proof of
The Tate-Shafarevich group
109
the Weak Mordell-Weil Theorem. There we displayed the curve 7? in
the image {77, A} of an element of H1 (Г, A2) 218 the intersection of two
quartic surfaces28.
As we have already emphasised, there is even now no algorithm for
deciding whether or not there is a rational point on T>. There is, however,
no difficulty in deciding whether or not there is a point on T> everywhere
locally. As we shall see in a moment, the elements of WC for which
there is a point on D everywhere locally form a subgroup. It is known
as the Tate-Shafarevich group and is usually denoted29 by the Russian
letter Ш (“sha”).
To show that III is a subgroup we must discuss localization. For any
prime p (including 00) we use a suffix p to denote an object defined over
Qj, instead of over Q. There is an obvious map
WC-^WCp
which takes the equivalence class of a principal homogeneous space
(7?, A) defined over Q into the class of the same {7?, A} considered over
Qj,. The non-cohomological description of the composition of principal
homogeneous spaces works entirely over the ground field: thus it shows
immediately that the localization jp respects the group law; but we have
not explained that description. From the cohomological point of view,
we have a map
induced by the inclusion © C • This situation was discussed at the
end of §21, where it was shown that jp is a group homomorphism and
is independent of the choice of inclusion Q C Qp.
Clearly Ш is the intersection of the kernels of all the localization
maps jp (including p = 00). For given m, denote by Sm the group of
elements of Ff'(F, Am) which map into III C H'fT, ®). It is called the
mth Selmer group. Now we have the exact sequence
0 —» —> Sm —> [ III ]m —> 0.
For m = 2, which we encountered in the proof of the weak Mordell-
Weil Theorem, we saw that S2 is finite and effectively constructible. It
28 The author apologizes for the clash between A denoting a structure of prin-
cipal homogeneous space and the group of elements of order 2 in ®.
29 This is the author’s most lasting contribution to the subject. The original
notation was TS} which, Tate tells me, was intended to continue the lavato-
rial allusion of WC. The Americanism “tough shit” indicates the part that
is difficult to eliminate.
по
Lectures on Elliptic Curves
can be shown by a more sophisticated version of the same argument that
the same things hold for Sm and general m, though now the effective
constructibility tends to be not very practical.
To sum up. The Selmer group is knowable. It majorizes (5/m(5 and
the “error” is given by Ш, which can be called the obstruction to the
local-glocal principle for curves of genus 1 with the given jacobian C.
This is as far as we shall go in this direction with the theory. We
conclude with background comments.
Before all this theory was invented, Selmer embarked on a massive
programme to find the Mordell-Weil groups of elliptic curves, especially
those of the type
C . X3 + Y3 + dZ3 = 0,
where d g Z. He used descent arguments to bound the Mordell-Weil
rank. Also, by a direct search, he found rational points on C and so
bounded the Mordell-Weil rank from below. Most often the upper and
the lower estimates for the rank coincided, but when there was a dis-
crepancy the difference was always even. Moreover, estimates for the
rank derived from different types of descent (e.g. majorization of (5/2(5
and (5/3(5) always differed, if at all, by an even integer.
After the group III was discovered by Tate and Shafarevich, it was
natural to look for the explanation of this phenomenon in the structure
of III. It turns out that there is a skew-symmetric form on III whose
kernel is the group of infinitely-divisible elements of III. It always
looked improbable that there are infinitely-divisible elements and by now
there is much evidence (but no proof) that they do not exist. If there
are no infinitely divisible elements, the existence of the skew-symmetric
form shows that the order of [ III ]m is a square. This explains Selmer’s
observation.
There is not merely a local-global principle for curves of genus 0, but
it has a quantitative formulation (and also, more generally for linear
algebra groups. The modern formulation is in terms of the “Tamagawa
number”). On the basis of massive calculations (this time on a com-
puter) Birch and Swinnerton-Dyer proposed what can be regarded as
a quantitative local-global theorem for elliptic curves. In their formula
there is a number, not otherwise accounted for. In all their calculations
the mysterious number turned out to be an integer and indeed a perfect
square.
It was natural to interpret this integer as the order of III (supposed
The Tate-Shafarevich group
111
finite), and, once made, this identification was supported on other gr-
ounds.
The Birch-Swinnerton-Dyer conjectures were widely generalized and
further evidence for their plausibility were adduced. It is only in the last
few years that progress has been made with their proof. Until the very
recent work of Rubin and Kolyvagin there was not even a single elliptic
curve for which III had been proved to be finite.
§23. Exercises
1. Let m, n be integers, m | n. Show that there is a group homomor-
phism A such that
0
commutes.
Hence show that there are /.i, v such that
0 —» <S/n<9 W.A") [^C]„ 0
к b b
о [ЖС]т 0
is exact and commutative.
Describe /г, iz explicitly.
24
The endomorphism ring
In this section, the ground field k is any field, possibly of characteristic
p 2, 3. [This last restriction solely because of our choice of canonical
form.] The main objective is the application to the estimation of the
number of points over finite fields, but we do a little more, to set things
in context.
Let
С : Г 2 = X3 + AX + В
be an elliptic curve defined over k. An endomorphism of C (over k) is a
rational map
ф : C-^C
defined over fc, for which
^(o) = o.
One endomorphism is the constant isomorphism which maps C entirely
onto o. Otherwise, if x is a generic point of C, then so is
£ = </>(*)
and k(x)/fc(£) is an algebraic extension. We define the degree of ф to be
ЦФ) = [fc(x) : fc(£)].
By convention, the degree of the constant endomorphism is 0.
The first lemma shows that ф respects the group structure of C. It
is not really needed for what follows, but it helps to set ideas. In the
application to finite fields, the conclusion will be obvious.
2Jf: The endomorphism ring
113
Lemma 1. Let a, b be points ofC. Then
ф(л + b) = ^(a) + ^(b).
Sketch proof. By extending the ground field if necessary, we may suppose
that a, b are defined over к. If ф is the constant endomorphism, there
is nothing to prove. Otherwise, let x be a generic point f = ф(х). By
the definition of the group law, there is a
A = A(x) g Ar(x)
whose only zeros are simple zeros at a, b and whose only poles are simple
poles at o, a + b. Let
A = Д(£) = Ыогт*(е)/*(Х) A.
Then the zeros of Л are just simple zeros at ф(а.'), ф(Ъ) and the poles of
Л are just simple poles at ^>(a + b) and at о = </>(o).
Note. cf. §14, Lemma 1. The proof above follows that in Silverman,
Theorem 4.8 (p. 75), where it is proved for isogenies and the treatment
is fuller. For the corresponding result for abelian varieties of any dimen-
sion, see D. Mumford, Abelian Varieties (Oxford, 1970), p.43, Corollary
3 or H.P.F. Swinnerton-Dyer, Analytic theory of abelian varieties (Cam-
bridge, 1974), Theorem 32 or S. Lang, Abelian varieties (New York and
London, 1959), Chapter II, Theorem 4.
All we shall need is the
Corollary. Let x, be as above and let
Then f g k(x), and
[k(X) . fc(O] = [fc(x) : fc(O] = d(^).
Proof. For ф(—x) = —<^>(x) = —£.
For any two endomorphisms ф, ф, we defined the sum ф + ф and the
product фф by
(^> + V’)(x) = ф(х) + ф(х),
(^)(x) = ^(V>(x)),
where x is a generic point. It is readily verified that this gives the set of
endomorphisms the structure of a (not necessarily commutative) ring.
114
Lectures on Elliptic Curves
Lemma 2.
Л(ФФ) = <1(ф)<1(ф).
Proof. Clear.
Lemma 3.
с1(ф + ф) + <Рф — ф) = 2d(</>) + 2d(i/i).
Proof. Let x = (ar, у) be a generic point, and put
</>W = Cl,
V’W = C2,
(</> + l/r)(x) =C3,
(</> - l/,)(x) = C4,
so
Сз = Ci + Сг, C4 = Ci — Ca-
Then
c, e fc(rr), (j = 1,2,3,4)
where
С; =(Сулу)-
We argue as in the corresponding results for heights (§17, Lemma 4).
The degree of an element of fc(ar) corresponds to the height of an element
of Q. As fc(ar) has no archimidean valuations trivial on fc, the results are
more precise.
By the formula for sum and difference, we have
L : Сз 1- C- •• СзС » =
(Cl -C2)2 :2(CiCa+A)(Ci+C2)+4B
: C1C2 — 2AC1C2 — 4B(Ci + C2) + ^42-
A similar argument to that for heights30 gives
<ШСз) + deg(C4) = 2deg(Ci) + 2deg(C2),
where “deg” is the degree as a rational function of x (= maximum of
the degrees of numerator and denominator.)
This result now follows from Lemma 1, Corollary.
30
cf. also (*) of §17
The endomorphism ring
115
Corollary. There are r,s,i£ ~L, depending on ф, ф, such that
сЦтф + пф} = rm2 + smn + in2
for all m, n G Z.
Further,
r > 0, i > 0, s2 — rt > 0.
Proof. The first part follows exactly as for heights.31 For the second,
d(.) > 0 by definition, so the quadratic form in m, n is positive semi-
definitive or definite.
The rest of this section is not needed for the application to finite fields.
By abuse of notation we denote the constant endomorphism by 0 and
the identity endomorphism </>(x) = x by 1.
Lemma 4. Every endomorphism ф satisfies a quadratic equation
ф2 — зф + t = 0,
where s, t € Z.
Proof. By the preceding Lemma,
d(m + пф) = m2 + smn + tn2
for some s, t g Z and for all m, n g Z.
Let I € Z. Then
cl(fi> + I) = d\f — s — I) = I2 + si + t.
Hence by Lemma 2
с1^ф + 1)(ф-з-1))=(12+sl + t)2. (*)
But
(ф + 1)(ф-3-1) = ф2 -зф-1(з + 1). (**)
Hence and by Lemma 3, Corollary, with ф2 — зф, 1 for ф, ф, we have
d(</>2 — зф + n) = (— n + t)2
for all n € Z. In particular,
d(</>2 — зф + t) = 0.
But the only endomorphism of degree 0 is the constant endomorphism
0.
31
cf. §17, Exercise 2.
116
Lectures on Elliptic Curves
Note. As was shown by Deuring, the endomorphism ring is isomorphic
to one of:
(0 Z.
(ii) a ring of integers in an imaginary quadratic field
(iii) a ring of integers in a generalized quaternion skew field.
The last case can occur only in characteristic p O', and the skew field
is very special.
§24 Exercises
1. Suppose that the ground field contains an element i with i2 = —1
and that its characteristic is not 2. Let C be Y2 = X3 + AX for some
A 0. Show that
г : Y -> iY, X -» X
is an endomorphism.
Construct the endomorphism 1 + e and check that (1 + e)2 = 2e as
endomorphisms.
2. Suppose that the characteristic of the ground field is not 2 or 3 and
that it contains p with p3 = 1, p 1. Let C be Y2 = X3 + В for some
В 0. Show that
A : X -> pX, Y-*Y
is an endomorphism. Construct the endomorphism A —A2 and show that
(A-A2)2 = —3 as endomorphisms.
3. Suppose that the characteristic of the ground field is not 2. For a 0
determine the b such that the isogenous curves
C: Y2 = _Y(X2 + aX + b)
Ci : Y2 = X(X2 - 2aX A a2 - 4b)
are birationally equivalent over the algebraic closure. Show that they
are equivalent over the ground field provided that —2 is a square in it.
Denote the isogeny, considered as an endomorphism of C, by p. Show
that p2 = —2 as endomorphisms.
4. Let
ф'.С-^С
be an endomorphism and suppose that
ф2 — зф + t = 0 s,t G ~L.
Exercises
117
For positive integer m show that ф = фт satisfies
Ф $ тф 4" — 0,
where sm, im € Z are defined as follows. Let a, /9 e Q be the roots of
T2 - sT + t = 0. Then
„ m i /ЭТП i „ m nm
H" P i CE p
5. (i) Let ф be an endomorphism and define </>' by
</>' = </> if ф S Z;
otherwise ф' = s — </>, where ф2 — зф + t = 0. Show that
фф1 = ф'ф = d(</>).
(ii) Let x be a generic point and let f be the points of C defined
over t(x) (fc = ground field) such that </>(^y) = x (with appropriate
multiplicities if ф is inseparable). Show that
= 52 X>
(addition on C).
(iii) If ф is another endomorphism, show that
(ФФУ = ф'ф'
and
(</> + ф)' = ф + ф'.
25
Points over finite fields
We denote by F? the field of q elements and denote its characteristic
by p, so q is a power of p. Our objective is the
Theorem 1. Let
C: Y2 = X3 + AX + В
be an elliptic curve over a finite field F?. The number N of points of C
defined over F? satisfies
|V-(? + l)| <2//2.
We shall give the main idea of a proof but will have to be impressionist
on one of the ingredients. Because of our canonical form, we shall assume
that p fi 2, 3. Note that N includes the point о “at infinity”. At the
end of the section we shall indicate the proof of a couple of other results.
Let x = (a:,y) be a generic point. We show that ^>(x) = (xq,y4') is
also on the curve. Indeed, since we are in characteristic p |
(у9)2 =г3+Аа; + В
= (a;’)3 + Agxg + Bg
= (г’)3 + Axg +B,
as A4 — A, Bg = B. This is the Frobenius endomorphism.
Now let u = (u,v) be a point defined over the algebraic closure Fp.
Then
</>(u) = (us, V*),
25: Points over finite fields
119
so u is defined over F? precisely when it is a fixed point of ф or, what is
the same thing, when
(</> — l)u ~ o,
where 1 is the identity endomorphism and ф — 1 is defined in terms of
the endomorphism ring. 3
In the notation of the previous section, clearly
d(</>) = q
and so by §24, Lemma 3, Corollary
d(</> — 1) = g — s + 1
where
s2 < 4g, |s| < 2g1/2.
We have seen that a point defined over F? is actually defined over
F? precisely when it is the kernel of ф — 1. But the degree of an endo-
morphism is equal to the number of algebraic points in the kernel, each,
counted with its multiplicity. If therefore we can show that the points
of the kernel of ф — 1 have multiplicity 1, we are done.
It is here that we have to leave a lacuna. One argument, which can be
made precise, is to observe that dxg/dx = qx4-1 = 0 in characteristic p,
and so the differential of the map ф — 1 is the same as that of the map
— 1, and hence never 0.
Note. The result is due to Hasse by essentially the same proof. It is of-
ten referred to as the “Riemann hypothesis for function fields” (of genus
1) because of an analogy with Riemann’s notorious unproved conjecture
about the zeros of the usual (“Riemann”) zeta function. It was general-
ized to curves of any genus by Weil and to algebraic varieties by Deligne.
The analysis of the action of the Frobenius map ф is still a central theme
of modern arithmetic geometry.
Theorem 2. Let T> be a. curve of genus 1 defined over F?. Then it has
a point defined over F?.
Proof. We developed the theory of the jacobian in characteristic 0, but
it holds for general characteristics. Let C be the jacobian of T> and let ®
be the group of points on C defined over F?. It is enough to show that
Я^Г,®)
is trivial, where
Г = Gal(F,/F,).
120
Lectures on Elliptic Curves
The group Г is generated32 by the Frobenius automorphism 7 (say):
a —> ag. We have to show that any cocycle {ao} is trivial. It is enough
to show that
a7 = 7b — b
for some b £ (S. Now
7b — b — (</> — l)b
where </> is the “geometrical” Frobenius, so ф — 1 is not the constant
endomorphism. For any c g 6 we can thus solve (</> — l)b = c for b,
since we are working in the algebraic closed field. In particular, this
holds for c = a7. The cocycle identity gives inductively that
a, = <zb — b <7 = 7,72,73,...
and we are done.
Note. For a broad generalization, see S. Lang, Algebraic groups over
finite fields. Amer. J. Math. 78 (1956), 535-563.
The Theorem is due to F.K. Schmidt and the idea behind his proof
is amusing. He used analytic means to estimate the number of points
defined over the extension fields F?„. In particular, he showed that the
number is > 0 for all large enough n.
Let bj,... , bn be n conjugate points defined over F?„ and Ci,..., cn_|-i
be similar conjugates defined over F?n+i. Then by Riemann-Roch there
is a function whose poles are simple poles at the c; and which has simple
zeros at the by. It has one further zero; which must be defined over F?.
Theorem 3. Let
A : Cj C2
be an isogeny of elliptic curves, everything defined over F?. Then Ni =
N?, where Nj is the number of points on Cj defined over F?.
Note. An isogeny is defined to be a rational map onto such that
A(oj) = o2. Lemma 1 of the preceding section extends to isogenies,
which gives compatibility with the usage earlier in the course.
Proof. Let ф} be the Frobenius on Cj. Clearly the diagram
SI “topologically”, that is the group generated by -y is everywhere dense in the
galois topology.
§25: Exercises
121
Ci A Ci
ь Iх
c, c,
is commutative, and hence so is
Ci Ci
Iх Iх
c, c,
It follows that the degrees
d(<fa ~ 1) = 4^2 ~ !)
are equal. But (proof of Theorem 1), this is just Nj = N?.
Example. The numbers of solutions of
y2 = x(x2 + ax + b) (mod p)
and
i/2 = x(x2 — 2ax + a2 — 4b) mod p)
are equal, where a, b are integers and p is any prime with
2b(a2 — 4b) 0 (mod p).
§25. Exercises
1. Let p be prime, p = 2 (3). Show that the number of points on the
elliptic curve
Y2 = X3 + В
defined over F,, is p + 1.
[Hint. Given Y, solve for X].
2. Let p be prime, p = 3 (4). Show that the number of points on the
elliptic curve
Y2 = X(X2 + A)
defined over Fp is p+ 1.
[Hint. Consider ±X together].
122
Lectures on Elliptic Curves
3. Let C be an elliptic curve defined over F? and let IV(n) be the number
of points defined over F?, where q = pn. Show that there are a, /3 G Q
such that
a/3 = p
and
N(n) = pn + 1 - an - /3n.
Hence show that all the IV(n) are determined by the value of Af(l).
Hence determine Д1(2) for
Y2 = X3 +X + 1,
with p — 3.
[Hint. §24, Exercise 4].
4. [Preparation for next exercises.] Let A 2> Z be a commutative ring
without divisors of 0 [an integral domain]. Suppose that every A 6 A
satisfies an equation A2 + “A + Ь = 0 (a, b G Z, depending on A). Show
that either A = Z or A = Z[a] for some single element a G A.
5. Let p = 1 (4) be prime and
C: Y2 = X(X2 + A)
an elliptic curve defined over F?. Let 9 G F?, 92 = — 1. Show that
£ : Y 0Y, X -X
is an endomorphism of C, and that e2 + 1 = 0.
Let ф be the Frobenius. Show that Фе = еф and deduce that
ф = и + vs
for some u, v G Z with u2 + v2 = p-
Show, further, that the number N of points on C defined over F? is
N — p + 1 — 2u.
Evaluate for some A andp and check that и (say) = |(p+l— N) G Z
and satisfies u2 + v2 = p for some v G Z.
6. Let p = 1 (3) and let
C : Y2 = A'3 + В
be an elliptic curve defined over F?. Let в £ F,, S3 = 1, в / 1. Show,
that
A : Y -> Y, X OX
is an endomorphism of C and that A2 + A + 1 = 0.
§25: Exercises 123
Show that the Frobenius ф satisfies фХ = Хф. Now continue as in the
previous Exercise.
7. Let
С: У 2 = .¥(A2 +4CX + 2C2)
be an elliptic curve defined over F?, where p is prime and —2 is a.
quadratic residue. Show that the number N of points is of the shape
N ~ p + 1 — 2u,
where и £ ~L and there is a v 6 I such that
u2 + 2v2 — p.
[Hint. §24, Exercise 3.]
26
Factorizing using elliptic curves
The problem of finding a factor of a given large integer has fascinated
mathematicians through the ages. Recently the question has assumed
practical, and indeed political, significance with the use of the products
of large primes in cryptology.
It is usually (but not always) easy to prove that a given composite
integer n is composite, e.g. if there is an a > 1 with a"-1 1 mod n.
But finding an actual nontrivial factor is a completely other matter!
For the logician, of course, the problem of factorizing an integer n
is constructive. All one has to do is to test all integers m < n1/2 for
divisibility. When, say, n has 100 decimal digits, this could take longer
than the age of the universe. What are needed are practical methods.
Recently H.W. Lenstra Jr. has shown that elliptic curves provide
powerful methods for this problem. We will sketch one of his attacks.
Lenstra’s method is suggested by Pollard’s “p — 1 method”. Let n be
a large integer with an unknown prime factor p. Let a be an integer and
consider
m = gcd(at — 1, n)
for some integer k. If к | (p — 1) then p | m. Unless we are unlucky, not
all the other primes q | n will divide m; and so m would be a nontrivial
factor of n.
One does not evaluate a1, of course, but works modulo n. There is
an algorithm which works in O(log к) steps (cf. Exercises). Evaluating
the ged is cheap, using Euclid’s algorithm.
Pollard’s method is particularly effective if n is divisible by a prime
26: Factorizing using elliptic curves
125
p for which all the prime factors of p — 1 are comparatively small. The
accepted recipe is to take к of the shape33
к = fc(b) = JJ pe(s),
where q runs through the primes and is the longest power of q which,
is < b. Here b is chosen suitably, in a way which will be described later.
The chances of success with this method of Pollard’s appear to be
best when the smallest prime factor p of n is substantially smaller than
n1/2. But even, then, we may be out of luck if p — 1 has some largish
prime factors. One can try to find a value of a whose exponent mod p
is substantially smaller than p — 1, but that is not very promising.
Lenstra observed that Pollard’s method depends on the fact that the
residue classes mod p have a group structure, and that elliptic curves
provide other groups which can be used for the same purpose.
Let
C: Y2Z = X3 +AXZ2 +BZ3
be an elliptic curve and let (r,i/,z) with x, y, z G Z be a point on it.
Let
k(x,y,z) = (xk,yk,Zk),
where к > 1 is an integer and Xk, у к, Zk G Z. Now let p be a prime, and
suppose that C mod p (in an obvious sense) is an elliptic curve over Z
mod p. The mod p points form a group whose order
N = Np = Np(A,B)
satisfies
H — (p + 1) |< 2-^/p.
If N | к, the point (n, Ук, ?k) mod p is the “point at infinity”, that is
P I zk-
Given A, B, x, y, z, values of (xk,yk,Zk) can be computed in O(logfc)
steps involving addition, multiplication subtraction. Since we are using
homogeneous co-ordinates, there is no need to divide. The resulting
values of Хк, Ук, zk may have a common factor, but this does not disturb
the conclusion that Np | к implies p | Zk-
Now let n be the large integer to be factorized and let к = k(b) for
some suitable b, as before. Then we can evaluate Xk, Ук, zk mod n in
O(log fc) steps of addition, multiplication, subtraction modulo n. The
33
That is, к is the gcd of the integers < b.
126
Lectures on Elliptic Curves
unknown prime divisor p of n will divide z^ mod n provided that | k:
and then p divides
m = gcd(n,zt).
If z^ = 0 (mod n), we axe out of luck. Otherwise m will be a nontrivial
divisor of n: which is what we want.
It can, of course, happen that m = 1, if Np / к for all p | n. If this
happens, we select other values of A, B, x, y, z (and, possibly, fc) and
try, try, try again.34
The above account leaves a couple of questions unanswered.
(i) How do we choose the initial curve C and the point (a;, y, z)l Since
all the calculations axe mod n, it is enough to find A, B, x, y, z G Z
such that
y1 = x3 + Axz2 + Bz3 mod n.
An obvious way is to put z = 1, choose A, x, у at random and use the
equation to determine B. Since we naturally suppose that we started
off' by checking that n has no small divisors, the chance that C is not
an elliptic curve for any p | n is negligible. In any case, there is no
harm in running through the algorithm: at worst we will draw a blank.
Alternatively, one can compute
I = gcd(n, 4A3 + 27B2).
If I — 1, we axe OK. If 1 < I < n, we have a non-trivial factor of n,
which is what we want. If I = n, which is highly unlikely, we abort the
run and choose fresh A, B, x, y, z.
(ii) What is the optimal choice of b in к = к(Ь)? It turns out that
this depends on the smallest prime divisor p of n: which is, of course,
unknown. We argue heuristically.
Let 1 < s < t, where t is an integer. We say that t is s-smooth if
every prime divisor q of t is less than s. It is known that the number
of integers t < T, for given Г, which are T^-smooth is very roughly
u~^uT. Put
L = L(T) = eXp(y(logTloglogT))
and let 0 < a < oo. On putting T1/” = L“, we deduce that the number
of t < T which are L“-smooth is roughly L~1f2aT. We shall paraphrase
З4 For the distribution of Np over curves, see B.J. Birch: How the number of
points of an elliptic curve over a fixed prime varies. J. London Math. Soc. 43
(1968), 57-60.
26: Factorizing using elliptic curves
127
this to the statement that the probability P that a random integer t in
the neighbourhood of T is L“-smooth is P = L~xl2a.
We shall choose the best value of a later. Let p be the unknown
smallest prime factor of n. Put L = L(p). We have seen that the order
Np = Np(A, B) of the points mod p on C is approximately p. Assuming
that Np behaves reasonably randomly as A, В vary, the probability P
that Np is ^“-smooth is P =
Take
b = La, k = k(b).
Then all the prime factors of Np divide k. The practitioners of the
mystery of factorization assume that it is highly probable that indeed
Np | k, which we suppose. The number of steps in one run of the
algorithm is O(log k), which is very roughly b = La.
To sum up. The amount of work in a run of the algorithm is about
La. The probability of success in a single sun is about L-1/2“. Hence
the expected work to find a nontrivial factor is about
Ila
This is minimized at
a = 1/^/2,
which is therefore the optimal choice.
The above estimates depend on the size of the unknown least prime
factor p of n. The worst case scenario is when p is nearly n1^2. However,
one expects the Lenstra algorithm to be most effective when the small-
est prime factor is much smaller. Thus it works better on “naturally
occurring” integers n than on the integers n used in some cryptosys-
tems, which are the product of two nearly equal primes. If nothing is
known a priori about the primes in n, a good strategy is to start with a
comparatively small b and to increase it gradually if necessary.
We have chosen a version of the Lenstra algorithm which is easy to
describe, rather than one which minimizes computation time. In prac-
tice, further devices and stratagems are brought into play. We do not
go into this here, but conclude with a variant in the spirit of the course.
In the variant, one considers the elliptic curve
C : CY2 = X3 + AX + В
for some C 0, where we now take the inhomogeneous form. Recall
that if (ariji/i) and (яг2,1/2) are points on the curve and
(1'з.Уз) = Oi,yi) + (2.-2,y2),
(1'4,У4) = (a?i, Vi) — (я?2, У2)
128
Lectures on Elliptic Curves
then гз, X4 are the roots of a quadratic equation whose coefficients
are polynomial in arj, x2. A, В (but not C). If now к is a positive
integer and if, to change the notation, (a;, y) is a rational point on C and
1/fc) — k(x,y), then the classical algorithm for computing gk can
be modified to give an algorithm to compute x^ in O(log fc) steps (cf.
Exercises).
Now write X = U/V and work homogeneously. If x = u/v, then
= Xk/v^ where щ., are obtained from u, v by O(log fc) additions,
subtractions and multiplications, but no divisions.
Now, as before, let n be the number to be factorized and p an unknown
prime divisor. Suppose that A, B, u, v £ ~L and work mod n. then, as
before, if Np | к then p | Vk and we can expect that gcd(v*,n) is a
non-trivial divisor of n.
In this version of the algorithm we may choose A, B, u, v entirely
arbitrarily. Put x = u/v, у = 1. Then, unless we are strikingly unlucky,
the point (.t, y) lies on C for some С E Q* which need not be evaluated,
as it is never needed.
Elliptic curves are used also in primality testing and in other unex-
pected ways: for example, finding square roots modulo a large prime.
See A.K. Lenstra and H.W. Lenstra Jr., Algorithms in number theory.
Chapter 12 (pp.673-715) of: Handbook of theoretical computer science,
vol. A (ed. J. van Leeuwen), Elsevier, 1990.
§26. Exercises
1. [Motivation for next question.] Let G be an abelian group and n a
positive integer. For g E G show that the following algorithm computes
gn in O(logn) operations.
(i) N = n, Y = 1 e G, Z = g
(ii) IF N = 0, GOTO END
(iii) M = [N/2], E = N -2M
(iv) IF E = 1 THEN У = YZ
(v) N = M, Z = Z'1
(vi) GOTO (ii)
END [У = gn],
2. Let С : У2 = X3 + AX + В be an elliptic curve. For positive
odd integer n and a = (a,b) on C, check that the following algorithm
computes u, where na = (u,d), *n O(logn) steps.
§26: Exercises
129
We recall that there is a rational function cf(ar) such that if x = (a:, y)
then 2x = (d(x), ?) for some ?.
Further, there is a quadratic q(T) = q(T-, a?i, £3) whose coefficients are
rational in xj, a;2 and whose roots are Г3, r4 if X3 = Xj+хз, x4 = xj—x2.
(i) N = n, X = a, Y = a, Z = a
(ii) IF N = 0, GOTO END
(iii) M = [W/2], E = N - 2M.
(iv) Z = d(Z).
(v) IF E = 1 GOTO (viii)
(vi) [Check that Y is a root of g(T; X, Z).] Y IS THE OTHER ROOT
OF q(T-,X,Z)
(vii) GOTO(ix)
(viii) [Check that X is a root of Q(T; Y, Z).] X IS THE OTHER ROOT
OF q(T-Y,Z).
(ix) N = M
(x) GOTO (ii).
END [X = u, where n(a,b) = (u,v).]
3. Suppose that (a,b) lies on
C* : EY2 = X3 + AX + B
for some E 0. Let n(a, b) = (u, v) on C*. Show that u is given by the
algorithm in (2). [i.e. the algorithm is independent of £.]
Formulary
Desboues’ Formulae*5. These are for
cii A3 T T a3A3 4" dAj A2A3 = 0.
This is nonsingular if 27«ja2a3 4- d3 ф 0. The residual intersection t of
the tangent at x is
tj = xj(aj+ixj+i — aj+2-'rj+2) (j taken mod 3)
The third intersection z of the line joining x, у is
zi = x2jyj+iyj+2 - y2jxj+ixj+2 (j mod 3).
Canonical curve.
Y2 = A3 4- AX 4- B.
If x = (г, у), then —x = (ar, — y).
Addition formula. Let
X1 = x2 = (Х2,У2)-
and
x = Xi + x2, x = (rr, y).
З5 A. Desboves. Resolution en nombres entiers et sous sa forme la plus generale,
de I’equation cubique, homogene a trois inconnues. Nouv. Ann. de la Math.,
Ser. Ill, vol. 5 (1886), 545-579.
Formulary
131
If X2 = —Xi, we have x = o. If X2 = Xi, we apply the duplication,
formula, given below. Otherwise, we may suppose that
2*2 7^ 1 *
The line joining Xj, X2 is
Y = IX + m,
where
, У1-У2 Х1У2-Х2У1
I = -------, m = -----------------.
X1 — 2?2 2*1 — 2*2
This line cuts the curve in Xi, X2 and
-(xi + x2) = -x = (2, -y).
The roots of
X2 + AX + В - (IX + m)2
are 2i, 1*2 and x. Hence
x = I2 — ii — 22;
and so
(21 - 2*2)22 = XlX2 + x2x2 - 21/11/2 + + 22) + 25.
Further,
у = —lx — m;
and so
(2*i - 22)3t/ = W2J/2 - Wiyt,
where
Wj = 3212*2 + 2j + A(21 + З22) + 45
W2 = symmetric.
Duplication formula. Here we consider
(х-2,у2) = x2 = 2x = 2(2,1/).
If у = 0 we have X2 = o. Hence we may suppose
У /0.
We need the tangent
Y = IX + m
at x. Since formal differentiation on the curve gives
JV
2H-7V = 3-Y2 + A,
we have
I = (З22 + A)/2y.
132
Lectures on Elliptic Curves
Hence (as for addition formula)
x2 = I2 — 2x
(3a;2 + A)2 — 8ary2
4y2 ’
i.e.
a;4 — 2Aa;2 — 8Bx 4- A2
2 4(x3 + Ax + B)
To find y2 we need the value
—a;3 + Ax + 2B
m = ------------------,
2y
which is determined by у = lx + m. Now
y2 = —lx2 — m;
which gives
(2j/)3i/2 = xs + 5Aa;4 + 20Ba;3
- 5A2a;2 - 4ABx - A3 - 8B2.
Formulae in X only. Let
xi = (»i, 1/1), x2 = (x2,y2)
with
a;i ®2.
Let
x3 = xj + x2 = (а:з,2/з)
x4 = Xj - x2 = (a:4,j/4).
Then
(а?! - ат2)2(;г3 + x4) = 2(.т1а;2 + A)(a;1 + a:2) + 4B,
(ari — а.-2)2а.’за;4 = a;2a;2 — 2Ax1x2 — 4B(a;i + a;2) + A2.
This follows from the expression for x in the addition formula. The
value of a;3 is x as given and that of a;4 is obtained from it merely by
changing the sign of yiy?- Hence the formula for ar3 + a;4 is immedi-
ate. That for aa3ar4 comes by substituting for yjy2 in the product and
cancelling (arj — a;2)2. [Alternatively, cf. §17, Exercise 3.]
Formulary
133
Multiplication36. Let (Хт,Ут) = m(JX,Y) where m G Z. Then
„ _ Хфт ~ Фт-1Фт+1
A M 12 ’
Фт
Y —
m %Фт’
where
Фо = 0,
Ф1 = 1,
ф2 = 2У,
= 3W4 + 6AA2 + 12-BX - A2,
ф4 = 4У(Х6 + 5АХ4 + 20ВХ3 - 5А2Х2 - 4АВХ - 8В2 - А3),
ф2п+1 = Ф^фп+2 - Ф^+1Фп-1,
УФ2п = Фп{Фп-1Фп+2 - ф2п+1фп-2}-
This is an exercise on the fact that a function is defined up to multi-
plicative constant by its zero and poles. We determine the constants by
looking at the behaviour at о using the local uniformiser
t = X/Y.
фт is defined by
(i) it has a simple zero at all a о with ma = 0. (a defined over Q).
(ii) it behaves like mt~m +1 at o.
More precisely
(I) if m is odd, there are |(m2 — 1) pairs (ay ±bj) of m-division pairs
and
Фт = m JJ(W “ aj)-
(II) If m is even, the three 2-division points are m-division points, and
there are |(m2 —4) pairs (а,,±Ьу), bj 0. Then
фт = mY ]J(X - aj).
Now for all m, even or odd, we have
Xm~rn“2t-2, Ут~т~3Г3
36
cf. II. Weber, Algebra Ш, §58; but we have adjusted the sign of ‘фгп so that
the leading term is always positive.
134
Lectures on Elliptic Curves
at o, and
Ф2тхт
has no poles except at o.
Further, Xm — X vanishes at a only if (m + l)a = о or (m — l)a = o.
Hence
X — Xm — — х 1фт, (*)
where the constant is right since both sides behave like (m2 — l)/m2t2
at o. This gives the formula for Xm. That for Ym follows immediately
from the specification of the poles and zeros.
It remains to give the recurrence relation. For integers I, m we have
Xj = Xm precisely when either (I + m)(X, У) = о or (I — ттг)(Х, У) = o.
Hence
Xl - Xm = фт+1фт-1/ф2ф2т‘
the constant being determined by the behaviour at o. But
X, - Xm = (X - Xm) - (X - X()
Hence by (*)
ф2фт+хФт-1 ~ ф2тф1+1ф{-1
— Фт-$-1фт — 1-
Put I = 7i, m = n + 1, so фт-1 = 1 and
Фгп+l = ФпФп+2 ~ Фп+1Фп-1
Put I = n — 1, m = n + 1 so ф,п-1 = фз = Y. Then
i Ф?п = Фп{Фп — хФп+2 ~ Фп+1 Фп —1 }
Further Reading
Cassels, J.W.S. Diophantine equations with special reference to elliptic
curves, J. London Math. Soc. 41 (1966), 193-291.
Husemoller, D. Elliptic curves, Springer, 1987.
Koblitz, N. Introduction to elliptic curves and modular forms, Springer,
1984.
Lang, S. Fundamentals of diophantine geometry, Springer, 1983. [The
first edition is less complete but more coherent: Diophantine geome-
try, Interscience, 1962.]
Serre, J.-P. Lectures on the Mordell-Weil theorem, Vieweg, 1989. [Notes
of a course given in 1980-81]
Silverman, J.H. The arithmetic of elliptic curves, Springer, 1986.
Tate, J. The arithmetic of elliptic curves, Invent. Math. 23 (1974),
179-206.
INDEX
birationally equivalent 4
Birch 71, 110, 126(fn)
Blichfeldt 19
Bremner 55(fn)
canonical form 32 et 3eq
canonical height 83
Chatelet 108
chord and tangent processes 24
coboundary 90
cobounding 90
cocycle 90, 98
cocycle identity 90, 98
cocycle (continuous) 101
cohomology (Galois) 89 et seq
cohomology group 98 et seq
complete, completion 8
continuous (action), (cocycle) 101
convex (pointset) 18
cubic curves 23 et seq
defined over 3
degenerate (laws) 39 et seq
Deligne 121
Desboves 25(fn), 26, 130
Deuring 116
Diophantine geometry 1
Diophantos 1, 24
discriminant 77
elliptic curve 32
endomorphism 112 et seq
everywhere locally 14
exceptional (point) 24
Fermat 1, 55, 63
filtration (p-adic) 48
finite basis theorem 54 et seq
finite basis theorem (weak) 55
forgetful functor 75
form 13
Frobenius endomorphism 118
Fueter 52(fn)
function field 58
fundamental sequence 7
Galois cohomology 89 et seq, 101
et seq
general position 29
generic point 58
genus 30
genus 0 4 et seq
genus 1 30, 32
globally 14
group law 27 et seq
H1 99
Hasse 119
Hasse principle:
see local-global principle
height 55, 78 et seq
height (canonical) 83
height (logarithmic) 82
Hensel 43
'Hilbert 90’ 95,97
homogeneous spaces: see princicpal
homogeneous space
Hypatia 1
integer (p-adic) 9
invertible 67
irreducible (curve);
see also reducible 24
isogeny 58
jacobian (of curve of
genus 1) 92 et seq, 95, 107
j-invariant 93
kernel of reduction 47
Kolyvagin 111
Lang 120
Lenstra 124, 128
level (of point in p-adic case) 47
lift 43
Lind 85
locally 14
local-global principle 2, 13 et seq,
85 et seq
localization 14, 103
logarithmic height 82
Mazur 51
Minkowski 19
Mordell 19
Mordell Theorem, Mordell-Weil
Theorem; see finite basis theorem
Index
137
multiplicity 23, 44
Nagell 34(fn), 52(fn)
neutral element (of group) 27
Newton 24, 43
nonsense 98 et seq
non-archimedean 7
non-singular 24
norm (map) 66
patch 67
pole 30
Pollard 124
principal homogeneous spaces 104
et seq
p-adic filtration 48
p-adic integers 9
p-adic numbers 6
p-adic units 9
p-adic valuation 7
rational curve
(= curve of genus 0) 3
rational (point etc.) 3
reducible (curve):
see also irreducible 43(fn)
reduction mod p 42 et seq
Reichardt 85
resultant 75 et seq
“Riemann hypothesis for
function fields” 2, 119
Riemann-Roch theorem 30
Rubin 111
Schmidt 120
Selmer 87, 110
Shafarevich 85
singular (point) 23
Swinnerton-Dyer 71, 110
symmetric (pointset) 18
Tamagawa number 110
Tate 85, 109(fn)
Tate-Shafarevich group 85, 109 et
seq
torsion 102
triangle inequality 7
ultrametric inequality 7
unit (p-adic) 9
valuation 6
valuation (p-adic) 7
van der Corput 19
weak finite basis
theorem 55, 66 et seq
Weil 1, 54, 108, 119
Weil-Chatelet group 108
INDEX
birationally equivalent 4
Birch 71, 110, 126(fn)
Blichfeldt 19
Bremner 55(fn)
canonical form 32 et seq
canonical height 83
Chatelet 108
chord and tangent processes 24
coboundary 90
cobounding 90
cocycle 90, 98
cocycle identity 90, 98
cocycle (continuous) 101
cohomology (Galois) 89 et seq
cohomology group 98 et seq
complete, completion 8
continuous (action), (cocycle) 101
convex (pointset) 18
cubic curves 23 et seq
defined over 3
degenerate (laws) 39 et seq
Deligne 121
Desboves 25(fn), 26, 130
Deuring 116
Diophantine geometry 1
Diophantos 1, 24
discriminant 77
elliptic curve 32
endomorphism. 112 ei seq
everywhere locally 14
exceptional (point) 24
Fermat 1, 55, 63
filtration (p-adic) 48
finite basis theorem 54 et seq
finite basis theorem (weak) 55
forgetful functor 75
form 13
Frobenius endomorphism 118
Fueter 52(fn)
function field 58
fundamental sequence 7
Galois cohomology 89 et seq, 101
ei seq
general position 29
generic point 58
genus 30
genus 0 4 et seq
genus 1 30, 32
globally 14
group law 27 et seq
H1 99
Hasse 119
Hasse principle:
see local-global principle
height 55, 78 et seq
height (canonical) 83
height (logarithmic) 82
Hensel 43
‘Hilbert 90’ 95,97
homogeneous spaces: see princicpal
homogeneous space
Hypatia 1
integer (p-adic) 9
invertible 67
irreducible (curve);
see also reducible 24
isogeny 58
jacobian (of curve of
genus 1) 92 ei! seq, 95, 107
j-invariant 93
kernel of reduction 47
Kolyvagin 111
Lang 120
Lenstra 124, 128
level (of point in p-adic case) 47
lift 43 '
Lind 85
locally 14
local-global principle 2, 13 ei seq,
85 et seq
localization 14, 103
logarithmic height 82
Mazur 51
Minkowski 19
Mordell 19
Mordell Theorem, Mordell-Weil
Theorem; see finite basis theorem
Index
137
multiplicity 23, 44
Nagell 34(fn), 52(fn)
neutral element (of group) 27
Newton 24, 43
nonsense 98 et seq
non-archimedean 7
non-singular 24
norm (map) 66
patch 67
pole 30
Pollard 124
principal homogeneous spaces 104
et seq
p-adic filtration 48
p-adic integers 9
p-adic numbers 6
p-adic units 9
p-adic valuation 7
rational curve
(= curve of genus 0) 3
rational (point etc.) 3
reducible (curve):
see also irreducible 43(fn)
reduction mod p 42 et seq
Reichardt 85
resultant 75 et seq
“Riemann hypothesis for
function fields” 2, 119
Riemann-Roch theorem 30
Rubin 111
Schmidt 120
Selmer 87, 110
Shafarevich 85
singular (point) 23
Swinnerton-Dyer 71, 110
symmetric (pointset) 18
Tamagawa number 110
Tate 85, 109(fn)
Tate-Shafarevich group 85, 109 et
seq
torsion 102
triangle inequality 7
ultrametric inequality 7
unit (p-adic) 9
valuation 6
valuation (p-adic) 7
van der Corput 19
weak finite basis
theorem 55, 66 et seq
Weil 1, 54, 108, 119
Weil-Chatelet group 108
17
Heights. Finite Basis Theorem.
We are now in a position to introduce the notion of height, and so to
complete the proof of the Finite Basis Theorem.
Let u = («о, • , un) be a point of projective n-dimensional space over
Q. As the co-ordinates are homogeneous, we may suppose without loss
of generality that
Uj £ Z, gcd(u0,.-.,wn) = 1. (1)
The height H(u) of u is defined to be
H(u) = max |uy |
j
with the above normalization. In this section || = Цоо is the absolute
value.
We shall mainly but not exclusively be concerned with the projective
line. We identify x € Q with the point (т, 1) on the line, and so write
= max{|u01, |ui |}
if x = uo/uj with Uq> 6 Z as a fraction in its lowest terms.
Lemma 1.
(i) Let D(Uo, U1), E(U0,Ui) € Q[Uo-,Ui] be forms of the same degree
n. Let u = (uojUj) be a point on the rational projective line, and
suppose that D(u), -®(u) do n°i both vanish. Then
H(D(u),S(u))<cH(u)n,
where c is independent of u.
11: Heights. Finite Basis Theorem.
79
(ii) Suppose, further, that the resultant of D, E is not 0. Then there
is a 7 > 0, independent of u, such that
H(D(u),E(u)) > 7H(u)n.
Note. The additional hypothesis in (ii) is equivalent to supposing that
D, E do not have a common zero over the algebraic closure Q.
Proof. By homogeneity, we may suppose that
and that u = (ti0,Ui) is normalized by (1). Clearly
|B(u)|, |S(u)| < c{max(|u0|, |ui|}n
for some c. In general -D(u), E(u) will have a common factor, but in
any case this implies the conclusion of (i).
Now suppose that the hypotheses of (ii) hold and let R be the re-
sultant. Then there are homogeneous forms Lj(Ua,U\'), Mj(U0,Ui) G
Z[I7o, СЛ] (j = 0,1) such that
LjD + MjE = RUjn~1 0=0,1). (*)
On substituting u for U we deduce that
gcd{D(u),S(u)}|fi.
Further, as in the proof of (i), there is a c' such that
lAG(u)l < c'{max(|u0|,|ui|}n~1 (j =0,1).
On substituting in (*) (with u for U), we obtain the conclusion of (ii)
with 7 = |Я|/2с'.
Now let u, v be two points on the projective line and let
w = («o^o, «0^1 + «1^0,«1^1)
= (wo,wi,w2) (say).
Lemma 2.
1 < H(w) <
2 - H(u)H(v) - ’
Proof. Let u, v be normalized by (1). Then the right hand inequality
is immediate.
It is readily verified that w0, uq, w2 have no common factor, so it will
be enough to show that
max(|w0|, |uq|, |w2|) > |{max(|u0|, |tq |)}{max(|tq |, |tq|};