Arithmetic and Geometry I PDF
Arithmetic and Geometry I PDF
Vol. 35
Edited by
J. Coates and
S. Helgason
Arithmetic and
Geometry
Papers Dedicated to
I.R. Shafarevich on the Occasion
of His Sixtieth Birthday
Volume I Arithmetic
Michael Artin,
John Tate,
editors
Birkhauser
Boston - Basel. Stuttgart Birkhauser
Boston Basel Stuttgart
Editors:
Michael Artin John Tate
Mathematics Department Mathematics Deparhnent
Massachusetts Institute of Technology Harvard University
Cambridge, MA 02139 Cambridge, MA 02138
Igor Rostislavovich Shafarevich has made outstanding contribu-
This book was typeset at Stanford University using the TEX document preparation tions in number theory, algebra, and algebraic geometry. The flour-
system and computer modern type fonts by Y. Kitajima. Special thanks go to ishing of these fields in Moscow since World War I1 owes much to his
Donald E. Knuth for the use of this system and for his personal attention in the de- influence. We hope these papers, collected for his sixtieth birthday,
velopment of additional fonts required for these volumes. In addition, we extend will indicate to him the great respect and admiration which mathema-
thanks to the contributors and editors for their patience and gracious help with im- ticians throughout the world have for him.
plementing this system. Michael Artin
Igor Dolgachev
Library of Congress Cataloging in Publication Data John Tate
Main entry under title: A.N. Todorov
Arithmetic and geometry.
To Z.R. Shafarevich
1. Introduction
where
AOKI AND SHIODA GENERATORS OF THE N ~ R O N - S E V E R IGROUP 3
L= { 1
0
(m :
(m :
even)
odd)
Futhermore, it is expected that a similar approach should be applicable
to Fermat varieties of higher dimension for the explicit construction of
Next we recall the following structure theorem of B&, which has been
formulated and partially proven in Shioda [4] and recently fully proven by
More precise statements will be given in $3. Aoki [I]. The formula (1.2) is a consequence of this result in view of (2.3).
Concerning the structure of the Ndron-Severi group of the (complex) +
We call an element a = (ao, a l , az, a a ) E B:n decomposable if a; a j =
Fermat surface X$,, the following problems remain to be studied: 0 (mod na) for some i # j , and indecomposable otherwise.
i) to find curves corresponding to the "exceptional" term ~ ( m ) ; T h e o r e m (8%). (i) If (m, 6) = 1, then 8; consists of decomposable
ii) to find genwators of NS ( X k ) over P. elements.
4 AOKI AND SHIODA
If a = ( a o ,a l , a 2 , a 3 ) is an element of 8 % with G C D ( a i ) = d > 1, Proof. See the proof of [4, Thm. 71.
set a: = a i / d and m' = mid. Then a' = (ab,a:,aL,a:) is a n element
f: X % -
of Bk, with GCD(a:) = 1, and one checks easily that the morphism
X L , given by ( x i ) H ( 2 9 ) induces the map
f *: H 2 ( x & , ) - + H 2 ( x : )
In the next section, we shall exhibit certain curves on X & which rep-
resent the "standard" indecomposable elcments of 8 % stated in Theorem
( 8 % ) . Note that, for this purpose, it sufices to consider the case a = a l l
P I , and 7 1 ,because in general we have V(a)" = V ( t a )for any automor-
such t h a t f * V ( a l )= V ( a ) .Thus, in order to construct explicit curves on phism a of C inducing at : < H ct on the subfield of m-th roots of unity
X k corresponding to V ( a ) ,it sufices to consider the case G C D ( a ; ) = 1. ( t E ( Z / m ) X )while
, cl.(C)O = c l . ( C O ) ,C 0 being the conjugate of C urder
Now define, for any character a of G k , o (this niakes sense since our surface X : is defined over Q).
3. Main Results
Then p, is the projector of H;,.,,,(Xk) to V ( a ) .In particular, if a E BL
and if E = cl.(C) is the class of an algebraic cycle C on X;, then p,(E) ia Theorem 1. Let m = 2 d ( d > 1 ) and a1 = ( 1 , d + 1, m - 2, d). Let
a n ulgebraic cohomology class generating the subspace V ( a )provided that C denote the curve of degree m i n P3 defined by
p,(t) # 0. Note that p a ( ( ) # 0 if and only if the "intersection number"
p a ( t ) . E Z I # 0. (3.1) z2 - q i x y = 0 , xd + yd + G w d = 0.
T h e n C is a nonsingular irreducible curve lying o n X L which represents
Definition. Given a E 8% and a curve C on X L , we say (for short) t h a t the Hodge class a1 and which satisfies
C represents the IIodge class a if the following two conditions are satisfied.
Let G = G&.
T h e n C is a nonsingular irreducible curve lying o n X& which represents 4. Proof of Theorems 1 and 2
4
the IIodge class 71 and which satisfies
First of all look at the polynomial identities:
T h e n C is a nonsingular irreducible curve which represents the Hodge class By the adjunction formula, the self-intersection number of C is given respec-
pl and which satisfies tively by
Since Gc is contained in I'Cer(al), we can define the element w c of Now we are ready to compute w c . ~ j cby (2.6). Let K ( a ) denote the
NS(Xk) @I C m in (2.6): image of Ker(u) in GIGc, and let K(u)* -- K ( u ) - {id.). Similarly we
define K(r), K(p), etc. Then
First we note that the curve C' in P 3 defined by (3.7) actually lies on
X& in case m = 2d with d even. This follows from the elementary identity
Case In this case, we have a(g) f 1, 1 = 3 = *I. The inter-
(b).
section C n
g(C) consists of the 2d points P,, = (0, q, 0 , l ) and
-m)
if 1 -
Qq = (q,0,0,1) (qd = if 1 = I , and of the d points P,, only
-1. The intersection multiplicity of C n g(C) a t each P,, or Qq
is computed as before and is equal to 2. Therefore (C . g(C)) = 4d or 2d, by substituting z, . . . by xdI2,.. . . The curve C' has the d singular points
according as 1 = I or -1. P,, = ( l , q J0,O) where 9d/2 = (- 1 & -)/a. To find the normalization
of C', observe that xy is a square on C' as the first equation of (3.7) shows.
Case (c). In thiscase, w e h a v e & ) # 1, 1 = -1 a n d c 3 = 1. Suggested by this, we consider the following curve 6' in P4:
There are d intersection points (1,0,0, c) (cd = -1) in C n g(C), and the
multiplicity a t each of them is equal to 2. Thus we have (C - g(C)) = 2d.
Since el = u ~ ( ~the
) , case (b) and (c) can be summarized as follows: It is easy to check that C is nonsingular and that the projection of P4
to Pn (x, y, z , w , t ) I--+ (2, y, z, w) defines a birational morphism of ?I onto
Z'
C' in case d is even, and an isomorphism of to C in case d is odd. As a
10 AOKI AND SNIODA
nonsingular complete intersection curve in P4, the genus of 6 is given by Therefore, with the same notation as in the proof of Theorem 1 (54),
+ +
d . 2 . 2 . (d + 2 + 2 - 5)/2 1 = 2d2 - 2d I . Therefore we have by the we have (write p = PI):
adjunction formula
and
Then we have
This proves part (i) of Theorem 3. Part (ii) is similar (and even simpler),
so it will be omitted. Q.E.D.
For each g E G - Gc, the intersection of the curve C and its transform
g(C) is described as follows:
(1) Assume g &/Ker(a). (1) If g E Ker(r), there are d points (0, y, z,O),
yn + a z d = 0, with intersection multiplicity 4. (2) If g E Ker(adr),
there are d points (x, 0, z, 0), xd +
-zd = 0, with multiplicity 4.
(3) If g E ~ e r ( o or
~ )g E Ker(a2), then there are m points (x, y, O,O),
+
xm ym = 0, with multiplicity 2. (4) If g E G - Ker(ad) U Ker(u2), there
+
are m points (x, y, O,O), xm ym = 0, with multiplicity 4.
(11) Assume g E Kcr(a), g &/Ker(r). Then there are m points (x, y,O, O),
+
xm ym = 0, with multiplicity 2, and m2 points of the form (x, y, z , w),
+
xd yd = 0, ZW f _ 0, with multiplicity 1.
AOKI AND SI-IIODA
are representation spaces for G which are known to give the correct bctti
numbers for V.
Concerning the structure of these cohomology groups one has s basic
conjecture formulated by Tate and proved by him for II' when V has good
reduction. Let C be the completion of W .By colitinuity G a c k on C and
p-ADIC ETALE COIIOMOLOGY 15
14 S. BLOCH
inducing a G-stable filtration Case 3. Let Bn C R;Lr be the sheaf of locally exact Kahler n-forms on
Y. Y is said to be ordinary if Hq(Y, Bn) = (0) for all q and n. Roughly
(0.4) FoI3;,(V, Z,), grnfIq = FnHq/~ ~ " 1 1 ~ . speaking, "ordinary" is equivalent to "IIodge-Witt" plus equality of Hodge
and Newton polygons. For a more careful discussion see $2 below. Let
Since the pcoho~~~ological
dimension of Y is equal to d i m P [l L] one has
' ) { X E H!,JS(P/W) I / X = p i X ) . Note G acts on this
H R ~ ~ ( Y / W ) (=
group via the action of G a l ( x / k ) on IZ~,,,(r/W).
To analyse the spectral sequence (0.3) it is necessary to understand the 0 = (2, -2) = {x, y) + {y, 2) = {2,1 - 1)
local structure of the sheaves ML. Our results in this area rest on Kato's
calculation of the Galois cohomology of quotient fields of henselian discrete for local sections x, y, z such that 1 - z is invertible.
valuation rings [17] and Gabber's theorem [lo]. We need Y to be smooth Define UmLA for m 2 1 to be the image
but make no hypothesis about Y ordinary. Note that the geometric stalks
of ML are the ktale cohomology groups
For simplicity I will restrict attention to the case n = 1. Define {n, 21,. . . , x,-l) H dlog(zl) A - . A d l o g ( ~ ~ E- ~R;);tog
(1.4)(iii) (1 + nmy, X I , . . . ,xr-l) +-+ pdlog(~1A
) . - .A dlog(zr-l)
B' = Im(d : n;-I -+ n;)
(1.4)(iv) (1 + xmy, XI,. . . , x,-1) t-+ dB A dlog(z1) A . . A dlog(~,-1) E Br
{x, 1 + r m y , x l , . . . ,z,-1)
C: 2' 4 Qr Cartier operator [5] djj A dlog(T1) A . A dlog(;z',-z) E Br-'
(1.4)(v) (1+ xmy, . . ,~ ~ - 1 )jjdlog(~1)A . A d l o g ( ~ , - ~ )
XI,. +-+
(iv) UmL; = (0) for m > el. T h e o r e m (1.7). There exists a unique homomorphism of sheaves
ML -, (R>ls)/p"(R>l,) given by { X I , . . . ,2,) H dlog(x1) A . . . A dlog(x,)
Remark (1.5). The deItham-Witt complex (141 is a complex of pro- for xi local sections of 0; and trivial on symbols with one element constant
sheaves W'RQ = {WnRQ)n21. The pro-object WebZglOgis defined to be (ie. in K).
the kernel of 1-frobenius on W'Rq. Multiplication by pn induces exact
sequences (op.cit.) Applied to the spectral sequence (0.3), this result gives a map
T
The maps in (1.4) can be explicated in terms of symbols as follows (here T h e o r e m (1.8). Let M- S be a smooth morphism of schemes,
xi E 02, y E Ox, F = ~ ( m o d x ) ) : s E S , 9 a bounded below complex of torsion e'tale sheaves on a retrocom-
pact open set V C S. Write
20 S. BLOCH p-ADIC ETALE COHOMOLOGY 21
where j: x-'(V)+ M . Let g t a r ( K ) be th,e Zariski sheaf o n M , associated is zero and then one argues using the structure theory (1.4). An analysis of
to the presheaf the structure of IIm(H, uK) in the non-ordinary case is the key to further
U H d ( u i t ,K lu). progress in this area.
For z E M,, let i, : Sp(lc(x)) 4 M , and write L; for the Zariski sheaf
$2 Ordinary Varieties
Recall the variety Y is ordinary if Hm(Y, Bn) = (0) for all m and n.
T h e n the Cousin complex gives a flasque resolution
The following proposition overlaps with results of Illusie, and we would like
(1.8.1) O+zd( K )
-zar 5 Lo-+L1-+L~-+... to acknowledge considerable inspiration from conversations with him. The
formulation here is due to Kato.
In our situation, if A is the henselization of the local ring of X a t a point
y of Y and E is the quotient field of the henselization of X a t the generic Proposition (2.1). The following conditions (1)-(5) are equivalent.
point of Y, then the injectivity of 6 in (1.8.1) implies (1) Y i s ordinary.
(2) H4(Y, Rk, BFPK HYT, 0%) is a n isomorphism for a n y q, 7 .
(3) IZq(Y,WnRjqlOg) @z/pnzWn(E) H q ( Y , WnR') is a n isomorphism
In this way, the stalks of M L a t points other than the generic point can be for any q, r , n.
controlled. (4) Hq(Y, WRb, ,og) @ z pW(K) -+ Hq(P,WR') is a n isomorphism
Concerning the proof of (0.6), we have by (1.6) for any q, r.
( 5 ) T h e frobenius f : HQ(Y,WR;)-+ Hq(Y, WR;) is a n isomorphism
for any q, r.
Moreover, Y is ordinary and H&,(Y/W) is torsion-free for all q i f and
adjoining
-
ap-" to A "kills" symbols {x, z l , . . . ,zr-1)) so writing only i f the following holds:
M: = {M:)n21 for the pro-system associated to the limiting situation
(6) For any q, the Newton polygon defined by the slopes of frobenius o n
x / S , we get an exact sequence (defining UM:) of pro-systems of ktale H&y,(Y/W) coincides with the Hodge polygon defined by the h q - ' ( ~ , R k l k ) .
shkaves on-P,
If IZq(Y, WR') is torsion-free for a n y q , r , the conditions (1)-(6) are also
o -+ uM: -+ M: -+ w.fl$,,og -+ o. equivalent t o
The key result is then ( 7 ) For any q, the slopes of frobenius o n H&ys(Y/W) are all integers.
Proposition (1.9). Assume P is ordinary. Then By way of example, it is possible to show that ordinary hypersurfaces
of degree d fill out a non-empty open set in the moduli space. This result
( U ) =( 0 all m, r. is due to Deligne (unpublished). As a second example, a general polarized
abelian variety is ordinary. (Gabber points out that no one has published a
proof of connectedness for the moduli space of polarized abelian varieties.
One reduces to showing the map Failing this, the above assertion is to some extent conjectural.)
R m ( Y , u'M;) 4 Hm(p,
u r l )
S. BLOCH
$3 Open Questions and Conjectures where = Image (B,',,p,c,(M - cusps) -+ H1(M)). Let 7'1 be the Hecke
operator corresponding t o 1. Let g be a modular form of weight k 2 for +
Here are some directions one might try t o explore. I' which is an eigenfunction for Hecke and suppose T1(g) = alg for 1 / N
with p / a p . Let w k ( g ) = 0, Ker(Tl - a[) C wk.
The idea is t h a t w k ( g )
-
p-adic functions on moduli: It follows from (0.6) (i) that when Y is should correspond to a submotive of
ordinary, the G-module gr;IIQ(V, Q p ) is independent of the lifting X of
Y.The extensions k times
H k + l ( E x M ...xME)= Hk+'(Ek)
We obtain
References
Kato, K., A generalization of local class field theory by using K - The Mordell-Weil Group
groups. I, 11, J. Fac. Sci. Univ. Tokyo, 26, 303-376 (1979), and 27,
603-683 (1980). of Curves of Genus 2
Koblitz, N., p a d i c variation of the zeta function over fanlilies of
varieties defined over finite fields, Comp. Math. 31, 119-218 (1975).
Milnor, J., Algebraic K-theory and quadratic forms, Inv. Math. 9, J. W. S. Casselv
318-344 (1970).
Oort,F., Lenstra, H., Simple abelian varieties having prescribed
To Z.R. Shafarevich
formal isogeny type, J. Pure Appl. Algebra 4, 47-53 (1974).
Serre, J-P., and Tate, J., Good reduction of abelian varieties, Ann.
Math. 88, 492-517 (1968). 0.0 Introduction.
Steenbrink, J., Limits of Hodge structures, Inv. Math. 31, 229-257
(1975). In 1922 Mordell [2] proved Poincark's conjecture that the group of ra-
Stienstra, J., The formal completion of the second chow group; a tional points 011 an abelian variety of di~nension1 (= elliptic curve with
K-theoretic approach, in Proceedings of the Rennes Conference in rational ~ o i n t )is finitely generated. His proof was somewhat indirect. In
Algebraic Geometry. 1928 Weil [5] in his thesis generalized Mordell's result t o abelian varieties
Tate, J., pdivisible groups, in Proceedings of a Conference on local of any dimension and to any algebraic number field as ground field. At
fields, Springer-Verlag (1967). the same time, Weil [6] gave a very simple and elegant proof of Mordell's
original result. I observed some time ago t h a t Wcil's simple proof admits a
further simplification. He uses the explicit form of the duplication and ad-
dition theorems on the abelian variety, but these can be avoided by, roughly
Received July 19, 1982
speaking, the observation t h a t every element of a cubic field-extension k(0)
Supported in part by N.S.F.
+ +
of a field k can be put in the shape (a0 b)/(cO d) (a, b, c, d E k). This ad-
ditional simplification has little interest in itself, but suggested that similar
Professor Spencer Bloch
ideas might be usefully exploited in investigating the Mordell-Weil group
Department of Mathematics
of (the Jacobians of) curves of genus greater than 1. This is done here for
University of Chicago
curves of genus 2.
Chicago, Illinois 60637
The literature dealing with individual curves of genus 1 is vast. Very
little, if anything, has been done with curves of higher genus. In fact the
machinery which has been developed, while admirable for deep and general
inveutigatiorls, appears (pace Mumford [3,4]) little suited to the explicit
treatment of explicitly given curves. I cannot cven find in the literature
an explicit set of equations for thc Jacobian of a curve of genus 2 together
with explicit exprcssioris for the group operation in a forrn amenable t o
calcula.tion: nor do I give one, since it is not needed for my present purposes.
Although I do not give any nu~nericalexamples, I hope and believe that the
theorems proved here will make it practicable to compute the Mordell-Weil
group of explicitly given curves of genus 2.
THE MORDELL-WEIL GROUP 20
28 J. W. S . CASSELS
where F(X) E k[X] and k is the ground field. When P has degree 5 or
0.1 It is useful to recall the main steps of Weil's simple proof of Mordell's
6, the genus of C is 2. Lines X = const. meet C in effective divisors of
theorem in a shape suited to later generalization. The ground field is k,
degree 2. They are all linearly equivalent, and their equivalence class will
and an abelian variety of dimension 1 is given by
be denoted by 0.The Mordell-Wcil group $ is defined to be the group of
divisor classes of degree 0 defined over k . By the Riemann-Roch theorem,
every element of $ contains a divisor a - @, where a is effective, defined
where over k, of degree 2. The divisor a is uniquely determined by the element of
9 , except that the whole divisor class QD corresponds to the zero element.
We shall show that there is a map p from 9 to an abeliari group M giving
does not llwe multiple factors. There is a single point 0 a t infinity. Every a situatiori very similar to t h a t used by Weil [6].
divisor class of degree 0 defined over k contains a uniquely defined divisor In a sense, the case when F ( X ) is of degree 5 is very special. A bilinear
8 - 0, where R is a point on C defined over k. The group 9 of divisor classes transformation of X and the appropriate change of Y takes a curve (0.2.1)
of degree 0 defined over k (the Mordell-Weil group) is thus also the group with F of degree 5 into one with F of degree 6. IIowever, the case of degree
of points on C defined over k with the induced law of conlposition [C is its 5 is somewhat simpler, so we treat it separately first.
own Jacobian]. The case when F ( X ) has degree 4 also fits into the picture, a t least when
Weil's proof that 9 is finitely generated when k is an algebraic number the coefficient of X4 is 1. Then C is of genus 1 and is birationally equivalent
field falls into three parts: to a C with F of degree 3. We shall, however, give this case separately, as
it motivates some features of the degree 6 case and because of the unified
(I) IIe constructs an abelian group M (depending on C and k) and picture presented by degrees 3, 4, 5, 6 together.
a honlo~riorphism
0.3 It is convenient to introduce here notation which will be common to
all four cases. There follows a synopsis which will bring out the common
pattern, although detailed enunciations will have to await the relevant later
whose kernel is precisely 2 9 . This step is valid for any field k of charac-
sections.
teristic 0.
The ground field is any field k of characteristic 0. Its algebraic closure
(It) If k is an algebraic number field, the image CL(G) is a finite group. is % We are concerned with curves
Hence 5 / 2 5 is finite. ("Weak Mordell- Weil Theorem.")
(111) Finally, the proof that 9 is finitely generated uses an "infinite
descent."
where
In this paper we are concerned with the generalization of (I), and so our
ground field is any field k of characteristic 0. Given the definition of p,
the generalization of (11) is immediate: we do not prove it because it can has no multiple factors. We write
be confidently left to the reader. Finally, the generalization of (111) in a
shape amenable to practical calculation might well be difficult: I have not
yet attempted it. where O is the image of the indeterminate T in the quotient ring, so
= (k[Ol*/k*{k101*f2 ( N even). This is (effectively) the case treated by Weil [6]. For a point a on
C : Y2 = F(X) defined over k we define
More precisely, the image will be in the kernel M of the map
given by the components except, possibly, the K1-components. Since the values of
p are in M,they are determined by the Ki components ( j # I), and so
(1.1.2) (1.1.5) holds also for the K1 components. There remains the case when,
say, d = e = 0 and a # b. Then a, b, t are the intersection of C with Y = 0,
is a homomorphism. and (1.1.5) follows by direct calculation. [Alternative arguments for these
special cases are in 51.3.1
Proof. If a = (a, d), then -EL = (a, -d), so clearly
1.2 T h e o r e m 1.2. The kernel of p: 5 -+ M is precisely 2 5 .
(1.1.3) P(-a) = P(a).
Proof. Since M C L is of exponent 2, the kernel certainly contains 25.
Hence it is enough to show t h a t We have therefore to show that p(a) = 1 implies EL = 26 for some b E 5.
Since 0 = 20, we need consider only
(1.1.5)
where a , b, c, d, el f E k .
(using F(0) = 0), where rz, r l , ro are linear forms in 31, so. We may
The condition (1.1.4) is equivalent to the statement t h a t a,b, t are the
therefore find s l , so not both 0 such that
intersections of C with a line:
on comparing coefficients of T 3 . IIence, on changing the signs of T I , TO if Comparison with (1.0.5) shows that the new definition of p(a) coincides with
necessary, the line the old.
We now show how to prove Thcorcm 1.1 using the new defiriition of p.
This time we show directly that the K1-components satisfy (1.1.5). Let
passes through = (a, cl). The residual intersection is 2b, where b = (so, t ) + +
a = (a, 0) be as above, let b # and let t satisfy n b t = 0. Let X be
and t is given by (1.2.9). Hence E 2 5 , as required. a generic point, and work with the ground field k ( ~ ) .As in the proof of
Theorem 1.1, there are 1 1 ( ~ )lo(x)
, E k ( ~ such
) that the line
1.3 The previous treatment of the case when a - O is not invertible was
somewhat ad hoe. We now sketch a rather more highbrow treatment which
has the advantage that it carries over automatically to the cases we shall meets C in x arid b. Let the third intersection be y. Then, as before,
consider later.
Let n = (a,0) and choose the notation in (0.3.4) so that the projection
of k[O] onto the field K 1 is given by O + a. Let ;Y be a generic point of C
over k and choose t to be a local uniforrnizer for X at 8. Then X(X)- a has Let x specialize t o a, so E, specializes to t. The left-hand side has a double zero
a double zero a t x -+ a; and so and so, if we divide by t 2 before specializing, where t is a local uniformizer,
we have
M(a)(b - a)(c - a ) E ( I C * ) ~ .
where M(x) specializes to a finite non-zero M(n) E k*. Replacement of t This is just the assertion of the K1-component of (1.1.5)) as required.
by another local unifor~nizer~nultipliesM(R) by an elcmcnt of We
show that p(n) can be defined by replacing the K1- component 0 of a - O
by M(n) before taking the image in L.
In fact, Y(x) has a simple zero a t 8, and so 2.0 Genus 1, Degree = 4.
Here we consider
where N(B) E k*. We have
(1.3.3) F ( X ) = (X - a)Fo(X),
There is a double point a t infinity. We denote the two branches by O+, 0-,
where where
Here the definition of L is different from that in $1. For a E 9 we define Here one of the four intersections is always a t o+. We suppose, firsl, lhat
given b y
is 20-, so 0 + b = 20- in 9, and hence
(2.1.2)
Proof. Let A = (a, d). The function X - a has zeros a and a' = (a, 4)
and poles O+, ti-. Hence a' + a = 0- in 9. Clearly p ( d ) = p(a), so and so p(a) depends only on n niodulo &.
We now abandon the assumption (2.1.6). Let a=(a, d) and
b = (b, e) # (a, -d). Then we can choose b l J bo so that (2.1.4) meets C in
fi and b [a double intersection a t a if b = a]. Let t = (c, f ) be the residual
Now consider the intersections of intersection. Then
a t least if def
$1.3. Since
# 0.
A
(2.2.7)
Then we get s l
-?Q, 8 1 . We may choose
t3 = 0.
loss of generality,
(2.2.11) Y * ( X 2 + t l x + to)
for some I E k* and some c3, c2, C I , CQ E k. ,We cannot have c j = c2 = 0,
since the defining relation F ( O ) = 0 of Q is of degree 4. for one choice of sign. There is a double zero a t (a0, r ) for appropriate
Suppose, first, that c3 = 0. Then without loss of generality c2 = 1 and r E k. The remaining zeros and poles are at O+, O-. Hence the co~~clusion
of the theorem holds.
A curve
for orle choice of sign. The only points other than occurring as zeros or
poles of the left-hand side of (2.2.4) arc 0+ and 0-. Hence the conclusion
of the theorem holds.
where F has degree 5 or 6, is of genus 2. Any line X = constant cuts C
We are left with
in a divisor which is linearly equivalent to the divisor cut out by the line
a t infinity. This class of divisors of degree 2 defined over k will be denoted
by @.
For 31,30 E k we have The differentials of the first kind are
40 J. W.S. CASSELS THIS MOILDELL- WEIL GROUP
Hence the divisors in @ are the only divisors which can be the zeros of Then
such a differential. It follows from the Riemann-Roch theorem that every Y ; = ( 7 2 1 x 1 + 722)6J'((711X1 + r12)/(721X1 + 7 2 2 ) )
divisor class of degree 2 defined over k, other than 8, contains precisely one = Fl(X1) (say)
effective divisor, and it is defined over k. An effective divisor a of degree E k[Xl].
2 defined over k is an unordered pair of points {a1, a2) such that either (i)
81 and R2 are each defined over k or (ii) al, a2 are defined over a quadratic
If F is of degree 5, then in general Fl is of degree 6. Coriverscly, If F is
extension of k and conjugate over k. Either a1 or 82 may be a t infinity, and I of degree 6 and has a zero in k, then we can use such a transformation to
a2 = R1 is permitted. obtain r of degree 5.
The Mordell-Wcil group 5 is, by definition, the group of divisor classes
3.2 If a, b, E k with a # b are zeros of F ( X ) ,then the divisor {(a, 0), (b, 0))
of degree 0 defined over k. If a E 5, a # 0, we represent it by the unique
effective divisor g in a + 8. The set of such together with 8 thus give is of order 2 in the Mordell-Weil group since
an isomorphic image of 9. Here
and both the divisors on the right-hand side are in 8. If E' is of degree 5,
where a,
@, (a are effective of degree 2, means that
necessarily effective) divisor in the class 8. If
a+@- -- is a (not
we may take the point 0 at infinity instead of one of (a, 0), (b, 0).
If F splits completely over k (or if we work over the algebraic closure q,
we thus get
then, clearly,
divisors of order 2. We have therefore obtained them all.
(3.0.5) -g = {(a, -4, (b, -f 1). I
3.1 The curve (3.0.1) may be transformed birationally into other curves
of the same shape but with in general different E'. Let
where, after multiplying X, Y by elements of k* if necessary, we have
I
Put
There is a single point a t infinity. The poles of X, Y are 20 and 50,
x = (711x1 + ~12)/(721X1+ 722) respectively. We retain the notation of 50.3. In particular,
and
YI = ( +
T ~ ~ xT~ ~ ~ ) ~ Y .
42 J. W. S. CASSELS THE MORDELL-WEII, GROUP
(4.1.2) a -+ ~ ( a )
Adjoin two independent generic points X i , X2 over k1. We extend the non- iu a group homomorphism.
trivial automorphism a (say) of k l / k to k l ( ~ 1 , ~ 2
by) u X ~ = X2. Let t l
be a local uniforrnizer for xl a t = (al,O), so a t l = t 2 (say) is a local
Proof. We note, first, that the definition of p gives p ( a ) = 1 for all
uniforrnizer for X2 a t all1 = 8 2 .
Now
a @.
The automomorphism Y -+ -Y of k(X, Y) over k(X) clearly maps a
into - a . Hence
where Nj(at, a2) E k;. Clearly uNl(al, 82) = N,(n,, a,), and so
(4.1.3)
Now let
This provides us with the missing K1-component. Clearly choice of another
local uniformizer instead of t l multiplies (4.0.6) by a11 element of (K;)2,
(4.1.4) a={(a1,d1)1(a21dz))1 @ = {(blle~),(b2,e2))
and so CL(%)is well defined.
We can give an explicit form of (4.0.6). Let
be defined over k. If, say,
Then
(h2x2 + h l X + h0)2- F ( X )
= -(X - a l ) ( X - a 2 ) ( X - bl)(X - bz)(X - c),
directly from the definition of p. Hence we may suppose t h a t the situation where c E k. Then 6 = (0, (c, f)), where f is given by (4.1.7), and (4.1.13)
exemplified by (4.1.5) does not occur. We may then find h3, h2, h l , ho E k follows agan in the obvious way.
such that There remains the case when a or @ contains O. If o occurs twice, we
argue as for (4.1.5). If 0 occurs only once, we make the obvious modifications
(4.1.7) Y = h 3 x 3 + h 2 x 2+ h l X + hO to the above argument with hg = 0. This concludes the proof.
meets C in a
and @. The total intersection is of degree 6, and so the
4.2 T h e o r e m 4.2. The kernel of
residual intersection (n; (say) is of degree 2. Clearly (a is defined over k, and
(4.1.8) Q[+@+(a=O.
and
for some cl,cz E F, and
(4.1.12) d l d z e ~ e z f l f z# 0, implies a E 25, and will do this first for IS[ in general position.
Put
since both sides have degree 5 4. Hence G(T)is a perfect square, a case
Suppose, first, t h a t we have already eliminated.
There is, however, a pair (I,, M ) = (L1, M I ) (say) for which 12 = 0, since
this is a single condition. Then m3 # 0 and, by homogeneity, we may take
m3 = 1. Similarly, there is a pair (L, M) = (L2, M 2 ) for which m3 = 0
Then and l2 = 1. IIence in general
where we consider both choices of sign together for the moment. Then
The condition that the coefficient of CI4 in (4.2.12) vanishes is a single
homogeneous linear condition on 12,I l l lo, which is rlon-vacuous since (4.2.7)
does not hold. Hence the set of pairs {I,, M ) is a 2-dimensional k-linear
space. On eliminating p between (4.2.5) and (4.2.12), they all satisfy
1 If w = 0, then G(T) is a square, which we have already excluded. Hence
(4.2.21) w E k*.
Here dl # 0 by general position, and so IIitherto we have allowed both choices of sign in (4.2.19). We should have
reached the comclusion of the theorem if (4.2.24) holds for either choice of
(4.2.23) 0 # Jw E k(al, d l ) = k(a2, d2). sign. Hence we are left with the situation when(4.2.30) holds for u = +1
and both v = +I and v = -1. It follows from (4.2.16) and (4.2.30) t h a t
Suppose, first, that
(4.2.24) w E (k*)2.
On substituting u = 1, v = 0, T = a1 in (4.2.18), we get
Then we can choose s E k with 32 = w so that
(i) If (4.2.25) also passes through R2, the intersection of (4.2.24) with C is
9 + 2@, where @ is given by L ( X ) = 0 and (4.2.25). IIence the conclusion a contradiction. This concludes the proof when 9 is in general position.
of the theorem holds. [We note in passing that we should also have reached a contradiction from
(ii) If (4.2.25) does not pass through 82, then it passes through supposing that (4.2.24) holds for both choices of sign in (4.2.19).]
4 = (a2, -d2). This can happen only if 81,82 are not conjugate over k, Now suppose that p ( 9 ) = 1 b u t is not in general position. Suppose
and so are each defined over k. Arguing as in (i), we have t h a t there is a @ such that
and
There is a natural injection of L into L, and we regard L as a subset of
(4.2.29) w = g s ,2 sEk*.
-
L. We denote by p the analogue of p, but for the ground field k-. If 9 is
defined over k, then clearly
Then by (4.2.20) and (4.2.23) we have
is a group homomorphism.
5.0 Genus 2. The Sextic Case.
Proof. The proof is very similar to that of Theorem 4.1 for the quintic
case, so we only briefly indicate how the different definition of L comes in.
Let
There is a double point a t infinity, with two branches O+, 0- (say) defined
ovcr z.
If a6 E (k*)2,then O+, 0- are ixidividually dcfined over k, but in where, for simplicity, a l l a2, bl, b2 are distinct. There are h3, hp, hl, ho E k
any case {o+, 0-1 is defined over k. such that
VJe retain the notation of $8.3. In particular, now
as required.
When d = 0 in (ii) or dld2 = 0 in (iii), the definition is extended precisely The remaining cases are left to the reader.
as in the quintic case [$4.0]. As always, p ( g ) E M.
5.2 The kernel of p contains 2 9 as usual. In general it contains other
5.1 Theorem 5.1. The map elements. Let
where the point is defined over k. Then p ( 8 ) = 1. Two divisors g and k. In particular, none of the t j are at infinity, and the t j are distinct, where
= {b, b) of this type are in the same class of 5 modulo 2 5 since t j = (cj1 fj).
We may choose i l l i o , h 4 , . . . ,ho E k, not all 0, such that
(5.2.2) {a, a) + {It, b) = 2{a, It).
One has curves C for which (5.2.1) is in 2 g and others for which it is not.
Indeed, let as = 1 and let 0 be one of the two points a t infinity defined over meets C in 26; where
k. Then it is easy t o see that {o, O) E 2 5 precisely when
for some Ciearly not both i l l io vanish. The intersection of (5.2.6) with C has degrce
8, and so the residual intersection is B.
Suppose, first, that both points of B = {(dl,gl),(d2,62)) are finite.
Then
and H ( X ) ,G(X) are of degree 3 and 5 2, respectively.
It is possible for there to be no points on C defined over k, and so no
divisors of type (5.2.1). We now describe further elements of the kernel. Let
a be an effective divisor of degrce 3 defined over k. Then by Riemann-Roch for some I E k*. Hence p(B) = 1. If a point of B is a t infinity, the
there is an effective divisor B of degree 2 defined over k such that modification to the proof is obvious.
L e m m a 6.2. The of type (5.2.1) and the B given by (5.2.5) are in with
the kernel. They all lie in the same class of 5 modulo 2.9, which may or
may not be 2 5 itself.
Proof. After what has been said above, all that is needed is to show Put
that B is in the kernel. We may suppose that there are no points on C
defined over k, and so bhat 6 = {tl, f 2 , f a ) where the t i are conjugate over
THE MORDELL- WEIL GROUP
J. W. S. CASSELS
There are
Then for one choice of u with w = nu2 the curve
(5.3.8) +
M ( T ) = m 3 ~ 3m 2 T 2 + m l T + mo E k[T]
passes through R1. Suppose, first, that it passes through 82. Then, by a
such that +
now familiar argument, the total intersection of (5.3.16) is a 2@, where
@ is given by L ( X ) = 0 and (5.3.16). Hence a E 29.
If (5.3.16) does not pass through 82, then it passes through & = (a2, -d2).
This can happen only if k ( a l , d l ) = k(a2, d2) = k, since otherwise, al,a2
since the absence of terms in 05, O4 imposes two linear conditions on arc conjugate over k. The preceding argument shows that
12,11,10. Hence
Now
and so
for some w E k. If w = 0, then G(T) is a square contrary to the general since {a2,8:) E 8 : and again is in the specified kernel.
position hypothesis, so (11) If (5.3.15) does not hold, we must have
We note for later use that the substitution of a1 for T in (5.3.11) gives
On substituting a1 for T , wc have
56 J. W. S. CASSELS THE MORDELLWEIL GROUP 57
for appropriate choice of u. We now go over to the (XI,Yl) coordinates and, for simplicity of nota-
We shall show that we now have the situation described in the proof of tion, drop the suffix "I ." It is clear that the change of coordinates does not
Lemma 5.2. Our first objective is to define a bilinear transformation of alter the definition of the p function. Further, by (5.3.5) and (5.3.25) we
X which, as it will turn out, will ensure that I ( X ) (in the notation of the may now suppose that
proof of the lemma) can be taken to be a constant.
On comparing the coeficients of T6 in (5.3.11) and using (5.3.20)) we
have
After a substitution X -+X + constant, we may eliminate the linear term
(5.3.22) nm32 - I: = w = gnu 2 . in G(X), and so
and, in addition,
for some
There are three linear conditions on the four 1:) so an L'(T) f 0 certainly
exists.
On putting If m i = 1; = 0, we could take L = L', M = MI in (5.3.11) and then
- 1,I contradicts (5.3.12 bis), (5.3.30). Hence, by homogeneity, we may
(5.3.27) y1 = Y / ( ~ X I+ q3, suppose that m&= 1; = 1, and so
the equation of C becomes
(5.3.36) Lt(T) = T 3+ u T 2 + linear E k[T],
(5.3.28) Y: = Fi(X1) E XI], (5.3.37) M1(T) = T 4 + u T 3 + quadratic E k[T],
for some a E k. On squaring (5.3.34) and using (5.3.30) we have
(5.3.38) L'(~)~G(Q
=) M'(Q)~,
for some w' E k, since terms in T8, T7 cancel by (5.3.33)) (5.3.36),(5.3.37). IIere dl # 0 by "general position," so
IIere wt = 0 is eliminated by the usual argument, and so
also passes through an. IIence the total intersection of (5.3.42) with C is and + % is in general position. Hence, by what has already been proved,
d + 2a1 where (a: is the divisor of degree 3 given by L 1 ( X )= 0 and (5.3.42). +
either d + 2x E 2 5 or d 2x is of the type described in Lemma 5.2
Hence lir is in the specified kernel. (but with & as grouKd field).
There remains the possibility +
Arguing as at the end of $4.2, if a 2x E 2 9 then d E 29.
e
Suppose now that d + 2X 2 5 but that theye is a B of the type (5.2.5),
+
where 6 of degree 3 is defined ov& k. Then B g+2x E 2 5 , so B +a E 2 9
-
and the conclusion of the theorem holds.
We shall show that this leads to a contradiction. By (5.3.39) we may suppose
that
In fact if e
+ 2X 25, then such a B always exists. It is enough to
5
show that if there is a dzisor of degree 3 defined over then there is
(5.3.44) M1(al) = vdlJg. a divisor 6 of degree 3 defined over k, and this follows by a specialization
argument. Let b be any element of k and let e E satisfy e2 = F(b). Then
With L, M given by (5.3.7), (5.3.8), we have, by (5.3.9), (5.3.34), that -
k specializes to k under XI + (6, e); X2 + (b, -e). We can take for any
5
specialization of over this.
(5.3.45) {L1(0) + L(O))p = M t ( 0 ) + M(O)
[4] D. Mumford. On the equations defining abelian varieties II. Invent. Number Theoretic Applications
Math. 3 (1967), 75-135.
[5] A. Weil. L'arithmktique sur les eourbes algklriques. Acta Math. of Polynomials with Rational Coefficients
5 2 (1928), 281-315.(= CEuvres Scicntifiques I, 11--45).
[GI A. Weil. Sur un thkort?me de Mordell. Bull. Sci. Math. (11) 54 Defined by Extremality Conditions
(1929), 182- 191. (=CEeuvres Scientifiques I, 47-56).
G. V. Chudnovsky
Received April 23, 1982
To Z.R. Shafarevich
Professor J. W. S. Cassels
Department of Pure Mathematics and Mathematical Statistics Introduction
16 Mill Lane, Cambridge CB2 lSB, England
It is well known that classes of polynomials in one variable defined by
various extremality conditions play an extremely important role in complex
analysis. Among these classes we find orthogonal polynomials (especially
classical orthogonal polynomials expressed as hypergeometric
and polynomials least deviating from zero on a given continuum (Chebicheff
Orthogonal polynomials of the first and second kind appear
as denominators and numerators of the Pad6 approximations to functions of
classical analysis and satisfy familiar three-term linear recurrences. These
poly~iomialswere used repeatedly to study diophantine approximations of
values of functions of classical analysis, especially exponential and logarith-
q [2], [3], [4], [5]. The methods of
mic functions a t rational points x = ~ / [I],
Pad6 approximation in diophantine approximations are quite powerful and
convenient t o use, since they rcplace the problem of rational approxima-
tions to numbers with the approximations of functions. There arc, however,
arithmetic restrictions on rational approximations to functions if they are
to be used for diophantine approximatioris. The main restriction on poly-
nomials here is to have rational integer coeflicients or rational coeficients
with a controllable denominator. Such arithmetic restrictions transform a
typical problem of classical analysis into an unusual mixture of arithmetic
and analytic difficulties. For example, recurrences defining orthogonal poly-
nomials must be of a special typc to guarantee that their solutiorls will
have bounded denominators. In this paper wc consider various classes of
polynomials generat,ed by imposing arithmetic restrictions on classical ap-
proximation theory problems (orthogonal or Chebicheff
62 G . V. CHUDNOVSKY NUMBER TIIEORETIC APPLICATIONS 63
In the first part of the paper we consider polynornials with rational While classes of polynomials with rational integer coefficients studied in
coeficients arising from rational approximations t o logarithmic and al- Chapter I arise from the orthogonality condition and the Pad6 approxima-
gebraic functions. 'These polynomials are defined through linear recur- tion, polynomials studied in Chapter I1 can be characterized .w Chebicheff
rences with integer coefficients, and they have denonlinators growing not polynomials with rational integer coefficients. 'rhey can be characterized
faster than in geometric progression. Together with the ordinary three- as polynomials with rational integer coefficients least deviating from zero
term linear recurrences, we consider more general matrix linear recurrences on a continuum E. Number-theoretic interest in these polynomials was
that are more convenient for the determination of arithmetic properties of originated in 1936 by the famous Gelfond-Schnirelman approach to the
polynomials generated by them. All the recurrences under consideration prime number theorem, in which deviations from zero of Chebicheff poly-
are representations of the so-called contiguous relations, i.e., linear rela- nomials with rational integer coefficients were directly related to the dis-
tions I~etweensystems of analytic functions having the same monodromy tribution of pri~nes.We refer the reader to a recent book by Ferguson [lo],
properties [2], [6], [7]. The transformatio~~ from one contiguous system of where this approach and related results are presented.
functions to another, described by linear recurrences, is broadly called the We study the original Gelfond-Schnirelman approach in the context of
Backlund transformation because its appearance in completely integrable the "integer" transfinite diameter of a continuum E in the cases E = [O, 11
systems as a canonical transformation [6], [7]. In our case, contiguous rela- and G = [O, l/4], which are important for the prime number theorem.
tions are analyzed in the simplest cases of Fuchsian linear differential equa- We include an exposition of the results of Fekete and the recent results of
tions with regular singularities a t O,1, oo, and several apparent singularities Aparicio on upper and lowcr bounds on the "integer" transfinite diameter.
(generalizations of Gauss hypergeometric functions). Though the one-dimensional version of the Gelfond-Schnirelman
The new recurrences are applied to logarithmic and inverse trigonometric approach is obviously insufficient, we extend the remarks of Trigub and
functions, and lead t o new LLdense"sequences of rational approximations examine the nlultidinie~isionalgeneralization of the Gelfond-Schnirelman
t o logarithms of rational numbers. Using these "dense" sequences of ap- method. Analysis of the simplest cases of polynornials with rational integer
proximations (see Lemma 1.1), we obtain new improved measures of ir- coefficients. least deviating from zero on [0, lId shows that this approach
rationality of particularly important numbers like In 2, n/&, r,fi.These works. Interesting developments here (such as an approximation of $J(x)by
measures are considerably better than those [I] furnished by the ordinary solutions of singular integral equations) coming from statistical mechanics
Pad6 approximation technique. We present here only those applications and the many-body problems are presented.
of the method of Backlund transformations [8] where all the recurrence It is quite remarkable that the a~lalyticmethods employed here are
relations are given explicitly. essentially those of the Petersburg school (Chebicheff, Markov, Bernstein),
Another series of recurrences arises from Thue polynomials introduced with some arithmetic adjustments.
by Thue [9] a s a means of approximation of algebraic numbers a of degrees This work was supported by the U.S. Air Force under Grant AFOSR-
n 2 3. These polynomials P,(x), Q,(x) with rational integer coefficients of 81-0190 and the John Simon Guggenheini Memorial Foundation.
degree r n + [f]are defined by the "Padk-type" relations
References
for a polynomial S ( x ) and arbitrary r 2 1. We pay particular attention t o [l] G . V. Chudnovsky, C. R. Acad. Sci. Paris, v. 288 (1979), A-607-
the cubic a, when the three-term recurrence connecting P,(x), Q,(x) is given A-609; C. It. Acad. Sci. Paris, v. 288 (1979), A-965-A-967.
explicitly in terms of invariants and covariants of the cubic polynomial P(x) [2] G. V. Chudnovsky, in Bifurcation phenomena in mathematical
whose root is a . Applications to measures of diophantine approximations of physics and related topics, D. Reidel, Boston, USA, 1980, 448-510.
cubic irrationalities (including cubic roots m, D > 0 and solutions f (a), [3] K. Alladi, M. Robinson, Lecture Notes in Mathematics, Springer,
1980, v. 751, 1-9.
d < 0 of Weber's modular equation) are presented.
G . V. CIIUDNOVSKY NUMBER TIIEORETIC APPLICATIONS
F. Beukers, Lecture Notes in Mathematics, Springer, 1981, v. 888, 1 - alb. Namely, (Lemma 1.1) the measure of irrationality is
90-99.
G. V. Chudnovsky, Lecce Lectures, Lecture Notes in Physics,
Springer, v. 120, 1980, 103-50. for all c > 0 and rational integers p,q; 191 >
D. V. Chudnovsky, G. V. Chudnovsky, Lett. Math. Phys., 4 (ISSO), "Dense" sequences of rational a.pproxirnations are established using linear
373-80. recurrences generated by contiguous relations. Asymptotics of solutions of
D. V. Chudnovsky, G. V. Chudnovsky, J. Math. Pures Appl., Paris, these recurrences define "density constants1' and measures of irrationality.
France, 61 (1982)) 1-16. We start with examples of Padk approximants and corresponding orthogonal
D. V. Chudnovsky, Festschrift in honor of F. Giirsey, 1982 (in polynomials. We examine Pad6 approximations to binomial functions and
press). powers of logarithmic functions. Here the recurrences are consequences
A. Thue, J. Reine Angew. Math. 135 (1909), 284--305. of contiguous rclations between generalized hypergeometric functions. We
Le Baron 0.Ferguson, Approximation by polynomials with integral describe separately Thue polynomials generating "dense" sequences of
coeficients, Mathematical surveys #17, American Mathematical rational approximations to algebraic numbers. This way we obtain new
Society, Providence, 1980. effective measures of diophantine approximations to cubic irrationalities.
In $4 we study new, rapidly convergent, improved Padk approximations.
Improved Pad6 approximations provide new bounds for measures of ir-
rationality of values of the logarithmic (inverse trigonometric) functions.
Chapter I. Padbtype Approximtions and Measures Particular attention is devoted to two special numbers In 2 and TI&. For
of ~ r r i t i o n a l i tof~ Logarithms and Algebraic Numbers these two numbers we present linear recurrences defining new "denseJ' se-
quences of rational approximations. Better "density constants" for these
We study diophantine approximations to numbers that are values of the sequences considerably improve measures of irrationality for ln2, ?r/d
logarithmic (or inverse trigonometric) or algebraic functions using polyno- given in [2], [3].
mials with rational coeflicients that are denominators and numerators or
rational approximations to corresponding functions. The main tool here
is given by recurrences and linear ordinary differential equations (1.o.d.e.) $1. Proofs of irrationality of the number 0 follow basically the same pattern:
with rational function coefficients satisfied by these polynomials. Linear
one establishes the existence of approximations p/q to 0 such that
recurrences themselves are reformulations of contiguous relations [I]
between systems of functions with the same monodromy group. They were
for
analyztd in the papers of D. V. Chudnovsky and G. V. Chudnovsky, see
e.g., [16]-[19], as Backlund transformations.
and $ ( N ) / N + co for N -+ co. Moreover, if we approximate 0 by a
These contiguous relations between approximants and functions furnish
"dense" sequence of rational numbers, then we obtain a simple method for
US with "denseJJsequences of rational approximations pn/gn to numbers a
finding the measure of irrationality of 0.
that are values of functions (at rational points). These "denseJ' sequences
We state the quite obvious and repeatedly reproduced
of approximations to a allow us to estimate the measure of irrationality
of a. The "dense" sequence in this paper means the following:
Lemma 1.1. Let us assume that there exists a sequence of rational
integers Pn,Q , such that
ai(n) -+ ai when n -+ oo. Definition 2.1. Let fl(x), . . . , fn(x) be functions analytic at
x = 0 and let m l , . . . , m, be non-negative integers (called weights). Then
Suppose the roots of the "limit" characteristic equation polynomials Pl(x), . .. ,P,(x) of degrees a t most m l , .. . ,mn are called Pad6
approzimants to f l(z), . . . ,fn(x) if
For systems of functions with the simplest monodromy (like an Abelian Then for any x # O,1, oo where r,(x) < r$(x), we have
group as a monodroniy group) wc know explicit expressions for the remaind-
er function and the Pad6 approximants [5]. These expressions are given as
simple contour integrals in the complex plane or multiple integrals in the
real domain. Thesc explicit expressions can be used in order to obtain an
asymptotics of the remainder function and Pad6 approximants by means of
the Laplace (steepest descent) method. Alternatively, asymptotics can be
i = 1,.. .,n as M
2 x m . j
obtained from contiguous relations between solutions of 1.o.d.e. satisfied by -+ m. Here 1; = exp(_).
the remainder function. The corresponding 1.o.d.e. and contiguous relations
themselves can be obtained as well from integral representations.
We start with the asymptotics of the remainder function and Padk Following [5] we present the contour integral representation for the
approximants in the Pad6 approximation problem for binomial, logarithmic remainder function and Pad6 approximants in the most general case iii).
and similar hypcrgcometric functions, see [I], [5], and then present explicit Moreover, our expressions include the inhomogeneous case of n 1 func- +
integral representations. :
tions fo(x), f ~ ( x ) ,. . ,fn(x):
for
problem iii) its well. In problerii iii) we put: analyzed. These recurrences for systems of functions satisfying Fuchsian
linear differential equations are called continguous relations, following
Riemann 111. We concentrate here only on the case of logarithmic functioris
as a particular case of the Gauss hypergeometric function.
The recurrences that are consequences of Gauss contiguous relations
between p F l functions can be prcserited in the form:
def
&,(a) = F 3 ( n + l , n + 1 , n + 2;z).
iv) coefficients of Pn(z) are rational integers; and rational numbers. For this we siniply use Lemma 1.1. The "arithmeticJ' and
v) cocficients of Qn(z) are rational numbers with the common denomi- "analytic" asymptotics are givcn by the properties iv)-v) and thc recurrence
nator dividing the least common multiplier of 1,. . . , n, denoted by (2.3), together wit11 Lemma 1.2. \Ne start with the number ln2, which
lcm{l, . . . ,n) (i.e., growing not faster than e('+O(l))n as n -r m). corresponds in the above-menticned scheme to z = -1. Lemma 1.1 on
"dcnse" sequences of approximations immediately gives us, as in [2], [8],
Statements iv), v) do not follow immediately from (2.3), since one may [15]:
-3.660137409...-c
suspect the denominator to grow like n!, because of the division by n 1+ IqIn2-pl > q
in order to find Xn+l. However, previous matrix recurrences together for the rational intcgers p, q, providcd that 191 2 ql(t) for any E > 0.
with (2.3) immediately imply iv), v). Hence one needs all the recurrences Similarly, Padk approximations to the logarithmic function a t points of
simultaneously. the Gaussian field Q ( i ) give us a measure of the diophantine approximation
Comparing (2.3) with the classical recurrence for Legendre polynomials
to ../&[21, [81, [151.
[7], an immediate relation is established:
Similarly, -
for all rational integers p, q, with 191 > 2'. Historically the first explicit
Rn(z) . z-" = Qn(x), x = 1- 2z-', published result on the measure of irrationality of belongs to Dariilov
[Ill. Later, similar bounds with dinerent exponents had been obtained
where g,(x) is a Legendre function of the second kind and, consequently, independently by several rescarchcrs (including Wirsing and Reukcrs), see
[2], [8]. All these results were based on the same system of Pad4 approxima-
tions of the logarithmic function and, hence, on the Hermite technique.
The difference in the exponent is explained by different accuracy in the
Because of the connection with the classical tlioery of orthogonal poly- computation of asymptotics. We prcsent now the recurrence leading to the
nomials one can write exact asymptotics together with explicit solutions to the recurrence: the
exponent 7.309.. . in the nieasurc of irrationality of r / f i is connected with
the following nice three-tcr~nlincar recurrence:
This shows that coefficients of Q,(z) do have denominators, but they are
of the form : m = 1,.. . ,n, which proves the properties of Pn,Q,. The
asymptotics of Pn, Qn follows from (2.3) according to Poincark's lemma. + +
As n -+ oo, x 2 72 1 = 0, where Xn -
xn as n -+ oo.
The properties i)-v) of Padk approximants and the remainder function Then there are two solutions p, and q,, of this recurrence such that
in the Pad6 approximation problem for the logarithmic function enable us
to estimatc the measure of diophantinc approximations of the logarithms of gn E Z for all n
74 G. V. CHUDNOVSKY NUMBER TIIEORETIC APPLICATIONS 75
and diophantine approximations to n / & and ln2. These new measures are
p, - l c m { l , . . . , 2 n ) E Z for alln; remarkable because for the first tirnc they do not make use of the Hermite
technique of 1873 [4].
by the Poincark theorem Ncverthcless, IIermite's technique of Pad6 approximations to powers of
the logaritlirnic function provides a nontrivial bound for the measure of
irrationality of n. For this wc use Pad6 approximiations to the system of
functions in i) of Theorem 2.2 for n = 6, cf. [3]. The bound obtained so
Then far is the following:
IqT
- > lql-18.8899414 ...
The expression for q, (E Z) is rather simple: for rational integers p, q with Iq1 2 q4(6) and any 6 > 0.
Similarly, an expression exists for a "near integer" solution p,: where C,(x) is a polynomial. These polynomials provide a "dense" sequence
of rational approximations p,/q, to a, if one starts with an initial good ra-
tional approxirnatiori x = plq to a, and puts p,/q, = B,(p/q)/A,(p/q). In
particular, if the initial approximation p/q is too good, then the exponent
of the measure of irrationality of a will be less than n, i.e., we obtain an
for o(m) = 1 +4 + + $. This explains why the denominator of pn improvement over the Liouville theorem. It is in this way that A. Thue
divides lcm{l, . . . ,2n). [22] proved his first ineffective theorem on diophantirie approximations to
The asymptotics of p,,, q, give a weak measure of irrationality of TI&: algebraic numbers according to which there are only finitely many rational
approximations lo a with an exponent of irrationality larger than n/2. This
method of Thue ay well as his theorem was subsequently generalized to the
multivariable case by Siegel, Schneider, Gelfond, Dyson, and fi~iallyRoth
[23]. Itoth's theorem states that there are only finitely many approxirna-
for 191 L 93(6) and X = In
In (( 22+- ffii))++1l = -7.3099863. +
tions to a with an exponent of irrationality larger than 2 E for every
6 > 0. However, this theorem as well as the original one of Thue does not
-
Thounh the r>ossibilitiesof l'add approximations
-- to the logarith~nicfunc-
give an cffective bound on the size of these approximations. Only Baker's
tion are co~npletelyexhausted by the bounds above, the improved rational
approximatioris to the logarithmic function give new measures of irrational- theorem gives us a slight effective improvement over tlie Liouville theorem,
ity. In 54 we present new recurrcIices and new bounds of the measure of and since any effcctivci~nprovcmcntin the exponent is important, attention
76 G . V. CHUDNOVSKY NUMBER THEORETIC APPLICATIONS 77
to the old Thue method has been revived in the hope that it can give new, We present recurrences and expressions for Thue polynomials associated
l ~,n), = 1, is
powerful results. Typically the case of roots a = ( ~ / b ) ~( m with a in the homogeneous form, when an initial approximation to a is z/y
considered since in this case Thue polynomials are expressed explicitly in (see y = 1 above). The approximants A,(x, y) and Ur(x, y) are homogeneous
terms of Pad4 approxir~lantsX,(z), Y,(z) to the function z", v = m/n, a t polynomials in x, y of degree 37 + 1 with rational coeficiemts and satisfy
z = 1, [24], [25], [26]. The Pad6 approximants X,(z), Y,(z) are polynomials .C,(x, y) for a polynomial C,(x, y). The
odi,(x, y) - B,(x, y) = (x - ay)2ri-1
of degree r and satisfy a simple three-term linear recurrence following three-term linear recurrence
(and the same recurrence is satisfied by Y,(z)), r = 1,.. . . Polynomials - 9(3r + 1 ) ~(x,f Y)~z,-I(x,Y) = 0
Xr(z), Yr(z) are, in fact, expressed as hypergeometric polynomials (Jacobi
has both A,(x, y) and B,(x, y) as its solutions. This enables us to express
We have
A,(x, y) and B,(x, y) in terms of Jacobi polynomials X,(z) from (3.1) for
Y,(z) = z'X,(z-I). v = 113. We put for r 2 0
(3.1) X,(z) = 2F1(-r, -r - v ; 1 - v ;z),
Thue polynomials A,(x) = A,(x, I), B,(x) = B,(x, I), similar to other
classical extrernal polynomials, satisfy Fuchsian 1.o.d.e. of the second order
Using properties of polynomials X,(z), Y,(z), many interesting results in addition to the three-term recurrence (3.1). If we put f(x) = f ( x , I),
on the diophantine approximation to roots a = ( ~ / b ) were ~ l ~ obtained then y(x) = A,(x), B,(x) are two linerly indepeudent solutions of the 1.o.d.e.
(251 [26], (31 (see specially [3] for new results on denominators of X,(z)).
Below we consider another class of algebraic numbers: cubic irrationalities,
following [27] (cf. [28]).
We consider cubic irrationalities a that are solutions of the algebraic It is interesting to note that, though f (x) has three regular singularities,
equation f ( a ,1) = 0 for the binary cubic form there are no accessory parameters in this equation.
For small Ix - ayl, polynomials A,(x, y), U,(x, y) are approxirnants t o a.
It, follows from Poincark's Lemma 1.2 that $ loglA,(x, y)l, $loglB,(x, y)l
are asymptotically logI&1, while $ loglA,(z, y)a - B,(x, y)l is asymptoti-
with rational integers a,b, c, d. With this form we associate an invariant
+ +
D (discriminant): D = -27a2d2 18abcd b2c2 - 4ac3 - 4b3d, and two
+
cally logltll where (1, 6 2 are roots of t2 2C(x, y ) t - 27D f(x, y)2 = 0,
(quadratic and cubic) covariants
It1( < I t 2 [ .According to [3] and (271, the common denominator A, of the
coefficients of the polynomials A,(x, y) and B,(x, y) satisfies
1
- log A, -+
4
-
r 6 .
78 G . V. CHUDNOVSKY NUMBER TI-IEORETIC APPLICATIONS
Also there are additional divisibility roper ties [26], [28]: coemcients of Example.
the polynomial X,(1 - 3 3 / 2 z )arc divisible by 3[3'/21. IIence according to 1) D = 2 : ~ = 1 . 4 2 9 7 0 9 ...; 2 ) D = 3 : ~ = 1 . 6 9 2 6 6 1 ...;
Lemma 1.1 we obtain the following 1271: 3) D = 6 : x = l . 3 2 0 5 5 4 ...; 4) D = 7 : ~ = 1 . 7 2 7 5 0 3 ...;
5) D = l O : x = l . 4 1 3 8 8 6 ...; 6 ) D = l 2 : ~ = 1 . 9 0 7 8 4 0...;
Theorem 3.3. Let x , y be rational integers and X / Y be a n approxima-
7) D = 13 : x = 1.824735.. .; 8 ) D = 15 : x = 1.493153.. .;
tion to a : 1x - ayl = min{lx - B y ] : f ( P , 1) = 0 ) and let D l G ( x ,Y ) = G ,
H ( x , y) = H be as above. For a rational integer M such that 9)
11) D = 19 : x -
0 = 17 : x = 1.198220.. . ; 10) D = 18 : x = 1.907841.. . ;
1.269963.. .; 12) D = 20 : x = 1.194764.. .:
We note that once the bound (3.4) is established for a given cubic I. Let d = -3. Then f - 2 = 0 , so n = 2.429709.. ..
irrationality a , the measure of irrationality with the same exponent is true 11. Let d = -11. Then f 3 - 2 f 2 + 2 f - 2 = O a n d n = 2.326120 ....
for any other irrational number fro111 the cubic field Q ( a ) . Namely for an
111. Let d = -19. Then f - 2f - 2 = 0 and rc = 2.535262.. ..
arbitrary irrational p E Q ( a ) we have
IV. Let d=-43. Then f 3 - 2 f 2 - 2 = O a n d n = 2 . 7 3 8 3 8 7 ....
V. L e t d = - 6 7 . Then f 3 - 2 f 2 - 2 f - 2 = O a n d n = 2 . 8 0 2 3 7 0 ....
VI. Let d = -163. Then f 3 - 6 f 2 + 4 f - 2 = O and n = 2.882945 ....
for an effective constant c , > 0 depending only on c > 0 and the ficld
Q ( a ) . IIence in all examples it is sulficient t o give only an exponcnt for a
singlc generator of the field Q ( a ) . The first series of cxamples we prcscnt 54. Here we present recurrences and their solutions that provide "dense"
gives the measure of diophantine approximations for cubic irrationalities sequences of rational approximations to In 2, x/fi and x with "density con-
from a given pure cubic extension ~(m) for a rational integer D > 0. stants" better than rational approximations given by the Pad6 approximrr-
80 G . V. CHUDNOVSKY NUMBER THEORETIC APPLICATIONS 81
tion t o the logarithmic function. Following Lemma 1.2, one can determine Lemma 1.2 by the roots of the quartic polynomial. Numerically one has
the "analytic" and "arithmetic" asyrnptotics of solutions of these recur-
rences. We follow the gcneral scheme of 52 but with matrix and scalar
recurcnces of a more complicated form reflecting a different monodromy
structure.
The new, better measure of irrationality of In 2 is based on a new set and
of contiguous relations, reflecting the presence of apparent singularities. loglPn In 2 - &,I -- -1.77602924 -n
Thcse recurrences are the following:
as n -+ 00.
G(m + 1, n , k; z) = G(m, n, k - 2; z) + (22. - l)G(m, n, k - 1; z ) Hence one has the following measure of irrationality of In 2:
(4.1) + (z2 - z ) ~ ( mn,, k; z);
G(m, n + 1,k; z) = G(m, n, k - 2; z) + G(m, n , k; z)(z - z2).
Solutions to these recurrences are completely determined by the initial The best exponent in the measure of the diophantine approximation to
conditions G(1,1, k; 2). In 2 we can achieve this way requires more complicated recurrences than
These are two kinds of initial conditions t h a t determine the sequences (4.1), correspponding to more apparent singularities. The properties iii), iv)
P, and Q,: hold, but tlic asymptotics of P,, Q, and R, are substituted by the following
ones:
and
((1 - z)2-k - ( - z ) ~ - ~ )for k # 2,
ii) G 2 ( l l1,k; z) = k-2
dzf
G z ( l , I , 2; z) = 0. Then Q, - G2(N1, N2,N3; -1);
logIP, In 2 - &,I - 1.93766649 . n
N1 = [0.88n], N2 = [0.12n], N3 = n. as n -+ co. Hence the measure of the diophantine approximation of In 2 is:
and b(i; n, m) : i = 1,2,3; a, c, el a', c', e' dependent on n and z have the
following form:
For n / & one should take z = -3, and for normalization we put
xn= 12["/.'1 . y[n/41. for all integers p, q with lql > qo.
84 G. V. CHUDNOVSKY NUMBER THEORETIC APPLICATIONS 85
These and similar measures of the irrationality of n / f i arise from three- Theorem 2.2 are n+l-term linear recurrences with coefficients that are poly-
term linear recurrences similar to the one satisfied by Y, above. Expressions nomials in N and x. They are deduced from matrix contiguous relations
of P, (or y t n ) )are slightly different but are expressed x
i sums of products for the generalized hypergeometric functions. The asymptotics of Padk
of three binomial coeflicients, unlike the case of Pad6 approximations to approximants and the remainder functions which follow from lemma 1.2 is
logarithms, when they are sums of the products of two binomial coefficients presented in Theorem 2.2. Sirnilar recurrences can, in general, be presented
only PI, PI, 1151. for the generalization of Thuc polynomials from $3, which are defined as
New dense sequences of approxi~nationsare found for n 2 , improving on polynomials PN(x, y) E Z [x, y] such that
the measures of irrationality for r2and T . They have expressions like above
but with the "integer" solution being a sum of a large number of binomial
coefficients. Also the three-term linear recurrence is substituted by multi-
term linear recurrences. We obtained the following measure of irrationality
of n 2 (improving that of Apdry) for ilBl+ i282 < N (with parameters 01,02). These Gelfond-Dyson poly-
nomials, as similar Roth polynomials in d variables, lead to better effective
measures of irratibnality of an algebraic number a, provided one knows
d - 1 initial good approximations to a. In particular cases of algebraic
roots a, Gelfond-Dyson polynomials can be determined explicitly through
so that (nq - p( > (ql-16 for rational integers p, q with Iql 2 ql. differential equations and the recurrences which they satisfy. A special
case of Gelfond-Dyson polynomials is the one considered by Siege1 [24],
which can be reduced to the Padk approximations of theorem 2.2. case ii)
with ml = . . . = m, = N , w l = 0, w2 = v ,...,w, = ( n - l ) and ~
$5. For number theoretical applications, it is important to use simul- a = (1 - x)" with x, v E Q; z # 0. The asymptotics of the remainder
taneous approximations to several numbers and, in particular, linear forms function and Padk approxirnants as N -+ co is given in 2.2, but for diophan-
in 1 , 0 , . . . , en-l for a given number 0. For example, to study diophan- tine applications we need to know the common denominator of Padk ap-
tine approximations to a value f(xo) of a function f(x), one has to apply proximants Pi(x) : i = 1, . . . ,n. This denominator was estimated in 53 for
Pad6 approximations to a system of functions 1, f(x), . . . ,f (x)"-' (see 2.1). n = 2 and Y = 113. We now present the bounds on denominators in the
Theorem 2.2.) cases i) and ii), furnish 11swith analytic tools to examine these general case, cf. [W].
Pad6 approximations. Integral representations (2.1) or (2.2) give us infor- For n = 2 and an arbitrary rational I/ = s / r , (a, r) = 1, the common
mation about the arithmetic properties of coefficients of Pad6 approximants. denominator AN of Pad6 approximants Pl(x), P*(x), in Pad6 approxima-
This way we can study rational approximations (or, in general, approxima- tions to (1 - x)" a t x = 0 with weight N, has the following asymptotics
tions by algcbraic numbers of f i e d degree) t o the numbers (1 - x)" or
In (1 - x) for v E Q(v 4 Z) and x E Q, x # 0. Moreover, since weights
in Padk approximations can vary, various similar diophantine approxima-
tion problems can be studied using Pad6 approximations. One of them is
the p-adic diophantine approximation to algebraic numbers and algebraic
roots, and another is the diophantine approximation by numbers whose as N -r co. Similarly, for n > 2 the asymptotics of the common
denominators have primc factors from a fxed set. Padk approximatiorls denominator of coefficients of Padk approximants can be determined ex-
to a system of functions lead to matrix (and linear) recurrences generaliz- 4
plicitly through sums of values of d log I-( with rational z [27]. However,
ing the ones given for n = 1. Linear recurrences connecting Pad6 ap- a simple upper bound can be given for the common denominator A% of
proxirnants and the remainder function for equal weights m l = =
e . .
coenicients of Pad6 approximants Pl(x), . . . ,P,(n) in the Padk approxima-
m, = N for the Pad6 approximation to functions in cases i), ii), iii) of tion to functions 1, (1 - x)", . . . ,(1 - x)(~-')" a t x = 0. For v = s / r ,
86 G . V. CIIUDNOVSKY NUMBEIL THEORETIC APPLICATIONS 87
(3, r) -I 1 and prime r we have D. V. Chudnovsky, Festschrift in honor of F. Giirsey, 1982 (in
press).
F. Bcukers, Lecturc Notes in Math., Springer, 1981, v. 888, 90-99.
D. V. Chudnovsky, G . V. Chudnovsky, Lett. Math. Phys. 4 (1980),
373.
D. V. Chudnovsky, G. V. Chudnovsky, J. Math. Pures Appl. 61
as N -+ oo. (19821, 1-16.
This lcads to new effective measurcs of irrationality (or measures of D. v.' Chudnovsky, G. V. Chudnovsky, Phys. Lett., 82 A (1981),
approximations by algebraic numbcrs of fixed degree) for algebraic roots 271.
i/rd with D > 0 (see the examples of $3 for r = 3). For example, D. V. Chudnovsky, G. V. Chudnovsky, Phys. I,ctt., 87 A (1981),
estimates of denominators from [27], $6, give uninflated bounds for an 325.
effectivc improvement OF the Liouville thcorem for numbers 0
with a K. Mahler, Math. Ann. 105 (1931), 267-76.
fixed D and r >
ro(D). We mention that for D = 2 one can take ro = 83. K. Mahler, Math. Ann. 168 (1967), 200-27.
A. Thue, J. Reine. Angew. ~ a t h . ~ 1 3(1909),
5 284-305.
W. J. LeVeque, Topics in number theory, Addison-Wesley, v. 2,
1957.
C. L. Siegel, Abh. Prcus. Akad. Wiss. No. 1 (1929).
References C. L. Sicgel, Akad. d. Wiss. in Gijttingen 11, Mat.-Phys. I., 1970,
no. 8, Gott. (1970), 169-95.
A. Baker, Quart. J. Math. Oxford 15 (1964), 375-83.
B. Riemann, Oeuvres Mathematiques, Blanch'ard, Paris, 1968, pp.
G. V. Chudnovsky, Ann. of Math. 117 (1983), No 3.
353-63.
D. K. Faddeev, Studies in number theory, ed. by A. V. Malyshev,
G. V. Chudnovsky, C. R. Acad. Sci. Paris, 288 (1979), A-607-A-609.
N.Y. 1968, 51-56.
G. V. Chudnovsky, C. R. Acad. Sci. Paris, 288 (1979), A-965-A-967.
H. M. Stark, in Applications of computers in algebra, Academic
Ch. IIermitC, C. R. Acad. Sci. l'aris, 77 (1873), 18-24, 74-79, 226.
Press, 1970, 21-35.
G. V. Chudnovsky, Cargesi: lectures in Bifurcation phenomena in
mathematical physics and related topics, D. Reidel, Boston, 1980,
448-510.
G. V. Chudnovsky, J. Math. Pures Appl. 58 (1979), 445-76. Chapter 11. On the Notion of the
G. Szego, Orthogonal polynomials, AMS, Providence, 1938. "Integer" Transfinite Diameter. and the
K. Alladi, M. L. Robinson, Lecture Notes Math., v. 751, Springer, el fond-~chnirelman Method in the Prime Number Theorem
1979, 1-9.
G . A. Baker, Jr., Essentials of Padk approximants, Academic Press,
1975. $1. We remind the readcr of the definition of the transfinite diameter of
C. V. Chudnovsky, Lecture Notes Physics, v. 120, Springer, 1980, a continuum E: defined by Fekete [I]. We put
103-150.
V. Danilov, Math. Zarnetki, 24 (1978), No. 4.
d,(E) =
21,
max J-JI z ~
...,z,EE I.< j
- zj1 2/n(n-1)
the absolute value of a deviation on E of polynomials of growing degree. Hence a . e*(2nN+1) is a non-zero rational integer and a < 62nN, so that
Among all polynomials P,(z) = zn + . + a0 which have the leading ~ ' 7- <~ " ~ or -2nN . log 6
e$(2nN+1) <G(2nN + 1) for an integer N 2 1.
coefficient one, we choose the polynomial with the minimal value of its norm This lower bound for $(x) would lead to the prime number theorem, if as
lPnlE= maxzEEIPn(z)l. The value of this norm is denoted by p", Then Gelfond-Schnireleman had hoped, 6 -+ l / e as n -+ oo. Unfortunately, a s
d(E) = limn,, p,. For an interval of length 1 in the plane, the transfinite Gelfond found much later, 6 > l / e and even 6 > (l+fi)-' # 0.414213.. .
diamter is 114. (more precise results on values of 6 are presented below).
Polynomials least deviating from zero were proposed as an instrument Despite the failure of the straightforward application of the original Gel-
for the elementary proof of the prime number theorem by Gelfond and fond-Schnirelrnan idea, it was soon realized that the same analytic method
Schnirelman in 1936 in their analysis of Chebicheff's proof of the Bertrand could still be used for the proof of thc prime number theorem if one
postulate. An account of this extraordinary attempt can be found in [2] and considered only polynomials with several variables with integer rational
[3]. The modern reader can find an account of the Gelfond-Schnirclman coefficients least deviating from zero on a d-dimensional cube [0, lId.
method for the prime number theorem in the historical section of the book The multidimensional approach has essentially been proposed by
by Le Baron Ferguson [4]. Less readily available, but not less interesting Trigub [5] and Ferguson [4]. Moreover, Trigub [5] has suggested that, per-
are the works of other representatives of the Russian school in constructive haps, an appropriate polynomial P(x1, . . . , xd) can be constructed using an
function theory (see Trigub [5] and Aparicio [ll]). analog of the Roth method by demanding that P(x1,. . . ,xd) have zeros
We repeat briefly the essence of the original simple attempt of Gelfond of different orders at given points inside the cube [0, lId. This avenue
and Schnirelman. The main number theoretical function under considera- has not yet been properly pursued, and instead the examples of polyno-
tion is the Chebicheff function +(x): mials P ( x l , . . . ,xd) considered here are given explicitly, rather than proved
t o exist by the Dirichlet box principle. Results for polynomials arising
from different discriminants and connected with the Selberg integrals are
presented below. Though we do not prove the prime number theorem in
its full generality, our constants seem to be close to the expected 1.
where the summation is extended over primes p sich that pm 5 x. The
prime number theorem itself is equivalent t o the statement +(x) -z for
large x[6]. The function +(x) is a logarithm of a more familiar arithmetic
52. We start with the notion of the integer transfinite diameter of a
function: e*(') = lcm(1,. . . , x ) for an integer x. The idea of Gelfond-
Schnirelman is to estimate +(x) from below by taking an integral of a
continuum E. For a given n >
1 we put
polynomial with rational integer coeficients least deviating from zero on
the interval [0,1]. Namely, let P ( x ) =xy=o pixi be a non-zero polynomial
pn(E) = rnin
f',.(z)EZ[z]
maxl~(x)l'l~,
zEE
of degree n with integer rational coefficients such that
where the nlin is taken over the set of all polyno~nialsPn(x) with integer
rational coeficients and of degree of at most n. Then the quantity
is a lower boaid on 6 ( 6 ) for B = [a,b] given by the Fekete theorem 171, see
Gelfond [8]:
90 G . V. CHUDNOVSKY NUMBER THEORETIC APPLICATIONS Q1
P r o p o s i t i o n 2.1. For b > a, b - a < 1 and E = [a,b] we have using different methods, one of which is based on the transformation of
6(E) 5 d w . cyclotomic polynomials [!I], and another of which is based on rational trans-
formations of an interval into itself [lo].
The first bound for 1(E) for E = [O, 1/41 is from polynomials Pm(x)
The precise value of 6(E) is unknown and it essentially depends on the
arithmctic properties of a and b. It is of utmost interest for us to consider with rational integer coefficients, whose zeros form a complete systern of
corij ugate algebraic numbers:
the case of E = [0, I].
The lower bound from Corollary 2.4 is in many cases better than that
given by d(E). One may even conjeclure that S(1C) = 1(E) at least for E where deg(Q) = d. In this way one defines polynomials Qn(x) of degrees 2n
being an interval with rational ends, for example for E = [O, 1) or starting from the initial polynomial Qo(z) = 1 - 52. This sequence of
f3 = [O, 1/41. The construction of a polynomial P,(x) satisfing the condi- polynomials can be used via Lemma 2.3 to get a lower bound for 6(E),
tions of Lernnia 2.3 and such that (aillln- is decreasing can be achieved E = [O, 1/41. For this, one has Lo evaluate the lower bound of 1(E) given
92 G.V. CHUDNOVSKY NUMBER THEORETIC APPLICATIONS 93
by a limit of lqn12-" with qn being the leading coeficient of Q n ( x ) . Using Remark 3.1. Let E = where I is an interval. There exists a
an iterative formula for the computations of Q , ( x ) , we obtain a product polynomial P ( x 1 , . . . ,xd) with rational integer coefficients in absolute value
less than 1 in E, if and only if the length of I is less than 4.
expansion [ l o ] for X = limn,,lqn)-2-". This formula, following [ l o ] ,has
the form X = nr=o +
- ~ O0 = 1, dk+' = Ok 6;'. This gives the
0 ~ where
We consider polynomials P ( x 1 , . ..,xd) E Z [ x l , . . . , z d ]having different
following estimate of the constant X :
degrees in different variables in each monomial in P ( x l , . . . ,x d ) . This
approach seems more rewonable since it involves sums of values of the
+function such as S : $(Y) dy, which are typically easier to evaluate than
The constant X-' gives us the lower bound for 1(E) and, hence, for 6 ( E )
$k). - We remind [6] that the prime number theorem is equivalent to
+ ( ~ ) d y x 2 / 2 for large x . The relationship between the values of
with E = [O, 1/41. One may assume that X-' is, in fact, a correct value of the $-function and polynomials least deviating from zero is based on the
b ( E ) for E = [ 0 , 1 / 4 ] . following consequence of the Gelfond-Schnirelman method. Let
The upper bounds on 6 ( E ) that are close to the value X-' are achieved
by considering polynomials of the following form P ( x 1 , . . . ,*d) E XI,. . . ,~ d and
] max
O<Zi<l
XI,.. . ,xd)l < 6.
where a, b are positive and v j are non-zero rational integers. Examples with
J = 0 , 1 , 2 are the following:
where lcm is taken over all d-plets ( i l , . . . , i d ) such that x f ' . . .x y is a
monomial in P ( x l , . . . , x d ) . Formula (3.1) is here the basis for obtain-
ing lower bounds for values of the $-function. Another auxiliary state-
ment deals with the interval [O, 1/41. If P ( x l , . . .,x d ) E Z [ x l , . . . ,x d ] and
max~<z;~l/.ilP(~l,...t~d)l < 1, then <
where Q o ( x ) = 1 - 5 2 ) Q 1 ( x )= 29x2 - l l x + 1. This gives the following up-
< <
per bounds for b ( E ) for E = [O, 1/41; 6 ( E ) 0.2; 6 ( E ) 0.186165509.. .;
where lcm is taken over all d-plets ( j l , . . . ,j d ) such that jk 5 2ik +
6(E) < 0.1840868.. . as compared with the lower bound given by
X-' : S ( E ) 2 0.17701068. Better upper bounds are achieved if J 2 2.
1
for k = 1 , . . . , d and 2 : . . . x y is a monomial in P ( x l , . . . , x d ) . In bounds
(3.1) and (3.2) it is essential that the corresponding i ~ ~ t c g r aare l s non-zero.
Usually it can be achieved by taking an arbitrary polynomial P o ( x l , . . ,xd) .
and taking its powers P F .
$ 3 . The multidimensional generalization of the Gelfond-Schnirelm meth-
The simplest polynomial with integer coefficients least deviating from
od gives the possibility of an alternative analytic proof of the prime num-
zero on I d is suggested by the definition of the transfinite diameter of
ber theorem with the possibility of improving the error term. According
to the method outlined in $1, the main problem is to find polynomials
I : A(x1,. . . ,xd) = ni< j ( x i - xj)'. Here
P ( x l , . . . , x d ) with rational integer coefficients least deviating from zero in
[0, l I d . For one matter, however, this generalization cannot be too much
diITercnt from the one-dimensional case.
94 G. V. CHUDNOVSKY NUMBER THEORETIC APPLICATIONS 95
for an arbitrary N and limd,, vd = length(I)/4(= d(1)). In order to (3.1) as its only arithmetic part. Since we are interested only in the
apply the bound (3.1) for the lower bound of the values of the +-function, asymptotic law of the distribution of primes, instead of estimating the
we express the right-hand side of (3.1) for the polynornial A ( x l , . . . ,xd)N integral
1
So So
. . . 1 P(xl, . . ., xd) dxl.. . dxd for P(xl, . . . ,z d ) E Z [xl, . . . ,xd]
in terms of values of the $-function: least deviating from zero on [0.lId, we consider integrals
with vd -t 4 as d 4 oo. This gives an inflated lower bound for the integral
of the +function: contained in [0, lId.Then the asymptotics of
log 16
attains its maximal value in [-I, lId. Then X I , . . . ,xd are all zeros of the Hence for N 4 oo (sufficiently large with respect to d) we obtain
Jaeobi polynomial Pf")(x) for a = 2p - 1, /3 = 2q - 1. This maximal
value T is:
We take E = [O, 1/41 (see (3.2)), and in this case the polynomial
P ( x l ) . . . , ~ d ) EZ [ X ~ ~ . , . b
JX~] Hence, in this case the value of "Chebicheff's" constant is 0.990358.. . . It
is, however, unexpected that b = 0 would be an optimal choice in view
of $2, where the corresponding polynomial 1 - 4t for E = [O, 1/41 plays
an important role. This phenomenon can be properly understood only by
analyzing integral equations determining the asymptotics of the integral
(critical point of P(xl,. . . ,xd)) as d -+ oo.
We consider, as above, the polynomial
- def (1 + a)2 (1 + b)2 i = 1,.. . ,d and xi E [-I, +I]. As d -+ oo we can apply classical methods
A r ~d~ = d2{log(l + a) . - 4 -
+ log(1 + 6) . - 4 -
of statistical mechanics, and for d -+ oo we can consider a distribution
a2 b2 1 function p(a) of xi on [-I, +I]. Classical methods [l4] suggest that x; fill
(34 - - log a - - log b - -(2 + a + b)2 log(2 + a + 6) the subinterval [-a, a] c [-I, I] and that the density p(a) is characterized
4 4 4
as $ ~ f f(xi) = = ~ ,$: f(a)p(a)da + o(1) for d -+ oo and an arbitrary
integrable function f (a). Hence the equations determining the critical
98 G . V. CAUDNOVSKY NUMBER THEORETIC APPLICATIONS
points of Q(xl, . . . ,xd) turn into a singular integral equation: for X E [-a,a] a n d 0 < a < 1,and
for x E [-a, a]. Following the Laplace method, we have An example above (with b = 0) shows that the direct transfer of the
one-dimensional considerations of $2 to multidimensional integrals of $3
is impossible. We present instead a different general scheme for the im-
provement of the "Chebicheff constantJ' and the proof of the prime number
theorem. This scheme is based on the solution of singular integral equa-
tions similar to (3.8) in terms of hyperelliptic integrals. For this we start
with poly~lomialsQl(x), . . . ,Qk(x) with rational integer coefficients having
all their zeros in the interval E = [O, 1/41. Then the d-dimensional integral
that can be used for improvement of the lower bound of the +-function as
in (3.2) is the following one
asd-+mand N + m .
We return now to the solution of the singular linear integral equation
which is a particular case of the Prandtl equation (see [15]). This equation
is of considerable interest for quantum field theory and plasma physics and
the same equation appears in the determination of the asymptotics of Pad6
approximations (161. The solution is given by the following explicit formula: for ti = xi(1 - xi) E [O, 1/41 and xi E [0, I] and vl 2 0,. . . ,vk 2 0. Let
{al,. . .,a,) be a complete set of (distinct) zeros of Ql(t), . . . , Qk(t). As
1 d -r oo, this system of algebraic equations determining the asymptotics
a t-x of (3.12) turns into a single singular integral equation for the distribution
(3.10)
1
function p(a). Let 0 5 a1 < a 2 < . . . < an 5 114. Then there are n 1 +
P= intervals E, = [a,, b,] such that a0 = 0, bo < a l ; an < a,, bn = 114;
a, < a, < .
6, < a,+l (3 = 1, . . ,n). The singular integral equation takes
the following form
where f (x) = - i{vlln--'--
11-21
+
v-llnl)
11+4
+ co, and where the constant co
is determined from the equation S:u
p ( t ) l n l d t = /(x). We present an
It-4
explicit expression for p(x) in the following form:
An explicit expression is the following one factorial as a sum of the values of the +function. For an arbitrary x 21
we, following Chebicheff, put T(x) = log{[x]!). Then
(-l)n-#+l
~ ( =4 X
~ 2 m ~ O I-(as)(.
x - b,)l
(3.14)
bn(xr + e=O
~ ( - i ) na, - ~ Jt - b
2 * ~ ~ ] Upper and lower bounds (4.2) for $(x) can be deduced simultaneously from
where
the integral representaion, since (r) 2n -1
and (,) both can be written as
13-function integrals. Namely,
54. Si~nultaneouslywith a lower bound for values of the +function, This example shows how an integral of a polynomial with integer rational
the Gelfond-Schnirelman method gives an upper bound for values of the
+function, and hence for the number of primes. For this we again use coefficients over the interval [-I, 11 can be inverted and represented as an
integrals over the polynomials with integer rational .coefficients, but this integral over a unit circle of a rational function with a singularity inside
time taken over a different path (domain) 'of integration. The essence of a circle. Namely, as in $5 1-2, looking on the term-by-term integration of
the method can be illustated by an example of the integral So
1
xn(l - x ) ~ ~ x ,
+
P ( z ) (1 ~ ) " z - ~with
, d(P) 5 2k - n, we get:
which is an equivalent of P. Erdos' method of obtaining upper and lower
bounds for a(x) used widely in all books on number theory (see [18]). In
this estimate we look a t(2) written as
n
k<pln
p divides
dz
From this we derive simultaneously upper and lower bounds for values of
the +function:
+
n . log 4 o(n) <
+(2n); for P ( z ) E Z[z], d(P) < 2k - n. In this formula, as above in $2, it is
+(2n) - $(n) 5 n log 4 + o(n). assumed that the integral S P ( z ) ( l + z)n% is non-zero. Since to bound
this integral we are using t k k h t i c a l points method, we can always insure
Thus we obtain the nonvanishing of the integral by considering ~ ( 2 instead ) ~ of P(z), for
a large integer N. We can instead consider integrals
for large n. Identities like (4.1) form a basis for the original Chebicheff-
Sylvester method [19], and they all follow from the representation of a
102 G. V. CHUDNOVSKY NUMBER TIIEORlZTIC APPLICATIONS
TI p 5 max l(t
t[-2,21
+ 2)"I2~(t)l,
k<p<n
2k-n Also we have
P ( t ) E Z[t], d ( P ) 5 7. A similar consideration exists in the multi-
dimensional situation. This way we achieve
(4.3) n
k<pSn
pd < - 2 <max
ti<2 I P ( t ~ , . . . , t d )H ( 2 + t i ) n / 2 1 ,
i=l
Hence, using the identity (4.4) (see the method of [20]), we obtain
for every 6 > 0 and rational integers a, b with Ibl" > ~ ~ I a 1 ~ ~ ~ , P51 D. V. Chudnovsky, G. V. Chudnovsky, Letters Nuovo Cirnento, 19,
1f ( a / b )- P/Ql > where P and Q are arbitrary rational integers No. 8 (1977), 300-02.
with IQI > QO(e,a, b). Here cl = cl(e) > O and Qo are effective constants. PJ J. Nuttall, ~ifuurcationphenomena in mathematical physics and
related topics, D. Reidel Publishing Company, Doston, 1980, 185-
202.
N. I. Muskhelishvili, Singular integral equations, P. Noordhoff,
References N. V. Croningen, Holland, 1953.
W. J. LeVeque, Topics in number theory, v. 1, Addison-Wesley,
L. V. Ahlfors, Conformal invariants, McGraw-Hill, 1973. 1956.
A. 0. Gelfond, L. G. Schnirelman, Uspekhi Math., Nauk, No. 2, J. J. Sylvester, Messenger. Math. (2)) 21 (1891)) p. 120.
(1936) (Russian). E. Landau, Sitz. Akad. Wissen. Wien, Math-Nat. Klasse, bd. CXVII,
A. 0. Gelfond, Comment "On determining the number of primes Abt. IIa, 1908.
not exceeding the given valuesJ' and "On primesJ' in P . L. Chebicheff, M. Nair, The American Mathematical Monthly, 89, No. 2, (1982),
Complete Works, v. I., Academy, Moscow, 1946, 2nd printing 126-29.
(Russian). M. Nair, J. London Math. Soc. (2), 25 (1982)) 385-91.
Le Baron 0. Ferguson, Approximation by polynomials with integral H. G. Diamond, K. S. McCurley, Lecture Notes Math., v. 899,
coeficients, Mathematical surveys #17, American Mathematical Springer, 1981, 239-53.
Society, Providence, 1980. D. G. Cantor, J. Reine Angew. Math., 316 (1980), 160-207.
R. M. Trigub, Metric questions of the theory of functions and
mappings, No. 2., pp. 267-333, Naukova Dumka Publishing House,
~ i e v 1971
, (Russian). Received August 6, 1082
A. E. Ingham, The distribution of prime numbers, Cambridge Tracts,
#30, cambridge Univ. Press, 1932. Dr. G . V. Chudnovsky
M. Fekete, C. R. Acad. Sci. Paris. 1923, 17. Mathematics Department
A. 0. Gelfond, Uspekhi Math. Nauk. 10 (1955), No. 1, 41-56 Columbia University
(Russian).
New York, New York 10027
E. Aparicio, Revista Matematica Hispano-Americana, 4 Serie, t.
X X X ~ I I NO.
, 6, 259-70 Madrid, 1978.
E. Aparicio, Revista de la Universidad de Santader, Numero 2,
Parte 1 (1979), 289-91.
E. Aparicio, Revista Matematica Hispano-Americana, 4 Serie,
v.XXXVI (1976), 105-24.
A. Selberg, Norsk Mathematisk Tidsskrift, 26 (1944), 71-78.
G. Szego, Orthogonal polynomials, American Mathematical Society
Colloquium Publication #23, Providence, Rhode Island, 1939.
L. HulthCn, Arkiv for Mat. Astron. Fysik, 26A, Hafte 3, No. 11
(1938), 1-106.
Infinite Descent on Elliptic Curves with
Complex Multiplication
John Coates
To I.R. Shafarevic
1. Introduction
properties are presumably typical of all elliptic curves. Namely, we consider 1. Descent Theory over the Base Field F
elliptic curves with complex multiplication, arid a certain non-cyclotomic
Zp-extension of the base field, whose existence is closely associated with We begin by recalling that the TatcSafarevic group LUF of E over F is
the hypothesis of complex multiplication, and for which the elliptic curve defined by the exactness of the sequence
is ordinary (see $2 for the precise definition). Much of the material in this
article is not rccent work and was presented in the Hermann Weyl Lectures
a t the Institute for Advanced Study, Princeton, in 1979, and will eventually
form part of a morc detailed set of notes on these lectures. However, the
where the product on the right is taken over all finite places v of F, and Fv
crucial results of $3, which are due to Bernadette Perrin-Riou [12], were
denotes the completion of F a t v . Now take a to be a non-zero element of
only proven after these lectures took place. Finally, there have been two
0 = Endp(E), which is not an automorphism. Taking Galois cohomology
important recent developments on the problems discussed in this paper.
of the exact sequence of G(F/F)-modules
Firstly, P. Schncider (see [13], [l4], and a manuscript in preparation) has
now established the equality of the analytic height and algebraic height for
arbitrary abelian varieties and Zp-extensions of the base field, for which
the abelian variety is ordinary. Secondly, Mazur and Tate (see their article we obtain the exact sequence
in this volume) have found some striking new descriptions of the analytic
height attached to a n elliptic curve over a Zp-extension, subject always to
the hypothesis that the elliptic curve is ordinary for the Zp-extension.
Notation. Let K be an imaginary quadratic field, 0 the ring of integers
of K , and F a finite extension of K. Throughout this paper, E will denote By dclinition, (IIIF), is a subgroup of (H1(F, E)),. The Selrner group sP)
a n elliptic curve defind over F which has complex multiplication by K in of E over F relative to a is defined to be 'the inverse image of (LUF), in
the scnse that its ring of F-e~ldomorphismsis isomorhic to 0 . As usual, we H'(F, E,), under the map a t the right hand end of (2). Thus we have the
fix an isomorphism of 0 with EndF(E) such that the embedding of K in funda~nentalexact sequence
F induced by the action of EndF(E) on the tangent space to E/F a t the
origin is the given embedding of K in F. Note also that the hypotheses
that EndF(E) is the maximal order of K involves no real loss of generality
si11ce every elliptic curve E over F with Q BzEndF(E) isomorphic to K We now introduce two modified forms of the Selmer group, whose defini-
is isogenous over F to one with this property. If N is an extension field of tions are motivated by the Iwasawa theory of 52. Indeed, in both cases,
F, we write E ( N ) for the group of N-rational points on E. If L / N is a we modify the local conditions in precisely the primes of F which are
Galois extension of fields, we write C(Z,/N) for the Galois group of L over ramified in the Zp-extension introduced in 52. We first specify the condi-
N . Letm denote a fixed algebraic closure of N, and A a discrete module tions that will be imposed on the prime number p for the rest of this paper.
Henceforth, we assume that p satisfies:-
for G ( m / N ) . The Galois cohomology groups of G(W/N) acting on A will
be denoted by H i ( N , A); when A = E(N), we simply write H i ( N , E). For
each 0-module D l we put D, for the kernel on D of a non-zero element H y p o t h e s i s on p. (i) p splits in K into two distinct primes, say
a of 0 . If J! is an integral ideal of K , let Dh = naEh D,. As usual, if ( P ) = pp*; (ii) E has good reduction at every prime o/ F above p;
(iii) p # 2.
D = E(F), we simply write E,, and Eb. Finally, if /3 EK, /3* will denote
its conjugate over Q.
As is well known, we note that (i) and (ii) are equivalent to the assertion
that lii has good ordinary reduction a t each prime above p. Also the
110 J. COATES INFINITE D E S C E N T O N ELLIPTIC CURVES
condition p # 2 is not needed in $1, but it is essential for the later parts
of the paper. Write Kp(= Qp) and Op(= Zp) for the completion of K a t
the first being the natural inclusion, and the second being given by multi-
p and its ring of integers, and put Dp = Kp/Op. Now take a # 0 t o be
plication by r*. We now pass to the limit and define
any element of 0 = EndF(E), which is not an automorphism. The first
modification of the Selrner group is t o neglect t o impose local conditions at
the primes above p, as follows. Define LUb by the exactness of the sequence
Here the inductive limits are taken with respect to the maps induced form
where the product is now taken over all finite primes v of F, which do
not divide P. We then define sP) to be the inverse image of (LUb), in
(a), and the projective limits with respect to the maps induced from (9).
We introduce more notation. If X is a finite set, #(X) will denote its
II1(F, E,) under the map on the right of (2). The second modification of
cardinality. Let A be an 0-module. We write
the Selmer group is to impose more stringent local conditions aL the primes
above p. Indeed, if v is any place of F, we have the local analogue of (2),
namely
We also put T,. (A) = A,.-, where the projective limit is taken with
Writing r,,, for the restriction map from EI'(F, E,) to H1(Fv, E,), it is respect to the maps induced by multiplication by T*.
plain from the definition of the Selmer g o u p that the image of SF) under If v is a place of F, write k, for the residue field of v, and Nv for the
cardinality of k,. When l3 has good reduction at v, we denote the reduction
r,,, is contained in the image of E ( F v ) / a E ( F u )under the map on the left
of (5). Hence the restriction map defines an application of E modulo v by E,/k,. Let $F denote the Grossencharacter of E over F
in the sense of Deuring-Weil. Thus $F is a Grossencharacter of F, which is
s,,: s$)- E(Fv)/aE(Fv). unramified precisely a t the places where E has good reduction, and which
has the property that, for each v where E has good reduction, GF(v) is
We now define RF)by the exactness of the sequence the unique element of O = EndF(E), whose reduction modulo v is the
Frobenius endomorphism of $ relative to k,. If a,b are elements of Kp,
we write a -- b if a = ~ bwhere
, E is a unit in Kg; we use the same notation
for elements of K, viewing them in Kp via the canonical inclusion.
where h, is given by the product of the s,,, over all v dividing P.
These modified forms of the Selmer group are only interesting for suitable Lemma 1. For each prime v of F dividing P, we have
choices of the endomorphism a, which we now explain. For the rest of the
paper, we fix an element T of 0 satisfying -
(i) #(Ev(kV)) 1 - $JF(v)/Nv;
(ii) Reduction modulo v induces an isomorphism from
Proof. The definition of $JF implies easily that If v is any element of Ex.-, we define
By the theory of complex multiplication, p divides $JF(v) because v divides Plainly w,(u, v ) belongs to the group pqn of qn-th roots of unity, and it is
P, and + F ( ~ ) + F ( ~ )=
* Nu. Hence the right-hand side of the displayed well known that the map
~ , to multiplication by a unit in Kp.
expression is equal to 1 - + b . ( ~ ) / Nup
This proves (i). Let E1,,(FV) denote the group of Fv-rational points in the
kernel of reduction modulo v. Since v lies above p, (7) implies that ?ran
is an automorphism of E1,,(FV), and so reduction modulo v induces an given by pn(v)(u) = w,(u, v ) is an isomorphism of G(P/F)-modules. More-
isomorphism >
over, it is easy t o verify that, for all n l, we have
E(F,)I~*"E(F,) =
2v(kv)/~*nR,(kv).
Assertion (ii) of the lemma now follows, since one sees easily that the
pprimary subgroup of Ev(kv)is killed by T * " for all sufficiently large n. We now begin the proof proper of Proposition 2. Noting that the image
Putting a = A*" in (6)) and passing to the projective limit, we conclude on the right hand side of (1) actually lies in the direct sum rather than the
from (ii) of Lemma 1 that we have an exact sequence product, we define Bn by the exactness of the sequence
where v runs over all finite places of F. Simple diagram chasing shows that
there exits a map
P r o p o s i t i o n 2. We have
gn: H ( H 1 ( ~ vEJ ) ) ~ . -+Bn
v IP
such that the sequence
Proof. We first recall the definition of the Weil pairing, suitably nor-
malized for the proof of Proposition 2. If u Ern, let 6 be the unique
element of Ern satisfying n * " = ~ u. Write 7; = F(E,n). There exist is exact. We can now pass to the inductive limiL in the obvious sense. It
rational functions f,,,, g,,, on E, which are defined over 3;) and whose follows that the index of IUF(p) in I&@) is the order of the kernel of g,
divisors are given respectively by, in an obvious notation, where g = lim g, is the induced map between
-+ nvlP
H1(Fv,E)(p) and
B = 1 5 B,. To calculate the order of this kernel, we dualize the map g,.
@ general, if A is a discrete, we write A for the Pontrjagin dual of A, and
f for the dual of a homomorphism between two such groups. By Tate local
where 0 denotes the zero point of E. Since f,,, o ?ran and gz:u
have the duality, the dual of the discrete group H1(FvJE) is E(Fv), and this in turn
ind;ces a duality between (H'(F,, E ) ) ~ and
, E(F~)/X*"E(F,). Moreover,
same divisors, we can multiply f,,, by a non-zero element of 3; so that
a theorem of Cassels ([I], p. 198) shows that the dual of B, is canoni-
cally isomorphic to Further, an analysis of the proof in [I] shows
114 J. COATES INFINITE DESCENT ON ELLIPTIC CURVES 115
that : - (i). When E,.n is identified with Hom(E+ , p q n ) via the map p, It follows from (16) and (19) that there is a natural injection O F <-+ T,.(UF),
above, then 5, is precisely the map h,.. of (6); (ii). By virtue of (Id), the and the following lemma is then clear.
dual of the natural map from Bn to B,+l is the map from SF' to
Lemma 3.
' ~ ) by multiplication by n*. Hence ij = l i p Gn is none other
s ( ~induced
than the map on the right of the exact sequence (12). Thus
where the two groups on the left are regarded as subgroups of SF.Define
eF via the exact sequence
(in general, if m is an integer > 1, p, will denote the group of m-th roots
of unity) gives rise via X , to a corresponding dec~mposit~ior~ of the Galois
116 J. COATES INFINITE DESCENT ON ELLIPTIC CURVES 117
group G(3,lF) = A X I', where x,(A) C pp-1 and xoo(T')C (1 p Zp). + We omit the proof of this lemma, which is based on a slight variant
Thus the fixed field of A is a Zp-extension F, of F, whose Galois group of the criterion of Nkron-Ogg-Safarevic (see [3], or [2] for a more detailed
can be identified with I'. It is obvious that the fixed Geld of I' is the field proof).
3 = F(EP). For siniplicity, we write x (respectively, K ) for the restriction We now begin the study of the arithmetic of E over F,. Recall that
of X, to A (respectively, I'). Ep C E ( F ) if and only if EpwC E(F,). Also, as always, I' = G(F,/F).
By class field theory, there is a unique Zp-extension of the imaginary
quadratic field K , which is unramified outside y. We denote this Zp- L e m m a 6. Assume that Ep C E(F). Then, for all i > 1, we have
extension by K,. The classical theory of complex multiplication shows
immediately that the Geld F, defined in the previous paragraph is in fact
the cornpositum of F and K,. It follows that F,/F is unramified outside
the set of primes above p, and that each time above p is ramified in F,/F.
Indeed, both assertions are plainly true for K,/K, and so they remain Proof. We only sketch a proof of this well-known fact. Putting
true for the translated Zp-extension Fw/F. G, = G(F(E~,)/F), we have
We next give two important technical lemmas. If v is a prime of F,, we
define F,,, to be the union of the completions a t v of all finite extensions -+ H~(G,,E~,).
Hi(I',Epm)= lim
of Q contained in F,. Note that F,,, will not, in general, be complete.
Now Ep, is a cyclic group of order pn with p # 2, and, as G, is a group
L e m m a 4. For each prime v of F, lying above p, the group of automorphisms of Ep,, it follows that G, is also cyclic. A simple direct
calculation then shows that Hi(Gn, EP,) = 0 for all i > 1.
Remark. Lemma 4, together with the Weil pairing (13)) shows also that we obtain the exact sequence
pp- n F Z , , is a finite gronp for all v dividing 9.
~2 is a finitely generated free abelian group for all n 2 0. If m 1 n, Proof. We first remark that it suffices to prove the result when
a simple argument (see [ll],p. 403) proves that vLm/ ~ r , is " torsion free. Ep C E(F). Indeed, if this is not the case, the restriction maps induce
Hence, if m >_ n , Vr," is in fact a direct summand of vLrn , and the assertion isomorphisms
of the proposition follows easily. sl,
7 (s;)A, Skm (S$JA,
We can now repeat the definitions of the Tate-Safarevic and Selmer because the order of A is prime to p, and so, for any A-module A, H ~ ( AA)
,
groups given in $1,simply replacing F in the earlier definitions by the field has no ptorsion for all i 2 1.
F,. In particular, we now write We now assume that Ep C E ( F ) . It follows from Lemma 6, and the
usual restriction-inflation exact sequence, that the restriction map yields
In fact, unlike the situation over the base field F, it is immaterial over F, P3) H'(F, E ~ 7~H ~) ( F , , E ~ ~ ) ~ .
whether we impose local conditions a t the primes above P.
Let V be any place of Fw, and w the restriction of v to F. We plainly have
a commutative diagram
Lemma 8. We have LUFm (P) = UIkm(p), and SF- = S IF,.
L e m m a 11. We have a canonical isomorphism Proof. Note that the second iomorphism is obvious. Also, by an en-
tirely similar reasoning to that given in the proof of Theorem 9, we can
assume that Ep C E(F), so that Ep, is rational over F,. P u t V, =
Hom(X,, Ep,). Hence both Vm and SF, are subgroups of
of F,, let H,,, denote the maximal unrarnified extension of F,,,, and let in f
l,, a simple calculation with commutators shows that
D, = G(H,,,/F,,,). Since E has good reduction a t v, we have
+
where w, = (1 T)P" - 1. On the other hand, global class field theory
(see p. 204 of [lo]). Suppose now that x E V,. By the definition of X,, for gives the following explicit description of G(fln/3,) as a G(3,lF)-module.
all places v of F, not lying above P, the restriction of x to the deconlposition If v is a place of 7, above p, let U,,, denote the local units 1 mod v of
group of v factors through the Galois group D,. It then follows easily from the completion of 3, a t v. Write &, for the group of global units of 7,)
(26) that the image of x in H1(FW,,, E ) is necessarily 0 for all v which do which are r 1 mod v for each place v of 7, above p. Let in be the diagonal
not divide P, whence x E Sjr,.
To prove that SF, C Voo,let n be an integer > 1, and take
embedding of &, in nvlP U,,,. Then Artin's reciprocity law gives the exact
sequence of G(?, IF)-modules
x, E s$,") C HO~(G(F,/F,), E , ~ ) By the definition of the Selmer
group, for each finite place v of F,, the restriction x,,, of x, to the
decomposition group of v is contained in the image of E(F,,,)/anE(Fm,,)
in 11om(G(F,,,/l;bo,,), kn). Suppose now that v does not lie above P.
Since E has good reduction a t v and (v, A) = I , Ern is mapped injectively
under reduction modulo v, whence it follows easily that, if x,,, = where A, denotes the p-primary subgroup of the ideal class group of 3,, and
I',,, modulo snE(FW,,), the extension F,,,(Q,,,)/ Fa,, is unramified, where the bar over in(&,) signifies its closure in the p-adic topology. Taking
where Q,,, satisfies .~r~&,,, = P,,,. In other words, x,,, must factor n = 0 in particular, we conclude that X,/TX, is a finitely generated
through the Galois group D, of the maximal unramified extension of Zp-modulc, whence a standard argument shows that X, is a finitely gener-
FW,,. Hence the global element x, factors through Hom(X,, Ern). Thus ated A-module. This completes the proof of the lemma.
SF,"' C Hom(X,, Ern), and, passing to the direct limit, we obtain
Remark. It follows easily from Theorem 9, Theorem 12, and Lemma 13
SF- C V,. This completes the proof of Theorem 12.
that the Pontrjagin dual of is a finitely generated Zp-module. In
fact, this is well known to be true for arbitrary abelian varieties defined
We can now use global class field theory to study the G(3,lF)-module
over a number field.
X,. Let A = Zp[[T]]be the ring of formal series in an indeterminate T
We write r, for the Zp-rank of in(&,) modulo torsion, so that
with coefficients in Zp. As usual in the theory of Zp-extensions (see [9]), it is
best to interpret the r-action on the compact Zp-module X, as a A-action,
in the following manner. Fix once and for all a topological generator 7, of
,'I and then defne T.x = (-yo- 1)x for x in X,. This extends to an action
of Zp[T] on X, in the obvious fashion, and finally to an action of A by by Dirichlet's theorem. Let 6, = [Y, : K ]- 1 - r,.
continuity.
Lemma 14. (i) X, is A-torsion if and only if 6, is bounded as
Lemma 13. The Galois group X, is a finitely generated A-module. n -+m; (ii) If 6, = 0, then X , is A-torsion.
Proof. The argument being standard (see [9], p. 255), we only give it Proof. Let p, be the Zp-rank of X,/w,X, modulo torsion. It follows
very briefly. Write 3, for the n-th layer of the Zp-extension 3,/?, and easily from the structure theory of finitely generated A-modules that X,
fl, for the maximal abelian p-extension of 3,, which is unranlified outside is A-torsion if and only if p, is bounded as n -+ oo. Assertion (i) is now
P. Since an is also plainly the maximal abclian extension of 3, contained clear, because (27) and (28) show that p, = 6, for all n >
0. It is also
J. COATES INFINITE DESCENT ON ELIJPTIC CURVES 125
124
For example, it follows from Proposition 15 that Conjecture A is valid The remarkable result given in this section (Theorem 17) is due to
if the base field F coincides with the field of complex multiplication K . Bernadette Perrin-Riou [12]. Using quite different methods, P. Schneider
Although we do not give details here, two proofs of Proposition 15 are has subscqucntly established an analogous result for arbitrary abelian vari-
known. Firstly, when F(Ep) is abelian over IC, the p a d i c analogue of eties and for Zp-extensioils of the base field for which the abelian variety
Baker's theorem on linear forms in the padic logarith~nsof algebraic num- is ordinary. We simply state the result, and refer the reader to [12] for a
bers can be used to prove that S, = 0 for all n 2 0. Secondly, when detailed account of the proof.
F(Eto,) is abelian over K, a generalization of the techniques of [4] can also We begin by rapidIy describing the p a d i c analogue of the Nbron-Tate
be used to prove Conjecture A (but not the stronger assertion that 6, = 0 canonical height, which is attached to E and our Zp-extension F,/F. Take
for all n> 0). a generalized Weierstrass model for E' over F
We now derive on important consequence of Conjecture A.
T h e o r e m 16. Conjecture A implies that E(F,) modulo its torsion If v is a finite place of F , we write ord, for the order valuation of v ,
subgroup is a finitely generated abelian group.
normalized so that ord,(F,Z) = Z. We always suppose that E(F,) is
endowed with its natural u-adic topology. It is a well known and elementary
Proof. Let 11 denote E(F,) modulo its torsion subgroup, and put fact, due to Nkron and Tate, that there exists a unique function
B -=I I @ o ~ p Since
. Proposition 7 shows that H is a free abelian group, one
sees easily that, in order to prove that H is finitely generated, it suffices
126 J. COATES INFINITE DESCENT ON ELLIPTIC CURVES 127
which is continuous when Q is endowed with the real topology, and which the subgroup of finite index in E ( F ) consisting of those points which belong
satisfies the following two properties :- (i) the limit as P tends to 0 of to the kernel of reduction modulo v for each place v of F dividing P. We
the expression Xv(P) - o r d , ( t ( ~ ) ) exists for every local parameter t at 0; define
(ii) for all points P # 0, Q # 0 in E(Fv) with P Q # 0, we have hp : El ( F ) -+K p
by setting hp(0) = 0, and otherwise
such that hp(P) = (P,P)p for all P E E ( F ) , and whose behaviour with
where N F v l K denotes the local norm from Fv to Kg; (ii) for all a
P
# 0 in 0 respect to endomorphisms a E 0 is given by
and all P # 0 in E1,,(FV) with aP # 0, we have
w z i ,~ i ) ~
Vol F
P( ~ ( ~ =1 [E(F)
) : ~ 2or] ,
As in the case of the Ndron-Tate height, we c.an combine these local factors
to define a global height height function. Since the theory of complex where n~ denotes the &rank of E ( F ) , and 21,. . . , z,, is any maximal
multiplication shows that F necessarily contains the EIilbert class field of &linearly independent set in E ( F ) . Although we do not go into dctails here,
K, it follows that the norm from F to K of each finite place v of F is it is theoretically quite straightforward to calculate Volp(E(F)) oncc the
necessarily principal, say N F I K v = (7,)) where the generator rv is well Mordell-Weil group itself is known. The calculations made so far support
defined up to multiplication by a root of unity in K . Recall that E 1 ( F )is the conjecture that voly( E ( F ) ) is always non-zero.
128 J. COATES INFINITE DESCENT ON ELLIPTIC CURVES 129
We now explain the connection between this bilinear form ( , )p, and the Granted Theorem 17, it is an easy exercise to establish the following
theory of infinite descent, which was discovered by Bernadette Perrin-Riou result. Recall that n F dentes the 0-rank of E(F) modulo torsion.
(121. Let
iF: E1(F) 0p. c--+BF = 1im +
RF*") L e m m a 18. # 0;
(i) Coker OF is finite if and only if VO~~(E(F))
(ii) If Coker OF is finite, then
denote the natural inclusion in the exact sequence (20). On the other
hand, the composition of the natural inclusion of E ( F ) QPo Dp in Sk with
the injection of Sk into SF, given by restriction (see Theorem 9) gives
Finally, we shall also need the following important result, which is proven
a canonical injection of E ( F ) Dp in SF,. Dualizing this map, we get
in [12].
(cf. (29)) a canonical surjection
T h e o r e m 19 (B. Perrin-Riou [12]). If Conjecture A is valid, then the
map 4F of Theorem 17 i s an isomorphism.
The ideas underlying this section were first exploited by Artin and Tate
which is no longer, in general, surjective. If H is an extension field of F, we [15] in their work on the geometric analogue. It seems fair to say that
write E ( H ) ( p ) for the subgroup of the torsion subgroup of E ( H ) which is they lie a t the heart of the study of the arithmetic of elliptic curves by the
annihilated by all sufficiently large powers of 7 ~ .Define mp by the formula techniques of Iwasawa theory and padic analysis. We go a little further
than previous work, in that we have avoided imposing the hypothesis that
Ill&) is finite.
If G is a p-primary abelian group, Gdiv will denote the maximal divisible
subgroup of G.
where, as always, 3 = F(Eg).
L e m m a 20. (i) N ~ ( p ) d i v= m>(p)div; (ii) #(m',(~)/m&(~)div)=
T h e o r e m 17 (R. Perrirl-Riou 1121). There exists a canonical map #(mF(Y)/~F(~)div).[UI>l(P)
: UF(P)]; (iii) If WF denotes the maximal divis-
ible subgroup of S&, we have an exact sequence
(36) 0 -+ E ( F ) @loDp -+ WF + i v 0.
m ~ ( ~ ) d+
with the following property: if we define the map
Proof. (i) and (ii) follow from the fact that LII>(y)/IIIF(~)is finite by
Proposition 2. Assertion (iii) is obtained by taking divisible subgroups in
the exact sequence (24), and nothing that E ( F ) B0,!Ip is divisible.
by OF = jF o 4.v o iF, then, for all x E E ( F ) and y E EI(F), we have
As before, let A = Zp[[T]]be the ring of formal power series in T with
coeficieritv in Zp. If A is a finitely generated A-torsion A-module, the
130 J. COATES
INFINITE DESCENT ON ELLIPTIC CURVES 131
structure theory shows that there is a A-homomorphism, with finite kernel We now study the behaviour of the characteristic power series of SFoo
and cokcrnel, from A to a A-module of the form a t the point T = 0. Recall that nF denotes the &rank of E ( F ) modulo
torsion. We define tF to be the Zp-rank of the Pontrjagin dual of IIIF(p)di,
(thus I.UF(Y)is finite if and only if tF = 0). P u t
Throughout the rest of the paper, we make the following assumption:- f (T) = TTFg(T).
(40)
Assumption. Conjecture A is valid ( i e . the weak p-adic Leopoldt con- This is clear from the exact sequence (39) and (i) of Lemma 21, because the
jecture holds for 7
, = F(Ep,)). characteristic power series of WF is plainly TTF.
Taking I'-invariants of (39)) we obtain the long exact sequence
We now collect together in a single proposition the consequences of this
assumption, which will be used later.
We can simplify this sequence in two ways. Firstly, recalling that Theorem
Proposition 22. Conjecture A implies that (i) SF, is A-torsion,
9 shows that ( 3 ~is canonically
~ ) ~ isomorphic to S>, and nothing tlie exact
(ii) SF, has no non-zero finite A-submodulea, (iii) the map qbFfiheorem
sequence
17 induces an isomorphism qbF: BF 'i
mi, have the same rank as Zp-modules.
and (iv) I.UF(p)di, and
o-+N~-+s',--+wF-+o,
L e m m a 23. The following three assertions are equivalent:-(i) g(0) # 0, wF/vF.For this reason, we define the subgroup CFof eFby
(ii) Coker aF is finite, and (iii) a~ is injective. When these hold, we have
Proof. By lemma 20, g(0) .# 0 if and only if (Y)r is finite, and (42)
shows that the finiteness of (Y)r is equivalent to the finiteness of Coker a ~ ,
because NF is finite. Also g(0) f 0 if and only if yr is finite. I3ut yr is with Ker 7F = Ker PF. Also, we clearly have a commutative diagram
finite if and only if yr = 0 because Y has no finite non-zero I'-submodule.
The rest of Lemma 23 is clcar from Lemma 20 and the exact sequence (42).
whose restriction on the left to CF is the pairing { , )P? Of course, an This completes the proof.
affirmative answer t o this question would not only show that pp(llIF)is
finite, b u t would also prove that pp(ILIF) = 1.
We can now state our final result. Recall that yo denotes our fixed
topological generator of r. Let u = Xoo(70) be the image of yo under the Concluding Remarks
canonical character giving the action of on Ep,. We then define, for
s # 0 in Zp, 1. When t F = 0, the analogy of Theorem 24 with the refinement of
the Birch-Swinnerton-Dyer conjecture for the complex L-function L(qF, .3)
given by Gross in [8], is evident. As is explained in [8], our hypotheses t h a t
p # 2 and that p splits in K guarantees that the Tamagawa factors a t
the bad primes in the complex conjecture are prime to p, and so do not
specifically appear in the formula given in Theorem 24, which is necessarily
only valid up t o multiplication by a p a d i c unit.
The conjectural link of LI](E/F, 3) with p a d i c Lfunctions (see [4], [5]),
2. Allowing the possibility that tF > 0 (i.e. that ILIF(p) is infinite), it is
which we do not discuss in the present paper, provides the reason for
natural to ask whether tF is independent of the choice of the ordinary prime
introducing the pole term when Ep C E(F).
p. Certainly, a mystical belief in the close analogy between the behaviour
of both the co~nplexand p a d i c L-functions attached to EIF a t the points
T h e o r e m 24. Assume Conjecture A is valid, and that volp(E(F)) # 0. s = 1 suggests that tF should be independent of the ordinary prime p, and
Then (i) LI](E/F,3) has a zero at s = 1 of multiplicity 2 r~ = nF t ~ ; + that a modified form of the complex Birch-Swinnterton-Dyer conjecture
(ii) the multiplicity of this zero is exactly rF if and only if the canonical should hold even when t~ > 0. Some highly important recent work of R.
pairing { , )P given by (48) has a finite kernel on the right, or equivalently Greenberg [7] provides fragmentary theoretical evidence for this belief.
a trivial kernel on the left; (iii) If the multiplicity of this zero is exactly r F , 3. Although we do not go into details here (see [5]), a combination of
then Theorem 24 and the calculation of the values of the complex Lfunctions
L($:, k)(k = I, 2,. . .) can often be used to completely determine the arith-
metic of E relative to small ordinary primes p. We simply mention, without
proof, several examples, which it does not seem possible to treat by classical
methods. The calculations of the complex values L($:, k) used in the proof
of these results were made by N. Stephens.
where mp and pp(UIF) are given by (33) and (49), and T,~F denotes the
&le 2. Take F = K = Q ( & f ) , and E the elliptic curve defined
Grossencharacter of 6 over F .
over K by the equation y2 = x3 - x. It is classical that nK = 0, and one
can now show that m K ( p ) = 0 for p = 5,13,17,29,37,41,53.
Proof. (i) is simply assertion (40). (ii) follows from Lemma 23 and
the fact explained above that Ker CYF= Ker y ~ and , Coker aF is finite
Example 3. Take F = K = Q(-), and E the elliptic curve defined
if and only if Coker 7 p is finite. Finally, to establish (iii), one combines
over K by the equation y2 = x3 - 22. It is classical that nK = 1, and
Proposition 2, Lemma 3, Lemma 18, Lemma 23, a.nd the formula (47), and
one can now show that ILIK(p) = 0 for p = 5,17,29. Moreover, we have
simplifies the resulting expression, noting that
V O ~ ~ ~ ( E ( --
K )1) if p = 5, and VOI~,(E(K))-- p if p = 17,29.
J. COATES INFINITE DESCENT ON ELLIPTIC CURVES
Cassels, J., Arithmetic on curves of genus 1 (VIII), Crelle 217 Professor John Coates
(1965), 180-199. Mathkmatiques, Bat. 425
Coates, J., Arithmetic on elliptic curves with complex multiplica- Universitk de Paris-Sud
tion, Hermann Weyl Lectures, I.A.S. Princeton, 1979, t o appear. 91405 Orsay, Prance
Coates, J., Wiles, A., On the conjecture of Birch and Swinnerton-
Dyer, Invent. Math., 39 (1977), 223-251.
Coates, J., Wiles, A., On padic Lfunctions and elliptic units, J.
Australian Math. Soc., 26 (1978), 1-25.
Coates, J., Goldstein, C., Some remarks on the main conjecture for
elliptic curves with complex multiplication, to appear in American
J. Math.
Greenberg, R., On the structure of certain Galoi. groups, Invent.
Math., 47 (1978), 85-99.
Greenberg, R., On the Conjecture Birch and Swinnerton-Dyer, to
appear in Invent. Math.
Gross, B., On the conjecture of Birch and Swinnerton-Dyer for
elliplic curves with complex multiplication, Progress in Math. 26
(Birkhauser), 219-236.
Iwasawa, K., On Z1-extensions of algebraic number fields, Ann. of
Math., 98 (1973), 246-326.
Mazur, B., Rational points of abelian varieties with values in towers
of number fields, Invent. Math., 18 (1972)) 183-266.
Perrin-Riou, B., Groupe de Selmer d'une courbe elliptique B mul-
tiplication complexe, Compositio Math., 43 (1981), 387-417.
Perrin-Riou, B., Descente infinie et hauteur padique sur les courbes
elliptiques B multiplication complexe, Invent. Math., 70 (1983),
369-398.
Schneider, P., Iwasawa Lfunctions of varieties over algebraic num-
ber fields. A first approach., to appear in Invent. Math.
Schneider, P., padic height pairings I, Invent. Math., 69 (1982),
401-409.
Tate, J., On the conjecture of Birch and Swinnerton-Dyer and
a geometric analogue, Skminaire Bourbaki, No. 306, Fkvrier,
1966.
On the Ubiquity
of "Pathology" in Products
Nicholas M. Katz
Introduction
its DeRham-Witt complex. One knows (c.f. [Ill, 1, 21) t h a t WOjc calculates
the crystalline cohomology of X, i.e., one has a canonical isonlorphism
One also knows that H5,i,(X) is a finitely generated W-module, and that
for every n, the Wn-modles
are finitcly generated. However, there is no reason in general that for given
(i, j ) , the W-module
t h a t for E a supersingular elliptic curve over k, the W-module (i.e., i = 2, is polygonally Hodge-Witt if t h c maximal Newton-Hodgc decomposition
j=O,X=ExE) (c.f. [Ka]) of ( M I F ) (i.c., using all those break-points of the Newton polygon
H 2 ( E X E, WO) which lie o n the Hodge polygon) is a decomposition into F-crystals of the
following types a) or b):
is n o t finitely generated.
Following Illusie-Raynaud and Bloch-Gabber-Kato, we say t h a t a prop-
er smooth k-scheme X is "IIodge-Witt" if, for all (i,j), the W-module a) There is a singlc Hodge slope:
I I i ( X , \YO$) is finitely generated, and we say t h a t X is "ordinary" if
I. Reduction to a Question about Polygons L e m m a 1. Suppse that a n F-crystal ( M ,F) i s ezpressible as a direct
s u m of F-crystals,
Let ( M , F ) be a "a-F-crystal on k" (cf. [Ka]), i.c., M is a free, finitely ( M ,F ) = @(Mi, Fi).
generated W-module, and F is a a-linear endomorphism of M which in-
duces a n autornorphism of M @ Q. We say t h a t ( M , F ) is polygonally T h e n ( M , F ) is polygonally ordinary (respectively, polygonally Hodge- Witt)
ordinary if its Ncwton and Ilodgc polygons coincidc. We say t h a t ( M , F ) i f and only if each s u m m a n d (Mi, I;) is.
"PATHOI.OGYn IN PRODUCTS 143
142 N. M. KATZ
is an equality. The result now follows from the Kunneth formula in both
Hodge and DeRham cohomology, which gives
Therefore xM(S) lies in Endw(M) if and only if the individual nMi(S) do,
and if this is the case, then
The set of Hodge slopes (with ~nultiplicities)of a direct sum is simply the
union of the sets of IIodge slopes of the summands.
144 N. M. KATZ "PATHOLOGY" IN PRODUCTS 145
and the observation that So it suffices to show that ( M I @I M2, Fl @ Fz) is indecomposable for the
Newton-Hodge decomposition, i.e., that its Newton polygon lies strictly
above its IIodge polygon, excepb at the endpoints. (For having a t least
three distinct Hodge slopes, it will not be Hodge-Witt!)
and similarly for Y. Each of the (Mi, Pi),being indeco~nposableand not polygonaly ordinary,
Q.E.D. has its Newton polygon strictly above its Hodge polygon, except a t the
endpoints. Let us say that a non-zero F-crystal satisfis ( N > H) if its
Combining Lemma 2 with the key result of Illusie-Raynaud and Ekedahl, Newton polygon is strictly above its Hodge polygon, except a t the endpoints.
our theorem is a special case of the following theorem about F-crystals, In view of the above discussion, the polygon theorem results from the
F) and (Hrris(Y),F).
applied to (EI,',~,(X), more precise
Polygon T h e o r e m . Given two non-zero F-crystals (MI, Fl) and N> H T h e o r e m . The tensor product of two non-zero F-crystals which
(M2,1;i), their tensor product (MI @ M 2 ,Fl @I F2)is polygonally Hodge- both satisfy ( N > I-I) again satisfies ( N > H ) .
W i t t i f and oriy i f one of the (Mi, Fi) is polygonally ordinary, and the other
is polygonally Hodge- Witt.
111. Interlude: Generalities on Polygons of Hodge and Newton
Type
11. Further Reduction of the Polygon Theorem
We fix as basic data a collection of r 1 1 distinct non-negative real
In view of Lemma 2, we are immediately reduced to the case where both numbers, called "slopes,"
of the F-crystals (Mi,E;) are indecomposable as F-crystals.
If (MI,li;) is polygonally ordinary as well, thcn ( k being algebraically
closed), there is an integer j 1 0 such that
together with strictly positive integers called "multiplicities"
nl,. . .,n,.
The integer
and the assertion of the theorem is obvious in this case.
It remains to show that when both of ( M i ,I;;.) are indecomposable, as ~ = x n ~
F-crystals, and neither is polygonally ordinary, thcn their tensor product
is called the "rank." Equivalent to such data is the corresponding convex
is not Hodge-Witt. Since neither of the (Mi, F;) is polygonally ordinary,
each of them has a t least two distinct Hodge slopes. Therefore their tensor polygon in the (X,Y) plane (illustrated below with r = 3 distinct slopes).
product has a t least three dintinct Hodge slopes (for if (Mi, Fi)admits both For brevity, we will refer indifferently to either the data
a; and b; > ai as Hodge slopes, for i = 1,2, then their tensor product
+ + +
admits a1 a2, b2 az, bl b2 (as well as a1 + b2) as Hodge slopes, and by
assump tion
-+ +
a1 a2 < bl a2 < 61 62. + or to the corresponding convex polygon as "a polygon."
148 N. M. KATZ "PATIIOLOGYn IN PRODUCTS
In terms of t h a t data (A1,. . .,A,; n l , . . . ,n,), we have the explicit for- We summarize this in a diagram:
mulas
Dp(t) = C n i min(t, Xi)
i
BOp(t) = C ni max(0, t - Xi).
I
are strictly positive in the open interval ]a;, b;[, and zero elsewhere.
For i = 1,2, let ai be the smallest Newton slope of (Mi, Fi). Then
Given non-zero F-crystals ( M I , F1)and (M2, Fz), let us denote by IIPl because (Mi, Fi) satisfies ( N > H). Therefore each of the functions (for
and 11P2 their respective Hodge po ygons, and by NPl and NP2 their i = 1,2)
respective Newton polygons. Lct us denote by HP1e2 and Bpie2 the
IIodge and Newton polygons of (M1,Ii;)@ (M2,F2), and let us denote by
IIfio2and NPlB2 the Ilodge and Newton polygons of (Ml@M2, F1@F2). has gy a sum with positive integer multiplicities of Dirac delta functions
Then we have the expected co~npatibilities supported a t various points, including the points a;,, ai, bi. Uecause the
functions f i are non-negativd, we have
where
~ )O N P , ~ ~>
B O M P , ~ ~-( B ( ~0)
To I.R. Shafaravich
1
h(P) = ih.(P) + Or(1) for P E A(Q).
domain, so the problem is to estimate tliis volurnc. Ilermite's theorem will and is the square of the volu~neof a fundamental domain of the lattice
be recalled for the convenience of the reader in the last section. 4 Q ) l A ( Q ) t o r in R 8 4 Q ) .
The conjectured estimates will also tie in naturally with a conjecture of As usual, the discriniinar~tis given by
Marshall Hall for integral points, which we discuss in $2. I shall give some
numerical examples in $3.
Manin [Ma] gave a general discussion showing how the Birch-Swinnerton The terms rpfor plA are given by the integral of Iwl over A(Qp), where
Dyer conjecture and the Taniyama-Wcil conjecture that all elliptic curves the differential form w is the one associated with the global ~riinirnummodel.
over Q are modular give effective means to find a basis for the Mordell- As Tate pointed out, ap is an integer for each p. This property is all we
Weil group A(Q) and an estimate for the Shafarevich-Tate group. The need to know about a, for our purposes. Essentially.
Taniyama-Weil conjccture will play no role in the arguments of this paper,
and I shall propose a much more precise way of estimating the regulator
and the heights in a basis as in IIermite's theorem. Manin also attributes a
"gloomy joke" to Shafarcvich in the context of his Theorem 11.1. It is there- +
except that the equation y2 = x3 + ax b is not the equation of a minimal
fore a pleasure to dedicate the conjectures and this paper t o Shafarevich. model, and one has to introduce extra cocfi?cients, cf. ['Fa I ] where the
matter is explained in detail. For simplicity of notation, I shall continue to
write the equation in standard Wcierstrass form as above.
The period a, is given by the integral
1. Rational Points
We let (P,Q ) be the bilinear symmetric form associated with the Nkron-
Tate height, namely
Finally IA(Q)t,, 1 is the order of the (finite) torsion group.
(P,Q) = h ( P + Q) - h(P) - h(Q)-
We are now interested iri estimating all the terms to get a bourid for
Then IIIIlR, which will yield a bound for 12 since ILL11 is an integer.
1
)PI = and h(P) = - 1 ~)~. Since the aP are intrgers, they work for us in getting an upper bound
2 for R.
If r is the rank of A(Q), that is the dirnensiori over R of R @ A ( & ) , then Mazur [Ma] has shown that IA(Q)lorl is bounded by 16, and hence this
Birch-Swinnerton Dyer conjecture that the L-function LA(s) = L(s) has a torsion number has a liniitecl effect OIL the ticsired estimates.
zero of order r a t s = 1, and t h a t the coefIicient of (s - 1)' in the Taylor Concerning the r-th dcrivative of the L-function. I am indebted to
expansion is given by the formula H. Montgomery and Dxvid Rohrlich for instructive discussions on the
analytic number theory of more classicid zeta functions and Lseries. In
particular, for the Ricrriann zeta function cQ(s), the Ricmarln hypothesis
implies t h a t
On the right-hand side, IIII( is a positive integer, the order of the Shafare-
vicli-Tate group.
The determinant is taken with respect to any basis { P I , .. ., P , ) . Its where t(t) tends to 0 as t + oo. More precisely, Montgomery conjec-
absolute value is called the r e g u l a t o r , tures [Mo], formula (10) that one can take
with some constant c. Without the loglog t, Titchrnarsh already observed This leaves the real period n, to be estimated from below. The Nkron
that one cannot improve the exponent (log t)-'I2, see [Ti], Theorem 8.12. differcrltial of a mini~rialmodel will not difler too much from dxly. Let US
The Rie~nannhypothesis iniplies by Theorem 14.14 that compute roughly the integral
Note that if either a or b = 0, then H(A)'/'~ can be replaced by Theorem 4.2 (1) and the lower bound of Conjecture 2, which allows us to
In light of the Hall conjecture below, it should always be the case that divide by (10glA,~,l)~-' on both sides so that we end up with only one
factor of log N on the right hand side.
The above conjectures are phrased for the canonical height h. To apply
them to the ordinary height h, of the x-coordinate, we need an upper bound
In any case, H(A) is absolutely homogeneous of degree 12, in the sense 1
on Ih - i h , which will be discusscd in the next section.
that H(cA) = I c ~ ' ~ H ( A )where
, cA denotes the elliptic curve obtained On the other hand, observe that the lower bound of Conjecture 2 allows
by multiplying all the quantities a, b, x,y with the appropriate power of c, us to estimate from above. Indeed, referring to IIer~nite'sthcorem
corresponding to their weight. in $4, we have
The shape of the inequality in Conjecture 1 is analogous to the upper R =d e t ( ~= ) ~lull2.. . 1u,I2
bound for the regulator of a number field K by easy esti~natesof the residue
of the Dedckind zeta function a t s = 1, apparently first noted by Landau,
of the form
~KRK 5 c[ ~ : ~ dK)[K:Ql-l
l ~ z ~ ( ~ ~ ~
where dK is the absolute value of the discriminant. Siege1 [Si] more carefully B c:(logl~rni~ 1)'
determined precise and very good constants in this estimate. In [L 11 I made by Conjecture 2. Hcncc the upper bourid for lLUlR in Conjecture 1 yields
the following conjecture: an upper bound for lLUl of the same nature, except that we can cancel
(10gIAminlY.
. .
Conjecture 2. For all elliptic curves over the integers as above, one The above arguments do not seem t o give further insight in the ques-
has the lower bound for the canonical height: tion raised in [L 11, p. 92, concerning the ratios of successive minima, or
equivalently the lengths of elements in a n almost orthogonalized basis.
It is a good question to determine to what extent the factor cr(log N)'
for any rational point P which is not a torsion point. in Conjecture 1 must really be there. After extending Q to the field of
2-torsion points the standard proof of the Mordell-Weil theorem of course
bounds r in terms of the number of prime factors of N but also in terms
This conjecture was proved by Silverman [Si] when the j-invariant is
of the 2-rank of the class number or the p r a n k of the class numbcr of
an integer. Silverman [Sil 31 has also shown that the conjecture is best
ptorsion points, for any prime p. The question is whether this second part
possible. Taking into account the Hermite theorem, we may now give an
of the bound is, in fact, significant over the rational numbers.
upper bound for an appropriate basis of the Mordell-Weil group. I leave
Supposing r is like the number of distinct prime factors of N, one
out the integral factor lLUl on the left for simplicity.
knows that if N is a product of the first r primes, then r is asyn~ptotic
to log Nlloglog N. On the other hand, the average value of r is log log N.
Conjecture 3. There exists a basis {PI,.. .,P,) for A(Q) modulo tor-
sion, ordered by ascending height, such that:
2. Integral Points
T h e o r e m 2.1. Let y2 = z3+ a x + b be an elliptic curve with a, b E Z. because j = - 2 1 2 3 3 a 3 / ~and a, A are p-integral. Taking the sum we then
Let h be the canonical height. Then obtain a bound
The O(1) comes only from the archimedean v, there is no such constant
In particular, for v = vp with a prime number p. The theorem follows a t once. (Note: one
can get something slightly better, see the Remark in [L 11, p. 32.)
I h - -h,
I
5
1
- log H ( A )
6
+ O(1).
Theorem 2.1 suffices amply to apply the bound of Conjecture 3 to the
Proof. Let h, be the local canonical height, or in other words the Nkron ordinary height IL,. However, I want the estimate for Ih - ih,l to be
function at v, normalized as in Tate, cf. [L I] Chapter I, $7 and 58, and entirely in terms of A. For this purpose, among many others, one can use
Chapter 111, $4. Let h,,, be the local height of the x-coordinate, that is a conjecture of Marshall Hall [Hall].
Remark. In using the Ball conjecture, the "variables" a, b have fixed I shall give here his two largest solutions. I am indebted to J. Rrette for
coefficients, namely 4 and 27. Trivial changes of coordinates (and of A. using the computer to put Selmer's solutions in the Weierstrass form
by a bounded small factor) reduce such equations to the case when the
coefficients are equal to 1. For example we multiply the equation by 4233 Y2 = x3 + b, with b = - 2 4 3 3 ~ 2 .
and let u = -12a, t = 22c3b to get t2 - u3 = 4233A0.
Since we are interested in the height, it is best to clear denominators and
consider the homogeneous form
It is not completely clear how the Hall conjecture should extend to an
arbitrary equation
y2 = x3 + ax b +
to be solved in relatively prime integers (u, w, z). Then
with a, b E 2. It seems reasonable to expect:
x = u/z and y = w/z.
Conjecture 5. If (x, y) is a n integral point on this elliptic curve, then
The transformation is given by the formulas
u = 35,208,298,044,859,842,638,816,896,575,916
which has the solution x = t6 + 2t2 and y = t9 + 3t5 + i t .
z = 94,295,149,257,506,211,891,484, 409,025
(u', z') = 956,232,024,756,251, 670
3. Some Numerical Examples
There exist some tables in the literature giving bases for the Mordell-
Weil group, mostly when the rank is 1. Selmer [Se 11, [Se 21 found some For the conductor, we use the formula in Stephens [St], p. 125, which
solutions which a t first sight appear quite large. H e works with the curve for N f *2 mod 9 gives the conductor of the Hecke character as
in Fermat form
S. LANG CONJECTURED DIOPRANTINE ESTIMATES
N = 223"912 = 3,039,948 It should be noted that most generators in Selmer's tables, for instance,
log N - 15.2 are considerably smaller than the upper bound given by the conjecture.
H ( A ) ~ I ~ ~NN ~4.9( x - ~ lo3
) ~ ~ ~ Statistically, it would seem that they follow quite a different distribution,
h, - 80.4
namely
R << loglA1.
The estimate is again roughly the same as in the preceding example.
For the exponential height, H, = exp h,, this would mean that on the
average, one has
I owe the next example to Andrew Bremner, who considers the curve H z << /Atk
for some constant k. Even the two largest Selrner points have heights
compatible with such a better bound taking k = 2; while the Bremner
This is the "other" case with complex multiplication. Bremner parametrizes point fits with k = 3 or 4. Most other points in Selmer's tables, and all the
a generalor of the Mordcll-Wcil group in the form points in Cassels' early table [Ca] or Podsypanin [Po] fit with k = 1. Within
the range of such tables, it seems impossible to distinguish a polynomial
estimate as above from an exponential estimate (for the exponential height)
168 S. LANG CONJECTUILED DIOPIIANTINE ESTIMATES
as in Conjecture 3. It would be interesting to have a more precise statistical 4. The Hermite Theorem
analysis of the possibility of a polynomial estimate.
We note that if f ( x , y) is an irreducible homogeneous polynomial of This section is essentially an appendix, t o state the Hermite theorem
degree > 3 over Z then there exists a constant k such that for any integral concerning lattices in euclidean space.
solution ( 2 ,y) of the equation
T h e o r e m 4.1. Let L be a lattice i n a vector space V of dimension
r over R, with a positive definite quadratic form. Then there exists a n
orthogonal bases { u l , . . .,u,) of V, and a basis {el,. . .,e,) of L having the
following properties.
To I.R. Shafarevich
So NO(G,M ) = M G and if M is a torsion module, II1(G, M) = MG Proof. Starting with the exact sequence of sheaves on X:
arid EIi(G, M) = 0 for i > 1. It follows from ttic IIochschild-Serre spectral
sequence in flat cohornology that we have the exact sequence of finite groups
Here II:(X, F) denoks flat cohoniology. Let H'(x, F) denote Ctale cohornol-
ogy. Recall tliat IIi(X,G,) = II)(X, Gm) and I I i ( X ,p,) = $ ( X , Pn)
Taking inverse limits with respect t o n, we obtain:
if p y n . Let 1 be any prime nunibsr, including p. l.et H>(X,Tl(p)) (resp.
I ~ ; ( ~ , T ~ (=~ ) ) B j , ( ~ , ~ (resp.
~ n ) H ) ( X , p l n ) ) . Since inverse
limit is an exact functor on the category of finite groups, we have
and taking Galois cohomology, we obtain: let T: Z i - - + ZI be a Zl-linear m a p with characteristic polynomial P ( X ) .
A s s u m e that the minimal polvnomial M ( X ) of T contains X t o at m o s t
the first power, i.e. that T is "semisimple at 0". T h e n the order of the
torsion subgroup of the cokernel of T divides P*(O).In particular, for all
where the last equality is Lang's theorem. We have the commutative but Jinitely m a n y primes 1, the cokernel o f T is torsion-free.
diagram
Proof. If T @ Q1 is invertible, the cokernel of T is torsion, and its
order is equal t o the determinant of T, up to an 6-adic unit, and so divides
the constant term of the characteristic polynomial of T. If T @ Q1 is not
invertible, look a t the diagram
shows that H i ( X , PI) maps onto lZi(X, G,)[. Since the cohoniological
2. The Cohomology of Gm dimension of X at 1 is 5 2d +
1 (see [M2], Chapter VI, Corollary 1.4),
H i ( X , G,)l and hence I l i ( X , Gm)(l) = 0 for i > 2d+l. On the other hand
In this section we investigate the relationship between the behavior of we have (by [M2], Chapter VI, Remark 1.5 b) and c)) that H ; ( X , ~ ~= )0
our zeta-function Z ( X , T) as 2' + q-', and the &talecohomology of G,. +
for i > d 2. Therefore the Kumrncr sequence for flat cohomology shows
Let 1 be a variable prime different from p. We begin with the following +
t h a t Ili(X,G,)(p) = 0 for i > d 2, which is <
2d + 1 if d > I. (If
-
proposition: d = 0, ZZi(X,G,) is well-known to be zero for i > 0).
c) It follows from (1.5) that H'(x, T~(P))maps onto T ~ ( I I ~ X ,
G,,,)),
Proposition 2.1. a) H'(X, G,) is a torsion group for i # 1. and Z I ) ( X , T ~ ( ~maps) ) onto T~(IZ~(X,G,)). If i > 3, we have seen
b) Hi(X,G,) = 0 for i > 2d 1. + t h a t ffi(x, ~ ( p ) is) finite and hence ~;(Iz'(x, G,)), being torsion-free, is
c) H'(x, G,) is finite for i # 1,2,3. zero. Since fZi(X, G,)l is finite, H ~ ( xG,)(1), must be finite. Similarly,
d) Let the order of fZi(X, Gm)(l) for i 2 4 be denoted by r q , and let the fZi(X,G,)(p) is f nite.
partial Ruler characteristic .. be X ~ J .Then +
Looking now at (1.5) with i replaced by i 1, we conclude t h a t
structure thcory of such groups ([K], Thcorems 4 and 9), II'(X, Gm)(l) = [ T I ] , t h a t (iv) implies (iii). So we must show that (i), (ii) and (iii) imply
( Q ~ / z ~ ) ~@( 'C;,
~ ~wherc
) p(i, 1) is a non-ncgative integer and Ci is a finite (iv). I,et C ( X ) (resp. C ( X ) ) be the free abclian group generated by the
I-group. Now look at (1.3) with i = 4. This yields one-dimensional intrgral subscl~emrs(curves) on X (resp. X). As Tate
discussed in his Woods Hole talk [T2], we have a hornornorhisrn 4 from
0 -+ lirrl
t 1 1 3 ( x ,G,,)/ln 4 H?(x, fi(/i))3 2i(H4(X1
Gm)) 4 0. C(W) t o 1 1 ~ ~ - ~ '(~fjf(,~ ) @ ( ~ - ' )which
), we will abbreviate by Il"d-(m.
n
(Tate states only that his map lands in I!2d-2(X) @ Q1, but i t is easily
Since C: -- lim II"X, Gm)/ln, the proof of c) suffices t o show t h a t if X seen t h a t in fact the map 4 factors through I12d-2(X).)
t
We see t h a t 4 induces a map dl frorn C ( X ) @ Z l to ~ I ~ ~ - ~ (I'oincarE
W ) ~ .
n
has finite torsion, C: is trivial for almost all 1, arid so C 3 = x,c: is a duality induces a map 0 frorn I12d-2(X)G t o I I O ~ ~ ( I ~ ?;(p))", ~(X, zl)
finte group. and the natural map from NS(X) to I I 2 ( x , ~ 1 ) "induces a map $ from
~ I O I ~ ( I I ~~i)",( ~ , ~ 1 t)o JIO~I(NS(X) 8 Zl, ZI). Diagrammatically, we
In order to proceed further, we must (and do) assume henceforth t h a t z
have:
for a fixed prime 1 # p, the I-component of the cohomological Brauer group
112(X,G,,) is finite. We prove in fact t h e rollowing theorem:
Proof. Tate's proof in [TI] of the equivalence of (i), (ii), and (iii) works where I;i and Dl are finite groups. We see by looking a t t h e sequence (1.1)
perfectly well in this case. (See also [%I,
p.218). I t is also clear, as in and taking direct limits that we Iiave
ZETA-FUNCTIONS AT s=l 185
A' such that A injects into A' and there are isomorphisms
(i) IRl(Gm)ll = 1 for all but finitely m a n y 1.
(ii) If we define R1(Gm) to be f l , l p l ~ l ( ~ m ) ~then
;l,
h : At @ Zi -+ IIorn(B, Qi/Zi) compatible with the 41 '8.
where Cx = fpVX'(X,
Gm)-l, u is an integer, and x'(X,G,) = L e m m a 2.6. Let A be a Jnitely-generated Z [ l / p ] - m o d u l e . Let B be
a torsion abelian group S U C ~that B(p) = 0 and there exist stbrjections
dl: IIom(B, Ql/Z1) --+ A @ Zl with finite kernel3 K1 such that Kl = 0 for
all but finitely many I . T h e n there exists a finitely generated
Proof. We have seen in Proposition 2.1 that H1(X,Gm)(nonp) is a Z [ l / p ] - m o d u l eA' such that A' surjects onto A and there are isomorphisms
finite group for i > 3 and i = 0, that H 3 ( X , Gm)co~o,(nonp) is finite, 41: Hom(U, Q1/Zl) --+ A' @ Z1 compatible with the 41'3.
and that H 1 ( X ,G,n)(tor) is finite. We have seen in Theorem 2.3 that
l12(X,Gm)(nonp) is finite. Since Cx is a non-zero rational number, and Proof. Lemma 2.6 is straightforward, so we prove Lemma 2.5. We
since formula (2.4) is true after taking I-adic absolute values for each prime immediately reduce to the case when A and Ilom(R, Q1/Zl) are torsion-
1 # p, we must have IRl(Gm)ll = 1 for all but finitely many 1. The truth free. Let A = n l Z P ( A@ Zl). The hypotheses of the lemma imply that we
of (2.4) then follows formally. have an exact sequence
We still would like to interpret Rt(Gm) as an actual regulator, i.e., as
the determinant of a pairing of finitely-generated abelian groups. Since we
are ignoring p in this section, this is too much to ask for. However, we
can interpret Rt(G,) as the determinant of a pairing of finitely-generated
Z [l/p]-modules into Q. with C finite and 4 induced by the 41's. Let B* = H O ~ ( BQ/z(nonp)),
,
Referring back to diagram (2.2) in the proof of Theorem 2.2, we see that
and let A' = { x E B*: some non-zero multiple of x lies in A ) . It is
immediate that A' is a Z[l/p]-module and that we have an exact sequence
0 -+ A 4'
-+ A' 5 C, where 4' and r' are induced from 4 and r. (Use the
fact that A is pure in A.) In particular, we see that A' is finitely-generated.
We wish to show r' is surjective.
where det(Cj, Dj) was the determinant of the intersection pairing bet- By construction A/mA 1 A/mA for any integer m. Since the isornor-
ween curves and divisors on X . Since we know from Proposition 2.1 that
phism factors through A'/mAt, At/mA' maps onto it/mA, which maps
#II3(X, Gm)(l)coto,is 1 for all but finitely many 1, it follows that #El is 1
for all but finitely many 1. Thcorerri 2.4 then implies that #Dl and #El are
onto C I m C . If we let m = # C then C / m C --C and we see that T' is
surjective. It is then immediate that A' satisfies the condition of the lemma.
both equal to I for all but finitely many 1. Interpreting If 3(X, Gm)(nonp)
as the dual of a finitely-generated Z [lip]-module then follows from the next
Remark 2.7. It is now clear that the regulator formed by using
two lemmas.
the finitely-generated Z[l/p]-module dual to H 3 ( X , Gm)(nonp) agrees with
n r f n \
n (ti,).
L e m m a 2.5. Let A be a finitely-generated Z [ l / p ] - m o d u l e . Let B be
n torsion abelian group such that B ( p ) = 0 and there exist injections
Remurk 2.8. Note also that if x(X, G,) makes sense, then x l ( X , G m )
41: A @ Z1-+ IIorn(U, Q l / Z l ) with finite cokernels Cl such that C1 = 0 for differs from it by a power of p.
all but finitely m a n y 1. T h e n there exists a finitely generated Zjllp]-module
ZETA-FUNCTIONS AT s= 1
3. The Cohomology of G,
It follows from the IIochschild-Serre spectral sequence for p p that wc have On the other hand, by the main theorem of Milne's paper [MI] we have
let Pl(X,T) have deg B = B l ( X ) and reciprocal roots all . . . ,ag. By the
functional equation q 2 / a l , . . . , q 2 / a B are the reciprocal roots of P3(X, T).
Also, P l ( X , T ) has integral coefticients, so if cyi is a root so is E; = q/ai
by the Riemann hypothesis.
We have
p1(x,
q-l) = n ( 1 - q-lai) = n(1- a;') = na?'n(aj - 1)
= fq - B / 2 ~ l ( l ) .
Norman [19]. Both of these authors point out that, although the concept of 1. Local Splittings
biextensions is riot explicitly mentioned in the theories of p a d i c theta func-
tions of Mumford, and of Barsotti [2] (see also Cristante [6]) it is directly Let IC be a field cornplete with respect t o a place v which is either
related t o these theories (via the theorem of the cube). One might also try archimedean or discrete. If v is discrctc, let 0 = OK denote the ring of
to relate Nkron's approach to p-adic theta functions [16], [17], [la] directly v-integers in K , T a prime element in 0 and k = O/KO the residue field.
to biextensions. If AIK is an abclian variety over K , anti v is discrete, wc denote by A
Thirdly, the theory of p a d i c heights for elliptic curves of complex mul- (or sometimes: AIO) the N h o n model of A/K over O. If v is archirnedean,
tiplication (and p a prime of ordinary reduction) has been developed by we let A denote A / K .
Perrin-Riou [20] and Bernardi [4], using a p a d i c version of t h e sigma- In the non-archimedean case, A0 (or denotcs the connected com-
function. Here the padic sigma-function plays a role analogous t o that of ponent of zero in A, i.e., the open subgroupscheme of whose closed
the classical sigma-function in Nkron's theory [15] for archimedean primes. fiber A; is connected. If U10 is any scheme over 0, its closed fiber TJ X Ok
Nkron has also developed a theory of p-adic valued height pairings us- is denoted Uo.
ing his padic theta functions [18]. In the case of elliptic curves of com-
plex ~nultiplicatiorl,an explicit connection between Nkron's definition and
(1.1) O r d i n a r y a b e l i a n varieties.
8ernardi's has not yet been made (to our knowledge). What is the relation
A and Alfc are c;~llctlordinary if v is discrete, the characteristic of k
(if any) between Nkron's p a d i c height and ours?
is # 0, and the special Gber of A satisfies the following equivalent condi-
Since the explicit expression for the local terms of our canonical p
tions.
adic pairing involves the canonical p a d i c theta functions (of Mumford
and Barsotti), we would find it useful to have a practical algorithm for
computing these functions. In a subsequent paper we will discuss this issue (i) The formal completion A{ of A. a t the origin is of multiplicative
in the case of elliptic curves. type, i.e., is isomorphic to a product of copies of 6& over the algebraic
In this connection, one should also note that the beginnings of a (mod p)- closure of k.
valued theory of height for general elliptic curves (with ordinary reduction (ii) For p = char k, the connected component of the kernel of the
at p ) can be found in ['Ll]. Iiomornorphisrn p: A; --+ A: is the dual of an &ale group scheme over k.
Our construction of padic valued canonical heights requires ordinary (iii) A$ is an extension of an ordinary abelian variety by a torus TA.
reduction at the primes of ramificatiori of the chosen Zp-extension. Can
one find a generalization or replacement of our construction valid for all If L / K is a finite Geld extension arid A/K is ordinary, so is AII, and
Zp-extensions? For elliptic curves with complex multiplication, R. Gross has formation of A0 comrnutcs with the basc change of rings OK -+ 01,.
some ideas on this; see J. Oesterlk, Construction de hauteurs archimkdiennes If AIK is ordinary, then A/K has good reduction over k (equivalently:
et p-adiques suivant la mkthode de Bloch, p. 175-192, in Skminaire de AID is an abelian scliemt~)if an only if TA = 0.
Theorie des Nornbres (SCminaire Delange-Pisot-Poitou), Paris, 1980-81;
Progress in Math., Vol. 22, Birkhsuser Boston, Basel, Stuttgart, 1982. (1.2) E x p o n e n t s .
We are grateful t o M. Artin, L. Breen, J. Coates, L. Moret-Bailly, By the exponent of a finite abelian group G we mean the sniallest integer
P. Norman and B. I'errin-Riou for pleasant arid informative conversa- m > 0 such that mG = 0.
tions concerning algebraic spaces, biextensions, theta-functions, and p-adic In this paragraph, suppose that v is discrete. Let mA = m * / ~denote
heights. the exponent of Ao(k)/Ag(k). Now suppose that k is finite. Let TA denote
We also thank 0 . Gabber for providing us with significant help in work- the "rnaxirnal torus" in A0 which exists by, e.g., [SGA 31 exposk X N , Tlirn.
ing out $5. 1.1. Let nn = n * , ~dcnote the exponcnt of /l;(k)/TA(k).
We refcr to the numbers mA and nA as the exponents of A.
198 MAZUR AND TATE
(1.4) p-splittings
The exponents are sensitive to isogenous change of A.
As for their dependence on the basc field K , m n 1 ~ 1adrnits a finite Let I / , V, W, arid Y be abelian groups, X a biextentiori of (U, V) by W
upper bound for all finite unrarnified extensions K 1 / K , while n ~ l is~ t and p: W-+ Y a hornornorphisrn. A p-splitting of X is a map
independent of K' provided A is ordinary and K1/K is a finite totally
ramified extension. If we drop the assumption that A be ordinary, then
such that
nnlKt adrnits a finite upper bound for all finite totally ramified extensions
K1/K.
(i) $(w + +
x) = y(w) $(x) for w E W, x E X.
(1.3) B i e x t e n s i o n s and p a i r e d a b e l i a n varieties. (ii) For each u E U (resp. v E V) the restriction of to ,X ( r c s p X,)
For an introductiori to the concept of bicxte~ision,we suggcst reading 52 is a group honioniorphism.
and $3 of [12]. For a fullcr treatment of this notion, see expos& VII and
VIII of [SCA 711. A useful and pleasantly written introduction to this fuller (IIcre ,X (resp. X,) denotes the part of X above {u) X V (resp.
treatrrirnt may be found in the first 5 4 pages of $1 of [5]. U X {v)) and is a group extension of V (resp. U )by CV.) Note that we are
Lct A>K denote the dual of AjK arid EfK the canonical biextcnsion of expressing the action of W on X additively. We will continue to do so even
(AIK,AllK) by GmIK expressing the duality ([SCA 7 I] Exposk VII, 2.9). wheu W = Gm.
If v is archirriedean, let En dcnote t h r canonical biextensiori EfK. If v is
discrete, let EA (or ETo) dcnote the canonical biextension of (AO,A') by (1.5) C a n o n i c a l p-splittings.
Ilet Y be an abclian group and p: K * -+ Y a homomorphism.
GmlO, i.e., the unique such biexterision whose general fiber is EfK (whose
existence and uniqueness follow from [SGA 7 I] ICxposk VII, 7.lb).
If BIK is any abelian varicty, to give a bicxtension Elrc of (AIK,BIK) Theorem. There exists a canonical p-aplittingl
by 6,/1c is rquivalent to giving a K-honlomorphisrn A: BIK - - (and
EIK is the pullback of EfK by ( I , A)), or to giving a K-hornornorphism
in the following three cases:
A': -+ AIK (the dual of A). (1.5.1) v is archimedean and p(c) = 0 for c such that lclv = 1.
Again, if v is arcliirnedean, let E denote ElK, while if v is discrele, E (1.5.2) v is discrete, p is unramified (i.e., p ( ~ *= ) O), and Y is uniquely
(or E/o) will dcnote the pullback of E;b, viewed as biextension of (A0, B) divisible by mA.
by Gm/0. (1.5.3) v is discrete, Ic is finite. A i.9 ordinary, and Y is uniquely divisible
The abelim varieties AIK, BIK will be said to be paired, if a bicx- by mAmgnAn8.
tension EIK of (AIK, BIK) by GmIK (equivalently: a K-homornorphisrn If both (1.5.2) arid (15.3) Iiold, they yield the same $ J ~ .
A: BIK -+ is fixed. The: image of $, satisfies the following inclusion relations:
In what follows, we suppose AIK, BIK are paired abelian varieties over
I<, with EIK the bicxtension expressing the pairing. in case (1.5.1)
In all cases, archimedean or non-archimedean, the set E ( K ) of points
of E with coordinates in K is a set theoretical biextension of the groups in case (1.5.2)
(A(K), B ( K ) ) by G m ( K ) = K*. Our aim is to iritroduce canonical split-
tings of biextcnsions obtained from this one via various types of homomor- + 1
mAmenAne P(o*), in case (1.5.3).
phisms p: K * -4 Y.
Before beginning the proof of the theorem, we give some lemmas on On the other hand, it is easy to check that this formula defines a p-splitting
set-theoretical biextensions. Let U , V, W and X be as in (1.4), i.e., X a 1C, on all of X, if +,
is a p-splitting of XO.
biexter~sionof (U, V) by W. For integers rn and n we define a map
Another case of unique extendibility of p-splittings is given by
which for each ( u ,v) E U X V takes the fiber ,Xu of X over (u, v) to the (1.7) Lemma. Suppose W' is a subgroup of W and XI a subset of
fiber ,,Xnv. The map (m, 1) is defined as multiplication by m in the group X such that X' is a biextension of (U, V) by W'. Let p: W---+ Y be a
X, for each v E V, the map (1, n ) is defined as multiplication by n in the homomorphism and let = p I W'. A p'-splitting +' of W' extends uniquely
group ,X for each u E U, and finally, to a p-splitting 1C, of W.
defn
(m, n) = (m, 1) 0 (1, n ) = (1, n) 0 (m, 1); Let W = Ui(wi + W') be the expression of W as disjoint union of cosets
of W'. Then X = U(wi + X') is a disjoint union because this is true on
the commutativity of (1, n ) and (m, 1) results from the compatibility axiom each fiber over U X V. If x = w, +
x' E wi X' and + +
extends +', we
for the two laws of composition in a biextcnsion. We have the rules must have
$(XI = P( wi) + +'(x').
and On the other hand, it is easy to check that this formula defines a p-splitting
(m, n)(x + w) = (m, n)x + mn w for w E W. 11, of X, if 4' is a p'-splitting of XI.
If p: W+ Y is a homomorphism and + is a p-splitting, then (1.8) Proposition. Suppose 1J and V 'are compact topological abelian
groups and X a topological biextension of (U, V) by W, such that the projec-
tion pr: X + U X V has local sections. Then X has a unique continzious
In particular, if Y is uniquely divisible by m and n, then we have splitting 1C,: X ---t 08.
canonical p-splitting $,. For x E E(K) there is an integer v such that Let p, p' : K * -+ Y be homornorphismv and c, c' E End Y. Then
(mA,1)2 E ?r" + +
E(o); writing (mA,1)x = nu y, we have
( n ~~ , B ~ B E) Et(6)
Y
be a homomorphism. Then
If we arc simultaneously in case (1.5.2) and (1.5.3), then p o $0 = 0, hence
the $, we have just defined in case (1.5.3) does indeed coincide with that
defined for (1.5.2). This concludes the proof of 1.5, or rather its reduction
to our result (5.11, 5.12) on the existence of unique splittings of certain i.e., the diagram
formal group scheme biextensions. E(K) 5 E(L)
and by local class field theory, j determines a Zp-extension L/# whose vth
layer K,/K is the unique cyclic extension of degree p" such t h a l
(sum in E~(L)).
for all v.
In particular, if p~ = p for all T E G I then
Examples. If G = G,, then G , ( K ) is the kernel of the homomorphism
p: G,(K) = K *-+ Q,. Tlic group of universal p-norms A ( K ) for our
And if p = 0 0 NLIK, where 0: K *--+ Y , then this becomes abelian variety A is of finite index irl A ( K ) if either AID is ordinary of good
reduction or L / K is unramificd, ( [ l l ] 4.39; and if L I K is unrarnified, 4.2
and 4.3).
(1.11.5) Lemma. Let A([<) be of finite index i n A(K1. T h e n there is and consequently by (1.11.6.1), +E(K) is contained in pU-' p(K*) for all
a unique p-splitting of the bieztensions E ( K ) which takes E ( K ) t o zero. Y, hence is zero.
To prove the claim, let 0, = o(K,) be the ring of integers in K,.
Proof. An easy application of (1.G) and (1.7). We refer to the above p- Note that the NCron model of AIKY over OY is, under assumption (a) or
splitting (which exists when i ( is~ ) index in A(K)) as Schneider's
of finite (b), the base-change to 0, of A. Let m u = ~ A I K , , n u = ~ A I K , and
p-splitting. m', = mH/KUln', = ~ B I K , be the exponents of AIKY and UIKY.
By the theorem in (1.5),
(1 . l l . 6 ) Proposition. Suppose that either
(2.1) The s y m b o l [R, D , c]. (2.1.6) For each D and each a0 E (A - Supp D)(K) there is a K-morphism
In tlie next two paragraphs, K can be any field; AIK is an abelian variety
over K , A)K its dual, and LC = BA, cf. (1.3).
Consider the set T of triples (a, D , c) consisting of:
(i) A zero cycle of degree zero, n = zi
ni(ai), a; E A(lC), on AK, all such that
of whose components are points ui of A rational over K. g(a) = [(a)- (ao), D, 11-
(ii) A divisor D algebraically equivalent to zero on AIK, whose support
is disjoint from 8. The properties (2.1.1)-(2.1 .G), for K arid its algebraic extensions, charac-
(iii) An element c E K *.
terize the symbol [a, D , c]. Indeed, suppose [ and [ Iz are two such
symbols with those properties. Let their difference be
A triple (8, Dl c) E I' deterniines a point [a, D , c] E C (K ) in a well-known
mariner, and every point of E ( K ) is of this form. The symbol [a, D, c] obeys
the following rules.
This makes sense by (2.1.1). By (2.1.2), 6(a, D , c) is independent of c, and
(2.1.1) [a, D, c] lies over the point (a,b) E A ( K ) X B ( K ) , where by (2.1.5), it depends only on the class of 1). Choosing divisors D j in this
class such that none of them passes through zero, and whose supports have
defn
a = S(R) = C n;a; an empty intersection, we find using (2.1.6) t h a t there are morphisms
i
and hi: ( A - Supp Di)--+ 6,
defn
b = Cl(D) = Thc point in A1(K) representing
the class of the divisor D . such that
hi(.) = b((a) - (O), Di, 1).
such t h a t k ( u ) is finite havc t11c mcanings explained in thc beginning of 51; For each c E K * we liavc then
and similarly for the abelian varicty BlK.
Let Y be a commutative group, and suppose that we arc givcn a family
p = ( P v ) v E Vof honiomorphisrns p,: K: -+ Y , such t h a t pv(O:) = 0 for all
but a finitc number of v's, and such t h a t the "sum formula" CUE,, p,(c) = 0
+
bccause $,(xu 4- c ) = $,(xu) p,,(c) and Cup,(c) = 0. Thus the right side
of (3.1.1) is indcpcndrnt of the choice of x E E ( K ) ahovc ( a , b). Moreover,
holds for all c E K *. the map E ( K ) -+ A ( K )x 13(K) is surjective, because there arc local scctions
Dcfinc the topological ring AK as the restricted product n',,,
K , where ( E is a linc buridle on A X B, minus its 0-section). IIer~ce(3.1.1) defines a
a vcctor x = ( z v ) v E Vis in AK if x u E 0, for all but a firiitc number of map A ( K ) X B ( K ) -+ Y. The map so defined is biadditivc, because the &,
v E V - V,. Wc have a canonical homomorphism K -t AK . It is convenient are splittings.
to view a family p = (p,) as above, as a homomorphism
(3.2) Definition. If, in the situation of (3.1)) the p,-splitting $, is the
canonical p,-splitting of Theorem 1.3, for each v, thcn the pairing (3.1.1)
which annihilates the image of 0: for all but a finite number of v E V - v, is called the canonical p-pairing, and is denoted by ( , ) p .
as well a s the image of K *.
Suppose t h a t we are givcn for cach v E V a p,-splitting, $,, of E(K,), (3.3) Thus, thc conditions for thc canonical p-pairing
and t h a t $,(Ic(o,)) = 0 for all but a finitc number of v's:
(3.1) Lemma. L e t p = (P,) a n d let (4,) be as just described. T h e r e t o be defined for a family p = (p,) of homomorphisms p,: K: --,Y are
i s a unique pairing (3.3.1) For cach v E V,, we have p , ( c ) = 0 if lcl, = 1.
(3.3.2) Thcre is a finite subset S C V - V , (possibly empty) such that
pv(o;) = 0 for v @ S U V , and such t h a t A, is ordinary and Ic(v) finite for
s u c h t h a t if x E E ( K ) lies above (a, b) E A ( K ) X R ( K ) , t h e n v E S.
(3.3.3) S u m formula CvEV pv(x) = 0 for x E K*.
(3.3.4) Y is uniquely divisible by M N , whcre
for c, c' E End(Y), (I E A(K), b E B(K). for a E A(K), b E A(K), where the lcft-hand side refers to the canonical
ppairing A ( K ) X B ( K ) -r Y (coming from E) and the right-hand side
(3.4.2) Change of field refers to the canonical p-pairing B ( K ) X A(K) -+ Y (coming from 'E).
I,et a : K -+ I, be a homon~orphisrn of "global" ficlds such that I, is If A is symmetrically paired with itself (by a biextension E of (AlK, AIK)
of finiLc degree over K , and the choscn set of places Vr, for L is the full by GmlK such that there is an isomorphism of biextension E Z 'E) then
"inverse iinage" of the chosrn set of placcs VK for K. the canonical p-pairing
Dcnotr by the same lettcr u the induced rriappi~igA ;( -+ A; and also
the mappings iriduccd on groups of rational points.
Then for an admissiblc p: A; -+ Y we have
NLIK: L* -
for v E VK and w E Vr, "lying above" v, and with the global norm
K * . Assume that Y has no [L : K]-torsion. Then for a E
A(L) arid b E B ( K )
Then the canonical p-pairing
is defined. Note that if K is a number field, V the set of all places, and lclv
the normed absolute value a t v E V, then "dividing" the exact sequence
(3.5) E x a m p l e s of c a n o n i c a l p-pairings.
(3.5.1) The Ne'ron pairing: Suppose our places v have absolute values 0 + Re, -+ Wo --* (ideal group of K ) -+ 0
c ++ lclv satisfying the product formula vEVW
by the sequence
units of K ideals
0 -+ -+ K* +
Taking Y = IW and pv(c) = - loglcl, for each v we obtain a canonical roots of 1 roots of 1 of K
pairing
A ( K ) X B ( K ) + R. we obtain an exact sequence
be a real vector space with basis elements e, in one-one corresponde~lce A ( K ) X B(K) --* (ideal class group of K).
with the places v E V. Let
This pairing is not new; see [10], [23] for a function Geld version, and for
something in number fields, see Duncan Buell's paper Elliptic curves and
class groups oJ quadratic fields, J. London Math. Soc. (2), 15, (1977), 19-25.
and let Z denote the subgroup of clements of W of the form 2) Let us specialize (3.5.2) Lo the case of K a real quadratic field of class
x
vEV
loglcl,ev, c E K*.
number one, arid AIQ, BIQ paired abelian varieties over Q.
Then Gal(K/Q) operates on A(K), B ( K ) arid Y. Let the superscript
+ or - refer t o the maximal subgroup on which the nontrivial element of
(4.1) Let p: A
;( - Qp be a continuous admissible ho~nomorphisn~,
A, K ) so that the canonical pairing
(for
are finite in number).
If PGal is nontrivial, then p ~ cuts
the condition that p ~ factors:
~ l
~ outl a Zp-extension L / K defined by
(4.3) T h e d e t e r m i n a n t f o r m .
For this paragraph and the next, let U = A', and E = E ~ .
For a E A(K), b E B ( K ) we have:
By the Mordell-Weil theore~n,the groups A ( # ) and A f ( K ) arc finitely
(4.1.1) generated. They are of the same rank since A and A' are isogcnous.
(a, b),, E P-"- P(AK) C QP
Let the common rank be r and let (P,)l<il,, (resp. (Qj)lljSr) be a
where v is the maximum of the two numbers basis for A(K) (rcsp. A1(K)) mod torsion.
Then for admissible p the determinant
(4.1.2) Ma? (ordPmAe) and Max ( o r d p m ~ , m 4 n
, ~,ne,) defn
vfinrte vES
6(A1P) = detl<i,j<r(Pi, Qj)p
where S is as in (3.3.2). is defined, and is, up to sign, an invariant of A and p. Since ( , )p is
linear in p, the determinant 6(A) is a honiogencous form of degree r on
(4.2) Admissible Zp-extensions. the Qp-vector space V,,. A line in this space, and also the corresponding
Let Zp-extension is called singular (/0r A ) if b(A,p) = 0 for p in the line.
PGal: G K Qp
(4.4) Schneider's p-adic a n a l y t i c height.
be a continuous homomorphisms of the Galois group G K of an algebraic
Supposc, in this paragraph, that K is a number field and that A has
closure of K into the additive group of p a d i c numbers. Then by the
good, ordinary reduction a t all places v of K , of residual characteristic p.
reciprocity law, p ~ , l induces a continuous homomorphism
Thcr, Schneider [22] has defined a height pairing
using "local splittirlgs" defined by universal norms in the cyclotomic (4.5.8) P r o p o s i t i o n : Let w E Wo = Wr. These conditions are eguiv-
Zp-extension. Ilcfine the hornornorphisnl p,: GK -+ Q, cutting out the alent.
cyclotomic Zp-extensions by p,(a) = logp(u) where u = u(u) E Z i is defined
by f = <" for all ppower roots of unity I. r.
Elere logp is the p a d i c ( 1 ) The element w is in the image of the natural projection
logarithm.
From (1.11.6) and the definitio11 of Sclmeider's height pairing ( , ), t W7n + Wo.
lirn
we inirnediatcly have: m
Proof. I3y the functorial property (3.4.3) we may assume that B is the
dual of A and E = E*. For each n 2 0, let a n E A(Kn) @ Zp be such t h a t
224 MAZUR AND TATE IIEIGIIT PAIRINGS VIA BIEXTENSIONS
CASE 1. r a n k A ( K ) = 1.
wherc p, = p o N K , / ~ . Then the anti-cyclotomic canonical pairing ( , ) p a , K is totally degener-
By (4.l.L), (a,, P),, E & - p n ( A ; ( n ) ,where Y, is given in terms of the ate. What are the universal norms? If (P,P),= # 0 it follows that the
exponents for AIK,, , 1jIK, als in (4.1.2). By the same reasoning a8 in (1.11.7) anti-cyclotomic Zp-extension is the only Zp-extension of K, singular for A.
v,, admits a n upper bound (say N) of n , giving:
CASE 2. r a n k A ( K = 2.
Let & E A ( K ) be such t h a t P, Q generate A(K) mod torsion, and
for all n. U& = -Q for u # 1 E Gal(K/Q).
A has a largest ideal of definition I (which is the radical of any ideal of If GIs
is a (formally) smooth formal group sche~neover the formal scheme
definition). S such that the structural rnorphisms 9 -+ S induces an isomorphism on
If A is adic, the topological space Sprc:(A/I) (for I any ideal of definitior~) underlying topological spaces, we say that 9 is a formal group over S .
is independent of I, and is called the formal spectrum of A.
The formal spectrunl of A may be endowed with a canonical local adic- (5.3) Examples.
ringed space structure ( 1 , ox), denoted Spf(A) such t h a t there is a canonical Most of our examples of formal schemes can be obtained by completing
isomorphism l ' ( X , o X ) A and such that t h e functor A I-+ Spf(A) is a a scheme X along a closed subschcme Y.
fully faithful functor from the category of adic rings to the category of (5.3.1) Completion along the closed fiber.
local adic-ringed spaces. By definition, an a f i n e formal scheme is a local Let XIS be an S-scheme locally of finite type and set Xn = X x s Sn.
adic-ringed space isomorphic to Spf(A) for some adic A. Let Y = Xols, be the closed fiber.
The topological ring A is called the a f i n e coordinate ring of the afine Denote by 2 the completion of X along Y. Then 2 is a formal 3-
formal scheme Spf(A). schenle. In the notation of ([ICCA I] $10) it is given as an adic (Sn)-system
The category of formal schemes is the (fully faithful) subcategory of local of schemes (XnlsJn.
(adic) ringed spaces "modelled on" affine formal schemes. (Cf. [8], [EGA I] If X), is another S-scheme locally of finite type, we have a bijection
$10.)
Recall that o is a complete discrete valuation ring with uniformizer n .
:,et S = Spec 0, and S, =, Spec o/nn+lo for n > 0. In particular,
So = Spec(o/n~)= Speck where k is the residue field of 0. (5.3.2) Formal completion at zero.
We let 6 dcnote the adic ring o with no as ideal of dcfinition, and
Let XIs be a smooth cornrnutative group scheme of dimension d. Let
3 = Spf(6). eo denote the identity clement of the closed fiber Xols,. Form X f , the
Let XIS be an S-scheme and set Xn = X X s S,, viewed as Sn-scheme.
Let Y C Xo be a closed subscheme. completion of X aL the point eo. Then Xf is a formal group over the
formal scheme 3. Moreover, Xf is an affine forrnal scheme whose affine
If XY denotes the completion of X along Y, then XY is a formal 3-
coordinate ring is isomorphic t o a power series ring in d variables over 6
scheme. Let R be a local 0-algebra separated and complete for the n-adic
with the maximal ideal as ideal of dcfinition.
topology.
Let [l be R viewed as adic local &algebra with ideal of definition n h .
Then Spf h is an afine formal scherne and it makes sense t o consider (5.3.3) Formul completion along subtori.
x'(R), the R-valued points of the formal $-scheme XY. This set may be Let XIS be a smooth (commutative) group scherne, and To C Xo a torus
viewed in a natural way as a subset of X(R), the set of R-valued points over k, contained in the closed fiber. Let Xt denote the formal group scheme
of the S-scheme X. Which subset? Since R is local, one can reduce over 3 obtained by completil~gX along To. Sirice To is an ifline k-group
this question to the case where X is afiinc, where it is easily resolved. sche~ne,a straightforward application of Serre's criterion (also compare [8]
Specifically, if x E X ( H ) , denote by x X s So E Xo(Ro) its specialization to V thm. 2.5) shows that X t is an afline forrnal group scheme over 3.
the closed fiber. Then There are natural homomorphisms of formal group schemes
Let ks be a separable algebraic closure of k, and G = Gal(ks/k). Let where ( X n ) t is the formal completion of Xn along To and Yf, is the formal
R be a local 'artinian ring' with k as residue field. If T is a torus over R, completion a t zero in Xn/Tn.
let Tolk denote its closed fiber. The category of tori over R is equivalent Since (Xn)t is canonically (Xt),, passage to the limit gives a sequence
to the category of tori over k, the equivalence being given by passage t o of affine formal group schemes over S:
closed fiber T H To; the latter category is anti-equivalent to the category
of free abelian groups of finite rank endowed with continuous G-module
structures, the anti-equivalence being given by &,,,-duality (cf. [SGA 31,
exp. VIII).
Consequently, given Tolk, for each n >
0, there is a torus Tnlsn given
The sequence is exact in the sense that the rnorphism X t Y f is
formally srnooLh, with kernel 5?. The affine coordinate ring of the 3-formal
-
up t o canonical isomorphisni, such that T, Xs,, So = To. group Y f is a power series ring over 0 in u variables where u is the
The system of tori ( T ~ ~ ~-, )has
, , ~tlle property that T, X sn S, T, codimension of To in Xo.
for n 2 rn, and determines a formal group scheme f which we refer to
(5.5.4) Split tori.
as the formal torus over 3 determined by Tolk. If To is a split torus over k, then the affine coordinate ring of Xt is
isomorphic to an 0-algebra of the form:
(5.5) T h e g e o m e t r y of t o r i c completions.
Let XIS be a commutative smooth group scheme over S, Tolk a torus,
and
PO: To--+Xo
where p is the dimension of To. Since To is split Pic(To) = 0, and hence
a closed immersion of klgroup schemes. Let ?',j. = ( T , , ~ , , ) , ~be
~ the Pic(Xt) = 0 ([SGA 21 exp. XI prop. 1.1, plus the fact that To is afine).
formal torus over 3 determined by TOlk as discussed in (5.4).
For each n 2 0,there is a unique closed immersion (5.6) Riextensions of f o r m a l g r o u p s c h e m e s over 3.
Let A, B , C be formal group schemes over 9 and E a biextension of (A, 8 )
by C over 3. Thus S is a formal 3-scheme which is a C-torsor over A X j. 8,
of &-group schemes extending po (SGA 3 exp. VIII thm. 5.1). locally trivial for the Zariski topology and, moreover, E is endowed with
two group-extensions structures
Invoking [SGA 31 exp VI, we may form the quotient group scheme Xn/Tn.
The sequence
is exact in the sense that p, is faithfully flat (it is, in fact, smooth) with where Ae = A X i B viewed as formal group scheme over 8, and E ( s )
kernel T,. denotes E as 8-formal scheme via its natural projection to B, etc.
The sequence (5.5.1) induces a sequence: The group-extensions ( A ) , (tB) are required to be co~npaLiblewith the
C-torsor structure of h', and with each other (cf. [SGA 7 I] exp. VII).
4 ~ h notation
e is not totally misleading. If X O / G is an abelian variety, there is, in fact,
3 ~ h notation
e is not misleading. This ? is, in fact, the completion along the closed fiber an abelian variety over S whose formal completion at the zero part of the closed fiber
of a torus T over S (determined uniquely up to canonical isornorphism), but this is
is yf. See ([SGA7 I] exp. IX 57). Howcver, we make n o use of this fact.
irrelevant to us.
HISIGHT PAIRINGS VIA MEXTENSIONS 23 1
230 MAZUR AND TATE
a sheaf in T (and conscquently a formal algebraic space over S ) endowed It then follows from the general fact ([SGA 7 I] exp. VIII 1.5.2) that
with the structure of biextension.
A key step in the proof of (5.10.3) is:
(5.10.4) P r o p o s i t i o n . B I E X T ~ ( ? ( U ) ,Vt; 6,) i s equivalent t o the punc- To show that Biext\(?(U), Vt; 6,) vanishes, we note that (5.10.2) repre-
tual category. sents an exact sequence of sheaves in the topos T and hence our problem
is reduced to the following two vanishing statements:
We prepare for its proof.
(5.10.5) P r o p o s i t i o n . A n y formal group algebraic space over 9 is a
formal group scheme over S.
Proof. Let B be a formal group algebraic space over 5, and, keeping t o But (5.10.7) follows immediately from (5.10.6) and the argument of
the notation of loc. cit., let B1 denote its first truncation (loc. cit. V llefn. proposition 3.4 of. [SGA 7 I], ext. VIII. To see (5.10.8), consider an exact
2.4). Then B1 is an algebraic space over So and it inherits an algebraic sequence of formal group schemes over S.
space-group sLructure from that of 8 . By Murre's theorem ([I], Theorem
4.1, [14]), B1 is a group scheme over So. By (loc. cit. V Theorem 2.5) B is
a formal group scheme.
Since Z f ( k ) = 0, and & -+ Zf is formally ktale, there is a unique lifting
(5.10.1) Corollary. Let G I , C2 be formal group schemes over 9 (com- 7: Zf --+ & sending the k-valued point of Z f to zero in &(k). It is immediate
mutative, as always). T h e n a n y element e i n E X ~ \ ( G ~ , G
is ~representable
) t h a t 7 provides a splitting of (5.10.9).
by a n extension 0 -+ G 2 -+ & -+ G1 -t 0 in the category of formal group Since the "Cartier dual," M = I I ~ ~ ~ ( ? ( u6,) ) , is ktale, one easily
schemes over 5. ) , 6,) = 0. The proposition then follows from
sees that ~ i c x t $ ( ? ( ~ Vt;
Now form (5.8).
is an exact sequence of formal 5-group schemes, then (e) splits. But, for and a symmetrical reduction of Vt t o Z f establishes our proposition.
each n >0, (en) = (e) X S Sn may be seen to be an exact sequence
of group schemes over Sn,and splits by ([SGA 7 I]exp. VIII, Prop. 3.3.1). (5.11) C a n o n i c a l trivializations.
Moreover, a splitting of (e,) determines uniquely a compatible splitting Let U, V, T(U),-,, T(V)o be as in (5.10). Thus Uls, Vls are smooth group
of (en) for a11 n. Thus fi = 0. schemes with co~inectedfibers.
234 M A Z U R AND TATE HEICIIT PAIRINGS VIA IIIEXTENSIONS 235
References
Finally, since IIoms(Zf, M) = 0, we see that l4] D. Bernardi, l l a u t e u r p-adique s u r les courbes elliptiques.
Sdminaire de Thhorie des Nornbres, l'aris 1979-80, SCrriiriaire
~ i e x t : ( ~ Z~f,; 6,) = 0 Delange-Pisot-Poitou. Progrcss in Math. series vol. 12. Birkhauscr,
I3ostori-Bascl-Stuttgart. (1981) 1-14.
as well and (5.11.1) follows from (5.8). L. Breeri, h n c t i o n s thc^ta ct the'or2me d u cube, preprint of the
[51
Laboratoire associC au C.N.R.S. No 305, Universitd. de Itcnnes I,
(5.11.4) Paraphrase of part of the above proof, using cocycles. 1982.
From the fact that M is a projective lirnil of locally constant finite [61 V. Cristaritc, T h e t a functions a n d Barsotti-?'ate groups, Anriali
(formal) group schemes over 3, and that N = 0, the reader may verify della Scuola Norniale Superiore di Pisa, Serie IV, 7 (1980) 181-
a: Y f X Z f -
that if E is a biextension of ( y f , Z f ) by 6,) then there is a section
E such that the 2-cocyclc (5.6.2) 1~1
215.
M. Harris, S y s t e m a t i c growth of Mordell- W e i l groups of abelian
varieties in t o w e r s of n u m b e r fields, Inv. Math. 51 (1979) 123-
141.
is trivial. From the relations satisfied by &, cp, (cf. [12], 52 (a), (b), (c))
it follows that p, may be viewed a s an 3-morphisms from Z f X to zf
236 MAZUR AND TATE HEIGHT PAIRINGS VIA BIEXTRNSIONS 237
D. Knutson, Algebraic Spaces, Lecture Notes in Math. 203, [EGA I] A. Grothendicck, dle'ments de ge'ome'trie alge'brigue. Publications
Springer Berlin-Heidelberg-New York, 1971. Mathkmatiques, I. H. E. S. 4 Paris (1961).
P . F. Kurchanov, Elliptic curves of infinite rank over I'-extensions, [SGA 21 A. Grothendieck, Cohomologie locale des faisceaux cohe'rents et
Math. USSR Sbornik, 19, 320-324 (1973). Thebre'mes de LeJachetz locaux et globaux, North-Holland Publish-
J. Manin, The Tate height of points on an abelian variety. Its ing Company - Amsterdam (1968).
variants and applications. Izv. Akad. Nauk SSSIl Ser. Mat. 28 [SGA 31 M. Demazure and A. Grothendieck et al, Shminaire de Gkomktrie
(1964), 1363-1390. (AMS Translations 59 (1966) 82-110). Algkbrique du I3ois Marie 1962/64. Sche'mas en Groupes 11. Lecture
B. Mazur, Rational points of abelian varieties with values in towers Notes in Mathematics. 152, springer, I3erlin-Heidelb&g-~ewYork
of number fields, Inv. Math. 18 (1972) 183-266. (1970).
D. Mumford, Biextensions of formal groups, in the Proceedings of [SGA 7 I]A ~;othendieck et all Skrninaire de Gkometrie Algkbrique du
the Bombay Colloquium on Algebraic Geometry, Tata Institute of Bois Marie. 1967169. Groupes de Monodromie en Ge'ome'trie
Fundamental Research Studies in Mathematics 4, London, Oxford Alge'brique. Lecture Notes in Mathematics. 288, Springer, Berlin-
University Press 1968. Heidelberg-New York (1972).
D. Mumford, A remark on Mordell's conjecture, Amer. J. Math.
87 (1965) 1007-1016.
J. P. Murre, On contravariant functors from the category of pre- Received October 13, 1982
schemes over a field into the category of abelian groups, Pub.
Math. I. 11. E. S. no 23, 1964. Supported by NSF
A. Nkron, Quasi-fonctions et hauteurs sur les varikte's abe'liennes,
Annals of Math. 82, no 2, (1965) 249-331. Professor Barry Mazur
A. Nkron, Hauteurs et fonctions thtta. Rend. Sci. Mat. Milano Department of Mathematics
46 (1976) 111-135. Harvard University
A. Nkron, Fonctions thtta p-adiques, Symposia Mathematica, 24 Cambridge, Massachusetts 02138
(1981) 315-345.
A. Nkron, Fonctions th2ta p-adiques et hauteurs p-adiques, pp. Professor John Tate
149 -174, in Skminaire de theorie des nombres (Skminaire Delange- Department of Mathematics
Pisot-Poitou), Paris 1980-81, Progress in Math., Vol22, Birkhauser Harvard University
Boston-Basel-Stuttgart (1982). Cambridge, Massachusetts 02138
P . Norman, Theta Functions. Handwritten manuscript.
B. Pcrrin-Riou, Descente infinie et hauteur p-adique sur les courbes
elliptiques c i multiplication complexe. To appear in Inv. Math.
M. ltoscnblum, in collaboration with S. Abramov, The Birch-
Swinnerton-Dyer conjecture mod p. Handwritten ma~iuscript.
P. Schneider, p-adic height pairings I, to appear in Inv. Math.
J. Tate, Variation of the canonical height of a point depending on
a parameter. To appear in Amer. J. Math.
Ju. G. Zarhin, Ne'ron Pairing and Quasicharacters, Isv. Akad.
Nauk. SSSR Ser. Mat. Tom 36 (1972) No. 3, 497-509. (Math.
USSR Izvestija, Vol. 6 (1972) No. 3, 401-503.)
The Action of an Automorphism of C
On a Shinlura Variety
and its Special Points
J. S. Milne
111 121 13orovoi shows Lhat the analogue of (0.1) for non-corinect,cd Shimura
varieties irnplies the existence of c:inoriical models for all Shirnura varieties.
Wc adapt his r r ~ r t l ~ o dtos sl~ow,in scc1,ion 4, 5, and 6, that (0.1) implics
Langlands's conjccturr for all conncctcd Shirnura varieties.
TIIE ACTION OF AN AUTOMOltPIIISM OF C 241
In the final section we review the main consequences of this result: pair ( + G I+Xi-)also satisfying (C), a special point 'h E +X+,and an
1,anglxntls's conjecture i n its original form; the existence of canonical models iso~norphisnl$, = (g H +g): G ( A f ) - - t r C ( A f ) . (See also [l4]; in general,
in the sense of Delignc; l,nnglandsls conjccture describing the action of we shall use the definitions of [l4] and [15] rather than [8].)
complex corljugatior~ on a Shirnura variety with a real canonical model;
the existence of canonical models in the sense of Shimura. An cxpository T h e o r e m 1.1. Assume that G is simply-connected; then, with the
account of this, and related n~aterial,can be found in [ l l ] . above notationa, there ezists an isomorphism
I am indebted to P. Deligne for several valuable conversations and,
especially, for suggestions that led to the elirniriatiorl of a hypot,hesis on the
congruence nature of a.rit,flmetic subgroups in the statement of t . 1 main ~
theorem. such that
i into ' i . These are the objects in (1.1). The map 'h: S -t T% 'G is that Proposition 2.1. Let (GIX+) and (G', XI+) satisfy (C), and let
4:
whose associated cocharacter is TP,,, and " X + is the ' ~ ~ ~ ( ~ ) ~ - c o ~ ~ j u g a c ~ GA, -+ G'AI be a n isomorphism such that $(G(Q)+) = Cf(Q)+. For
class containing 'h . a n y finite e'tale morphism p : Mo(G,X+)-t Mo(G', XI+)compatible with
One other fact we shall need concerns the class 7 of ('GI ' i ) in H1(Q, T) 4, there exists a n element g E G(Q)+ and a n isomorphism 4,: G 3 G'
(equal to the image of P(T,p h ) under such that p = M0(+,) o g. I n particular, p is a n isomorphism.
The existence of 4 shows that the image of 7 in H ~ ( Q , , T )is zero, for all
finite 1; its image in H 1 ( R , T ) is represented by ~ ~ ( - - 1 ) / ~ ( - - l[14,
) 3.141.
Remark 1.5. Theorem 1.1 (in fact, Conjecture C O ) is proved in
((51, [Is]) for Shimura varieties of abelian type. Since we shall need to make
use of this result for groups of type A, we outline the main steps in its proof. satisfy the same conditions ay p and 4,and p o y[h] = [h']. It therefore
For pairs ( G , X + ) with G the symplectic group and X+ the Siegel upper sulfices to prove the following proposition.
half-space, (1.1) is shown in [15, 7.171 to be a consequence of a statement
about abelian varieties of CM-type. This statement is proved in [5]. Let Proposition 2.2. In addition t o the hypotheses of (.2.l), suppose there
G be of type A, and suppose that G is almost simple over Q. Then G can +
exist h E X+ and h' E X'+ such that p([h]) = [h']. T h e n is defined over
be embedded into a symplcctic group ([4, 2.3.10]), and the following easy Q and cp = Mo($).
lemma can be applied.
Proof. There exists a unique isomorphism @: X + q X ' + lifting all cpr,,r:
Lemma 1.6. Let ( G , X + ) satisfy (C), and let be a reductive sub-
r \ X+-+ I" \ X'+ and sending h to h'. Let @, be the map
group of Gad. Suppose that some h X+ factors through pRI and let a H @ o a o @ - I : Aut(X+) + Aut(Xt+). For any
X& be the ~ " ~ ( ~ ) + - c o n j class u ~ a containing
c~ the composite h' of h with
-
H -+ zad. A s s u m e that pad satisfies (C3) and let U be the s i m p l y 'cotr-
nected covering group of g a d . T h e n (11,~ &satisfiesj (C), and there is a n @,(a)induces an automorphisn~of Mo(G',X'+); in particular, it lies in
embedding M"(H, x;) L* M O ( G ,X + ) compatible with I I ( A f ) c-+ G(Af) the cornrnensur;~,bilitygroup of any I" E x ( C ' ) and therefore belongs to
under which [h'] I-+ [h]. If h is special, so also is h', and i f (1.1) holds for GfU"(Q)(see [I, Thrn. 21). Consider a q t G(Q)+, and write q, and q~
( G , X t ) and h , t h e n it does also for (H,x&)and h'. for its images iri G O ~ ( Q ) and
+ G(Q)T. Then q, and qf define the same
automorphisnl of M o ( G , X + ) , and so @,(q,) and +(qf) define the same
$2. Morphisms of Shimura Varieties automorphisnl of Mo(G', XI+). They therefore have the same image in
Gfad(Q)+A(relGI) = Gr(Q);+ Gfnd(Q)+. Therefore +(qf) E G f ( Q + )
A morphism cp: Mo(G,X+)--+ MO(G',X'+) will be said to be finite and
@(a)+
(and @,(q,) = +(qf) in G " ~ ( Q ) + ) . As G(Q)+ is Zariski dense in G, we
ktale if, for any r' E C ( G f ) , there exists a I' E C ( G ) such that co~icludethat 4 is defined over Q . Write 4, for 4 regarded as a Q-rational
map, and consider
Example 2.5. The last corollary applies to the Shirnura varieties defined
by simply connected groups without factors of type A,, n 2 8 (see (3.8)
below). For these groups G(Q); = G ( A f ) and so
We begin by recalling a theorem of Kazhdan. Let X + be a Hermitian
symmetric domain, so that the identity component G of A u t ( X + ) is a
product of connected non-compact simple real Lie groups. For I' an arith-
metic subgroup of G, I' \ X+ carries a unique structure of an algebraic
The propositions can also be used to conipute the automorphism groups variety, and so T(I' \ X + ) is defined, T E Aut(C).
of non-connected Shirnura varieties.
246 J. S. MILNE TtlE ACTION O F AN AUTOMORPHISM OF C 247
T h e o r e m 3.2 ( K a z h d a n ) . (a) The universal covering space XI+ of Choose X: + r(Tlg \ X + ) 9 3 its to make thc- following diagram commute
T(I'\ X + ) is a Hermitian symmetric domain. with t h e upper arrows, and choose Lj to rrlake it commute with the lower
(b) Let G' be the identity component of Aut(X1+), and identify the arrows:
fundamental group I" of r ( r \ X+) with a wbgroup of G'; then 1" is a XT
id
+
5 x:
lattice in G'.
(c) We can assume that K is the group used in the construction of r o . Proof. Margulis's theorem gives us a pair (Gl, 4s) satisfying (a) and
such that 4s is surjective with a compact kernel. We can suppose that I's
It is then clear that 1'' = l'o n K / % ( Q ) . 'L'hc second q u a l i t y follows from
the Grst and (a). is irreducible. Then GI is almost sirnple over Q and so cannot be of type
A. Therefore Gl1 does not have any compact factors and $1 111ust be an
isomorphism.
Later we s l d l show that T'" is contained in thc identity component G'
Let A be a subgroup of ro,s of finite index, and let AA =
of A u t ( ~ : ) , but for the prcsrut wc define I': = I'on GI.
We now fix an integral structure for C arid define, for any firiile set S of ((fi) E I J r E s r ( G ~OG,)
, I
fi(X) E Q1 fi(X) = f11(1)1 all 111' E S1X E A).
finite primes, G(A$) = n,,, G ( Q )x n,,, G(Zl). Let Toss = I.~~C(A!);
Thcn, for all sufficientlysmall A, AA is independent of A and Spec& = GI.
then %'o,snI': can he regarded as a subgroup of G b ZG'X G1. nleS This shows the uniqueness.
When we enlarge S, to S' say, then G1 does not change and
$ J ~ ,= ~ 4s.
G ~ We~therefore get a map
L e m m a 3.5. The group I'& is a n irreducible lattice i n G',.
4 s : Gls
properties:
-
L e m m a 3.6. There exists a n algebraic group G1 over Q and a m a p
cb, where Gls = G 1 X~ nlES
Gl,lr having the following
and
Proof. Let 7 E I'o, and let 7 E A u t ( ~ f X
as an element of h u t ( x f ) X G ( A f ) / % ( ~ ) For
maps Fo,s into ro,s.
.
) G l ( A f ) map to 7 regarded
large enough S, 7 E I ' o , ~ ,
We have proved (3.1) with G replaced by Gad. An argument of Borovoi L e m m a 3.10. There exist fundamental maximal tori Ti C GI,
(see 5.2a below; it is not necessary to assume there that G is simply i = 1 , . . .,k, and elements 7i
= Ti(Q) such that 7 = 7 1 . . . 7k.
connected) shows that the map GY,
-+ G$ defined by $J identifies G i d
with an inner form of Gad. Therefore $ J f : GA, --+ GIAt has the same Proof. Let U be the set of g E G l ( R ) such that the centralizer of
property, and so +f JZ(G) is defined over Q, i.e., $Jf identifies Z 5 Z(G) g is a compact maximal torus. Then, the usual argument using the Lie
with Z1 % Z(G1). algebra, shows that U is open in Gl(R). Moreover, U generates Gl(R). Let
From (3.4) we know that I'o n Z ( A f ) / Z ( Q ) = 1. Therefore any ele- 7 = 7;. . . ytk with 7: E U . According to the real approximation theorem,
ment g of GI(&)/%(&) can be written uniquely as g = 7 zg, 7 E ro, the ~ e Gt l ( Q ) n U is dense in U. We can thcrcfore choose 7i E G1(Q) n [J,
TIIE ACTION OF AN AUTOMORPFIISM OF C
252 J. S. MILNE
Remark 4.2. The condition (b) irnplies that TI, and any subtorus, is a
We therefore have a finite ktale map product of one-dimensional tori, and therefore satisfies the Hasse principle
for H1.
As T i is split, we can write
which is G(Af)-cquivariant when we identify Gal with GIAr by means of
4f. When we apply r-' to this map, and repeat the whole of the above Lie G i = Lie T i @ (Lie Gi),
construction for ( G I ,x:)
and T-l, we obtain maps aER
is an isomorphis~rland, on applying r-', we get that is defined over F , and we let H', be the corresponding connected subgroup
of GI. Write 11, = RcsFIQ H', and Z, = Z ( H , ) .
254 J. S. MILNE TIIE ACTION OF AN AUTOMOILPlIISM O F C 255
P r o p o s i t i o n 4.3 (Borovoi, OniYEic). Z(G) = na Za, (0 E Rntc). maps )I 3 t h and )1 ;--t = p t $ p- induce a C-linear, equivariant
isornorphism p+ 3 ttl,(sec [4, 1.2.141).
Proof. From the formula [p,p] = k, valid whenver a = k @ jl is the (b) It follows from [4, 1.2.71 that X'+= X+. Sincc
Cartan decomposition of a non-compact real Lie algebra, it follows t h a t
~nt+ c ~ n t >
c R . Thus
h and h' define thc same IIodgc filtration on Lie(G), but this implies that
they are equal [4, p. 2541.
and the proposition follows by applying ResFIQ,
P r o p o s i t i o n 5.2 (Borovoi (21). Let (G, X + ) , (G', X''), and T , p,
In this section, we use the rricthods of norovoi [2] to deducc (1.1) from Proof. (a) Let p ( ~ [ h ]= ) gt[h'], h' E XI+,g' E C ( A f ) (see the Appendix).
(3.1). For any t E ~ ( T ( Q ) ) ,t[h] = [h], and so $(t)g'[ht] = g'[hl], i.e.,
Let ( G , X + ) satisfy (C),
. . and consider ( T I h) C ( G , X + ) . For any
(gtP1$(t)gt ) [h'] = [h'].
q E T(Q), q[h] = [h], and so there is a representation ph of T on the tangent
space th t o M O ( G , X + )a t [h]. This implies that 9'-'$(t)gt E Gt(Q). Let g = $J-'(~'-'); then
$ o c&g o i: TA/ + G ~ maps I T ( Q ) into C t ( Q ) . As T ( Q ) is Zariski dense
Lemma 5.1 (cf. [2, 3.2, 3.31). (a) ph -
@ a , a E R(Gc, To), in T , this means lhat $ o a d g o i is defined over Q ; we denole it by i'.
(a,P h ) = 1. (b) Let h' be as in (a); then phr = 7ptL bccause (cp o g ) ( ~ [ h ]= ) [h'] and
(b) Suppose that ( C , X t + ) also satisfies (C), and that p o g is l'(Q)-equivxriant. Thercfore (b) follows from (5.lb).
(l",h') c (G, XI+); (c) As $ o & g is an isomorphisni ( G ,i)*, 2 (GI, iP)Af, it is clear that
such that Adp(z) acts trivially on a z on P+, and as z-I on P-. Remark 5.3. If in (5.2) we replace cp by cp o g and $ by 4 o u d ( g ) , then
Thus h,C jl' if and only if a(p(z)) = z , i.e., ( a l p ) = 1. The canonical p and $ are still compatible, b u t now p ( ~ [ h ]=
) [h'] itrlrl $ o i = it.
256 J. S. MILNE THE ACTION OF AN AUTOMORPHISM OF C
i
C o r o l l a r y 5.4. Consider ( T ,h) -+ ( G , X+),and assume that G is and taking ~ [ h , ]t o [ + h a ]The
. map
simply-connected and that 'I' 2'/Z(G) satisfies the ZIasse principle for
II'. Let ( T , 'h) i l , 'x+)
( ' G , be as in $1. Then there exist isomorphisms
We shall now need to consider two special points h and h' for a given Consider the diagram
( G , X + ) . To distinguish the objects 'GI 'X+,p,, . . . constrl~ctedrelative
t o h from the similar objects constructed relative t o h', we shall write the
former 'lhG, 'phX+, p,,h,. . . .
Gal(L/Q)
Let P ( r l h) and P(r,h') be t h e elements of ( G a d ( ~ L ) / C a " ~ ) )
corresponding
-
to h and h' respectively (see (1.4)). The image of P(T, h) and
P(r, h') in H 1 ( Q , Cad) are trivial a t the finite primes and are equal a t the
infinite primes (see (15, p. 315-3161). As Gad satisfies the Ilasse principal
for 111([9, VII. 61) this shows t h a t P(T,h) and c ( r , h') have the same image
in H1(Q, G ~ and ~ therefore
) B ~ F ( Th1)P(r,
, h)-' lies in G " ~ ( A J ) / G " ~ ( Q ) .
Moreover, there is an isomorphism f : 'yhG 4 ' P ~ ' Gsuch t h a t
The upper square commutes because of (6.1). The outside of the diagram
commutes becausc both maps send [h] to ['h'] = [f o 'h] and the action of
T ( ~ to
) thal of Thus the lower triangle commutes.
'lhtg.
L e m m a 6.5. If (6.3) is true for the pairs (h, h') and (h', h"), then it is
also true for the pair (h, h").
Proof. This is immediate, since d(r; h", h') 0 4(r; h', h) = q5(r; h", h).
set of compact roots in R(Gc, Tc), which determines K,. The centre Z of T = U' sornc r , where U is the unique one-dimensional non-split F-torus
IC, is 1-dirne~lsional(loc. tit,.), and h dcfines an isomorphism S / G , 5 Z; split by L, and therefore I '(I,/F, 1') x ( F X/ N L X )'. T h e penultimate
it is thcrcfore determined up to sign. assertion shows that c is represented by a family (ct, . . . ,c,) of totally
positive elements of F X . After adjoining JcT to P', i = 1 , . . . , r , we can
We now complete the proof of (6.3). As usual, we can assume that G assume c = 1. Thcn c, = t-l . a t some t E T(L) and so, after replacing /3
is almost simple over Q and therefore that G = ItesFlg G I , where F is with Pt-', we can assume it lies in G l ( F ) . Regard P as an element of G(Q).
totally real and G1 is absolutely almost simple. Let ( T , h ) C (G, X + ) and Lemma (6.5) and the first part of the proof show that we need only prove
(TI, h') c ( G , X + ) , and lct TI, T i C G I be such t h a t RcsFIQ TI = T and t h e theorem for &P o h and h'. This means t h a t we can assume t h a t h and
ResFlq T i = T'. There exists a CM-field L splitting both I; and T:. After h' factor through the same torus. But then they must be equal because,
replacing G with G, = ResFtlQ where F' is the maxinial totally in the context of (6.6), h-' does not lie in the same connected component
real subfield of L, and using (6.4), we can assume that L is a quadratic as h.
extension of F. As and T:,L are split, there exists a p E G1(L) such Finally, suppose l(wvo) # 0, say wvo = sawlo with l ( w ~ , ) < l(wvo)
t h a t PT1P-' = Ti. For each real prime v: F C-+ R of F, choose an and s, thc reflection corresponding to the root a. If a is compact a t v,
extension (also denoted by v) of v to L and write Zf, for B g F , , R , any then s, lifts to 7, E N ( F v ) (see [15, p. 3081) arid we can replace yVowith
yuo7, . Then w , , ~is replaced with w',~ and we can apply the induction
-1
F-group 11. As TI,, arid Ti,, are compact, there cxisls a 7, E G,(R) such
that yv2;,v7;1 = Ti,,. Let c, = p-' . up, where o generates Gal(L/F). hypothesis. Supposc therefore that a is not compact a t v and define
Then c, E N(L), where N is the normalizer of T, and so it defines a class H L C G I , 11: > TI, Zf; of type A,, as in $4. Let ZZ, be the derived
c E H1(Z,/F, N). AS 7;' . v(P) E N ( C ) and group of Ilb, and let T, = 11, n 1'. Then T, -- U ,which implies t h a t
N'(L/F, T,) + $, II1(Lv/Fv,T,) is surjective. Choose c, E I l l ( L / F , T,)
mapping to (c,) where c, = 1 for v f v, and cvo = 6(s,) where S is the
boundary map
where L denotes complex conjugation, we see that c maps to 1 in
l l ( l , v / ~ wN,) . Let w, be the image of 7,' v(P) in W(L,), wherc
W = N I T . The image w of c, in W(L) is L because v(w) = w;' . LW, and N, = Norm(%). Then c, maps t o 1 in $N'(L,/F,,N,). T h e Hasse
L acts trivially on W(Z,,) (see [15, p. 3071). Thus c, E T(L) and principle therefore shows that c, splits in II1(L/F, H a ) : (c,), = g-' . og,
g E H,(L). L i f t s, to n, E N,(L); then v,((c,),) = n z l . m , and so
gn;' E H,(R). Since we know the theorem for ltesFlq H a , (6.4) allows
us to replace (T, h) with (&g o T, &(gn;') o h). This replaces ,8 with
The following diagram is useful: /3g-', yvO with rvon,g-l, 7, with v # v,, wvo with gs,'wvog-l =
g ~ ' , ~ g - ' , and w, wilh gwvy-l, v # v,. Thus C l ( w , ) is di~niriished,and
we can apply the induction liypothesis.
$7. Conclusions
We now prove (6.3) by induction on 1 = ~ , l ( w , ) , where l(w,) is the We have shown that Conjecture C0 of [15, p. 340-3411 is true for ( G , X + )
length of w, as an element of W(C). Suppose first t h a t 1 = 0. Then whenever G is simply connected. As is remarked in [15, 9.61, this implies
'
7,' . v(P) E 7'(Lv) and so c maps to 1 in ZI (Lv/Fv, T ) for all v. Note that the general case.
THE ACTION OF AN AUTOMOItPIIISM OF C 263
262 J. S. MILNE
Theorem 7.2. Canonical models (in the sense of (4, 2.2.51) exist for Such a (G, X+) definrs a connected Shimura variety M O ( G , X + ) . The
all Shimura varieties. topology r(G) on gad(^) is that for which Ihc images of the congruence
sulr)groups of C(Q) form a fundamental system of neighbourhoods of the
identity, and
Proof. This is a consequence of (7.1) (see (15, $71).
MO(G,x+) = lip r \ X+
Theorem 7.3. The conjecture of Langlands describing the action of where the limit is over the set C ( G ) of torsiori-free arithmetic subgroups of
complex conjugation o n a Shimzira variety having a real canonical model G a d ( Q ) that are open relative to the topology T(G). For h E E X +[h] , and
[8, p. 2341 is true. [hIr denote the iniages of h in M '(G, X + ) and r. M o ( G , X + ) I' \ X+.
,4ny LY E G U d ( Q ) +acts on M O ( G ,X + )by transport of structure: a[hjr =
Proof. This again follows from (7.1) [a o I L ] , ( ~ ) . Any g E GI&); acts aa follows: Let r E C(G)and let K
be a compact open subgroup of G ( A f ) such that r contains the image of
Theorem 7.4. The m a i n theorems of [IS], viz. (4.6) and (4.9), are K G ( Q ) + ; then g E q K some q E G ( Q ) + , and g[h]r df = [adq0 hlBrQ-l.
true for all Shimura varieties. These actions combine to give an action of G(Q)T X Gad(&)+ (semi-
direct product for the obvious action of C a d ( Q ) + on G(Q)T). The map
Proof. They are proved i11 [13] under the assumption that G is classical q H (q,uJq-l) identifies G(Q)+ with a normal subgroup of the product,
and thc, canonical model exists, but the first assunlption is only used to and the quotient
simplify the p o o f of [13, 1.31, and we can instead deduce this theorem
from Proposition 2.1 above.
continues to act on M o ( G , X + ) . The completion of Gad(Q)+ for the topol-
Remark 7.5. Theorem 7.4 gives a definitive answer to the question of
ogy r(G), Gad(Q)+A(relG), is equal to C(Q);,G(g)+C""Q)+, and this
Slhnura 119, p. 3471.
ideritificat,ion is compatible with the actions of the groups on Mo(G,X + )
(see [4, 2.1.6.21).
Remark 7.6. For Shiniura varieties of Abelian type, (7.1), (7.2), and
Any x E M O ( G , X + )can be written x = g[h] for some g E G ( Q ) I and
(7.3) were first proved in ([5], [15]), [4], arid [12] respectively.
h t X + . For suppose xr = [h]r.;then, for any 1'1 c ,'I Xr, =; 71.1 [ h ]some
-yrl E F; let 7 = limrl,l y r l , and let 7 = g * a;then x = ? [ h ] = g[a(h)].
If G is simply connected, then G(Q)+ = G(Af); moreover
Appendix
We say that (G, X+) satisfies (C) if G is a semi-simple group over Q and
X+ is a G(R)+-conjugacy class of maps S -4 G$ for which the following
if K is a compact open subgroup of G ( A f ) containing Z(Q), and r is the
image of K n C ( Q ) in Gad(Q)+.
hold:
TI1E ACTION OF AN AUTOMORPIIISM OF C 265
To I.R. Shafarevich
Introduction
then get a variety over Fp which is on the one hand a moduli space for T h e o r e m 1.2. Let X o / S p e c k be a n ordinary K3 surface. T h e m a p
polarized K3 surfaces of dcgree d in ch;~ractcristicp and on t h r other hand
the reduction of the canonical rrlodrl of a Shimura variety. Langlands has (130. classes of liftings X/ Spec A )
conjectured a purely group theoretical (or linear algebraic) description of -+ {Iso. classes of liftings G / Spec A )
the set of Fppoints of the reduction of a Shimura variety [9]. In the present
case it then makes sense t o use the term "periods" for the elements of this defined by
set and t o view Langlands' conjecture as a conjectural Torclli theorem for X / Spec A I-, $x/ Spec A
polarized K3 surfaces over Fp. One instance t h a t indicates t h a t Lhis makes
is a functorial isomorphism.
sense is the suprrsingular case. Here Langlands' conjecture predicts t h a t a
supersingular K3 surface is uniquely determined by its crystalline cohomol-
ogy. This was conjectured by Ogus in [ll]arid recently proved by him [12] Proof. [lo] theorem 1.3.
as a consequcncr of Rudakov's and Shafarcvich's results on degenerations
of K3 surfaces in characteristic p [14]. Since hcight one groups arc rigid, there is precisely one lifting C; of
B T $ ~to Spcc A. Si~~iilarly
dtale groups arc rigid so there is precisely one
lifting G? of $goto Spec A. It follows t h a t if G is any lifting of $xo t o
Spec A we have an extension
1. Review of the Canonical Lifting and the Kuga-Satake-
Deligne Abelian Variety
In this section we recall some of the results of our previous paper [lo] liflirlg the extension
and of Deligne's paper [4].
Let k be a perfect Geld of characteristic p.
Definition 1.1. A K 3 surface X o / k is said t o be ordinary if the In articular we can consider the lifting of $x0 defining the trivial
following equivalent conditions are satisfied. extension G = G i X (:$. 13y the theorem there exists a unique lifting
XCan,,/SpecA such that $xc=,,,= G X G gI. Now let A = Wn and put
Xn = X,,,,w, then we get a proper flat formal schcrne {Xn)/ Spf W.
(i) The formal Brauer group B T $ ~has height 1.
(ii) The frobenius F: H2(OxO)4 1I2(Ox0) is bijective.
T h e o r e m 1.3. The formal scheme {Xn)/Spf W i~ algebraizable and
(iii) The EIoclge polygon and the Newton polygon of H&,s(XO/W) coin-
dejines a I(3 surface XcUn/SpccW . Furthermore Xcanhas the property
cide.
that any line bundle o n Xo lifts uniquely to X,,,.
Let A be a n artinian local ring with residue field k and let X/ SpecA be
Proof. [lo] proposition 1.6.
a lifting of the K3 surface Xo/ Speck.
By Artin-Mazur[2] the enlarged formal Brauer group $xAdefines a
pdivisible group on Spec A lifting $xo / Spec k. Next recall from [4] the following facts: Let (X, L ) / Spcc V be a polarized
In [lo] we proved the followirig theorem.
K3 surface where V is a discrete valuation ring with residue field k a finite
extension of Fp and fraction field K a finite extension o f Q p
Choose an isomorphism K rr C and base extend X t o a K3 surface
x, / Spcc C .
270 N. 0. NYGAARD THE TORELLI TI3EORb;M FOR K3 SURFACES
We let p2(Xc,Z(1)) denote the primitive part of the cohomology, Le., It follows from theorem 1.6 t h a t we have isomorphisms
W)Y-
(i) Ahas complex multiplications by the even Clifford algebra where a) is an isomorphism of rational Ilodge rtructures and b) is an
C = c + P ~ (Z(1))
x ~ ,of the bilinear form o n p2(xc,~ ( 1 ) ) . isomorphism of Galois modules.
The canonical lifLing of A0 is characterized by the fact that the frobcrlius
(ii) H1(Ac, Z) is isomorphic to c + P ~ (~x ( 1~ , a right C-module,
) )as
lifts to A,,,. It follows that we get an isogeny U A t ~End(Acun,c)
~ ~ @Q
and left multipliction defines a n isomorphism of algebras
and hence by c) above an isogeny U A E End(&) @ Q. Since the frobenius
on A. cornmutes with the action of C it follows that U A c 13ndc(Ac) @ Q.
The isogeny oa induces an vutornorphisrrl of H 1 ( A o , Q ) compatil>le with
the IIodge structure and the right C-niodule structure hence an elerrlent
(iii) p is a n isomorphism of f o d g - structures. U A E ~ n d c ( H ' ( ~ Q)).
c,
We define an algebra autoniorphisrn oxo of C+l'2 (XCan,c,Q(1)) by
(iv) Tensoring with Zl and identifying Zl-cohomology with l-adic e'tale
cohomoloyy p induce.? an isomorphism of e'tale sheaves o n Spec K t
where f : XKI-+ Spec K t and y: A--+ Spec K t are the structure maps.
(v) A has good reduction. compaLible with the cup-product and the Hodge structure and such that
Fx0 is identified with the geometric frobcnius urtder the canonical isolnor-
Assume now that (Xo, Lo) is a polarized ordinary K3 surface over Fq and phism
consider the canonical 1ifLirig Xcan/Spec W with the unique polarization f p2(Xcun,c, Q(1)) @ QI P:t(x~,
Q((1))-
lifting Lo. Applying the above Ltieorern to (Xcan,L) we get an abelian
variety A over a finite extension of Qp.
no, i n T h e o r e m 2.1. Let (Xo, Lo) and (Xb, Lb) be polarized ordinary K 3
T h e o r e m 1.6. A is isogeneous t o the canonical lifting A,,, of
surfaces over Fq. Assume that 4: M(Xh)--1 M(AYo)is a n isomorphism
the sense of ordinary abelian varieties.
272 N . 0. NYGAARD THIS 'I'OREL1,I TtIrCOREM FOR K 3 SUILl'ACES 273
is bijective.
TIIE TORELIJ TIIISOREM FOR K 3 SURFACES
References
[I] Artin, M., Supersingular K3 surfaces. Ann. Sc. kc. Norm. Sup. Lie
and the fact that p is an iso~norphismof IIodge structures implies that skrie, t. 7, 543-568, (1974).
[2] Artin, M., Mazur, B., Formal groups arising from algebraic
varieties. Ann. Sc. EC. Norm. Sup. 4e &riel t. 10, 87-132, (1977).
[3] Berthelot, P., Cohomologie cristalline des schemas des caracte'r-
is compatible with the 1Iodgc structures. It follows from [lo] lemma 3.2 istique p > 0. Lecture Notes in Math. no. 407, Springer Verlag.
that we have injective maps of Hodge structures Berlin, Heidelberg, New York, (1974).
[4] Deligne, P., La conjecture de W e d pour les surfaces Kt?. Inv. Math.
15, 206-226, (1972).
I~cligne,P., Vurie'te's abe'liennes ordinaires sur u n corps fini. Inv. Real Points on Shirnura Curves
Math. 8, 238 -213, (1968).
Uclignc, P., Illusie, I,., Gristaux ordinaires et coordone'es canoniques.
In Surfaces Algcbrique, Serninaire Orsay, 1976-78. Lecture Notes A. P. Ogg
in Math. no. 868, Springer Verlag. Berlin, iIeidclberg, New York,
(1981).
Katz,. N.,. Serre- Tate local m o d u l i of ordinary abelian varieties.
In Surfaces Algcbrique, Serninaire Orsay, 1976-78. 1,ccture Notes
in Math. no. 868, Springer Verlag. Berlin, IIcidclberg, New York, Let 13 be a quaterriion algebra over Q , i.e., a central simple alrgbra of
(1981). dimension 4 over Q. We assume t h a t FI is indefinite, aqd fix an identification
Kuga, M., Satake, I., Abelian varieties associated t o polarized K3 of I?, = B @ R with iM2(R). T h e discriminant L) of I1 is thcn the product
surfaces. Math. Ann. 169, 239-242, (1967). of a n w e n number of distinct primes. T h e completion Elp = B Jj Q,, is a
Langlands, R., S o m e c o n t e m p o r a r y problems w i t h origins in the skew field if p I I) and is isomorphic to M2(Q,) if p y D , and D = 1 if and
J u g e n d t r a u m (FIilbert 's problem 12). In MatliernaLical dcvelopntents only if 13 is isomorphic to M 2 ( Q ) , i.t'., B is not a skew field.
arising from IIiltwrt problems, S y ~ n p in . Pure Math. vol. XXVIII, T,et cr H at denot,e the canonical i n v o l u t i o n of B; ttlus (ap)' = /3'a1, and
part 2. American Math. Soc. Providence, Rhode Island, (1976). +
the trace s ( a ) = a at and the n o r m n ( a ) = a d are in Q for any a E B.
Nygaard, N., T h e Tate conjecture for ordinary K3 surfaces over If R oi onc of its corr~plclionsis isomorphic t o a rriatrix algebra M 2 ( K ) ,
fin it^ fields. 'I'o appear in Inv. Math.
Ogus, A, Stipersingular K3 crystals. Journeks dc GEometrid Algd-
then (::)I = (yc a n d the trace and norm correspond t o the trace
and dctcrrninant of a two-by-two matrix. An e1cmrr:t a of E) is an integer
brique. Rennes 1978, Asthrisque no. 64, 3-86, (1978).
if s ( a ) w~rdn(n)arc in %. An order in B is a subring 0 of B, consistJirig of
Ogus, A., A crystalline l'orelli t h e o r e m for supersingular K3 sur-
integrrs and of rank 4 over Z . T h e corresponding local orders are written
faces. This volume.
Op -- O @ Z p , a n d we have 0 = I1 np Op.
Sh;tfarevich, I., Piatetski-Shapiro, I. A l'orelli t h e o r e m for surfaces
Givcn an integer 11' 2 1 which is relatively prime to I), an Eichler order
of type K3. Math. U.S.S.R. Izvestija 5 , 547-588, (1971).
of levd I' is an order 0 such t h a t the local order Op is maximal if p /Y F,
Shafarevich, I., Iludakov, A., Degeneration of K3 surfaces over
and if p / F ,there is an isomorphism of TIp onto Mz(Op) carrying Op onto
fields of finite characteristic. Prcprint.
Tate, J., E n d o m o r p h i s m s of abelian varieties over finite fields. Inv. {(:, :) : a, b, c , d E Zp . Thus an Biehler order of level 1 is a m a x i ~ n a l
Math. no. 2, 134-144, (1966). order. All ordcrs in th'is paper will be Eichlcr orders. Eichler has shown
Murnford, D., Matsusaka, T w o f u n d a m e n t a l t h e o r e m s o n d e f o r m a -
tiona of polarized algebraic varieties. Amer. Jour. of Malh. 86, [3,41:
(1964).
1) There is only one Eichler order of a given level F, u p to conjugation,
i.c., tile general one is a0a-', where a E B X .
2) 0 contains a unit of norm -1.
Reccivcd August 10, 1982
T h e group O ( ' ) of units of norm 1 in 0 is thus of index 2 in 0 X . It is a
Professor Nicls 0 . Nygaard
discrete subgroup of SL(2,R ) a n d so s c l s on the upper half-plane g). Since
Department of Mathematics
O ( ' ) depends only on D and F, up to conjugation, we have a well-tieli~~cd
University of Chicago
Rientann surface 0 ( ' ) \ 2). If D > 1, this Riernann surface is compact
Chicago, lllirwiu 60637
278 A. P. OGG REAL POINTS ON SIIIMUItA CUItVES 279
The basis results on the arithmetic of quaternions which are used in this +
and el = a r e n , so a e l = ne2 and el ae2 = se2, as desired. This works
paper can be found in the papers of Eichler or in the recent lecture notes (
if 7 = (el e2) = a&--by ) z
y-cz y E 0 2 , i.e., if we can find x, y E Zp with
[la] of VignCras. 111 Z or Zp we write a I 6 if a divides 6, a 11 6 if a divides 6 +
cx2 (d - a)xy - by2 E Z F . This is possible unless p divides c, d - a , and
exactly, i.e., if a divides 6 and a is relatively prime t o b/a, and a - b if a 6, in which case a is not primitive.)
and 6 are associates.
It is a pleasure to thank Y. Ihara for several helpful conversations. Theorem 1 (Eichler). Let h(ll) be the cla.93 number of R . Then
(t)
i
Remark. 1) Here we use the Fact that the class number h ( 0 ) is 1; in 2 (if = 1)
general, when 11 is not ncccssarily indefinite, thc left sidc would be a sum
over h ( 0 ) classes. The result is not stated in exactly this form in [5], but b) If p f 11 I" (say f - pk), then vp = pk (if (f) = 0).
the proof is the same.
2 ) Let Ill be the principal order in L, i.e., the order consisting of all
O (if ( t )= -1)
integers, and let Rf = Z -k f K l be thc ortlcr of conductor f . Their class
-- pZk
numbers arc connectctl by Ucdckiritl's formula
(if r,. )
d) If ppF f I then up - (if j' p2k+')'
Proof (sketch). Parts i) and ii) are very fa~niliar,so let us assume that
p2 I F and Op '= : a , b, c , d E Zp . Then Tip = Zp[a], wlicre we
where (k) (4+
is the Legendre symbol, if I, is a field, and oll~crwise = 1.
assume t h a t 3 =. s ( a ) and n = n ( a ) satisfy
In that last case, say I, = Q + Q = {(: :) : a , d E Q), so Rl = z z
has four units k l , *c, where 1 = ( o1 0'), r = (A
and h ( R I ) = 1; then n=O and f 11 3 1
i f f 2 3 n - p 2 a n p
iff 2' < n - f 2 and f 1 3 - 1.
3) The generalized Legendre symbol for the order R = Rf is
Thus we take n as small as possible. If up 2 1, the11 we can write
Rp = Zp[P] with
( ' )
=
( 1, so Fln(p)ln. Conversely, if F I n , then
is primitive in 0, with the desired norm and trace. Thus up 2 1 if
and only if F I n, so up = 0 exactly in t h e cases listed in the statement of
The forrriulas for the local f;xtors up(R, 0 ) are then: the theorem, and we may assume that F ( n frorn now on.
i) If p / D, then up = 1 - (f).
ii) 11 F, then up = 1 +
If p
iii) Suppose p2 I F. and ae2 = 0 = a l e l , so a -
(;F, :), or p )/d and a - (i:). Thus up = 2
in this case, in agreement with a).
a) If (pf)' I F, then up =
I
Suppose p f from now on. Then p 3, so if a = (ic is primitiveI :)
(otherwise). in 0, with norm n and trace Y, then p divides both a and d and so p
282 A. P. OGG REAL POINTS ON SIIIMURA CURVES 283
cannot divide both b and c. If p [b, let do be a standard representative The genus y(m) of s(~
=)~ / ( w ( m ) )is related to the genus g of S by
of d modulo P. Then 7 =
-
y)
(d2do E 0: and 7-'a7 = Fco dol ) -
--
tdO -
a(do) a. Conversely, any do with do(3 - do) 0 (rnodF) leads t o a
solution rw(do), since P I n. Similarly, if p I b but p j'c, we find that a - where e(m) is the number of fixed points of w(m) on S. Let P E S be
a fixed point. Suppose that P is represented by z E B. (Fixed cusps
(7 .".).
-
o(ao) = with bo divisible by p and a0 a standard representative occur only when m = 4; cf. [12, Prop. 31.) Then p(z) = 7(z), where
of a modulo k'. Thus up is the number of solutions x (mod F ) of ~ ( -3x) r 7 E o('). We replace p by 7 p and assume that p(z) = z; we assume also
O(mod F) plus the nunlber of solutions y (mod F ) of y(y - 3) that s(p) > 0, replacing p by -p if necessary. Then Q(p) is a11 imaginary
n(modpF).
-
-
In case a), for example, where n = 0 and f (1 3, and (pf)' I F, say f quadratic number-field. Since p' generates the same ideal 1 = l', we have
pk, the first equation has as solutions z r 0 , s (mod 1~'/pk),which is 2pk p' = cp, where t E o(') n Q(p). In general, t = -1 so p 2 = -m; the
solutions (mod P'), and the second has as solutions y 0, 3 (mod Flpk-l), + +
other possibilities are E = 1 i = 1 ~ 4 if , m = 2, and E = 1 - (3, if
or 2pk-' solution (mod F). The other cases are similar.
-
m = 3; here and in the following cn denotes a primitive n-th root of 1.
The order R = Q(p) n O contains Z[p]; if it is larger, ttien we claim t h a t
m +
3 (mod 4) and R = Z[(1 p)/2]. (Assume that p 2 = -m and let
2. The Atkin-Lehner Group.
As always, let 0 be an Eichler order of level F in the quaternion algebra
I? of discrirninant D. As shown by Eichler, the group of (nori-zero fractional
+
--
a = ( a bp)/2 E 0, where a, b, E Q . Then a = s(n) E Z, and -bm =
+
s ( a p ) E s(l(m)) C m . Z, by (I), so b E Z. Since n ( a ) = (a2 mb2)/4 E Z,
we have a h (mod 2) if m = 3 (mod 4), and otherwise a and b are even,
using the fact that p is primitive in 0, since it generates the ideal I(m), in
two-sided) ideals of O is as follows. We have the obvious ideals xO for x E the case m 0 (mod 4)).
0 X . In addition, if m 11 D F , we have an ideal I = I(m), non-obvious if Thus, t o z E @ representing a fixed point p E 5 of w(nz), we have
m # 1, with I2 = mO. To define it, it is enough to describe the local associated two optimal ernbeddings of R into 0 , corresponding to p and p'
conipletion Ip at any prime p. We take Ip = Op if p m. If p I m (with trace 2 0). Consider an equivalent embedding, defincd by 7p7-',
and p I Dl then I , is the ideal of all non-units in 0,. If pk 11 m arid F, where 7 E O X . If n(7) = 1, then 7/~7-' fixes 7(z), which also represents P.
and we identify Op with
($ i)Op - 4($A).
{(p:c
Note that
I
t ) : a, b, c, d t Zp , then we identify 1, with
Suppose then that n(7) = -1. Then 7p7-' futes 7 ( ~E) @, which represents
the complex conjugate point P E S, according to the real structure of S
defined in [17], and discussed more fully in the next section. If P = P,
i.e., (changing the notation) if Z = 7(z), where 7 E O with n(7) = -1,
then z and hence z is fixed by 7p7-', so TI"T-' = p'. (We do not have
(1) s(l(m)) c m z . rp7-' = p, for then 7 would be a unit in R arid have norm I .) Cor~versely,
The quotient group of all ideals modulo the obvious ideals is generated
suppose that p' = 7p7-', where 7 E O X . Then p' Fies 7(z), SO 7(z) = z
by the I ( m ) and YO is isomorphic to C i , where r is the number of prime
or Z; actually 7(z) = Z, i.e., P = P, since if 7(z) = z , then 7 E O(p)
factors of Dli' and Cz is the group of order 2.
We can write I ( m ) = pO = 011, where p E 0 has norm m, using
arid 7p7-I = p and not p'. Thus I' = P if and only if /I - p', and the
number of fixed points of w(m) on S is the nurnber of inequivalent optimal
Eiclder's results that any ideal (left, right, or two-sided) of 0 is principal
embeddings of R into 0, such that p 0 = I(m). Now any two such p are
and O has a unit of norm -1. Then O = pO Xp-ll since 0 = iiOp-l,
locally equivalent at any prime p dividing m (by Theorem 2 and its proof).
and also o(') = pO(l)p-'. Hence p defines an autornorphism w(m) of
By Theorem 1, then, we have (expect in the case D = 1 and m = 4)
S with w(m)' = 1, called the Atkin-Cchner involution a..sociated to m;
w(m) # 1 if m f 1. These involutions from the Atkin-Lehner group
PI W = {w(m) : m 11 D F ) -- C;.
284 A. 1'. OGG REAL POINTS ON SIIIMURA CUIlVES 285
anticommuting elements ,f3,7 with P2 = m and 72 = i l l so the (reduced) i) If -p = 7P7-', wh.ere 72 = -1, t h e n one e n d o f X p = pm(Zp)is
discriminant D F of 0 divides the discriminant 4m of this module; thus a real E - p o i n t , and every real C-point arises i n this way.
m 11 D F I 4m. Suppose that 72 = -1. Then 7 is an elliptic element of ii) If -0 = 7p7-', where 72 = 1 (necessarily D = I), t h e n one end
o('), of order 4, with a unique fixed point zo in B. Since B7 = -7P, we see of X p is a fixed point of w(m), and every real fixed point of w(m) arises i n
that P(zo) is also fixed by 7, and in the lower half-plane (since n(P) < O), this way.
so p(zo) = z0. Thus zo E Zp and p,,(zo) is a real image of an elliptic fixed iii) T h e two ends of Xp are of opposite type (as i n i) and ii)) if and
point of order 2, or a real "E-point" for short. Note that pl = is also only if R = 0 n Q(p) contains a unit of n o r m -1.
in U(m), and zo E Zp,. The sets Zp and Zp, meet at a right angle at zo
+
(say because pl = (1 + -y)-'P(l + ?), and 1 7 acts as a rotation through Proof of iii). If -p = 7/3r-1 = 71~r11, then 71 = 7p, where
90), so Xp and Xp, meet a t Po = cpm(zo)a t an angle of 180, as they
should. It may happen that P -
PI, in which case X p is a circle, i.e., a p E 0 and p commutes with p, i.e., p E R x , and conversely. The type
changes only if n(p) = -1.
component of s ( ~ ) ( R ) ;we discuss this possibility later. Conversely, let zo
be an elliptic fixed point of order 2 mapping to a real point Po on ~ ( " 1 .
Now suppose that Xp, meets Xp, in a non-cusp, where Pi E U(m) and
Then zo E zp, where P E U(m), and 20 is fmed by 7 E 0, where 72 = -1.
P2 is not equivalent to +PI; we can suppose that there is a z E Jf) with
Then fi7(zo) = f o ,so PI = /3-y has trace 0 (as shown early in this section),
i.e., /3 and 7 anticommute and we are in the situation just discussed. Thus z E ;5p, f l Zp,. Then /32 = PIT, where 7 E O X and n(-y) = I , and
7(z) = z. Thus z is an elliptic fixed point of order e = 2 or 3, and we can
-0 = 7P7-1, with 72 = -1, if arid only if an end of Xp is an E-point. suppose that 7 is of order 2e = 4 or 6. We have p2 = p l y = 7',81,since
The number of elliptic fixed points of order 2 on S is v(Z[i],0). If this
= -Pi If e = 3, then 7/317-1 = = --Ply = -P2, contrary to
number is # 0, t,hen 4 1
F, and the number is either 2' (if 2 D F ) or 1
27-1 (if 2 (1 DF), where r is the number of prime factors of D F . In either the assumption. Thus e = 2, so we are in the situation i) of Propositiori 2:
case, the E-points form a single orbit under W, with (w(2)) as stabilizer
P r o p o s i t i o n 3. If the set8 Xp, a n d Xp, meet i n a non-cusp P, d
when 2 11 D F . In particular, they all look the same, on S or on ~ ( " 1 .
6
Consider now the second case; /37 = -7P, with 72 = 1, possible only
PI I f ~ t P 2 ,t h e n P is a real E - p o i n t .
if D = 1. Then n(7) = -1 (otherwise 7 = r'), so 7' = -7. Then
a = P 7 defines w(m) and a2 = -m, so a has a fmed point z E B. Since a
We have seen that real E-points can occur only if m I D F I 2m, and of
aniiconlrnutes with /3 and with 7, it fixed P(z) and 7(z), which are therefore course ~ ( m > ) 0 and v(Z[i], 0) > 0. I,et us show next that these cases
z, so z E Zp n 2,. Thus X p and X7 = pm(Z7) meet at a real fixed point do occur, a t least if m is not a square (automatic if D > 1). Suppose
first that D F is odd; then rn = Dl!' and m r 1 (1nod4), since D is the
p,(z) of iu(m). Conversely, let P = p(z) E S be a real fixed point of w ( m ) ,
product of an even number of primes, each r 3 (mod 4), and p r 1 (mod 4)
not a cusp; we can assume that a(z) = z and P(z) = z, where a defines
if p I F (we are assurriing that i E 0 ) . P u t Rl = Z[(l + f i ) / 2 ] and
w(m) and p E U(m). We assume that m > 3 and hence a2 = -m. (If
m = 2 or 3, then F divides 8 or 12 and S = Xo(F)has genus 0, and the R2 = Z [ f i ] . We realize Ll in standard form as
real locus of any quotient of S is a single circle.) Write a = p 7 , where
7 E 0 and n(7) = -1; then 7(z) = 2 and s(7) = 0, so -a! = a' =
7'P' = 7P = -P7, and we are back in the situation just discussed. This
proves the first two parts of: ( A prime p ramifies in R if and only if p I m and (f) = -1, i.e., p I Dl
P r o p o s i t i o n 2. Let PE U(m). The possibilities for P - -P are as
so this is the quaternion algebra of discriminant D.) Put /3 =
( -+)
0
follows. and 7 = A). Then 0 = {(,; :,) :a, b E RI is an order in l3 of
288 A. P. OGG ItEAI, POINTS ON SIIIMURA CURVES 289
with 7 c (i o,) and /3 rt (" A). ) Thus we d o have real 6-points in this
n0
case. Furthermore, Q(/'?) = 111, while PI
Q ( p l ) n 0 -- R 2 , so [j and arc c c r t a i ~ ~ lnyo t equivalent. Finally, we we have w h a t we want. Note t h a t in either case we have
claim t h a t if 1) > 1 anti we arc in this s i t u a t i o ~ ~thcn
, t h e component of
s("')(R) contairling these sets is
Ilet us treat first t h e case m = 2t = D F . Here we take /3 = (1 + p)7 =
7(1 - p), so P2 = m , p2 = -1, and P p = -pP. Thus there is a real
E-point a t a n end of X p , a n d we need to know whether p p is equivalent
+
t o /3 or not. Now (p l)-l/'?(p + +
1) = 7(p 1) = y ( l - p)p = Pp, so the
general solution a E U X of spa-' = P p is (p + 1 ) a = x + yP E Q(P).
+
Here a E 0 if and only if x y P is a11 e l e ~ n e n of t 12 = 0 n Q ( P ) = Z [PI
of norm 2, i.e., p - p p if a n d only if x 2 - m y 2 = &2 is solvable with
i.e., there are only two E-points on t h e componrnt. This is truc because x, y E Z. If this is not solvable, a n d D > 1, thcn we have #(m) = u(m)/2
t h e stability gronp of the con~poncrltin ~ / ( w ( r n )is) an elcrr~enti~ry abclian - ~ X p with
as in thc previous case. It i t is solvable, thcn we h a ~ e . 2 ~circles
2-group iicting freely a n d transitively on the IS-;)oirlts on the component,
a n d so i t is of order 2. Thus t h e nurnlwr of c o r n p r ~ c r l t sor S ( ~ ) ( ~isI )again
/3- -P, plus circles Xp with P ,)'-P, so #(m) = (v(rn) -t 27-2)/2, with
u(m) == h ( 4 m ) = h ( Z [ f i ] ) .
#(m) = v(m)/2, as in (9), although the reason I t x i changed; iri this case Suppose now t h a t m = t a n d take P = 7 ; as noted above, we have
we have zTp2 special components as just discussed, plus circles Xp where R = 0 n &(P) = Z[P]. We have only t o decide whether /3 and p p are
P+ - P.
Thus we arc reduced t o t h e case D F = 2 t , where t is odd, and
m = t or 2t. If p I t , then p = 3 (mod 4) if p I D and p 1 (1r1od4) if
p I F. We assume t h a t t is n o t a square (automatic if 1) > 1). 'l'hen
- +
equivalent or not. Again (P l)--'p(p
+
+ I ) = Pp, so we ask again whether
therc is a n a E 0 with (p 1 ) a = x -typ, i.e., whether x 2 - m y 2 = f 2
is solvable with x, y E Z or not.
Thus we have proved:
we a realize B a ( ) :6 , ( ) (The odd ramified prirnes
Theorem 3. Let D > 1 a n d m 11 D F , where m i.s not a .square
a r e those dividing t and r 3 (mod 4), i.e., those dividing D.) T h e order ( s ( ~ ) ( R ) is empty if m is a q u r e ) . Let u ( m ) be the number deJined in
--
0 = I(;,:) : a , 6 E ~ [ ( f i ] } has d i s c r i r n i n a ~ i~tt a n d the desired comple- Proposition 1. Then the number # ( m ) of components of s ( ~ ' ) ( R ) is v(m)/2,
unless u(m) > 0 , i E 0 , D F = 2t with t odd, m = t or 2t, a n d
tion 5, = 0, a t any odd prime p; t h e proof is the same as above. Write x2 - m y 2 = +2 is solvable with x, y E Z , in which case #(m) =
Ji 0 0 1
+
(u(m) 2T-2)/2, where r is the number of distinct prime factors of DF.
our Eichler ordcr will contain a
with index 2. If 2 1 D, so 02 is to
be the maximal order, then we j r ~ s tadd in (1 + p)(l + ~ ) / 2 ,which has
-
Example. Consider t h e case F = 1, D = 2p, m = p (a prime). T h e n
trace I and norrr~( 1 - t)/2 E Z. This also works - when 2 1 F, as follows. S(P)(R)is empty if and only if fi 6 O, i.e., p --
1(1nod8). If p 5(n1od a),
W r i k t = b2 + c Z , which is possible, and map 0 2 into the standnrtl order then #(p) = u(p)/2 = h(p). If p 3(mocl /I), then i E 0 a n d x 2 -py2 = ~ t 2
290 A. P. OGG REAL POINTS ON SHIMURA CURVES 29 1
is solvable, so #(p) = (1 + h(4p))/2. The fixed points of w(2) on S are as the cusp 0, so YOo consists of these two arcs and contains just two cusps.
follows: Thus S(R) has 2r-1 components, a single orbit under W, with stabilizer
group (w(F)), when F is odd (arid > 1). O n each component, w(F) is a
i)
ii)
two points in Q(i), if p
two points in Q(-),
--
if p -
3 (mod4);
5 , 7 nod 8).
"reflection," with two hxed points.
Let us assume from now on t h a t the genus of S is > 0; otherwise S ( R )
consists of one circle.
Let P be even (and > 2). We have connected 0 to oo to 112, and hence
(Cf. Shimura [10] for the rationality.) The images in S(P) are then also 0 to 2 / F , applying w(F); we express this by writing ( F / 2 , 1 , F, 2), the
rational ovcr Q, so we see t h a t s(P)(Q) is non-empty (with at least two cusp l / y being listed by its denominator y. If 2 11 F, then w(2) sends F
elements when p z 7 (mod 8)) if s(P)(R) is. to F / 2 and 1 t o 2 (cf. [12]), so we have connected up 2 and F / 2 , and
Y w contains four cusps. Thus we have in this case 2r-2 components, each
containing four cusps, and forming a single orbit under W. The stabilizer
4. Real Points when D = 1. of a component is .(w(2), w(F)), with w(2) a rotation and w(F) and w(F/2)
reflections. Suppose now that 4 1 F. Then the matrix a = (: 'i2),
In this section, S = X o ( F ) is the ordinary modular curve of level F ;
nornlalizes 0(') and so defines an involution of S, defined over Q, fming
B = M 2 ( Q ) and 0 = . Let p: g- S be t h e the cusp m ( y = F') and interchanging the cusps y = 1 and y = 2. If 8 1 F,
then the cusp y = F / 2 is fixed by a, so F / 2 and 2 are connected and YW
natural map, where denotes t h e upper half-plane pj together with cusps. cor~tainsfour cusps. Thus we have 2'-' components, a single orbit under
We recall the description (cf.[ll]) of the cusps of S. For each positive divisor W, with (w(F)) as stabilizer, when 8 1 F. Suppose finally that 4 11 F, say
y of F we have p(t) cusps where t = (y, F l y ) and x is taken modulo li' = 4F', where F' is odd (and > I). Then a sends the cusp y = 2F' t o
t (and prime t o t). These cusps are conjugate ovcr Q, and their field of the cusp y = F', so we have (2F1, 1, F , 2, F'); w(4) sends (1, F ) to (4, F'), so
rationality is Q(ct), ct = e2*ilt, so they are real (or rational) only if t = 1 F' is connected to 4, and w(F) sends (2, F') to (2F1, 4), closing the circle.
or 2. Thus we have rational cusps with t = 1, forming a single orbit Thus we have 2r-2 components, a single orbit under W with stabilizer
under W ; cf. [12] for the action of W on the cusps. If 4 ( F, then t h e (w(4), w(F)), with each component containing six cusps, when 4 11 F. In
rational cusps with t = 2 also form a single orbit under W, consisting of this last case, w(F) and w(4) act as reflections, the fixed points of w(4)
2' elements if 8 1 F, and of 2r-1 elements (stabilized by w(2)) if 4 11 F. being cusps (with t = 2), while w(F1) is a rotation. Thus:
,.
The set of real points of S is the union of the sets YE = p(ZE), where ZE
is the geodesic arc E(Z) = f in and 6 E 0 satisfies S(E)= 0 and n(6) = Theorem 4. Let S = Xo(F), F > 4. Then the components of S ( R )
-1, or E E U(1). Thus YE is an arc in S(R), beginning and ending a t a cusp form a single orbit under W, and are described by the followiny table:
and passing through no other cusps. We do not care whether p: ZE-4 YE is
one-one or not, although t h a t is the case when F > 1 for the "fundamental Case #Components
arc" (cf. [9])Z(l 0 ) = {iy : y
0 -1
>
01, which lies in a fundamental domain
2%F
27-1
#Cusps per component
2
stabilizer in W
(w(F))
for r o ( F ) = o('). 2 j/ F
2'-2 4 (w(F), w(2))
Let us determine the component Y w of S ( R ) containing the cusp 41 P 27-2 6
connecting m t o O = w(F)(oo), so it is fixed (w(F),w(4))
oo = 1/F. It contains Y 27-1 4
(0-1)) 81F (wp'))
by w(F). Taking E = (i 2'))
the set ZEis the line Re(z) = -112, connect-
In all cases, w ( F ) acts on a component as a reflection, the two fixed points
ing m t o -112. If F is odd (and > 1, say), then -112 is the same =
being non-cusps. I f 2 11 F, then w(F/2) is cr reflection (with non-cusps as
292 A. P. OGG ILKAL POINTS O N SIIIMURA CURVES 293
fixed p i n t s ) and w(2) is a rotation, and if 4 11 F, then w(F/4) is a rotation We suppose t h a t m # 2 , 4 from now on.
a n d w(4) is a reflection with cusps (with t = 2) as fixed points. Let us suppose next t h a t m is rrot ;t square, in which case the situation
is sorncwhat similar to t h a t when D > 1. Accordirrg to Tllcorern I, w(m)
has real fixed points only if m = F or F / 2 , in whirl1 cases ~ ( ~ ) / ( w ( m ) )
Now let m I/ P, m > 1, and let pm: $)- s ( ~ be )t h e natural map. cor~sist~s
of 2'-' or 2'-2 half-circles.
Then s ( ~ ) ( R )consists of the old part ~ ( ~ ) / ( w ( r nplus
) ) t h e new part: 'raking now m = E' or F / 2 , let us d c t e r m i ~ l rt h e componerlt X w
of S('")(R) containing t h r cusp m. It contains X,,going from co t o
a fixcd point of w(rn), where it meets Xp; here E U(l), P E fJ(m),
anti cp = -PC. Specifically, if m = I;' we can take c = (h
p .= ( mO O') J
{ [$!{I
(if a2 - mb2 = -1) need a formula; using the action of W, we may assume t h a t y = 1 or 2
(13) #(m) = A + 8)/2 (if a 2 - mb2 = I), with no rsseritial loss of generality.
Given an elen~entof G, we represent it by x E Z, coprime t o 2n. We can
where A is t h r riurr~berof cornporierits of S(R) if the fixed cusps of c r (above) solve cx = -1 + a n , with c, a E Z and d I c, getting an clement 0 of U(m),
are conr~ectctlto oo there, and otherwise A = 0, arid B = 2r-2 if 2 / F , with b = -41 + an). One end is (-1 + a n ) / n c = x / n = P ( x , I ) and the
i E 0, a n d s2- m y 2 == f2 is solvable, and otherwise B = 0. +
other is (1 an)/nc. If n is ever1 (this is the easirst case), then this fraction
The remaining case where w(m) stabilizes a component of S ( R ) is is already in lowest terms, so (1 + a n ) / n c = (1 + a n ) / ( n . d . c l d ) = P(zl, d),
m = F / 4 . In this case S(R)/(w(m)) consists of 2r-2 circles, by Theorem where x' = c/d = l / d x E G. Thus P ( l / d x , d ) is the only neighbor of
4, and there are no real E-poirits, so we have P ( x , I), and more generally P ( l / d x , d l y ) is the only neighbor of P ( x , y),
when n is even. If d > 1 (and 2 1 n), t h r n each new component contains
exactly two cusps and we arc done, so suppose t h a t d = 1; write now
I'(x) = ll(x, 1). NOWari exception to the general rule occurs only when
In the rernainirlg cases ( m f 2,4, F , F / 2 , F / 4 ) , S(R)/(w(m)) consists
296 A. P. OGG REAL POINTS ON SflIMURA CURVES
x = x-' in G; then the corresponding co~nponentmay be a loop X p , component, with no cusp equal t o one of its neighbors, looks like
with only one cusp, or it may be conncctcd t o the old part. P'or exaniplc,
p= )A:( 1 0
fixes P ( 1 ) and anticon~mutcswith t = ( o -,), which f i e s oo, (16). . . ,P ( x , y), I'(l/xd, dly), P ( x d / l , y), P ( 4 / x d , dly), P ( x d l l 6 , y), ...
so P(1) is on t h e component Xm containing m. Also, PI = (n2c;n) I - 9 and its length is 2k, i.r., it contains 2k new cusps, where Ic is t h e order of
1-na/2 -l+n/2 4 i r i G. 15xccptions can occur only if d = I; then P(x) is cqual to one of
l i e s P(1) and anticon~mutcswith r l = (7t12+ +n1,2), which fixes
its neighbors if x 2 = 1 or 4x2 = 1 in G. If so, then P(x) is connected t o
1 / ( 2 + n ) , i.c., the cusp 112, which is connected t o m on the old part. Thus,
a n E-point or t o a Gxed point of *w(rn). Now I'(1) is connected to oo, by
in either case, P ( x ) is the only IWW cusp on its cornponcnt when x 2 = 1 in
t h e same proof as above (this proof did not nrcd n t o be even), and oo is
G , so we have:
connectcd t o P(1/2), since P = 2 -(=)/'
( )
~ixcs(I + n ) / 2 n = p ( 1 / 2 )
Theorem 6.1. Let I' = dm, where m = n 2 , with n even, n 2 4 a n d anticornrnutes with t = (A I:), which fixes co. Thus the component
(and m 11 F'). Xw contains
solutions f x of
this, write n = c2 +e:, where (c, e l ) = 1 and (say) c is odd, c l is even; then
1 = a c l + a l c for s u i h b l e a , a1 E Z. P u t x = a c - a l c l . Then xc+cl = a n 2
+
and - x c l + c = a l n 2 . P u t b = a1 - a x , bl = a a l x . Then bc = 1 - a 2 n 2
Remark. Then
in U(m). They anticommute, since t = DPlnv2 -
- (d
-n-a('+z2)
is in
-z )
U(-1). Thus Zo meets Zp, a t t h e fixed point of t, so X p meets Xp, in an
23-point, and we claim that t h e cusps a t t h e other ends are P ( x ) arid P(x/2).
(if 8 1 n )
(if 2 11 n a n d x 2 z -1 (mod n ) is not solvable)
+
To see this, write n = (u2 v2)/2, c = UV, cl = (u2 - v2)/2, whcre u a n d
+ + +
v arc coprime odd integers. T h r n 1 a n = c a l a ( n cl) = a l u v a u 2 , +
-
(otherwise).
+ +
Suppose n odd, n >_ 3, from now on; we retain the notation of the
beginning of t h e paragraph preceding Theorem 6.1. A neighbor of P ( x , 1)
is (1 + a n ) / n c . This fraction must be p u t into lowest terms; note t h a t this is P(x). Similarly,
v(alv + a u ) -
a n d (1 a n ) / c = ( a l v au)/v in lowest terms; since
+
(nc, 1 a n ) is a power of 2. If d is odd (i.e., if F is odd), we can choose
c t o be either odd or even; taking c odd we have (1 3- a r r ) / ( n . d . c l d ) =
P ( l / x d , d), a n d taking c even we have P(1/4xd, d). Thus (applying w(y)),
t h e two neighbors of l'(x, y) are P(l l x d , d/y) and P ( l / l z d , d l y ) . A general
298 A. P. OGG REAL POINTS ON SI-IIMURA CURVES 299
+ +
in lowest terms, and ((u -- v)/2)(a(u - v)/2 a l ( u v)/2) 3 x/2 (mod n), This is in general of length 4k, where k is the order of 4 in G . Exceptions
so we have P(x/2). can occur (with a cusp equal to one of its neighbors) only if d = 2, and
Thus, whenever x2 = 1 in G, an old cusp or an E-point lies between 4x2 = 1 in G, so P ( x , 1) resp. P(x, 2) is equal to its neighbor P(1/4x, 1)
P ( z ) and P(x/2). Now let 1 be the order of 2 in G. If 1 is odd, then resp. P(1/4x,2). Note that P ( x , 1) and P(x, 2) are interchanged by w(2).
-
the component of P ( x ) has length 1 (contains 1 new cusps), where x2 = 1 By Theorem 4, a component of S(R)/(w(m)) is a half-circle whose ends are
in G. In fact, it contains exactly the new cusps P ( x . 2i), while ordinary fued by w(m) and interchanged by w(2), and any E-point (present only
components have length 21 = 2k, as wr have seen. If 1 = 2k is even, then if each prime dividing n is 1 (mod4)) is fixed by w(2). Thus, given
ordinary components have order 2k (k is the order of 4 iri G); so do the 4x2 = 1 in G, and d = 2, so P ( x , 1) is connected to a Fixed point of w(m)
special components, since P(x/2) is on the component i f x2 = 1 in G, and or an E-point, this fixed point or E-point is in turn connected to the image
again our component contains exactly the new cusps P ( x . 2i). Thus: P(x, 2) of P ( x , 1) under w(2). The component in question thus contains all
P ( x .4', 1) and all P(2x . 4', 2), and is fixed by w(2), so it contains all cusps
Theorem 6.2. Let F = dm be odd, where m = n 2 , n > 3. Let k P ( x .2', 1) and P ( x . 2i, 2), and no other cusps. Thus a special component
resp. 1 be the order of 4 resp. 2 in G = ( Z / n Z ) X /(+l). has length 21, where 1 is the order of 2 in G. If 1 = 2k is even, this is the
same as the length 4k of ordinary components, while if 1 = k is odd, then
If d > 1, then each new component of s ( ~ ) ( R )has length 2k, so special components have length 2k while ordinary components have length
i)
4k. Thus:
- -
#(m) = p(n)/4k.
We have cx = -1 +arb as above, and can take c E 0 or 2 (mod 4) as we
iii) I j d = 2 and 1 = k is odd, and A is the number of solutions of
wish. 'l'akirig c 2 (mod I), we find that (1 + an)/nc = 1'(1/2xd, d/2), and
taking c O(mod 4), we get I'(1/4zd, (1). Thus the neighbors of P ( x , 1) arc x2 = 1 in G, then s ( ~ ) ( R ) has A special components of length 2k, plus
P(1/2xd, d/2) and 1'(1/4xd, d). Applying w(y) and ~ ( 2 ~where ) , y 11 d, y ordinary components of length 4k, and
odd, we get in general
. . .,P(1/4xdl d/y), P ( x , y), P(1/2xd, d/2y), . . .
. . . ,Y(l/4xd, d/2y), P ( x , 2y), P(1/2xd, d/y), . . . Suppose next that 4 11 d. By the usual argument, we find that the neigh-
bors of P ( z , 1) are P(1/2zd, d/2) and P(1/4xd, d) and more generally P ( x , y)
and hence
is betwen P(1/22d, d/2y) and P(1/4xd, dly), for an odd exact divisor y of d.
Applying w(4), we have P(x, 4y) between P(1/2xd, d/2y) and 1'(1/4xd, d / 4 ~ ) ,
A. P . OGG REAL POIN'I'S ON SI-IIMURA CURVES 301
so a new component is obtained by dividing S by ttic hyperelliptic involution v has points raional
over R or Q or not. If D = 1, then the first pitrt was done in [12]-actually,
this is the hardrst case, and the second questior~does 1101arise, since S has
rational cusps; wc assume that D > 1 frorn now on.
Thc methods for determining whether S is hyperelliptic or not are well
in general of length 6, hut of length 3 if d = 4 and 16x2 = 1 in G , and known (cf. [12] for the case 11 = 1 and [IOj I'or the case F = I), so we give
only then. As t o the old part, since S ( R ) consists of 2r-2 circles, with w(m) here only an outline of the proof and a statcmerit of the results. It turns
acting bs a rotation on each cornponcrit if d = 4 and carrying one circle out that the hyperclliptic involution v is always in W, so whether S/(v) has
onto another if d > 4, we have 2'-"circles if d = 4 and 27-3 if d > 4. real poirit,s or not is contained in the results of $3.
Thus: If S is hyprrclliptic, and F' I F ,then Sn,,,*ris also L'subhypcrclliptic," i.e.,
either hypc~rcllipticor of genus < 1 (since it has a function of degree 2 if S
Theorem 6.4. Let F = dm, where 4 11 d and m = n 2 ( n 2 3). does). A hyperclliptic curve does not admit Ci iis a group of autornorphisms
(cf. [I'L], Proposition I), so in our case a t most three primes divide DP',
and lias length 4 as do all riew components (using the action of W). Thus:
if S is hyperelliptic. The formula for the genus is
Theorem 6.5. Let F = dm, where 8 1 d and rn = n 2 ( n 2 3). Then
#(m) = 2'-2 +
28-2cp(n), where s is the number of odd prime factors of d.
In particular, if S is hypcrelliptic we get (by (21) and (23)) noted in the example a t the cnd of 53, w(2) has four fixed points, two over
Q(i)and two over Q ( G ) , so it is not possible that Ci acts as a group
of rational autornorphisms; this elimirlatcs D = 142. Another argu~rlentis
found in Michon [10], p. 224, and a proof that the cases D = 133, 177 do
In particular, if F = 1 we can take 1 = 2 or 3 (since S has genus 0 if not arise; another way to eliminate these lwo values is to find a few more
D = 6), so D = pq ( p and q prime, p < q ) with ( p - l)(q - 1) <120. Then points on the curve besides the (g - 1) = 8 supersingl~larpoints, so there
are more t h a t ten points over F(4). At any rate, we have the first part of:
certainly D < 300 and we can use Table 5 (pp. 135-141) in 121, as follows.
In this table, for D = pq(p < q), the last column gives t h e dimensions of
S + + , S '-, S -+, in order, where S is the space of new (A la Atkin- Theorem 7. There are exactly twenty-four values of D for which
Lehner) cusp forms of weight 2 for h ( D ) and, e.g., S+- is the subspace S = So,, is hyperelliptic; i n each case the hyperelliptic involution v is
on which w(p) = 1 and w(q) = -1. Now it has been known for a long in W . These values, together with the genus g of S and the m for which
time (since Eichler's work on the trace formula and its applications), t h a t v = w ( m , ), are:
for any D the space S is essentially the same as the space of holomorphic
differentials on S, i.e., the two spaces have the same dimension and the
actions or the respective IIecke algebras are equivalent. This led Kazhdan
and Maaur to surmise that the Jacobian J ( S ) must be Q-isogenous to the
new part of the Jacobian Jo(D) of X O ( D )which
, was proved immediately by
Ribet [13]. The actions of the two Alkin-1,ehner groups on S are "opposite,"
i.e., if the Dirichlet series correspondi~~g
to a nor~nalizednewform is
For three of these values, namely D = 57,82,93, the curve S is not hyper-
then the operator w ( ~ on) new forms for r o ( D ) has cigenvalue -ap = T I ,
elliptic over R; for the other twenty-one values, S is hyperelliptic over Q.
while the operator w ( ~on) J ( S ) has eigenvalue a p ;cf. Atkin-Lehner [I] and
Eicl~lcr[GI, respectively. Thus, for us, the last column of Table 5 should
be read backwards; for exan~ple,when D = pq, the genus of S / W is the Proof of the last part. Since O is a maximal order, there are real poi~lts
dinlension of S--. Checking through the table, we find twenty-four values on s(") if and only if fi E R. This is always the case if m = D and we
D = 26,. . .,206 for which S is hypcrclliptic, with hypcrelliptic involution find that fi B exactly in the three cases listed; for example
@ B
v E W; these valucs are listed in Theorern 7 below. For example, for when L) = 57 since the prime 3 splits in Q(m) but ramifies n B.
D = 206 = 2 . 103, the last column in Table 5 reads 0 , 5 , 4 , 0 , so S has For the remaining twmty-one valurs, we can show that S is hyperelliptic
-
genus 9 and v = w(206), while for D = 82, it is 1,0,2,0, so g = 3 and over Q by finding in cach case suitable "points of complex multiplication"
v = w(41). Instead of using the table, we could of course compute the which map to rational points on ~ ( ~ For 1 . any value of D ,the construction
genera in each case by formulas (22), (3), (4). To show t h a t there are no is as follows. Let 1 be 1, 2, or a prime 3 (mod 4), so I, = Q(Q) has
further values of D for which S is hyperelliptic, i.e., to show that S is discriminant -4, -8, or -1 and odd cl:lss number h . We assume that each
hyperelliptic but v f i n W is not possible, we need check only those S for prime p dividing D does not split in L, so L B. To be specific, we can
which cach w(m), m # 1, has a t most four fixed points, i.e., g ( m ) = 19/21 or realize Ll as
+
[(g 1)/2], by a result of Schoenebcrg (cr. [12], p.452). Checking th-ough
'l'able 5, only D = 133,142, 177 need a closer look. For D = 142, as we
304 A. I-'. OGG REAL POINTS ON SFIIMURA CURVES
and take 0 to be a maximal ordcr containing the ring of all such clcrnents
with u , b intcgral in ~ ( f i ) . (Clrarly B is indefinite, so i t is enough to
check that an odd prime ra~nifiesin 13 if and only if it divides D). Then 0
contains anticommuting elcrnents p = (0,A) and p = ( fi 0
E U(m),
so the point P E S defined by the fixed point of p in @ maps to a real
point of s ( ~by) the
, results of $3. Shimura has shown (cf. [16], p 5 8 ) that
the exact field of rationality of P is thc class field C ( L ) of the imaginary
quadratic numbcr-field L. Thus P is of degree h ovcr I,, and its image
on s(")is of degree h over Q, so we have a rational point on s(') if we The curve S is not hyperelliptic over R for exactly six o j these values,
can embed L in 11 for one of thc nine values 1 = 1,2,3,7,11,19, 43, G7,163 namely (Dl F ) = (6,17), (10, 13), (14,3), (15,4),(21,2),(26, 3).
for which h = 1 . This can always be done if 11 = pq < 2227, ay one
checks; on particular, the twenty hyperclliptic curves S with v = w(D) are This leaves thirteen curves S which are hyperelliptic over R. For the
hyperclliptic over Q. nine with m = DF, we find rational points on s ( ~ coming
) from complex
The only other case is D = 58. In this case S is of genus 2 and hence multiplication by fl,where 1 is a prime with h(-1) = 1, 1 r 3 (mod4),
autonmtically hyprrelliptic over its field of definition Q, since t h r r r is a
positive cmonical divisor (of degree 2) defined over Q. Also, we have
and );( is 1 if p I F and is not 1 if p I D. We can take 1 as follows:
already noted in the example a t thc end of $3 that S(P) has rational points
if it has real points in the case L) =. 2p.
This method will not work in two of the remaining four cases. The A. Ogg, Rational points on certain elliptic modular curves. Proc.
method depcrids on finding anticomrnuting p and P in 0 with p2 = -1 and Symposia Pure Math. 24 (1973), 221-231.
,B E U ( m ) . In the case D = 10, F = 19, m = 38, the discriminant of O A. Ogg, IIyperclliptic modular curves. Bull. Soc. Math. France
would then divide 4.1.38, so 1 -- 5, which is not allowed (sincc h(-1) = 1). 102 (1974)) 449-462.
Thus we cannot prove that Slo,lgand similarly S14,5is hyperelliplic over K. Ribet, Sur les variktks abkliennes A multiplications rCelles. C.
Q by this method and must leave the problem until another occasion. For R. Acad. Sci. Paris 291 (1980), Skrie A, 121-123.
s15,,
we use 1 = 7, B = {(-:, i) : a , b ) ,( T - k )
~ ( f i )= G. Shimura, On the zeta-functions of the algebraic curves unifor-
mized by certain automorphic functions. J. Math. Soc. Japan 13
and p = (-,0 o).
1
At the prime 2, we can map p onto (141 1 and p onto (1961), 275-331.
G. Shimura, On the theory of automorphic functions. Ann. of
p ( 1 2 (note that flE Z2). Thus SI5,,is hyperelliptic over Q, as Math. 70 (1959), 101-144.
is S39,2,by the same proof.
Except for two unresolved cases, then, the curves S which are hyperel- G. Shirnura, Construction of class fields and zeta functions of al-
liptic over R are hyperelliptic over Q. gebraic curves. Ann. of Math. 85 (1967), 58-159.
G. ~ h i n i u r a ,On the real points of an arithmetic quotient of a
bounded symmetric domain. Math. Ann. 215 (1975), 135-164.
M. F. Vignkras, Arithrnktique des Algkbres de Quaternions. Lecture
References Notes in Matllernatics 800. Berlin, Heidelberg, New York: Springer,
1980.
A. 0 . L. Atkin and J. Lchner, IIecke operators on ro(m). Math.
Ann. 185 (1970), 134-160.
B. Birch, W. Kuyk (ed.), Modular functins of one variable IV. Received February 26, 1982
Leclure Notes in Mathematics 476. Berlin, IIeidelberg, New York:
Springer, 1975. Professor Andrew P. Ogg
M. F~ichler,~ b e die
r Einheiten der Divisionsalgebren. Math. Ann. Department of Mathematics
114 (1937), 635-654. University of California
M. Eiclllcr, ~ b e rdie Idealklassenzahl hyperkomplexer Systeme. Berkeley, California 04720
Math. %. 43 (1938), 481-494.
M. Eichlcr, Zur Zahlcntheorie der Quaternionen-Algebren. J. Reine
Angew. Math. 195 (1955)) 127-151.
M. Eichler, ~ b e die
r Darstellbarkcit von Modulformen durch Theta-
reihen. J. lteine Angew. Math. 195 (1955)) 156-171.
Y. Ihara, Congruence relations and Shimura curves. Proc. Symposia
Pure Math. 33(1979), part 2, 291-311.
13. Mazur, Rational isogenies of prime degree. Invent. Math. 44
(1978), 129-162.
B. Mazur and 11. P. F. Swinnerton-Dyer, Arithmetic of Weil curves.
Invent. Math. 25 (1974), 1-61.
J. -F. Michon, Courbes de Shimura hyperelliptiques. Bull. SOC.
Math. France 109 (1981), 217-225.
Special Automorphic Forms on PGSp4
I. I. Piatetski-Shapiro
To I.R. Shafnreuich
linguistic corrections. -
2) To any nontrivial character ?C, of k p we can attach the Weil repre-
It is known that this function has moderate growth and hence the integral We will tlcr~ote( , ) the symmetric form on X which is preserved by
J'GSp4. We will denote by ( , ) the skew syrnmctric form on a two-
dimensional space Y preserved by SL2. This way we define a dual reductive
pair G, SL2. In fact, it defines a liltirig from to G. z2
Assume thal the automorphic form p(g) is a lift, i.e., it car1 be written
in the form ,-
makes sense, where f 2 is any cuspidal automorphic form.
Assume thal f 2 E ~ 2 where
, 7r2 is a cuspidal autornorphic representation;
then the set of functions of the form (0.5) is an automorphic representation
PI, [lo]. where n(h) is a cuspidal autornorphic function on x 2 ( A ) . Then [3]
Now we describe a dual reductive pair, which is important for our
purposes.
Consider the space
where W(h,$p, a ) = SK\a
$ ( - ~ ~ ) u ( ( A f ) hda
) and ZT = (y2, -T) is the
X I
= {T E M4(k) T J is skew sym~nctricarid tr(T) = 0). special point in %I which depends only on T. N is the unipotent subgroup
of SL2, which preserves yz. Since any 6 E O;.(A) preserves z ~ we , have
X is a 5-dimensional space over k. We get a representation of GSp4 on X
by
g: T--t g-ITg. This shows that any form which comes as a lifting is special.
L e m m a 1.1. Let (n, V) have the [J-property; the space of the linear pk (mo) = lo. Now define
functionals o n V satisfying
is at most one-dimensional. where so is a. sufficiently large open compact subgroup of S. This vo satisfies
the properties of the lemma.
Proof. Consider the group R = O;.S. Let /I be any character 0 $ . Let
1 be a linear functional satisfying Remark. If vo E V satisfies (1.4), thcn for any T with det T = 0 we
have lT(v0) = 0. In order to prove this, if sufFices to verify that a set of T
such that l ~ ( v 0 #
) 0 co~isistsof T of the form m ~ o m - l ,m E sup X(m).
It is known that the space of linear functionals satifying (1.1) is at most This last set is closed and does not contain any T such that det T = 0.
one dimensional 1.11. III our case p = 1. Let ly- be a linear functional such that (T-nondegenerate)
We describe a generalized Kirillov model. Let 1' be a non-degenerate
symrnetric matrix of order two. Ilct (x, V) be an irreducible smooth repre-
sentatiorl of G' (k a local ~lonarchimedc:mGeld). We say that the integral We denote by 1 y (the unique up to a scalar multiple) functional IT such
that
lT(x(6)v) = lT(v), Vv E V, V s E O$. (1.7)
Let 1'"': L;--+ C be any functional such that linv(n(6)x) = linv(x),
exists if it stabilizes for large open con~pactsubgroups of 5'. Let 7;,. . . , T2 Vx E Li, V6 E 0;;. We denote by 7r the natural action of OTi on I,,. Denote
be representatives of classes M-equivitler~tsy~nmctricmatrices. P u t I
by Ly = (1 L; linV(l)= 0 ) .
Denote Li = V / K and
functions
V - L; the natural projection. Introduce the
will be an invariant functional for some T. It is easy t o check that any
invariant functional has the form (1.8).
d ( m ) = cpi(n(m)v). (1.2)
L e m m a 1.3. If ( n , V ) is an irreducible smooth representaion, which
does not satisfy the U-property, then there exists a vector vo E V, vo f 0
Proof. If (n, V) does not possess thc U-property then there exists io
rp$(m) = X(m) and rp',o(m) = 0, i # io. ( l a 4 )
such that L!,, is nontrivial. Using Lemma 1.1 we can construct vo # O such
that p,o(m) E Ly, Vm, Vi. Thus vo satisfies the assumptions of our lemma.
Assume now t h a t (T, V) is an irreducible preunitary representation satis- Lernma 1.5. Assume that P = MS and Q = FN contain the same
fying the U-property. Uorel su6group B of G. Then for any nondegenerate symmetric T except
for a subset of a smaller dimension,
Lemma 1.4. Under the appropriate choice of the measure d m on
OSi \ M, we have
where BI = B n M .
Proof. Consider the completion V of V . According to the result of the Theorem 1.1.
(1) If (n, V) i.9 a preunitary ~ m o o t hirreducible representation with the
appendix, we have the following spectral decomposition with respect to the
U-property, then for any Q-invariant subspace Vl, we have
unitary operators ~ ( s ) s, E S
(2) If (TI, VI), (nz, Vz) are two such reprcsentations, satisfying T ~ J ~ ~ T
The integration is with respect to all nondegenerate symmetric matrices. as unitary representations, then z ~ 2 .
(Recall that any character of S has the form $ ~ ~ ( =
s ) $(tr T S ) where 1' is
a symmetric matrix of order 2.) The meaning of (1.11) is t h a t there exists Proof. Let Vl be a Q-invariant subspace of V. Denote by
a map that attaches to each v E a measurable function f v ( T ) E VT such
that
( ' " 1 ~ )= / I ~ ( T ) \ ~ ~ T J
First we show that Fi is a n M invariant set. Indeed, let mo E Ti artd
where d T = dp,(T) is a spectral measure, corresponding to r / S . m l E M. We show that mom1 E El. There exists GOsuch that p$(fio) #
Let v E V be a srnooth vector. Then there exists a n open compact 0, arid there exists G l , close to m l , such t h a t f i l = r72; 16 k o b J b E 131,
subgroup Mo C M which stabilizes v. f v ( T ) is preserved by Mo ;w well. 6 E O$,. (Imnrna 1.5). We gct that G o G l E E;, which implics mom1 E Ri.
Let 7'1 be a nondegencrate symmetric matrix of order 2. Then its orbit l h i s shows that E; = 0;; \ M. Using the same argument as in the proof
r ,
under Mo is an open neighborhood of TI. This irnplies that fv(T) is of Lernma 1.2, one can show t h a t for any mo E Ei,,3vo E Vl such that
locally constant on the set of the nondegenerate T's. According to our r z ( m ) = 1, m E U(mo) and p:O(m) = 0, if i # io or m 6 U(mo). Since
h', = 0;: \ M , Vi, such that cp' is nontrivial, we have (see 1.10) that the
construction, f,(T) defines a map V + VT which commutes with the
natural action of S. (On VT, S acts according to the character +r.) Since
T has the U-property, all the spaces VT should be one-dimensional, and 0$
acts trivially on O$.. Thus we can identify f v ( T ) with one of the function
set of such functions is dense in V .
Now we prove the second statement of our theorem. Let :I V1
be a Q-isomorphism, which exists according to our assumptiorl. Put B1 =
- V2
M n Q. It is clear that if v E Vl is invariant with respect to a conlpact
SPECIAL AUTOMORPIIIC FORMS O N PGSp,l 319
open subgroup U of B1, then p t ( m ) is locally cor~stant.We show that if This implies (see Theorem 1.1) t h a t (n, V), viewed as a representation of Q,
p t ( m l ) = 0, then p k V ( m l ) = 0 (for v as above). Indeed, p t ( m l ) is a linear can he written as i n d g r , where T is an irreducible unitary representation
functional which is uniquely defimd by its transfornlatio~llaw with respect of D ,on which Z acts according to a nontrivial character 4. IIence, ?- is
to S. of the form o @ w+, where Q is a unitary representation of x2. Consider
It is obvious that is nontrivial for x l iff the corresponding map for x2 the Weil lifting 0(o,$). 0 ( u , 4 ) has the U-property, and its restriction t o
is nontrivial. Denote by S'O the subset Q is isomorphic to o @ w+ [3]. The rigidity theorem (Theorem 1.1) can be
proved in the case when we assume a priori t h a t only one representation is
{ v E V1 Ip>(m) is locally constant with compact support and unitary. This implies that n = O(u,+).
p;(m) = 0 for i f i o ) . We consider now the archimedean case. The idea of the proof is similar
to the nonarchimedean case. It is easy to see that if k = R, then there
Similarly we introduce S:. It is clear that a. sends S y t o SF. Since a are two equivalence classes of nondegenerate symmetric matrices of order
is a linear map preserving zero and commutir~gwith right translations by 2. The first class consists of definite matrices and the second consists of
elernents of B1, a. is a multiplication by a constant on S?. Ilcnce, on Sy , indefinite niatriceu. In the case k = C, there is only one equivalence class
a commutes with the action of M . Since @Si is dense in and since of nondegenerate symmetric matrices.
M and Q generate G, X commutes with the action of G. This proves our Let (T, V ) be a smooth, preunitary, irreducible admissible infinite dimen-
theorem. sional representation of G. Let us recall that this means that there exists
in V an invariant herrnitiari form. We let I1 be the completion of V with
Lemma 1.6. If Z C G i s a n y unipotent subgroup a n d ( a , V ) is an respect to this form. Then V consists of all the vectors in I1 lying in
irreducible infinite dimensional unitary representation of G, t h e n i t has n o the domain of definition for all the operators of the enveloping algebra of
vector i n V which is invariant with respzct t o Z . G. It is easy to see that V is a nuclear space. Denote by V' the space
of continuous linear functionals on V. Then we have a rigged (equipped)
Proof. It follows from the Howc-Moore theorem that any matrix co- space:
efficient of an infinite dimensional unitary irreducible representation tends v C H C V'. (1.12)
to zero [7]. For the definition of a rigged space see [6].
P u t Q = FN where F is the reductive part and N the unipotcnt radical Let T be a symmetric matrix of order 2. Denote by LT the space of all
of Q. Denote by i! the center of N. Let D be the centralizer of Z in Q . It continuous linear furictionals on V satisfying
is well known t h a t any representation T of L) on which Z acts according to
a given nontrivial character 4 is of the form o @ w+, where w+ is the \Veil
representation of = E2. N which corresponds to the character and +
o is an irreducible unitary representation of n2.
r is irreducible (unitary) and denote by V$ the subspace { v E V I lT(v) = O,VIT E LT).
ifl o is irreducible (unitary). Let pT be the natural projection from V onto VT = V$ \ V. P u t
where M iis the stabilizer of 2;. Take a finite place p l # po. Thcn take the non-zcro vector u l E npl
In t11e appendix it is proved t h a t except for the case k = C arid n the which was defined in Lemma 1.2 (see the remark after the lem~na). Let
Weil representation - the contribution of the degenerate terms is zero. u = @upE n be a nor)-zero vector sucll t h a t u p , = U O , up, = U I . Lel J(g)
We now explain how t o prove a lcmma similar to Lemma 1.2. In order be the automorphic function in n which corresponds to u. We will show
t o prove it, wc 11avc to use the same argument as in Lemma 1.2 and also t h a t under our assumptions
the fact2 t h a t the Lie operators act locally on the runctioris pl(m).
This
implies that any smooth function with compact support on M , \ M defines
some vector iri V . From the definition of u l , it follows t h a t
Now theorems 1.1, 1.2 can be proved exactly as in the nonarchimedean Vp E PA VT, d e t T = 0.
) 0
f ~ ( p=
case.
If det T f 0, then since f(y) is a special automorphic form, fT(P) defines
-
an invariant linear furlctional on n. Using the property of uo wc obtain
thai, JT(p) = 0 for p E PA and invertible T. I t is well known t h a t Pk \ PA
$2 Special Automorphic Forms is a dense subset of G k\ GA. Hence, from (2.1) it follows that: f 0, a
contradiction.
In this section we will prove t h a t any cuspidal special automorphic form
on (2 is a lifting from x2 ('r11~0r~m 2.2). It seenis t h a t thc assurnption C o r o l l a r y . Let 7i be a preunitary irreducible special automorphic rep-
tliat the autorr~orphicform should bc cuspidal is not necessary. Wc need it resentation. Then xlQA is irreducible as a unitary representation.
sincc the lifting of noricuspidd autornorphic forms from z2 was not yet
investigaled. Proof. This follows from Theorem 2.1 and Theorem 1.1.
Lct r be a preunitary irreducible spccial autornorphic representation.
T h e o r e m 2.1. Let a = @ r p be an irreducible automorphic special Let f E n. P u t
r
representation; then each component np has the U-property.
irreducible representation of DA with a given restriction to the group ZA Proof. We first prove the following lemma. Let k # C.
( 2 is the center of D) can be uniquely written in the form a @ w+, where o
is an irreducible representation of x 2 ( A ) and w+ is the Weil representation Lemma 1. Let (T,H)be a n irreducible unitary representation of D
of DA which corresponds to the character 4. It is easy to see that if T is s ~ i c hthat Z acts nontrivially. Consider the spectral decomposition of I f with
autornorphic, then so is a . It remains to prove that a is cuspidal with a respect to S . T h e n the spectral measure of the aet of degenerate characters
missing character +. We first see that of S i.9 zero.
12
-
I S-symmetric , X corresponds to
automorphic representations and we also know that their restrictions to
Q A are of the form ~ n d g ; , where T = a @ W+ [3]. Using Theorern 1.1 we
obtain that n g O(a, +). Since the multiplicity one theorem is true for such
and Y corresponds to ( (ii)).
",
Consider the spectral decomposition of
wd with respect to S. The spectral measure is concentrated on matrices
representations, we obtain t h a t n = B(a, +) [3].
of' the form T = ( $1 and we can take as a spectral measure just
the staridard ~ ~ e b e s g umeasure
e dp. For the representation a the spectral
Appendix measure is concentrated on matrices of t b forni 1. = (: 0) where a is in
a union of cosets module ( K * ) 2 ,and we can take the spectral measure to be
The aim of this appendix is t o prove the following theorem, which is due d a . Hence the spectral decomposition of T with respect to S is concentrated
t o R. Howe. on matrices of the form (O0 a 0 1 p
) +(p p) =
, .
(k p1i; a) wit11 measure
Theorem. Let (n, I ) be a n infinite dimensional irreducible unitary dpd,. The degenerate matrices are those with a = 0. Clearly,
..
the spectral
representation of G = PGSp4(k), where k i s a n y local field. I n the case measure of the set of such matrices equals zero.
k = C we exclude the W e i l representation. Consider the spectral decom-
position with respect t o S : Lemma 2. Let k # C and ( 7 , H ) be a unitary representation of D o n
which Z acts according t o a nontrivial character; then the spectral measure
of the set of degenerate characters of S is zero.
Proof. This follows from the fact that H is a direct sum (maybe con-
where T is the symmetric 2 X 2 m a t r i z which defines the character
tinuous) of irreducible unitary representations satisfying the assumptions
&(S) = +(tr TS). T h e n the spectral measure of the set of all degenerate
of Idernma 1. Wc now prove our theorem for the case k # C . Consider the
matrices equals zero.
324 I. I. PIATETSKI-SEIAPIRO SPECIAL AUTOMORPIJIC FORMS ON PGSpq
Professor I. I. E'iatetski-Shapiro
Department of Mathematics
Yale University
By construction H 0 is invariant with respect t o Q. We have t h a t H 0 is
Ihx 2155 Yale Station
invariant with respect to the Lcvi subgroup M of P, since M transforms
New IIaven, Connecticut 06520
degenerate characters of S to degenerate characters of S. Thus 15' is
G-invariant and hence H = IIO. It. I-Iowe proved that if the spectrum of an
infinite dimensional irreducible unitary representation of G is degenerate,
then H is the Weil representation [9].
Sous-vari6t6s d'une vari6t6 ab6lienne
et points de torsion
M. Raynaud
To Z.R. Shafarevich
Soicrit A une variktk abkliennc dkfinie sur le corps des nornbres complexes,
T le sous-groupe de torsion de A et X un sous-schkma fer~nkintkgre de A.
locale (cf. 5.3.1) permet alors d'cn dkduire que X est le translatk d'une 1.2. Soit R un anneau de valuation discrkte complet, de corps des
sous-variktk abklienne de A. fractions K de caracthristique sbro el dc corps rksiducl k algkbriquement
Dans le cas gknkral, le Frobcnius F de A, ne se relkve pas en un en- clos de caractkristique p > 0. O n suppose quc pR est I'iddal maximal de R
dornorphisme de A, et on doit remplacer A par son extension vectorielle (i.e., 11 cst abolunicnt non ramifik) et pour n >0, on pose R, = Ii/p7'+'R.
universelle forrnelle &. Cette modification crke quelques complications plus Pour tout n > 0, soit An un 12,-schbma abdlien tel que
techniques que thkoriques. Tout d'abord f n'est plus un schkma en groupes
rnais un schkrnn formel en groupes et sa fibre gknkrique EK est un groupe
analytique rigide. Ensuite I'cndomorphisme 4 agissant sur l'algkbre de Lie
de CK fait intervcmir plusieurs pcntes; on doit skparer la contribution de et soit A = lim A, le R-schkma abklicn formel corrcspondant. Si A: est le
-+
chacune d'elles ct traiter spkcialement la pente 0. R,-schkma abklien dual tle A,, nous dirons que A' = lirn A: est le schkma
Comme application, on montre que la torsion situke sur une courbe non --+
abklien formel dual dc A. Par exemple, dans le cas algkbrisable, A et A'
elliptique de A, est finie, bornke uniformkment aprks translation, du rnoins provicn~ientpar conlplktion le long de leurs fibres fermkcs d'un Il-schkrna
si X n'est pas contenue dans une surface abklienne de A. abklien A et de son dual A'.
Pour tout n 2 0, l'analogue de (1) fournit une suite exacte de
R,-sclibmas cn groupcs lisses:
1. Extensions vectorielles universelles.
cients d a m k. L'analogue de (2) fournit un isornorphisme: Consid6rons sur Ic k-schhrna abdlien A, le rnorphisme de Froheniuu ab-
s o h E': A,--+& et le ~norphismcde IG-obenius rclatif Flk: A. - / i ~ l ) , de
sorte que 1'011 a F = uk 0 Flk. Par lc cllangemc~ltde base OR, on dkduit
cle A (rcsp. &(A)) dcs R-schhmas forrncls en groupcs A(') et &(A)(')et par
d'oh l'on dkduit un isornorphisme canoniquc: transport de struct,urc par I'autornorphisme U R , on a &(A)(')= &(A(')).
D'nprks 1.3, au k-niorphisrne Flk: A, 4 /iL1), corrcspcnd un rnorphisme
de R-schkmas forrnels en groupcs &(l;jk): &(A)---+ & ( A ) ( ' ) ,que nous notcrons
Soit ~riairttcnaritB = linl Bn un second R - s c h h a nbdicn formel, B' +lR. l3n composarit 4/Ii avec l'isornorphisrne absolu gn: &(A)(')+ &(A),
---t
n on ot)ticnt un endomorphisme absolu, un-linkaire 4: &(A)-+ &(A). Au
son dual el &(a) son extension vcrtoriellc forrnellc univcrselle. niorphisme de Frobenius relatif F'lk: A, -+ AL'),
correspond par dualitk
Soit u,: A, -+ 13, un k-morpltismc de sch6ntiw abkliens. Alors, d'aprks le rnorphisrne du Vrrschiebung V/k: A~')--+A',. I1 en r6sultc que si on
([7] chap. 11. $1) pour tolit n 2 0, il existc un rnorphisrric dc Rn-sch6mas identifie Lie &(A) B IIE,,,(A',) par T, Lie 4 s'identifie au Verschiebung V,
en groupes: 0;'-lir16aire, de la cohomologie cristnlline.
En(uo): E(&) -+ E(Bn),
caractkris6 par Ics propriktbs suivantes:
1.5. Dans la suite, Ic schkma abdlier~ formcl A 6tant Furk, on pose
E,(uo)rst Ic k-rnorphisrnc canonique provcnant du caractkre univer- & = &(A), 2, = V ( A ) et on note K : l + A l'application dc paxiage au
i) quotient qui intervient dans (4). On a donc une suite exacte dc I{-sch6rnas
sel des extcnsioris E(A,) et E(B,).
forrnels cn groupes:
ii) 13n+l(uo) se rGduit en lL(u,) modulo pn+', et donc les En(uo)
dL:fiuissent url rr~orphisrne&(u,): &(A)-+ & ( B ) dc R-schdmas forrnels en
groupcs.
iii) Le diagramnlc suivnnt est cornmutatif:
qui par r6duction modulo k, fournit une suite exactc de k-groupcs algk-
lie f (u,) briques
Lie&(&) -----4 LieE(B)
1.4. Notons uk: Spcc(k)- Spec(k), I'isomorpl~isrnedkfini par le Frobe- (lo1) 0 -+ LIpL -t M/pM + A/pA -t 0.
nius de k qui envoie a E k sur U P . On note UR I'autorrtorphisme de
Spec(R) = Spec(MT(k))qui relbve uk ct O K la restriction de nil B Spec(K).
Pour tout R-schdnia (ou sch6ma formel) X on note x(')1c 12-schkrna (OU 1 1.6. 11 rksulte de 1.3 que & ne ddpcnd pas du choix du R-schkma abdlien
sclihma formel) ddduit de X par le clti~ngerncrltde base O J ~ . 011a donc un forrncl A qui relkve /Io. Plus prdcisi.rncnt, si est ,111 autre R-sch6ma
isornorphisrnc i~bsolux(')-+X not6 encore OH. ahdicn formel qui relkve A,, on a unc suite exact,? dc J2-schkm:1 forlnelu en
M. RAYNAUD Points de torsion 333
groupes, analogue de (9): 2.3. Iteprenons leu notations du numkro prkckdent. A la suite exacte (9)
de R - s c h h a s formels en groupes lisses correspond, par passage B la fibre
gknkrique, une suite exacte de K-groupes rigides arialytiques lisses:
qui s'identifie A (10) par rkduction modulo p. Le groupe formel vectoriel (12) 0 -+ VK -+ &K -+ AK -+ 0.
ti slidentifie par l'application logarithme B un sous-R-module de La suite exacte d'algkbres de IAe, associke A (12) se ddduit de (9') par
M = Lie&. tensorisation avec K. En particulicr, nous noterons H I'algkbre de Lie de
De plas, la correspondarlce: &K identifike A HETi,(Ab) @ R K par T A K, de sorte que l'on a la suite
exacte:
ktablit une bijection entre: L'endornorphisnie absolu, on-linkaire 4 de & induit un endomorphisme
- d'une part les R-rnodulcs E , facteur direct de M, tels que = L/pL arc-lindaire de l'espace rigide &K e t not6 encore 4; Lie (4) induit le Verschie-
(cornme sous-k-vectorirls de M/pM) bung V sur 11.
- d'autrc part, les R-schdrnas abkliens formels 3 qui rclbvent A,.
suitc (z,), n >0, d'klCrrlcrlts dc G tels clue px,+l = x,. Soit pnCle noyau Notons que la ~riultiplicationpar pn dans X est Ctalc sur la fibre gknkrique
de la multiplication par p" d a m G et soit ]im t
,,,G la. lirnite projective XK, en particulicr pnX(R) est lini et donc on a la suite exacte:
"p,n"
du s y s t h e projectif:
Nous dirons que les Clkr~ientsde n pnX(R) sont les e'le'ments inde'/iniment
n
Par passage & la limite sur lcs suites exactes: p-divisible3 de X(R).
Notons Tpm X(11) (rcsp. Tp- X(k)) la torsion p-primaire pdivisible de
X(R) (resp. X(k)).
on obtient la suite exacte: Lemme 3.2.2. L'application n pnX(R) -+ X(k) a un noyau fini
n
(resp. est injective) si et seulement si l'application qm
X(R) -+ Tpm X(k)
est surjective (resp. bijective).
1 12 1
O-t pnX(k)-+l@X(k)-r X(k)
3.2. Soit R comme dans 1.2 et soit ,T un R-schCma formel en groupes "p,n" '(P"
conimutatif e t lisse, B fibre ferrnke X, = XIi @ k connexe
niontre que 1e noyau de l'application n pnX(R) -+ X(k) est isomorphe
n
Lemme 3.2.1. Les applications: au conoyau de la flkclie lirn ,nX(R) --+ lim ,nX(k). Cette dernikre
e t
"p,n" "p,n"
lim
t X(R) -) lim
c I(&) --+ Iim
t X(k) application se d6duit tle I'applicatiori TpmX(R) -+ Tpm X(k) par application
I'P" " P" ''~" du fo~icteurI I o I I ~ ( Q ~ / .),
~ !d'oh
~ , le le~nme.
dkduites dea applications canoniques X(R) -+ X(Rn) -+ X(k), sont bijec- Ezemples 3.2.3. Supposorls que X soit un schCma abklien forrricl. Alors
tive~. X,(k) cst pdivisible, donc n pnX(k) = X(k).
n
Notons Xpoo lc R-groupe p-divisiblc construit sur les noyaux dcs niulti-
En effet, on a X(R) = lim
t-
X(Rn) et doric, par associativitk des li~nites plications par pn dam X. On a une suite cxactc de R-groupcs p-divisibles:
n
projcctives, l i i X (R) = l@ am
c I(&). Par suite, il s u l t de montrer
"P" "P"
que pour tout entier n 2 0, l'application lim
e X(Rn+,) 4 lim
c X(Rn) oh (Xp-)inf cst connexe et (Xpm)et est ktale isomorphe B ( Q ~ / Z , ) ~oh
, h
"P" "P"
cst le p-rang de X,. La suite exacte (16) est scindCe en rCducLion modulo p
est bijective. Or comme X est lisse, l1a.pplication X(IZn+l) -+ X ( R ~ est
) et il rCsulte de 3.2.2. que llnpplic,ztion n pnX(R) --r X(k) cst bijective si
surjective; par ailleurs son noyau est anrrulC par p, ktant n
Points de t,orsion 337
336 M.RAYNAUD
I1 sufiit de montrer quc ) induit I'identitk sur lim &(R)et d'aprks 3.2.1. il
e t seulcrrier~tsi (16) est scindde. Ce sera lc cas si h = 0, ou bien si X, est t
D'aprks 3.2.1, il sufit de voir que l'application lim &(k)4 lim ff (k) est
bijective, ce qui rksulte de la suite exacte:
'7, '7,
Or F induit l'identitk sur A,(k) et on a dkj& remarquk que I'application
lim E,(k) -t linl A,(k) ktait bijective.
e C
('P" <'P"
dkduik dc (10) et du fait que V ( k ) est annul6 par p.
Remarques 3.3.2. i) Pour la dkrnonstration du t l r k o r h e annonck clans 3.5. Soicnt R comme dans 1.2 et A un R-schkma abklien de fibre
l'introduction, lcs seuls points indkfiniment p-divisiblcs de A(R) que nous gknkrique Ak.
auroris & considdrcr sont c e w de torsion.
ii) On peuL rnontrrr que la torsion p-primuire ir:dffinirnent p-divisible T h k o r e m e 3.5.1. Soit XK un sous-sche'ma fermk intkgre de AK con-
de & ( [ I ? ) w t nulle, de sorte q ~ l'application
c t- & ( H ) -r
a,: linl &(R) est tenant u n ensemble Zariski dense de points inde'jiniment p-divisible3 de
injective. "P" A ( K ) = 411). O n suppose que X n'est stable par aucune translation n o n
nulle de Arc. Alors il existe une partie E l de x,
Zariski dense duns X K ,
3.4. Considkro~lsl'tmdornorphisme absolu 4 dc & et rappelons que dont l'image dans A(k) est u n seul point.
4 = on o (din) (1.4). Notons que ) ieduit 11n endo~norphisn~e du groupe
& ( R ) . Rn r b t , soit .c t & ( R ) ,vu conime une section de & nu-dessus de Corollaire 3.5.2. Soit X K u n sous-sche'ma ferme' intkgre de AK con-
S p e c ( 1 l ) Cornposant x avcc c$/* on obtient une section de &I1) au-dcssus tenant u n ensemble Zariski dense xde points indljiniment p-divisibles de
de Spec(ll), puis on transporte cettc section en une section dr & nu-dessus A(k). O n suppose de plus que l'une des conditions suivantes est rkaliske:
de Spec(ll) par le el~angenrentdc basc (oa)-l. On a donc 4 0 s = #(x)oon.
i) eat forme' de points de torsion.
ii) L'application n pnA(R)--, A ( K ) a u n noyau jini (cf. 3.2.2).
n
Les klkments indkfiniment p-divbihles de & ( R ) La dkrnonst.ration du thkorkme fera l'objet des trois numkros suivants.
P r o p o s i t i o n 3.4.1.
Montror~scomrncnt le t h k o r h c entraine le corollaire.
sont jixe's par 4.
Points de torsion 339
cela rdsulte par excrnplc de 3.2.2) ct assurent quc Its fibres de I'application
X" C ( b ) sont finies, donc X'f( cst h i et par suitc rkduit B un point
ration nel. 4.2. Soit Y l'adhkrence rigide dans d'une famille A de poirlts indk-
finirncnt p-divisibles de & ( R ) .Les points de Asont fix& par q5 (3.4.1)) donc Y
est stablc par 4. On supposc dc plus clue Y t,st un espacc rigide irrkductible.
4. Etude rigide analytique. Soit d sa dirncnsion. Commc Y est rkduit et K dc caract6ristique 0,
Y est alors lisse sur K en dchors d'un fermk rigide Z f Y ;I'ouvert
cornpldrncntsirc U est le plus grand ouverl de Y oii le faisceau des formes
D;ms les nurn6ros 4 ct 5 on reprend Ics hypothkses et notations de 1.5 difKrentielles de degrk 1, fly, est localement libre de rang d.
et 1.6. Rappelons (2.3) que I'on a not6 11I'algkbre de l i e de EK et que q5 induit
4.1. Soit A unc partie de & ( R =
) tK[K). sur H lc Verschiebung V issu de la cohornologie cristallirle de A:. Soit
I1 que:
342 M. RAYNAUD Points de torsion
associke 11 (9). I'ar tensorisation avec K on obtient (12') et par rdduction 6.4. Reprenons la suibe exacte d'alghbres de Lie:
modulo p, on obtient
En elkt, supposons qu'il existc X > 0, tel quc Tu n TA I I / , soit non nul,
donc localcn~rncntlibre dc rang nx > 0. Il'aprks 5.4.1, pour tout y E A,
il cxiste uri sous-12-module librc lly,X de Hi, de rang n x , stablc par V, tcl
montre xY(R) est une partie de A(&) qui se spkcialise en z, E A,(k), en
+
que y exp(FIy,x) soit contenu dans Y. Conlrne X > 0, il rdsulte de 6.3.1
particulier = ~ ( hse)spkcialise en z,, d'oh le thkor&xne3.5.1.
que Hy.x n'est pas contenu dans Lie VK, donc Ny,x = (I,ie.rr)(Iiy,x)est
un sous-R-module non nu1 de L i e n K . I1 en rdsulte que ~ X ~ ( N engendre
,,~)
+
une sous-varidtk abdlienne non nulle By dam A K . Commc x(y) e ~ p ( N ~ , ~ )
est contenu dans X ( K ) = Xalg(K), +
x(y) By est contcriu dans Xal,. 7. Complkment sur la ddmonstration de Bogomolov.
L'rnsemble des ~ ( y )y, E A cst Zariski dense dans Xalget doric (6.1.1),
XaIgest stat)le par les trarislat,ions d'une sous-variktk abdlienne non nulle 0, A une
7.1. Soient c un corps algkbriquernent clos de caractkristiq~~e
B de AK, en contradiction avec lcs hypothkscs faites sur Xaly variktk abblienrle d6Gnie sur c, X u n sous-schdma fermk inlhgrc de A qui
348 M. RAYNAUD Points de torsion 349
n'est stable par aucunc translation non nullc de A. Notons Z le fcrmk de sous-variktk abflicnnc de A (cf. (21 p. 703). Vu la dklinition de U, y: est
+
X adhkrcnce des sous-schkrr~asde X de la forrne a U , oh a E A(c) e t B de dimension 0, done Y ct Ua,b sont de dimension 0.
est unc sous-vari6td abklicnnc non nulle de A. D'aprbs 6.1.1, on a Z # X. Prouvons maintenant 7.1.1. Soil L un sous-corps de c, de type fini sur
Soit U = X - 2. Q, tels que A, X et Z soient dkfinis sur L. Notons la cl6turc algkbrique
Illknonck suivant, prkcise le rksultat dc Bogomolov e t nous allona le dc L dans c et C le groupe de Galois de sur L. D'aprks ([2], cor 2 p. 703))
dkmontrer par les m h e s techniques: il existe un nonibre entier 1 > 1, e t g E G, tel que si x E A@) cst d'ordre
une puissance dc p, alors g x = 12. On ktend g en un Lautoniorphisme
P r o p o s i t i o n 7.1.1. Soit p un nombre premier. Alors il existe un entier not6 encore g dc c.
N > 0 tel que, Va E A(c), la torsion p-primaire contenue dana U + a est Si alors a E A(c) et si u + a est un point d'ordre une puissance de p de
finie, de cardinal 5 N. + + +
U a , on a g(u + a ) = 1(u + a ) e t d'antre part g(u a ) = gu gu E fJ g a , +
donc 11 E Ua,pa et on applique 7.1.2.
Soit 1 un enticr > 1 e t notons 1~ la multiplication par 1 dans A. Pour
tout couple ( a ,b) .de points de A(c), nous allons const,ruire par rkcurrence
un certain sous-schfma fcrmf UF~ de U. I'renons U$ = U et supposons
-
8. Points de torsion d'une sous-varidtd.
avoir d6fini UL?', sous-schk~nafermP de U. Soit u$) lladhkrencc de u$)
dans X ct d6linissons UF;+')par la formule: 8.1. Dans ce numkro, nous dfmontrons le rksultat knonck dans I1introduc-
tion. Soicnt donc A une variktk abklicnne dkfinic sur un corps algkbrique-
ment clos c dc caractkristiquc 0, T le sous-groupc dc torsion dc A(c) e t X
un sous-schkma fcrrnk intkgrc de A tel que T n X soit Zariski dense dans
X. I1 nous faut rnontrer que X est le translatk d'urlc sous-variktk abklienne
de A.
L n );(I: forn~entune suite dfcroissante de sous-schkmas fermfs de U, donc
stationnaire, dc valeur UaVbpour m > 0. On a lA(Ua,,
+
(m+l) a ) C + a+
b
et donc lA(Ua,b a ) C va,b+ b et clairernent, Ua,b est le plus grand sous-
Quittc & passer au quotient par un sous-groupe de A, on peut supposer
que X n'est stable par aucune translation non nulle dc A, ce qui nous permet
d'introduire le fermk strict % dc X (cf. 7.1) et l'ouvert complkmentaire
s c h h a fermd dc U posskdant cette propriktk. U = X - %. Nous devons alors nlontrer que X cst un point de A.
Par dcs rkductions klkmcntaircs dktaillkes dans ([!I], 10.2 et 10.3), on se
L e m m e 7.1.2. Le schima Ua,b est fini et de'pend de faqon constructible rarnkne au cas oh A est un El-schkma abklien (Mini sur un anncau R d u
de (a, b ) . E n particulier, le cardinal de Ua,b est borne' par un entier N type d h i t dans 1.2, de caractfristiquc rksiduellc p > 2 e t oh X et U sont
indipendant de (a, b). des sous-schkmas de la fibre gknkrique AK de A. Soient R une cl6ture
8 algkbriquc dc K ct G = Gal ( R I K ) .
Le caractkre constructible de Ua,b est irnrnkdiat sur la dkfinition, une fois Prockdant cornme dans ([!I], li), on choisit unc dkcomposition
notk que si (a, b) varie d a m un fcrmk intkgre Z de A X A, alors pour rn
-
fixk, la formation de u:?) commute au passage aux fibres au-dessus d'un
ouvert non vide de 2. 11 rcstc i voir quc fJaTb est fini. Soit x E A(c) tel que du G-module T(R), oG T 1 ( R ) = T1(IC) est la torsion rationnelle, pdivisible
+ +
(1 - 1)x = a - b et soit Y = Ua,b a x. Alors lA(Y) c y. Si Y1 est (somme directe dc la torsion d'ordrc premier i p ct de la torsion p p r i ~ n a i r e
la rkunion dcs composantes irrkductibles de Y dc dimension maximum, on pdivisible, ratio~lnellle sur K ) et oh T" est un supplkn~entaire(non
) F. I1 cxiste alors une cornposante irrkductible Y: de Y'
a aussi l ~ ( Y l C canonique) de TI dans T. Ritppclons ([!I], 5.2.2) que I'action de G sur T"
e t un enticr n > 0, tcls quc l i ( Y i ) = F:. Donc Y:eut le translatk d'une
--I
cst forte dans le scns suivant: VM cntier > 0, 3n entier > 0, tcl que si
E
350 M. RAYNAUU Points de torsion 35 1
x E ll"(K)est d'ordrc p' avcc r > n, alors l'orbitc de x sous G a un l e r cas: il existe une courbr: clliptique E de A qui laisso S stable par
cardinal > M. txanslation, auquel cas S est une surface dliptiqi~cde fibre I;, de base
Par aillcurs, d'aprcs 7.1.1, la torsi011 p-primaire conknue d a m U a, + une courbe non elliptique C de U = AlE. Cornme C ne contient qu'un
a E A ( K ) est finie, borrlde par iln entier N indkpendant dc a. nombre fini de poinls de torsion, Ics poi~llsde torsion de S sont situks sur
un nornbrc fini de courbes de genre 1, E;, i E 1. Pour x variable dans X ( c ) ,
les interscctions Ei n ( X - x) ont un cardinal bornk, d'oh le rksultat.
8.2. Soit z un point de torsion de U(R). On a x = x' -t x", avec t
x' E T 1 ( K )et x" E T1'(1~).Alors x" cst un point de t,orsiorl p-primaire de 2 k m e cas: Les translations qui laissent S stable sont en nombre fini.
U - x'. Comme U e t x' sont ddfirlis sur I<, toute l'orbite de x" sous G cst Alors S ne conticnt qu'un nonlbre fini de courbes de genre 1, E,, i E I
contenue tians U - 2'. Puisquc cettc orbite a au plus N blCrnents, il existe (6.1.1) et si Z est la rkunion des Ei, I'ouvert U = S -. Z ne cor~tientqu'un
n, indkpcndant de x" tcl que l'ordre de XI' divise pn. nombre fini de points de torsion et on conclut cornme dans le ler cas.
Soit X' I'image de x dans A par la multiplication par pn. Alors, tout
point dc T n U(K)a urle image dans X' qui est contenue dans T' n X 1 ( K ) . R e m a r q u e 9.2. . La proposition 9.1 laisse ouvert le cas oil S est une sur-
Cornme T n U btait Zariski dense dans U , T 1 n X ' est Zariski dense dans XI. I face abklienne. Rappclo~istoutcfois que pour prcsque tout nontbre premier
Mais T' est constituk dc points de A ( K ) = A(IZ) indbfinirnent p-divisibles. p, la torsion prcrnikre A p, contcnue dans X + a , est borr~keindkpcndamrnent
I1 rbsulte alors dc 3.5.2 que XI est le translat6 ci'une sous-variktk abdienne t de a cornme il rdsulte de ((91, 6.3.1).
de A; par suite il en cst de rni.rne de X ct donc X est rbduit A un point.
L '~llusionsabout it, however, he had none. "Un pays oil, quand on parle, on est pendun
is how he once called if, (speaking Lo t l ~ cQueen Dowager of Prussia after his arrival in
Berlin in 1741), acrording to an anrrtlote preserved by CondorceL; cf. L. d u Pasquicr,
Leonard I h k r et de8 a m q l'aris, 1927, pp. 40-41.
f
f
EULER AND THE JACOUIANS OF I~:I,LIPTICCURVES 355
Notations bcing as above, let k be the ground field; for us it may be any
ficld of characteristic other than 2 or 3; for Euler it was mostly the field of
be such a form; its invariants, in the sense of clavsical invariant theory, are real numbers, occasionally the field of cornplex numbers, and just possibly
given by (though Euler never says so) the field of rational numbers if he had the
possible application to diophantine equations at the back of his mind. P u t
K = k(P), where P is a point of I'. For M = (z, u) on C, p u t z = p(M) as
before, and u = $(M); also p u t y = p ( P - M), v = $ ( P - M). For a given
P u t F ( z ) = F(x, 1) and call C the curve of genus 1 defined by u 2 = F(x). P these arc rational functions on C, the former being defined over k and
One can show t h a t the jacobia~lJ ( C ) of C is the curve r given by t h e latter over K . As x, y are functions of ordrr 2 on C , there is between
them a relation @(x, y) = 0, where @ is a polynomial of degree 2 in each
one of the variables x, y, with coefficients in K. Changing M into P - M
exchanges (z, u) with (y, v); therefore @ is symmetric in z and y. Changing
and t h a t there is a rational mapping of C into r, given by M,Y into - M , -P does not change x and y; therefore changing P into
-1' can a t most multiply with a scalar; by choosing the sralar factor in
@ properly, we may thus assurrle that all its coclficients are polynomials in
r = v(P).
Clearly the two roots of the equation @(x,Y) = 0 are
where g(X,Y), h ( X ,Y ) are the two covariants of t h e form F ( X , Y), ay given y = p ( P - M) = p ( M - P), yt = p ( - P - M ) = P(P + M).
by the classical theory. To this I added in 1954 (loc. cit.) that r and
C may be regarded as the two components of a cornrnutativc algebraic As we have u" F(x), v2 = F(y), and as the mappings M --r M f P
group G(C), with r = J ( C ) as a subgroup of index 2, arid with the leiwe w = ir~variant,whilc the mappings M -+ fY - M change it into
point a t infinity on r as the neutral element. The group-law bcing written -w, one may thus, with Eulrr, interpret the above results by saying that
additively, it induces the usual group-law oq the "Weicrstrass cubic" r, and @(z,y) = 0 is an integral of e i t h r one of the diITercntia1 equations
also mappings ( M , P ) -t M + P of C X f into C and ( M , N ) -+ M + N
of C X C into r. The mapping (4) is then none else than M -+ 2 M .
One defines a function p on G(C) by putting p ( M ) = x for M = (x, u)
on C, and p ( P ) = 6 for P = ( & q ) on f ; we have then - M = (x,-u), or, more accuratcly perhaps, of the equation
(el
-P = -T/), and irl particular p ( - M ) = cp(M), p ( - P ) -- p ( P ) .
I'ut w = $; this is the diUcrentia1 of t h e first kind on C . For any
given point P on f , the mapping M -+ M + P is an autornorpliisrn of
C which leaves w invariant; conversely all such a~~tornorphisms are of that
form. Similarly, tlic autornorphisms of C which change w into -w are the
' it is the "gcnrral integral1', sinrr it contains the arbitrary constant
mappings M -+ P - M. Thus, in an obvious scnsc, the points F of r t = g ( P ) . This is how 1Suler viewed the matter, from the time ( a t the end
may be said to parametrize these two farnilies of autornorphisms of C. As I of 1751) when I~agnano'sProduzioni Matpmatiche drew his attention upon
found in 1954, explicit formulas for all these mappings can easily be derived t h r equation
from IIerlnitels results in [2].
Hati I but known it, I could have found all those forrnulas in a paper of
1765 by Euler ([l(b)]), where I was surprised t o discovcr them not long ago.
This is to be explained here. and others of the mrnc kind.
356 A. WEIL EULER AND T H E JACOBIANS OF ELLIPTIC CURVES 357
Having in fact discovered that the differential equation (5) has the general Thus, for @(x,y) = 0, we have
integral
+ +
x2 y2 c2x2y2= c2 + 2 x y 4 ' ,
also, put O = 4([ + c). Then the last equation in (8) gives the point P = oo (the point a t infinity on r)and therefore the value oo
for 0, while the other sign will then give P' = 2A and thus a finite value
p = 64(4t3 - it - j), for 8. Therefore P, P' correspond respectively to the values of O given by
['
For P = ([,q), P' = (6') ql), we have then [ = $ - f , = $ - $. As
to 7, q', they can now be obtained by using either one of the forrnulzm (9)
In order for (7) to determine a transformation (x, u) -+ (y, v) which changes as it applies to the present case; q will be obtained by substituting a, m , b,
% into ,, we must take a = -p. Replacing then q by pq, we can rewrite
dy
for x, u, y in the first one of these formulas, using in the coefficients of Q
(7) as follows: and R the value of O found above; for q' one has first to substitute 6' for O
in those coclficients, then a, -m, b for x, u, y. This completes the solution
found iri 1765 by Euler for a problem first formulated in 1954.
6 = ------ [H(.,
(b - u)'
b) *d m ]
April 23, 1982
where H is the polynomial
Professor Andrk Weil
lf (a, b) = A + R(a + b) + i C ( a 2 + b2) + Dab(a + b) + 13a2b2 School of Mathematics
Institute for Advanced Study
Princeton, New Jersey 08540
([l(b)],p. 343). Clearly these values of O must correspond to the points
P = l3 - A, P I = r l + R on r.
In order to detcrnrine the signs belonging to
tlresc two points, one need only observe that one sign must give for Ll -- A