Analyti c Numbe r Theor y
This page intentionally left blank
http://dx.doi.org/10.1090/coll/053
America n Mathematica l Societ y
Colloquiu m Publication s
Volum e 53
Analyti c Numbe r Theor y
Henry k Iwanie c
Emmanue l Kowalsk i
America n Mathematica l Societ y
Providence , Rhod e Islan d
Editorial Boar d
Susan J . Priedlander , Chai r
Yuri Mani n
Peter Sarna k
2000 Mathematics Subject Classification. P r i m a r y H F x x , H L x x , H M x x , H N x x , 1 1 T23 ,
11T24, 1 1 R42 .
For a d d i t i o n a l informatio n a n d u p d a t e s o n t h i s book , visi t
www.ams.org/bookpages/coll-53
Library o f Congres s Cataloging-in-Publicatio n D a t a
Iwaniec, Henryk .
Analytic numbe r theor y / Henry k Iwaniec , Emmanue l Kowalski .
p. cm . — (Colloquiu m publications , ISS N 0065-925 8 ; v. 53 )
Includes bibliographica l reference s an d index .
ISBN 0-821 8-3633- 1 (acid-fre e paper )
1. Numbe r theory . I . Kowalski , Emmanuel , 1 969 - II . Title . III . Colloquiu m publication s
(American Mathematica l Society ) ; v. 53 .
QA241.I85 200 4
512.7'"3-dc22 2004045081
C o p y i n g an d reprinting . Individua l reader s o f thi s publication , an d nonprofi t librarie s
acting fo r them , ar e permitte d t o mak e fai r us e o f th e material , suc h a s t o cop y a chapte r fo r us e
in teachin g o r research . Permissio n i s grante d t o quot e brie f passage s fro m thi s publicatio n i n
reviews, provide d th e customar y acknowledgmen t o f th e sourc e i s given .
Republication, systemati c copying , o r multipl e reproductio n o f an y materia l i n thi s publicatio n
is permitte d onl y unde r licens e fro m th e America n Mathematica l Society . Request s fo r suc h
permission shoul d b e addresse d t o th e Acquisition s Department , America n Mathematica l Society ,
201 Charle s Street , Providence , Rhod e Islan d 02904-2294 , USA . Request s ca n als o b e mad e b y
e-mail t o reprint-permissionQams.org .
© 200 4 b y th e America n Mathematica l Society . Al l right s reserved .
The America n Mathematica l Societ y retain s al l right s
except thos e grante d t o th e Unite d State s Government .
Printed i n th e Unite d State s o f America .
@ Th e pape r use d i n thi s boo k i s acid-fre e an d fall s withi n th e guideline s
established t o ensur e permanenc e an d durability .
Visit th e AM S hom e pag e a t h t t p : //www. ams. o r g/
10 9 8 7 6 5 4 1
3 2 51 41 31 21 1 1 0
Contents
Preface x i
Introduction
Chapter 1 . Arithmeti c Function s 9
§1.1. Notatio n an d definition s 9
§1.2. Generatin g serie s 0
§1.3. Dirichle t convolutio n 2
§1.4. Example s 3
§1.5.
1 Arithmeti c function s o n averag e 9
§1.6. Sum s o f multiplicativ e function s 2 3
§1.7. Distributio n o f additiv e function s 2 8
Chapter 2 . Elementar y Theor y o f Prim e Number s 3 1
§2.1. Th e Prim e Numbe r Theore m 3 1
§2.2. Tchebyshe v metho d 3 2
§2.3. Prime s i n arithmeti c progression s 3 4
§2.4. Reflection s o n elementar y proof s o f th e Prim e Numbe r Theore m 3 8
Chapter 3 . Character s 4 3
§3.1. Introductio n 4 3
§3.2. Dirichle t character s 4 4
§3.3. Primitiv e character s 4 5
§3.4. Gaus s sum s 4 7
§3.5. Rea l character s 4 9
§3.6. Th e quarti c residu e symbo l 5 3
§3.7. Th e Jacobi-Dirichle t an d th e Jacobi-Kubot a symbol s 5 5
§3.8. Heck e character s 5 6
Chapter 4 . Summatio n Formula s 6 5
§4.1. Introductio n 6 5
§4.2. Th e Euler-Maclauri n formul a 6 6
§4.3. Th e Poisso n summatio n formul a 6 9
§4.4. Summatio n formula s fo r th e bal l 7 1
§4.5. Summatio n formula s fo r th e hyperbol a 7 4
§4.6. Functiona l equation s o f Dirichle t L-function s 8 4
§4.A. Appendix : Fourie r integral s an d serie s 86
Chapter 5 . Classica l Analyti c Theor y o f L- functions 9 3
§5.1. Definition s an d preliminarie s 9 3
v
vi CONTENT S
§5.2. Approximation s t o L-function s 9 7
1§5.3. Countin g zero s o f L-function s 0 1
§5.4. Th e zero-fre e regio n 0 5
§5.5. Explici t formul a 0 8
1
1§5.6. Th e prim e numbe r theore m 0
§5.7.1
1Th e Gran d Rieman n Hypothesi s 3
1
1§5.8. Simpl e consequence s o f GR H 7
§5.9. Th e Rieman n zet a functio n an d Dirichle1
1
t L-function s 9
§5.10. L-function s o f numbe r fields 1 2 5
§5.11. Classica
1 l automorphi c L-function s 3 1
§5.12.1Genera l automorphi c L-function s 3 6
§5.13. Arti n L-function s 4 1
1
§5.14. L-function s o f varietie s 4 5
1§5.A. Appendix : comple x analysi s 4 9
Chapter
1 6 . Elementar y Siev e Method s 5 3
§6.1. Siev e problem s 5 3
1 §6.2. Exclusion-inclusio n schem e 5 4
1 §6.3. Estimation s o f V+(z), V~(z) 5 7
§6.4. Fundamenta
1 l Lemm a o f siev e theor y 5 8
§6.5. Th e A 2-Sieve 6 0
§6.6. Estimat e fo r th e mai
1 n ter m o f th e A 2-sieve 6 4
§6.7. Estimate s fo r th e remainde r ter1 m i n th e A 2-sieve 6 5
2
§6.8.
1 Selecte d application s o f A -sieve 6 6
Chapter 7 . Bilinea 1r Form s an d th e Larg e Siev e 6 9
§7.1. Genera l principle s o f estimatin
1 g doubl e sum s 6 9
§7.2.
1 Bilinea r form s wit h exponential s 7 1
1§7.3. Introductio n t o th e larg e siev e 7 4
§7.4.
1 Additiv e larg e siev e inequalitie s 7 5
§7.5. Multiplicativ
1 e larg e siev e inequalit y 7 9
§7.4. Application s o f th e larg e siev e t o 1
sievin g problem s 8 0
§7.6. Panoram a o f1 th e larg e siev e inequalitie s 8 3
§7.7. Larg e siev1e inequalitie s fo r cus p form s 8 6
§7.8.
1 Orthogonalit y o f ellipti c curve s 9 2
1§7.9. Powe r moment s o f L-function s 9 4
Chapter 8 . Exponentia l Sum s 9 7
§8.1. Introductio n 9 7
§8.2. Weyl' s metho d 9 8
§8.3. Va n de r Corpu t metho d 20 4
1 §8.4. Discussio n o f exponen t pair s 2 3
1 §8.5. Vinogradov' s metho d 2 6
Chapter 9 . Th e Dirichle t Polynomial s 22 9
§9.1. Introductio n 22 9
§9.2. Th e integra l mean-valu e estimate s 23 0
§9.3. Th e discret e mean-valu e estimate s 23 2
§9.4. Larg e value s o f Dirichle t polynomial s 23 5
§9.5. Dirichle t polynomial s wit h character s 23 8
TABLE O F CONTENT S vi i
§9.6. Th e reflectio n metho d 24 3
§9.7. Larg e value s o f D(s, x) 24 6
Chapter 1 0 . Zer o Densit y Estimate s 24 9
§10.1. Introductio n 24 9
§10.2. Zero-detectin g polynomial s 25 0
§10.3. Breakin g th e zero-densit y conjectur e 25 4
§10.4. Gran d zero-densit y theore m 25 6
§10.5. Th e gap s betwee n prime s 26 4
Chapter 1 1 . Sum s ove r Finit e Field s 26 9
§11.1. Introductio n 26 9
§11.2. Finit e field s 26 9
§11.3. Exponentia l sum s 27 2
§11.4. Th e Hasse-Davenpor t relatio n 27 4
§11.5. Th e zet a functio n fo r Kloosterma n sum s 27 8
§11.6. Stepanov' s metho d fo r hyperellipti c curve s 28 1
§11.7. Proo f o f Weil' s boun d fo r Kloosterma n sum s 28 7
§11.8. Th e Rieman n Hypothesi s fo r ellipti c curve s ove r finit e field s 29 0
§11.9. Geometr y o f ellipti c curve s 29 1
§11.10. Th e loca l zet a functio n o f ellipti c curve s 29 7
§11.11. Surve y o f furthe r results : a cohomologica l prime r 30 0
§11.12. Comment s 3 3
Chapter 1 2 . Characte r Sum s 3 7
§12.1. Introductio n 3 7
§12.2. Completin g method s 3 8
1 §12.3. Complet e characte r sum s 3 9
§12.4. Shor t characte r sum s 32 4
§12.5. Ver y shor t characte r sum s t o highl y composit e modulu s 330
§12.6. Character s t o powerfu l modulu s 33 5
Chapter 1 3 . Sum s ove r Prime s 33 7
§13.1. Genera l principle s 33 7
§13.2. A varian t o f Vinogradov' s metho d 340
§13.3. Linnik' s identit y 34 2
§13.4. Vaughan' s identit y 34 4
§13.5. Exponentia l sum s ove r prime s 34 5
§13.6. Bac k t o th e siev e 34 8
Chapter 1 4 . Holomorphi c Modula r Form s 35 3
§14.1. Quotient s o f th e uppe r half-plan e an d modula r form s 35 3
§14.2. Eisenstei n an d Poincar e serie s 35 7
§14.3. Thet a function s 36 1
§14.4. Modula r form s associate d t o ellipti c curve s 36 3
§14.5. Heck e L-function s 36 8
§14.6. Heck e operator s an d automorphi c L- functions 37 0
§14.7. Primitiv e form s an d specia l basi s 37 2
§14.8. Twistin g modula r form s 37 6
§14.9. Estimate s fo r th e Fourie r coefficient s o f cus p form s 37 8
viii C O N T E N T S
§14.10. Average s o f Fourie r coefficient s 38 0
Chapter 1 5 . Spectra l Theor y o f Automorphi c Form s 38 3
§15.1. Motivatio n an d geometri c preliminarie s 38 3
§15.2. Th e laplacia n o n H 38 5
§15.3. Automorphi c function s an d form s 38 6
§15.4. Th e continuou s spectru m 38 7
§15.5. Th e discret e spectru m 38 9
§15.6. Spectra l decompositio n an d automorphi c kernel s 39 1
§15.7. Th e Selber g trac e formul a 39 3
§15.8. Hyperboli c lattic e poin t problem s 39 8
§15.9. Distributio n o f lengt h o f close d geodesie s an d clas s number s 40 1
Chapter 1 6 . Sum s o f Kloosterma n Sum s 40 3
§16.1. Introductio n 40 3
§16.2. Fourie r expansio n o f Poincar e serie s 40 4
§16.3. Th e projectio n o f Poincar e serie s o n Maas s form s 40 6
§16.4. Kuznetsov' s formula s 40 6
§16.5. Estimate1 s fo r th e Fourie r coefficient s 4 3
§16.6. Estimate s fo r sum1 s o f Kloosterman sum s 4 5
Chapter 1 7 . Prime s1 i n Arithmeti c Progression s 4 9
§17.1. Introductio n 4 9
§17.2. Bilinea r form s i n arithmeti c progression s 42 1
§17.3. Proo f o f th e Bombieri-Vinogrado v Theore m 42 3
§17.4. Proo f o f th e Barban-Davenport-Halbersta m Theore m 42 4
Chapter 1 8 . Th e Leas t Prim e i n a n Arithmeti c Progressio n 42 7
§18.1. Introductio n 42 7
§18.2. Th e log-fre e zero-densit y theore m 42 9
§18.3. Th e exceptiona l zer o repulsio n 43 4
§18.4. Proo f o f Linnik' s Theore m 43 9
Chapter 1 9 . Th e Goldbac h Proble m 44 3
§19.1. Introductio n 44 3
§19.2. Incomplet e A-function s 44 5
§19.3. A ternar y additiv e proble m wit h A b 44 6
§19.4. Proo f o f Vinogradov' s thre e prime s theore m 44 7
Chapter 20 . Th e Circl e Metho d 44 9
§20.1. Th e partitio n numbe r 44 9
§20.2. Diophantin e equation s 45 6
§20.3. Th e circl e metho d afte r Kloosterma n 46 7
§20.4. Representation s b y quadrati c form s 47 2
§20.5. Anothe r decompositio n o f the delta-symbo l 48 1
Chapter 21 . Equidistributio n 48 7
§21.1. Weyl' s criterio n 48 7
§21.2. Selecte d equidistributio n result s 48 8
§21.3. Root s o f quadrati c congruence s 49 4
§21.4. Linea r an d bilinea r form s i n quadrati c root s 49 6
TABLE O F CONTENT S i x
§21.5. A Poincar e serie s fo r quadrati c root s 49 8
§21.6. Estimatio n o f th e Poincar e serie s 50 1
Chapter 22 . Imaginar y Quadrati c Field s 50 3
§22.1. Binar y quadrati c form s 50 3
§22.2. Th e clas s grou p 50 8
1 §22.3. Th e clas s grou p L-function s 5 1
1 §22.4. Th e clas s numbe r problem s 5 7
§22.5. Splittin g prime s i n Q(y/D) 52 0
§22.6. Estimation s fo r derivative s L ^ ( 1 , X D ) 52 3
Chapter 23 . Effectiv e Bound s fo r th e Clas s Numbe r 52 9
§23.1. Landau' s plo t o f automorphi c L-function s 52 9
§23.2. A partitio n o f A ^ ( \ ) 53 1
§23.3. Estimatio n o f S 3 an d S 2 53 3
§23.4. Evaluatio n o f 5 i 53 4
§23.5. A n asymptoti c formul a fo r A^)(I ) 53 6
§23.6. A lowe r boun d fo r th e clas s numbe r 53 8
§23.7. Concludin g note s 54 0
§23.A Th e Gross-Zagie r L-functio n vanishe s t o orde r 3 54 1
Chapter 24 . Th e Critica l Zero s o f th e Rieman n Ze t a Functio n 54 7
§24.1. A lowe r boun d fo r N 0(T) 54 7
§24.2. A positiv e proportio n o f critica l zero s 55 0
Chapter 25 . Th e Spacin g o f th e Zero s o f th e Rieman n Ze t a-Function 56 3
§25.1. Introductio n 56 3
§25.2. Th e pai r correlatio n o f zero s 56 4
§25.3. Th e n-leve l correlatio n functio n fo r consecutiv e spacin g 57 0
§25.4. Low-lyin g zero s o f L-function s 57 2
Chapter 26 . Centra l Value s o f L-function s 57 7
§26.1. Introductio n 57 7
§26.2. Principl e o f th e proo f o f Theore m 26. 2 58 0
§26.3. Formula s fo r th e firs t an d th e secon d momen t 58 2
§26.4. Optimizin g th e mollifie r 58 9
§26.5. Proo f o f Theore m 26. 2 59 5
Bibliography 59 9
Index 611
This page intentionally left blank
PREFACE
This boo k show s th e scop e o f analyti c numbe r theor y bot h i n classica l an d
modern directions . Ther e ar e no division lines ; in fact ou r inten t i s to demonstrate ,
particularly fo r newcomers , th e fascinatin g countles s interrelations . O f course , ou r
picture o f analyti c numbe r theor y i s by n o mean s complete , bu t w e tried t o fram e
the materia l int o a portrai t o f a reasonabl e size , ye t providin g a self-containe d
presentation.
We were writing thi s book i n a period o f time durin g an d afte r teachin g course s
and working with graduate students in Rutgers University, Bordeau x University an d
Courant Institute . W e than k thes e institution s fo r providin g condition s fo r bot h
of u s t o wor k together . W e share d idea s o n wha t thi s boo k shoul d b e abou t wit h
many of our colleagues, who gave us critical suggestions. Amon g them we would like
to mentio n Etienn e Fouvry , Joh n Friedlander , Philipp e Miche l an d Pete r Sarnak .
During a lon g proces s o f typin g an d preparatio n o f thi s boo k fo r publication , w e
received stimulatin g encouragemen t an d technica l advic e fro m Serge i Gelfand , fo r
all o f hi s hel p w e express ou r gratitude . Caro l Hame r helpe d t o polis h som e o f ou r
English phrase s whil e he r littl e boy s trie d t o destro y th e Te X files withou t success .
We thank the m al l fo r th e output .
Henryk Iwanie c
Emmanuel Kowalsk i
15 December, 200 3
xi
This page intentionally left blank
http://dx.doi.org/10.1090/coll/053/01
INTRODUCTION
Analytic Numbe r Theor y distinguishe s itsel f b y th e variet y o f tool s i t employ s
to establis h results , man y o f whic h belon g t o th e mai n stream s o f arithmetic . I t
is no t par t o f analysi s no r o f an y particula r disciplin e o f mathematics , howeve r i t
does interac t indee d wit h variou s fields. Therefor e everybod y seem s t o vie w th e
subject differently . Thi s vas t diversit y o f concept s o f analyti c numbe r theor y i s it s
great attraction . Ou r desir e i n thi s boo k i s t o exhibi t th e wealt h an d prospect s
of th e theory , it s charmin g theorem s an d powerfu l techniques . Howeve r i t i s no t
our primar y objectiv e t o giv e proof s o f th e stronges t results , althoug h i n man y
cases w e com e quit e clos e t o th e bes t possibl e ones . Rathe r w e favo r a reasonabl e
balance betwee n clarity , completenes s an d generality . Th e boo k wa s conceive d
with graduat e student s i n min d s o th e reade r will ofte n find tha t ou r emphasi s i s
on reasonin g throughou t th e arguments . O f cours e ou r presentatio n i s subjective ,
and i n retrospec t ma y los e it s meaning . Certainl y w e d o no t alway s follo w th e
original line s of discovery, bu t occasionall y w e do draw brie f historica l perspectives .
Leonard Eule r mus t ge t credi t fo r th e first us e o f analytica l argument s fo r th e
purpose o f studyin g propertie s o f integers , specificall y b y constructin g generatin g
power series . Euler' s proo f o f th e infinit y o f prim e number s make s us e o f th e
divergence o f th e zet a functio n an d th e correspondin g produc t ove r primes , whic h
is name d afte r him . Thi s wa s th e beginnin g o f analyti c numbe r theory . Nex t
came P . G . L . Dirichle t whos e creatio n o f the theor y o f L- functions fo r characters ,
resulting i n th e proo f o f th e infinit y o f prime s i n arithmeti c progressions , make s
him th e tru e fathe r o f analyti c numbe r theory . Pro m thes e earl y day s t o moder n
times th e distributio n o f prim e number s constitute s th e cor e o f th e subject . Thi s
will b e apparen t i n th e cours e o f ou r book . Th e first tw o chapter s cove r question s
of prime s u p t o th e elementar y method s o f P . Tchebychev .
Chapter 3 provides definition s an d basi c propertie s o f Dirichle t character s an d
the Gaus s sums . Character s o n ideal s o f imaginar y quadrati c fields ar e als o intro -
duced, no t onl y becaus e the y pla y a supportin g rol e i n subsequen t chapter s bu t t o
show a bi t o f analytic numbe r theor y beyon d th e traditiona l domai n o f the rationa l
integers a s well ; ther e wil l b e othe r example s throughou t th e book , fo r instance ,
elliptic curves .
Poisson summatio n fo r numbe r theor y i s wha t a ca r i s fo r peopl e i n moder n
communities - i t transport s thing s t o othe r place s an d i t take s yo u bac k hom e
when applie d nex t tim e - on e canno t liv e without it . Chapte r 4 presents a classica l
account o f this basic technology. Man y reader s do realize now, others will figure out
later, tha t w e ar e alread y talkin g abou t idea s o f modula r forms . Bu t w e continu e
our consideration s alon g traditiona l line s (bot h classica l an d mor e recen t ones )
before th e concep t o f modularit y take s th e leadin g position .
1
2 INTRODUCTION
The celebrate d memoi r o f B . Rieman n o n th e zet a functio n i s embedde d i n
the contex t o f abstrac t L-function s i n Chapte r 5 . I t i s no t ou r styl e t o conside r
things i n term s mor e genera l tha n necessary , s o definin g a clas s o f L-function s
which suits minimum requirement s o f our forthcomin g application s wa s not withou t
difficulty an d hesitation . I n thi s wa y we could conve y to dedicate d researcher s tha t
generalizations ar e no t alway s straightforward . Fo r instance , t o establis h th e zero -
free region for L-functions o f degree > 1 one cannot rel y on the same principles as for
the Dirichlet L-functions . Th e key ingredient i s the Rankin-Selberg convolution. O n
the othe r hand , th e proble m o f exceptiona l zer o i s resolve d fo r man y automorphi c
L-functions o f degree > 1 (not withou t cleve r constructions ) whil e i t remain s ope n
for th e L-function s wit h rea l characters . Furthermor e i n Chapte r 5 a messag e i s
sent tha t a bette r lif e exist s i n th e worl d o f automorphi c form s tha n i n th e zo o of
degree on e L-functions .
Analytic number theor y does not mea n non-elementary. Th e first autho r recall s
that hi s first seriou s encounte r wit h analyti c numbe r theor y starte d b y readin g th e
lovely boo k o f A.O . Gelfon d an d Yu.V . Linnik , "Elementar y Method s o f Analyti c
Number Theory" . Whe n a n ambitiou s beginne r start s fro m ther e her/hi s lov e o f
the subjec t i s seale d forever . Tr y i t yourself ! On e i s instantl y capture d b y siev e
methods. I n thi s boo k w e do no t hav e spac e t o giv e justice t o thi s marvelou s idea ,
nevertheless Chapte r 6 should suffic e fo r basi c applications .
Next come s the "Larg e Sieve" , which is not a sieve but a name fo r othe r things .
Yes, i t di d originat e fro m a shor t pape r b y Linni k o n a siev e problem , bu t i t too k
time t o recogniz e th e tru e natur e o f thes e ideas . I n Chapte r 7 we revea l ou r view -
point an d th e crucia l attribute s (spectra l completeness , orthogonality) , the n w e
demonstrate th e amazin g powe r o f th e larg e siev e o n selecte d ol d an d ne w prob -
lems. Othe r feature s o f the large sieve are scrutinized showin g the good an d th e ba d
sides. Fo r example , th e approac h usin g th e dualit y principl e i s fruitful fo r harmon -
ics o f degre e on e (characters ) whil e producin g poo r result s fo r harmonic s o f large r
degree (lik e fo r exampl e th e eigenvalue s o f Heck e operators). Th e controvers y ove r
the prope r plac e o f th e larg e siev e i s academic . Simpl y speakin g th e larg e siev e
inequalities ar e part s o f bilinea r form s theory .
Estimates fo r exponentia l sum s ar e th e first tool s whic h deepl y penetrat e th e
problems o f analyti c numbe r theor y beyon d natura l structures . Thes e canno t b e
grasped b y harmoni c analysi s alone . Se e what cleve r us e peopl e mad e o f th e prop -
erty tha t a shifte d interva l i s anothe r interval , tha t addin g a n intege r t o a se t o f
integers yield s agai n a se t o f integer s (sorry , prime s ar e no t preserved!) . W e chal -
lenge algebraist s t o fin d a structura l explanatio n o f th e powe r o f suc h arguments !
They shoul d rea d Chapte r 8 t o fin d wha t H . Wey l buil t ou t o f thes e observa -
tions. Va n de r Corpu t an d Vinogrado v ar e als o th e mai n figures fro m th e earl y
stages o f tha t discipline . A lo t o f wor k an d talkin g wen t int o ou r presentatio n o f
Vinogradov's method , becaus e i t i s not quit e correctl y explaine d i n numerou s pub -
lications. A t som e poin t Vinogrado v depart s fro m th e Wey l differencin g proces s
and treat s multi-dimensiona l exponentia l sum s a s bilinea r form s (thi s i s th e wa y
we think o f i t anyway) .
The nex t tw o chapters sho w more recent technolog y which was developed t o re-
place the unprove n Rieman n hypothesi s i n applications t o th e distributio n o f prime
numbers. W e ar e talkin g abou t estimate s fo r th e numbe r o f zero s o f L-function s i n
vertical strip s whic h ar e positivel y distance d fro m th e critica l line . Hopefull y i n a
future on e wil l sa y w e wer e wastin g tim e o n studyin g th e empt y set . Grea t idea s
INTRODUCTION 3
are camouflage d ther e i n argument s o f enormou s complexity , s o thi s migh t no t b e
enjoyable fo r everyon e a t first . Howeve r i f you thin k th e Rieman n hypothesi s i s not
provable i n you r lifetime , pleas e rea d an d admir e thes e unconditiona l substitutes .
Special mentio n goe s t o Hug h Montgomery , Marti n Huxle y an d Matt i Jutil a fo r
the mos t origina l contributions .
Although w e are primar y intereste d i n rationa l integer s on e ca n lear n an d ben -
efit a lo t fro m arithmeti c o f other fields . No t onl y fro m th e numbe r field s o r p-adi c
fields, bu t indirectl y from th e fields of finite characteristi c a s well. Particularl y fruit -
ful ar e th e method s o f exponentia l sum s ove r th e finit e fields . I n Chapte r 1 1 we
prove (amon g othe r things ) th e Rieman n hypothesi s fo r specia l curve s whic h yield s
the celebrated estimat e of Weil for Kloosterma n sums . Th e Kloosterma n sum s hav e
been employe d t o solve various problems o f analytic numbe r theor y fro m th e begin -
ning o f thei r creatio n i n 1 926 . W e als o mentio n briefl y th e stat e o f th e knowledg e
of exponentia l an d characte r sum s ove r algebrai c varieties . Application s o f thes e
are harde r t o make , ye t ther e i s a handfu l o f example s i n th e literature . I t wa s
a painfu l decisio n t o exclud e al l bu t th e simples t fro m presentatio n i n thi s book .
Otherwise t o d o ful l justic e fo r thes e highl y sophisticate d idea s w e woul d hav e t o
choose th e mos t complicate d applicatio n fo r whic h w e hav e n o room . I t suffice s t o
say that a preparation o f a given problem o f analytic numbe r theor y t o a n estimat e
for characte r su m ove r varietie s ca n b e th e state-of-the-ar t i n it s ow n right , neve r
mind tha t th e final argumen t i s powered by the outside forces o f algebraic geometry .
Dirichlet character s ar e alread y discusse d i n Chapte r 3 and w e return t o the m
in Chapte r 1 2 to trea t ver y shor t characte r sums . Agai n on e mus t b e inventiv e t o
break limit s o f natura l structures . Burges s theore m i s a fin e example .
Sums over primes ar e treated i n the next chapter . Whe n Vinogrado v succeede d
in estimating sums over primes of additive characters, which he needed for a solution
to th e ternar y Goldbac h proble m i n conjunctio n wit h th e circl e method , i t wa s a
shocking result . Befor e hi m th e Gran d Rieman n Hypothesi s coul d d o th e job , bu t
keep in mind that th e Riemann hypothesi s is still not established . Th e original ideas
of Vinogradov were borrowed fro m combinatoria l sieve and were rather complicated .
Recently develope d identitie s offe r muc h simple r treatment s o f mor e genera l sum s
over primes . A s the y shar e th e sam e fundamenta l principl e (reducin g th e su m t o
bilinear forms ) th e result s ar e prett y muc h th e same , s o th e choic e o f th e metho d
is a matte r o f tast e an d technica l convenience . T o captur e th e ke y element s i n
Chapter 1 3 we develop mor e tha n on e identity .
A popular criterio n for analyti c number theor y is that comple x variable analysi s
is bein g used . Perhap s i t i s bette r t o sa y harmoni c analysis , sinc e th e actio n o f
the latte r i s mor e profound . Fo r a lon g time , analyti c numbe r theor y flourishe d
exclusively from abelia n harmonic analysis, that i s to say from th e Fourier transfor m
in IR n. Ther e i s stil l a grea t potentia l i n thi s classica l analysi s t o b e explored .
However muc h stronger fertilizer s bega n to act o n the soi l of analytic number theor y
in recen t times . Thes e ar e automorphi c functions . O f course , modula r form s hav e
been driving algebraic aspect s of number theor y muc h longer , bu t i n a limited scop e
(confined t o holomorphic forms) . Ne w resources of automorphic theor y are found i n
the spectra l analysis , th e foundatio n o f which wa s le d b y H . Maas s an d A . Selber g
at th e turn o f the 1 940' s (real-analyti c cus p forms, Eisenstei n series , trace formula) .
In simpl e term s a non-abelia n harmoni c analysi s foun d it s rol e i n analyti c numbe r
theory. Trul y effectiv e expansio n o f spectra l method s int o analyti c numbe r theor y
began abou t twent y fiv e year s ago , changin g th e fac e o f eithe r subjec t irrevocably .
4 INTRODUCTION
This boo k barel y addresse s th e fascinatin g issue s o f th e ne w directio n throughou t
Chapters 1 4 , 1 5 , an d 1 6 . Ou r feature d applicatio n i s t o estimatio n fo r sum s o f
Kloosterman sums . Thi s i s a goo d choic e (i f n o mor e ca n b e accommodated) ,
because th e reade r ca n appreciat e th e ne w tool s b y comparin g wit h th e earlie r
results derive d i n Chapte r 1 1 b y algebrai c considerations . Anothe r applicatio n o f
the spectra l theor y o f automorphi c form s t o a n arithmetica l questio n i s presente d
in Chapte r 21 , tha t i s t o th e equidistributio n o f root s o f quadrati c congruence s
of prim e moduli . Th e spectra l theor y continue s t o gro w extensively , s o i t woul d
be prematur e t o wra p i t u p her e o r i n an y othe r book . Fo r furthe r readin g w e
recommend [1 3] , [Sa3] .
Although th e spectra l method s o f automorphi c form s predominat e curren t re -
search i n analyti c numbe r theory , th e traditiona l problem s continu e gettin g ou r
attention wit h respectfu l intensit y throughou t th e remainin g chapters . Grea t trea -
sures o f th e subjec t mustn' t b e burie d i n th e past . Firs t o f al l a newcome r shoul d
learn th e storie s o f prime s i n arithmeti c progression s t o larg e moduli . I n Chapte r
17 she/h e wil l find ho w E . Bombier i an d A.I . Vinogrado v bypasse d th e Rieman n
hypothesis t o establis h (b y th e larg e siev e an d othe r means ) unconditiona l result s
with application s a s goo d a s on e ca n ge t fro m th e R H itself . O f cours e ou r argu -
ments ar e no t identica l wit h th e origina l one s (o f 1 965 ) sinc e w e take advantag e o f
later simplification , i n grea t measur e du e t o P.X . Gallagher .
Chapter 1 8 goes further bac k to 1 94 4 when Linnik gave an extraordinary boun d
for th e leas t prim e i n a n arithmeti c progression . Fo r a lon g tim e thi s boun d wa s
considered a s th e mos t difficul t theore m i n analyti c numbe r theory . An d yes , i t i s
still har d b y today' s standards , an d on e ca n stil l lear n a lo t fro m th e technolog y
applied! Face d with the obstacle of the exceptional zero, Linnik brings the repulsio n
effect (h e calls it Deuring-Heilbronn phenomenon ) t o a new level; amazingly enoug h
he turn s th e proble m t o hi s advantage ! Thi s i s a fascinatin g developmen t i n th e
history o f analyti c numbe r theor y whic h w e recommen d on e shoul d maste r fo r a
better understandin g o f the statu s o f th e exceptiona l zer o today .
Once upo n a tim e th e famou s Goldbac h proble m wa s wort h a millio n dolla r
prize award . Fo r application s th e proble m (representation s o f eve n integer s b y th e
sums o f two primes ) ha s n o grea t merit , bu t a s a n intellectua l challeng e on e woul d
be prou d t o crac k it . Probabl y somethin g ne w abou t prim e number s woul d b e
revealed then . Rea d Chapte r 1 9 to improv e you r chances .
Chapter 2 0 i s serious . Her e analyti c method s stor m th e domai n o f diophan -
tine equations , whic h fro m ancien t Greek s wa s exclusively a busines s o f arithmetic .
Started b y Hard y - Ramanujan , continue d b y Hard y - Littlewoo d an d develope d
substantially furthe r b y Kloosterman, th e circl e method use s orthogonality o f addi -
tive characters to detect equations , no t onl y to solve algebraic equation s bu t a larg e
class of additive problem s ove r special integers a s well. Th e toughes t ar e the binar y
additive problems. The y are not completely solved by the Kloosterman method, bu t
at leas t w e get a very reliable picture o f what th e true asymptoti c fo r th e numbe r o f
solutions shoul d be . Kloosterma n sum s whic h w e covered i n the precedin g chapter s
are instrumenta l i n th e circl e methods . After classica l idea s w e propos e a mor e
direct varian t whic h i n principl e shoul d produc e th e sam e results , howeve r withou t
employing Kloosterma n sums . On e shoul d rea d Chapte r 2 0 wit h a n ope n mind ,
separate technica l (stil l attractive ) element s fro m conceptua l device s t o see clearl y
INTRODUCTION 5
the connection s wit h modular forms . Certainl y Kloosterma n an d Rademache r wer e
aware o f thes e intrinsi c connections , whil e the y ar e overlooke d b y som e specialist s
in th e circl e method .
Equidistribution problem s fo r sequence s o f specia l integers , lattic e point s i n
various domains , solutio n t o diophantin e equations , etc , constitut e a heav y indus -
try over the analyti c number theory. W e regret ther e is no space to run this industr y
in full capacit y i n the book . Th e boo k of M.N. Huxle y [Hu4 ] treat s onl y the lattic e
point problems , howeve r quit e deeply . I n Chapte r 2 1 we are dealin g wit h th e prob -
lem o f distributio n o f roots o f a quadrati c equatio n reduce d modul o prime . A s th e
prime modulu s tend s t o infinit y w e sho w tha t th e root s ar e uniforml y distributed .
The argument s includ e almos t everythin g tha t w e develope d i n th e boo k s o far ,
thus showin g tha t th e industr y i s robust .
Because o f failur e o f th e uniqu e factorizatio n o f algebrai c integers , th e arith -
metic of number fields is not a s easy as for rationa l numbers an d sometime s perplex -
ing. Th e complexity i s measured b y the order o f the ideal class group. Naturall y th e
case of imaginary quadrati c fields receive d th e first an d th e mos t attentio n becaus e
units do not interfere . W e do know that th e class number grow s to infinity (s o there
is only a finite numbe r o f imaginary quadrati c fields with a fixed class number), bu t
the seriou s issu e i s t o estimat e th e clas s numbe r effectively . Chapte r 2 2 describe s
the proble m thoroughl y an d prepare s th e groun d fo r th e advance s i n Chapte r 23 .
The effective lowe r bound fo r the class number (du e to D. Goldfeld) ma y not appea r
strong fo r demandin g researchers , ye t i t i s deep wit h respec t t o result s take n fro m
other sources . Firs t o f all it use s the Gross-Zagie r formul a fo r //-function s o f ellipti c
curves a t th e centra l point . W e d o provid e a substantia l overvie w o f th e involve d
arguments fro m ellipti c curves , althoug h thes e ar e mor e geometri c tha n analytic .
The analyti c argument s themselve s ar e quit e delicate . Actuall y the y cam e first,
the L-function s o f ellipti c curve s bein g supplementary . Indee d w e worke d ou t a n
effective lowe r boun d fo r th e clas s number whic h depend s o n th e orde r o f vanishin g
of general L-function s o f degre e two , suspectin g tha t th e requirement s ar e satisfie d
by quit e a fe w o f them .
In Chapte r 2 4 we prove a very classical result o f Selberg that a positive propor -
tion o f zero s o f th e Rieman n zet a functio n lie s o n th e critica l line . Thi s i s a goo d
place t o lear n abou t th e mollificatio n technique s ( a kin d o f smoothing) , whic h i s
used i n man y work s toda y an d wil l reappea r i n Chapte r 26 .
Assuming th e Rieman n hypothesis , H.L . Montgomery reveale d i n 1 97 4 that th e
distribution o f zero s o f £(s ) follow s th e behavio r o f eigenvalue s o f certai n "ensem -
bles" o f unitar y matrices . Mor e recentl y physicist s joine d th e tea m o f worker s i n
number theory , creatin g a ne w excitemen t an d hop e fo r finding a pat h t o a proo f
of the Rieman n hypothesis . Thi s i s the mai n objectiv e o f the so-calle d rando m ma -
trix theory , on e o f th e mos t popula r subjec t an d drivin g force s o f curren t analyti c
number theory . I t offer s reliabl e model s fo r predictin g th e behavio r o f arithmetica l
quantities whic h fo r a lon g tim e wer e shroude d i n mystery . Th e consistenc y o f
the rando m matri x theor y wit h th e harmon y o f integer s stil l seem s quit e surpris -
ing. Whateve r th e futur e o f this enterpris e wil l be , du e t o th e curren t cooperation ,
analysis i s closer t o arithmeti c tha n eve r before . A subject o f suc h magnitud e can -
not b e full y presente d i n a shor t space . Therefor e i n Chapte r 2 5 w e stic k t o th e
original them e o f th e correlatio n o f zero s o f £(s ) an d it s variation s o n th e zero s o f
6 INTRODUCTION
families o f automorphi c L-function s whic h ar e nea r th e centra l point . W e leave i t
for th e reader t o judge whethe r th e idea s o f random matri x theor y ar e realistic fo r
launching a n attac k o n the Rieman n Hypothesis .
In recen t investigation s th e centra l value s o f L-function s appea r i n a variet y
of formula s wit h vanishin g o r non-vanishin g assumptions . Tak e fo r exampl e [IS2 ]
where a n effectiv e lowe r boun d fo r th e clas s numbe r o f imaginar y quadrati c fields
is derive d essentiall y fro m th e non-vanishin g o f centra l value s o f familie s o f L -
functions, t o th e contrar y o f the vanishin g requirement s i n the previou s investiga -
tions. Anothe r exampl e i s the formul a o f T. Watso n [Wa ] by mean s o f which th e
quantum-ergodicity conjectur e (tha t i s the equidistributio n o f Maas s cus p forms )
is reduced t o a subconvexity boun d fo r certai n L-function s o f degree four . W e con-
sider i n detai l on e non-vanishin g statemen t whic h ha s application s t o arithmeti c
geometry.
We hope thi s boo k will show the picture o f analytic numbe r theor y i n plenty of
colors. Howeve r we must sa y that a lot of significant topic s are left out . Missin g are
the dispersio n method , th e amplificatio n metho d (se e [M2]) , som e analyti c tech -
niques fro m diophantin e approximation s an d transcendence . Moreove r probabilit y
arguments ar e barel y exposed , an d w e didn' t touc h ergodi c theor y either , whos e
impact o n number theor y ha s been fel t strongl y i n the last years .
We als o try to show som e detail s o f the powerful theorie s whic h ar e developin g
as th e mos t usefu l ne w tools fo r analyti c numbe r theory , i n particula r th e theor y
of higher-degree automorphi c form s an d their L-functions , an d algebraic geometry ;
young researcher s i n particular shoul d b e encouraged t o develop expertis e i n thes e
subjects. I t i s certain tha t spectacula r application s hav e onl y begu n an d more wil l
be ope n t o thos e wh o understan d bot h sides . Dually , arithmeti c geometr y an d
algebraic numbe r theor y als o giv e an d promis e a wealt h o f ne w questions, o r new
aspects o f old ones, wher e th e skills an d technique s o f analytic numbe r theor y wil l
be tested to the utmost. Hopefull y the y will bring rich rewards to those who will try
to com e to these ope n fields... W e barely mentio n som e questions relate d t o ellipti c
curves bu t w e believe tha t ther e i s muc h mor e t o discover . Th e dee p conjecture s
of Lan g an d Trotte r [LT ] ar e alread y quit e popular , an d a fe w othe r challengin g
problems ma y be foun d i n [Kol] .
The exercise s insid e eac h sectio n serv e a dua l purpose , som e ar e t o improv e
the reader' s skill , th e other s serv e a s additiona l informatio n abou t th e subject .
Historical remark s ar e brief , t o giv e som e orientatio n i n th e developmen t o f th e
matter, rathe r tha n t o credi t exhaustivel y th e inventors . Th e only advic e w e offe r
to new researchers is read! read ! read ! man y papers with complete proofs. Knowin g
a resul t i n analytic numbe r theor y i s only the first ste p to liking it; more importan t
and rewardin g i s to understan d th e argument s o f its proof . Ou r viewpoin t i s tha t
making mathematic s shoul d no t b e rate d lik e breakin g spor t records . Sometime s
the stronges t resul t i s boring whil e a slightl y weake r on e generates grea t pleasure .
Formal prerequisite s fo r muc h o f the boo k ar e rather slight , no t goin g beyon d
differential calculus , comple x analysi s an d integration, especiall y Fourie r serie s and
integrals. I t i s more importan t fo r the reader t o have or acquire a good understand -
ing o f how to manipulat e inequalitie s an d not simpl e identities .
INTRODUCTION 7
In late r chapter s automorphi c form s becom e important . W e have include d tw o
survey chapters, ye t w e expect tha t man y readers will have already som e knowledg e
of this importan t topic , o r wil l stud y i t independently .
In som e section s (fo r instanc e Section s 5.1 3 an d 5.1 4) , whic h ar e intende d a s
convenient reference s fo r certai n fact s an d result s which ar e hard t o locate in prope r
form i n th e literature , w e assum e tha t th e reade r ha s som e familiarit y wit h othe r
topics, suc h a s representation s o f group s an d algebrai c geometry .
Sections o f this boo k wer e writte n ove r a perio d o f time , therefor e reader s wil l
notice a sligh t chang e o f styl e an d repetition . W e thin k tha t a smal l redundanc y
is helpfu l fo r readin g lon g arguments . Occasionall y th e sam e objec t i s introduce d
again i n a differen t chapte r i n loca l terminolog y whic h shoul d b e mor e familia r i n
a particula r context . W e believ e thi s flexibilit y i s justified fo r comfort , eve n a t th e
expense o f losin g uniqueness .
Our notation s ar e mostl y standard . Bu t sinc e inequalitie s wit h unspecifie d
constants ar e the lifeblood o f analytic number theory , an d sinc e there ar e sometime s
controversies o n thi s subject , w e spel l ou t th e meanin g o f th e variou s compariso n
symbols 0 ( ) , o() , ~ , x o r <C . Mos t important , w e us e Landau' s / = 0(g) an d
Vinogradov's / < C g a s synonyms ; thu s f(x) < C g(x) fo r x G X (wher e X mus t b e
specified eithe r explicitl y o r implicitly ) mean s tha t \f(x)\ < Cg(x) fo r al l x G X
and som e constan t C ^ 0 . An y valu e o f C fo r whic h thi s hold s i s called a n implie d
constant. Sinc e a constan t i s mos t ofte n a functio n lookin g fo r a variable , th e
"implied constant " wil l sometimes depen d o n other parameters , whic h we explicitly
mention a t th e mos t importan t point s (bu t sometime s i t i s clea r fro m context) . I f
there i s no othe r dependency , w e speak o f "absolut e constants" . Thi s usag e mean s
that ou r <9() , for instance , i s not th e sam e a s tha t o f Landa u o r Bourbaki . W e us e
/ x g t o mea n tha t bot h relation s / < C g an d g <C / hold , o f cours e wit h possibl y
different implie d constants .
However / = o(g) fo r x —> XQ means tha t fo r an y e > 0 ther e exist s som e
(unspecified) neighborhoo d U e o f x$ suc h tha t \f(x)\ ^ eg(x) fo r x G U £. The n
h ~ g mean s h = g + o(g). Thos e ar e th e sam e a s i n Landa u o r Bourbaki .
Among th e fe w notatio n whic h ma y b e unfamilia r t o a beginner , w e mentio n
that p k\\m, wher e p i s prim e an d k a n integer , mean s tha t p k divide s m exactl y
(i.e. p kJrl doe s no t divid e m) . Th e integra l par t [x] is th e intege r n suc h tha t
n< x < n+ 1 .
We sometime s us e th e notatio n Yl* t o denot e sum s restricte d t o a subse t o f
"primitive" objects , whic h wil l b e indicate d i n eac h case , an d ^ t o denot e a su m
restricted t o squarefre e numbers .
This page intentionally left blank
http://dx.doi.org/10.1090/coll/053/02
CHAPTER 1
ARITHMETIC FUNCTION S
1.1. Notatio n an d definitions .
Throughout thi s boo k Z , Q, R, C denot e th e set s o f integers, rationals , rea l an d
complex number s respectively . Th e additio n o f complex number s make s C a group ,
and Z c Q c R c C i s th e natura l inclusio n o f subgroups . Th e Lebesqu e measur e
dx o n th e additiv e grou p R i s th e invarian t measur e (wit h respec t t o additiv e
translations). Th e subset s Q * C R* C C * of the non-zer o number s ar e closed unde r
multiplication, the y ar e considere d a s multiplicativ e groups . I n R * w e distinguis h
the subgrou p o f positiv e rea l number s R + whic h i s endowe d wit h th e invarian t
measure x~ ldx (wit h respec t t o multiplicativ e translations) .
The positiv e integer s ar e calle d natura l numbers . Th e se t o f natura l number s
N = { 1 , 2 , 3 , . . . } contain s th e se t o f prime s P = {2,3 , 5, 7 , . .. }. Thes e ar e funda -
mental element s o f arithmeti c sinc e ever y natura l numbe r factor s uniquel y (u p t o
a permutation ) int o power s o f distinc t primes . Th e distributio n o f prime s i s one of
basic problem s o f analyti c numbe r theory . Prim e number s ar e ofte n denote d b y p.
First o f all, to cultivate analyti c number theor y on e must acquir e a considerabl e
skill fo r operatin g wit h arithmeti c functions . W e begi n wit h a fe w elementar y
considerations.
A comple x value d functio n / define d o n N is calle d a n arithmeti c function . I n
some context s a n arithmeti c functio n / : N - > C i s bette r see n a s th e sequenc e
A = (a n ) wit h a n = f(n). Ver y ofte n i n suc h contex t A i s supporte d o n number s
of primar y interest , an d a n i s jus t a multiplicity , o r som e sor t o f weight , whic h
is introduce d whe n countin g suc h numbers . A differen t jo b i s assigne d t o certai n
functions / : N — > C (suc h a s Dirichle t character s o r Heck e eigenvalues ) whic h
we cal l "arithmeti c harmonics. " Thes e pla y a n instrumenta l rol e i n analyzin g th e
primary sequenc e A = (a n). Essentiall y th e arithmeti c harmonic s ar e employe d
with th e primar y sequenc e A = (a n) t o produc e a famil y o f twiste d sequence s
Af = (a nf(n)). Th e twiste d sequence s wit h appropriat e harmonic s ar e capabl e o f
selecting a special subsequence of A whic h is our target (thin k of Dirichlet character s
being employe d t o detec t prime s i n arithmeti c progressions) .
Thanks t o th e additiv e an d multiplicativ e structure s o f integers th e tw o impor -
tant classe s o f arithmeti c function s ar e distinguished . A functio n / : N— > C i s a n
additive functio n i f i t satisfie s
(1.1) f(mn) = f(m) + f(n)
for m,n relativel y prime . I f thi s propert y hold s fo r al l ra, n , the n / i s sai d t o b e
completely additive . Fo r exampl e f{n) = log n i s completel y additive . Similarly , a
9
10 1. ARITHMETI C FUNCTION S
function / : N—» C i s multiplicative i f i t satisfie s
(1.2) f(mn) = f(m)f(n)
for ra,n relativel y prime , an d / i s completel y multiplicativ e i f (1 .2 ) hold s fo r al l
ra,n. Fo r exampl e f(n) = n~ s wit h 5 G C i s completel y multiplicative . Obviously ,
the additiv e an d th e multiplicativ e function s ar e determine d b y thei r value s a t
prime powers . W e hav e / ( l ) = 0 i f / i s additive , / ( l ) = l i f / i s multiplicativ e no t
identically zero .
1.2. Generatin g series .
To an arithmeti c functio n / w e shall attac h th e tw o most natura l infinit e serie s
(1.3) E f(z) = £/(n)z" ,
n
(1.4) D f(s) = ^/(n)n" s,
n
where z, s ar e comple x variable s (i n th e cas e o f th e powe r serie s Ef(z) w e ma y
also includ e n = 0 i f /(0 ) i s defined) . Thes e ar e calle d th e generatin g series , o r
functions, o f / . Othe r kind s o f generatin g function s ca n b e attractiv e a s wel l t o
capture distinc t propertie s o f / . Fo r man y arithmeti c function s th e correspondin g
generating serie s converge s absolutel y i n a smal l domain , an d i t i s a n interestin g
question ho w fa r th e serie s ca n b e analyticall y continued ? I f th e generatin g se -
ries extend s beyon d th e rang e o f absolut e convergence , the n thi s propert y usuall y
manifests som e grou p structur e i n th e coefficient s f(n) ("random " serie s canno t
be continued) . Fo r exampl e th e analyti c continuatio n o f Arti n L- functions i s inti -
mately relate d t o th e reciprocit y law s i n numbe r fields (i n th e abelia n cas e a t an y
rate), th e analyti c continuatio n o f Hasse-Wei l L- functions i s a majo r ste p toward s
the modularit y o f th e correspondin g ellipti c curve s (ove r Q) , an d s o on .
The generating power series were introduced b y L. Euler (1 707-1 783 ) fo r study -
ing specia l additiv e problem s (whe n doin g s o h e wa s th e first t o mi x analysi s wit h
arithmetic). Le t Ef(z) an d E g(z) b e th e serie s fo r / an d g. The n th e produc t
n
(1.5) E f(z)Eg(z) = J2 Hn)z
n
yields th e powe r serie s fo r th e functio n h give n b y (th e additiv e convolution )
(1.6) h(n) = J2 fW9(™)-
£-\-m=n
Applying Cauchy' s theore m on e expresse s th e coefficien t h(n) b y th e contou r inte -
gral
1 n
(1.7) h(n) = ^- f E f(z)Eg(z)z' ' dz.
J\z\=r
Hence, give n reasonabl e analyti c propertie s o f Ef(z)E g(z) on e ca n deduc e a goo d
estimate fo r h(n), o r even a n asymptoti c formul a a s n tend s t o oo. O f course, Eule r
did no t kno w Cauchy' s theorem , s o hi s idea s wer e limite d t o th e powe r serie s fo r
which on e has a secondary expression , givin g a n exac t formul a fo r f{n) rathe r tha n
1. A R I T H M E T I C F U N C T I O N 11
an approximatio n (stil l no t a tautology). W e give three cut e examples . Firs t i s th e
identity
oo oo
$>" = II(1 + *2m) = ( 1 -*r 1
10
which i s nothin g bu t a n analyti c statemen t o f th e uniquenes s o f binar y expansio n
of natura l numbers . Her e i s anothe r identity :
oo oo
X>(n)* = II( 1 -* ro )~ 1
n
10
where p(ri) i s the partition function . Les s obvious is the following formul a o f Jacobi:
oo oo
(1.8) £ V * = JJ(1 - z m){\ - z m+1 /2f.
1
— oo
About ninet y year s ag o th e integra l representatio n (1 .7 ) fo r th e additiv e equatio n
(1.6) gav e birth t o th e circl e method. Thes e idea s wil l be develope d i n Chapte r 20 .
By changin g th e variabl e z int o
2niz
(1.9) e(z) = e
the powe r serie s (1 .3 ) i s see n a s a Fourie r series , whic h i s a commo n practic e i n
modern analyti c numbe r theory .
The generatin g serie s Df(s) i s calle d th e Dirichle t serie s afte r bein g use d i n
Dirichlet's fundamenta l work s o n prime s i n arithmeti c progressions , thoug h th e
special cas e
oo
S
(1.10) C(*)=£n-
1
was alread y considere d b y Euler .
The Dirichle t serie s i s particularl y attractiv e fo r multiplicativ e function s (bu t
not limite d t o thi s case) . For , i f / i s multiplicative, the n
(l.n) D f(s) = n(i+ f(p)p~s+f(p2)p~2s+•••)
p
provided th e involve d serie s i n prim e power s an d th e produc t ove r prime s (calle d
the Eule r product ) converg e absolutely . I n particular , w e hav e
(i.i2) C(* ) = II(i-p-T 1
V
if R e (s) > 1 . Thi s representatio n o f £(s ) a s a n Eule r produc t i s a n expressio n i n
analytic languag e o f the uniqu e factorization o f natural number s int o distinct prim e
powers.
12 1. ARITHMETI C FUNCTION S
1.3. Dirichle t convolution .
Assuming tha t th e Dirichle t serie s Df(s),D g(s) converg e absolutel y i t follow s
that th e product Df(s)D g(s) i s also give n b y a Dirichle t series , namel y w e have
oo
s
Df(s)Dg(s) = J2 Kn)n~
1
with th e coefficient s
(1.13) Mn ) = £/(d)ffQ) -
d\n
The arithmeti c functio n h defined b y (1 .1 3 ) (neve r min d th e convergenc e o f gener -
ating series ) i s called th e Dirichle t (o r multiplicative) convolutio n o f / an d g, an d
it i s denoted b y f *g.
The se t of all arithmetic function s wit h th e usual additio n + an d the operatio n
• i s a commutative ring . Th e function 5 : N -> C whose Dirichlet serie s is Ds(s) = 1
is the uni t elemen t o f this ring , i.e.,
J 1 i f n= 1 ,
(1.14) S(n)
~ \ 0 ifn>l .
The functio n ((s) define d i n R e (s) > 1 by (1 .1 0 ) i s calle d th e Rieman n zet a
function (afte r hi s semina l memoi r [Rie]) , it s Dirichle t coefficient s mak e th e con -
stant functio n l(n ) = 1 for al l n ^ 1 . B y virtu e o f (1 .1 2 ) th e invers e o f £(s ) ha s
also a Dirichle t serie s expansion , namel y
(1.15) ' -TT 1^ \_^M» s
0
C(s) 11V psj ^m '
P
say, wit h coefficient s
(1.16) n(m) = |
(-l)r if m = pi . .. pr wit h pi ,.. . , p r distinc t
0 otherwise.
The functio n fi(m) wa s introduced i n 1 83 2 by A. F. Mobius , an d it bear s hi s nam e
ever since . Moreover , b y th e Eule r produc t (1 .1 2 ) th e logarith m o f ((s) ha s th e
Dirichlet serie s expansio n
(i.i7) iogC0 0= £ £ r y •is
e=i P
Recall tha t th e Dirichle t convolutio n * correspond s t o th e multiplicatio n o f th e
generating Dirichle t series , therefor e th e identit y £(s ) • (~ 1 (s) = 1 reads a s
1i fm— 1 ,
(1.18) a(m ) = 5>(<9 = {
0 ifra>l .
Using thi s formul a on e obtains th e followin g
1. A R I T H M E T I C F U N C T I O N 13
MOBIUS INVERSION . For any / , g : N — > C the following two relations are
equivalent:
(1-19) g(n) = J2f(d),
d\n
(1.20) /(» ) = 5>(d)sQ)-
d\n
REMARK. T O be accurate wit h th e history th e inversion formula s i n the above
form wer e state d onl y i n 1 85 7 by R . Dedekind . Th e origina l versio n state d b y
Mobius wa s somewhat different , i t amount s t o sayin g tha t fo r an y rea l variabl e
functions F , G : [1, x]— > C the following tw o relations ar e equivalent:
(1.21) G{x) = Y. F (xh),
n^x
(1.22) F(x) = ^2 v(rn)G(x/m).
m^x
The Mobiu s functio n i s multiplicative . I f / , g ar e multiplicative, the n s o are
/ • g and / • g. I f g is multiplicative, the n
d
(1.23) E ^ ) ^ d ) = ll(1-5(P))-
d\n p\n
This produc t admit s a probabilisti c interpretation . Viewin g g(p) as a probabilit y
of som e independen t event s whic h ma y occur a t p on e can think o f (1 .23 ) a s the
probability tha t non e o f the events associate d wit h prim e divisor s o f n occur .
1.4. Examples .
Now w e give a larg e sampl e o f arithmeti c function s whic h on e encounters i n
analytic numbe r theory . Man y othe r function s wil l be introduced i n due course.
The diviso r functio n r(n) i s the number o f positive divisor s o f n, so we have
oo
y
(1.24) e(s) = £r(n)n-°.
1
More generall y Tk(n) denote s th e number o f representations o f n a s the product of
k natura l numbers , s o its Dirichlet serie s i s ( k(s). Explicitl y
1
ai
(1-25) ^)=( t-l )--(art-ll) tt'-tf-rf'-
For an y v £ C we define o v{n) b y
oo
s
(1.26) C ( s ) C ( s - ^ ) = 5 > » n - ,
1
so it is the sum of powers o f divisors
(1.27) <7,(n )= ^ < r.
d\n
14 1. ARITHMETI C FUNCTION S
We hav e th e Ramanuja n formul a
oo
(1.28) t(s)C(s - a)C(s - P)t(s - a - P)C 1 (2s _ a - /3 ) = ] T ^(n)ap(n)n'\
l
In particular ,
oo
T n 2n S
(1.29) (\s)C\2s) = J2 () ~-
1
One ca n recognize th e abov e Ramanuja n serie s a s a special cas e of the Rankin -
Selberg convolutio n L- functions attache d t o modula r form s (se e Sectio n 5.1 ) . Di -
viding (1 .29 ) b y £(s ) on e get s
oo
13
(1.30) C (s)C (2s)^J£T(n>~^
1
hence multiplyin g b y ((2s)
oo
<3(s) = £ ( £ T{m>))n-°.
2
1d m=n
This i s th e symmetri c squar e L-functio n associate d wit h ( 2(s). Dividin g agai n b y
£(s) on e get s
oo
12 - 2Uj(n)n S
(1.31) C (s)C (2s) = J2 ~
l
where u(ri) denote s th e numbe r o f distinc t prim e factor s o f n. Nex t w e ge t
oo
n n S
(1.32) ((s)CH2s) = J2\^ )\ ~ -
1
Note tha t |/i(n) | = ^ 2(n) i s th e characteristi c functio n o f squarefre e numbers . B y
(1.32) w e hav e
2
(1.33) „ (n) = £>(<*) •
d2\n
Inverting (1 .32 ) w e obtain anothe r Dirichle t series ,
oo
C\s)C('2s)='£Hn)n-s
1
with coefficient s
(1.34) A(n ) = (-l) n < n >
where fi(n ) denote s th e tota l numbe r o f prime factor s o f n (counte d wit h multiplic -
ity). Thu s A(n) , which i s called th e Liouvill e function , i s completel y multiplicativ e
1. A R I T H M E T I C F U N C T I O N 15
and i t agrees wit h ji(n) on squarefree numbers . Th e Euler functio n <p(n) i s defined
by the series
(1-35) fcl) =gv,(n)n-.,
so it is also give n by
(i.36) * > ( » ) = » n ( i - £ ) = » E ^ -
p\n d\n
One ma y think o f ip(ri)/n a s th e probabilit y tha t a randoml y chose n intege r m
is coprim e wit h n. I n other words , (p(n) represent s th e number o f residue classe s
(a mo d n) wit h (a,n) — 1 .
Very importan t arithmeti c function s emerg e b y differentiation. W e begin wit h
oo
-C'(«) = ^ ( l o g n ) n - s ,
1
which give s us the logarithm functio n
(1.37) L ( n ) = lo g n.
Since L(n) i s additive, i t follow s tha t L • (/ * g) = (L • / ) * g -h / * (L • g). Thi s
says tha t th e multiplication b y L is a derivation i n the Dirichlet rin g o f arithmeti c
functions.
Next th e function A(n ) ca n be defined a s coefficients i n the Dirichlet serie s for
the logarithmi c derivativ e — (log£(s)y = — C'(s)£(s)_1, so
(1.38) - ^ ( S ) = ^A(n)n" s.
1^
By th e Euler produc t (1 .1 2 ) on e computes tha t
/ 1 o n , K( v f log p i f n = pa, a ^ 1 ,
(1-39) A(n ) =< ^
(^ 0 otherwise .
Thus A(n ) i s supported o n prime powers . On e can read (1 .38 ) a s A = / i * L . Mor e
explicitly,
(1.40) A(n ) = J2 ^ lo ^=-T, V{d) log d.
d\n d\n
By Mobiu s inversio n w e obtain L — 1 • A , or equivalently,
(1.41) log n = 5^A(d) .
The functio n A(n ) wa s know n an d effectively use d b y P. Tchebyshev, nevertheles s
it i s called th e von Mangoldt function . Her e i s another identit y whic h i s useful i n
elementary prim e numbe r theory :
(1.42) A - = / X * ( L - T ) .
16 1. A R I T H M E T I C FUNCTION S
The vo n Mangold t functio n A ha s bee n generalize d t o th e highe r vo n Mangold t
function o f an y degre e k ^ 0 by A k = \i * Lk, i.e. ,
(1.43) A fc(n) = ][>(<* ) (log ^
d\n
(notice tha t A o = 6). Thi s satisfie s th e recurrenc e
(1.44) Afc+ i = LA k + A • A k
from whic h i t follow s b y inductio n tha t A k(n) i s supporte d o n number s havin g a t
most k distinc t prim e factors , i.e. , A k(n) = 0 i f uo{n) > k. Moreover , w e hav e
k
(1.45) 0^A k(n)^(logn) ,
the lowe r boun d followin g b y inductio n fro m (1 .44 ) an d th e uppe r boun d followin g
from th e formul a L k = 1 * A k, o r explicitly ,
fc
(1.46) ( l o g n ) =^Afe(d),
d\n
which come s fro m (1 .43 ) b y Mobiu s inversion . O f course , A ^ i s no t multiplicative ,
but w e hav e th e followin g hand y formul a (se e (1 .44) )
Ak(mn) = 5 Z ( * ) ^i(m )^-i( n )' i f
( m > n ) = 1-
EXERCISE 1 . Fo r an y intege r k ^ 0 an d a rea l numbe r x > 0 , defin e
(1.47) A fc(n,aO = $>(d)(log^)* .
d\n
Note tha t A k(n,x) depend s onl y o n th e squarefre e kerne l o f n. Prov e tha t
k
'k\ A A ,_ nN )/ , l_0x\g ~i
Ak(n,x) = J2 ( ! ) ^ ( D
Then, usin g A . < LJ~ lAk, deriv e tha t fo r n ^ x
(1.48) A fc(n, x) f ^ J < Afc(n ) < A fc(n, x).
Therefore A k(n,x) i s supporte d o n integer s n havin g a t mos t k distinc t prim e
divisors, an d i t satisfie s 0 < A k(n,x) ^ (logx) fe i f n ^ x .
Next sho w tha t i f (ra , n) = 1 , the n
Ak(mn,x) = ^2 [ •) Aj(n)A k-j\jn, - J .
Hence deriv e tha t fo r n — p^1 . . . p% r wit h p i , . . . , P2 distinc t prime s
k r
(1.49) A k(n,x)^r\ ( ) (logx)*- (logpi)... (logp r ).
1. A R I T H M E T I C F U N C T I O N 17
One ca n associat e a Mobiu s an d vo n Mangold t functio n t o an y multiplicativ e
function. I f Df(s) i s the generatin g Dirichle t serie s fo r / whic h ha s Eule r product ,
then /jbf and A / ar e define d a s coefficient s i n
— = £„,(„,)„,-,
(L50)
w. )^
Hence f*A.f = f-L an d A j = jjLf*(f-L). Clearl y A j i s supported o n prime powers .
If / i s completel y multiplicative , the n fjtf(n) — fJ>(n)f(ri) an d A/(n ) = A(n)/(n) .
There i s a variet y o f trul y interestin g arithmeti c functions . Th e theor y o f
modular form s i s a basi c sourc e fo r multiplicativ e functions . A s a simpl e mode l w e
pick up the functio n r(n) whic h is the number o f solutions t o a 2 + b 2 — n i n integer s
a, b. Th e generatin g Dirichle t serie s fo r r(n) i s equa l t o 4(#(s) , wher e
C*(s) = J2(Na)-°
a
is the zet a functio n o f th e imaginar y quadrati c field K = Q ( \ / ~ l ) whil e th e facto r
4 i s th e numbe r o f unit s i n K. Her e a run s ove r non-zer o integra l ideal s o f K. Al l
these ar e principa l ideal s generate d b y th e Gaussia n integer s a = a + bi G Z[>/—T] ,
a ^ O , an d i f a = (a) , the n Na = a 2 + b 2. Hence , indee d
4 s
Cx(s) = ^ r ( n ) n - .
Clearly \r{n) i s multiplicative . Th e prim e number s whic h ar e represente d a s th e
sum o f tw o square s ar e characterize d i n a beautifu l theore m o f Fermat . I t i s eas y
to se e tha t n o prim e p = — l(mod 4 ) ca n b e s o writte n an d Ferma t prove d tha t al l
other prime s ca n be (uniquel y u p to the sig n an d orde r o f a, b). I n modern languag e
this theorem o f Fermat i s just th e factorization la w in the Gaussia n domai n Z[\/—l] ;
it assert s tha t p — 2 is squar e o f a prim e ideal , p = l(mo d 4 ) split s int o a produc t
of tw o distinc t (comple x conjugate ) prim e ideals , p = — l(mod 4 ) remain s prime .
Using th e factorizatio n la w on e ca n sho w b y verificatio n o f loca l factor s tha t
where % 4 is th e non-trivia l characte r t o modulu s 4 (w e hav e X4( n) = si n ^) an d
L(s, X4 ) i s the associate d Dirichle t L- function
L{S,XA) = ^2x4(n)n~
Hence
(1.51) r(n) = 4^ X4 (d).
d\n
18 1. A R I T H M E T I C FUNCTION S
Besides th e Dirichle t serie s there i s great interes t i n studying th e Fourie r serie s
for r(n) du e t o th e additiv e natur e o f th e equatio n a 2 + b2 = n. W e hav e
oo
^2r(n)e(nz) = 0 2(z)
o
(recall (1 .9 ) tha t e(z) = e27rlz), wher e 0(z) i s the thet a functio n
oo
2
(1.52) 6{z) = ^e{n z).
— OO
This serie s converge s absolutel y fo r I m (z) > 0. Mor e generally , lettin g rk(n) b e
the numbe r o f representation s o f n a s th e su m o f k square s w e hav e
oo
J2rk(n)e(nz) = 0 k(z).
o
On one hand, th e theta function 0(z) i s recognized in analytic number theor y fo r
its abilit y t o selec t squares . O n th e othe r hand , 0{z) i s usabl e du e t o th e followin g
transformation rule :
(1.53) ef^±l\ =v( Cld)(cz + d)h(z)
\ (sZ I (J/ J
which hold s fo r an y z with I m (z) > 0 an d an y integer s a, 6, c, d wit h ad — be = 1,
c = 0 (mod 4) , wher e i/(c,d) depend s onl y o n c , d. W e hav e v 2(c,d) = x±{d), so
z/(c, d) takes onl y fou r value s ± 1 , ±z (se e (3.42 ) fo r th e exac t description) . I n othe r
words, 0(z) i s a modular for m o f weight \ wit h multiplie r v(c,d) whil e 0 2(z) i s a
modular for m o f weight on e an d characte r % 4 on r 0 (4) . I n additio n t o th e modula r
relations (1 .53 ) th e thet a functio n satisfie s th e followin g involutio n equation :
<-) *(S) - G)4»(§)-
Note tha t 0(z) doe s no t vanis h b y virtu e o f th e Jacob i produc t (1 .8) .
More generall y on e associate s a theta functio n wit h an y positiv e definit e qua -
dratic for m i n severa l variable s an d correspondin g harmoni c polynomial s (se e Sec -
tion 1 4.3) .
Yet othe r type s o f arithmetic function s ar e give n b y exponentia l sum s ove r
Z/raZ o r (Z/mZ)* . Th e mos t importan t o f thes e fo r analyti c numbe r theor y ar e
Gauss sums , Ramanuja n sum s an d Kloosterma n sums . W e mentio n her e onl y th e
quadratic Gaus s su m (fo r genera l Gaus s sums , se e Sectio n 3.4 )
G
<™»= E • ( £ ) •
a;(mod m)
They wer e evaluate d exactl y b y Gaus s a s a consequence o f th e Quadrati c Reci -
procity La w (se e Theore m 3.3) :
1 4- ?m
(1.55) G(m) = y -y V ^ -
1. A R I T H M E T I C F U N C T I O N 19
The Kloosterman sums 5(a, b; c) encompass Ramanujan sums . The y are define d
by
/, -„• > „ ,, N v ^ * /ax + bx\
(1.56) S(a,b;c)= ^ e (— J
x (mo d c)
for integer s a, b and c ^ 1 . Th e symbo l ^ * restrict s th e su m t o x coprim e wit h c
and x denotes th e multiplicativ e invers e o f x modul o c , i.e . xx = 1 (mo d c) . Not e
that S(a, b] c) i s real, an d ha s th e followin g symmetries :
(1.57) S(a,6;c ) = S(6,a;c) ,
(1.58) S(aa' , 6; c) = S(a, 6a'; c) if (a', c) = 1 .
It als o ha s multiplicativ e propertie s (b y Chines e Remainde r Theorem )
(1.59) S(a, b; cd) = S(ac, be; d)S(ad, bd; c) if (c, d) - 1 .
Kloosterman sum s wil l mak e man y appearance s i n this book . The y ar e crucia l to
modern analyti c numbe r theor y throug h link s with spectra l theor y o f automorphi c
forms. Thoug h n o "simpler" formul a exist s (comparabl e t o (1.55) fo r quadratic
Gauss sums) , a sharp boun d fo r individua l Kloosterma n sum s is known, du e to A.
Weil:
(1.60) |5(a,6;c) | ^r(c)(a,6,c) 1 /2Vc.
We will prove thi s in Chapter 1 1 (se e Corollar y 1 1 .1 2 ) an d us e it many times .
When b = 0 (or n — 0 , se e (1 .57)) , th e Kloosterma n su m S(n, 0 ; m) i s called a
Ramanujan sum . I t is also sometimes denote d c^n). Fo r thes e ther e i s an explici t
formula, se e (3.2 ) an d (3.3) .
1.5. Arithmeti c function s o n average.
In th e cours e o f analytic numbe r theor y th e average s o f arithmeti c function s o f
various kind s appea r al l ove r th e place . Th e firs t tas k i s to establish estimate s fo r
finite sum s o f typ e
(1.61) M f(x) = "£ /(») •
In thi s sectio n w e sho w a few elementar y ideas .
The first commo n too l i s summation b y parts , whic h i s the formul a
Mfg{x) = Y. /( n M n ) = Mf(xM*) ~ f Mf(t)g'(t)dt
n^.x
and it s obviou s variants . Thi s formul a ofte n allow s u s to evaluate A4f g(x) i f Mf
is known an d g is smoot h an d doesn' t oscillate .
If / i s defined o n [1 , x], monotonic an d continuous , the n Mf(x) i s well approx -
imated b y a corresponding integral , precisel y
(1.62) M f(x) = j* f(y)dy + 0(\f(x)\ + |/(l)|).
20 1. ARITHMETI C FUNCTION S
For example , w e ge t
fe
(1.63) ^T(logn) = xP fe (logx) + 0((loga;) fc )
where Pk(X) i s a polynomia l o f degre e k
fe
Pk(X) = £ ( - l ) -^,
and
lo
(1.64) 5Z( § ~)k = *'* + 0((logx)*).
If /(y ) i s continuousl y decreasin g t o zero , on e ca n d o bette r tha n (1 .62) . Indeed ,
putting
(1.65) [x] = max{ ^ G Z : < ( x }
(the integra l par t o f x) an d
(1.66) {x} = x - [x]
(the fractiona l par t o f x) th e averag e o f / i s given b y th e Stieltje s integra l
Mf(x) = JX f{y)d[y] = [ f(y)dy - / * f(y)d{y}.
Hence
(1.67) M f(x) = J'f(y)dy + lf + 0(f(x))
where th e constan t j t i s given b y
OO /"O O
/
f(y)d{y} = /(l) + J {y}df(y).
For example , w e ge t
(»•«) E ^ - ^T<'»^)' +1 + <-U*»» + ° ( ! = £)
where 7 & is called th e Stieltje s constant . Fo r k = 0 we get th e Eule r constan t
OO
-2,
(1.69) 7 = l - / {y}y-Ms/"zdy = 0.57721 5 . . .
/
Similarly on e ca n evaluat e average s o f smooth function s i n severa l variable s b y
repeated application s o f th e integra l approximatio n method . I n thi s wa y on e ca n
derive Gaus s formul a fo r th e lattic e point s i n a circl e
(1.70) ] T r(n) = 7TX + 0(yfx).
1. ARITHMETI C FUNCTIO N 21
Actually, Gauss ' argumen t wa s geometrical i n nature, h e derive d (1 .70 ) b y packin g
the circl e wit h a unit squar e (whic h amount s to the sam e thin g a s the integration) .
In th e sam e wa y on e obtain s
rfc n
(1.71) Yl ( ) = ^+ OO^)
where pk = 7r / C / / 2 /r(| + 1 ) is the volum e of the ^-dimensiona l uni t ball .
Most interestin g arithmeti c function s / ar e no t define d o n all rea l number s in
which cas e it makes n o sens e to approximate Mf(x) b y th e integra l (1 .67) . Yet , if
/ i s given by a convolution o f two functions on e of which i s smooth o n real number s
and th e othe r vanishe s rapidl y o n natural numbers , i t is still possibl e t o evaluate
Mf(x) b y mean s o f integrals an d convergen t series . Specifically , i f / = g*h, wher e
h is monotonic an d bounded , w e arrang e M.f(x) a s
Mf(x)=^29(m)Mh(~),
m^x
py
and, afte r replacin g Mhiy) b y / h(t)dt + 0(1), w e ge t
Jo
m^.x m^.x
Estimating th e partia l su m ove r y < m ^ x triviall y w e ge t
(1.72) M f{x) = [ X F(y)dy + 0(M ]gl(x))
Jo
where
(-3) ™ _ £*£>*(£).
If w e assume tha t th e serie s Y^Qi 1 71 )1 71 '1 converge s absolutely , the n thi s formul a is
a sligh t modificatio n o f a theorem o f A. Wintner . A s a n example w e appl y (1 .72 )
for f(m) — (/?(n)n -1 whic h i s given b y (1 .36 ) gettin g
(••74 ) £»fi!UJL + <,<*„.
n^x
For th e divisor functio n r(n) th e sum M T{x) ha s geometric interpretation ,
namely it represents the number o f lattice points (m , n) G NxN unde r th e hyperbol a
ran ^ x. I n this cas e ou r genera l formul a (1 .72 ) yield s
n^.x n^x
showing tha t th e averag e valu e o f r(n) is log n. I n order t o improve th e erro r ter m
in th e diviso r proble m Dirichle t applie d a simple, ye t powerfu l tric k wit h switchin g
divisors. Firs t b y symmetry o f the equatio n m\rri2 = n we figure tha t
MT(x) = 2 £ g ] - [ ^ ] 2 .
m^y/x
22 1. ARITHMETI C FUNCTION S
Here w e hav e fewe r term s o f summatio n tha n i n th e straightforwar d arrangement .
This tric k i s called th e hyperbol a method . No w dropping th e integra l par t symbol s
we mak e a n erro r 0(y/x), an d i t follow s b y (1 .69 ) tha t
T
(1.75) Yl (n) = x lo
§ x + ( 2 ^ ~ l )x + 0{y/x).
EXERCISE 2 . Prov e b y Dirichlet' s metho d tha t
l
] T r k(n) = xP k(logx) + 0{x ~^)
n^.x
where P k i s a polynomia l o f degre e fc — 1 (Pk(x) i s th e residu e o f ((s) kxss~1 a t
8 = 1 ).
Dirichlet's hyperbol a metho d work s nicel y fo r th e lattic e point s i n a bal l o f
dimension fc ^ 4 . Lagrang e prove d tha t ever y natura l numbe r ca n b e represente d
as the su m o f four squares , i.e. , r±{n) > 0, and Jacob i establishe d th e exac t formul a
for th e numbe r o f representation s
r 4 (n) = 8( 2 + ( - l ) » ) £ d.
d\n,d od d
Hence w e deriv e
^r 4 (n) = 8 ^ (2 + ( - I D£ d
n^.x m^.x dm^.x,d od d
^2
« I > + <-ir>(^ + < >)
m^.x
oo
= 2x 2 ] T ( 2 + ( - l ) m ) m " 2 + 0 ( x logs) .
l
= 3((2)x 2 + 0(x\ogx) = -(irx) 2 + 0(x\ogx).
This resul t extend s easil y fo r an y fc ^ 4 (writ e r ^ a s th e additiv e convolutio n o f
r4 an d r&_4 , appl y th e abov e resul t fo r r^ an d execut e th e summatio n ove r th e
remaining fc — 4 squares b y integration )
(1.76) ] T r fc (n) = ^ 0-+ O f r * " 1 logs) .
Notice tha t thi s improve s th e formul a (1 .71 ) whic h wa s obtaine d b y th e metho d o f
packing wit h a uni t square . Th e exponen t | — 1 in (1 .76 ) i s best possibl e becaus e
the individua l term s o f summatio n ca n b e a s larg e a s th e erro r ter m (apar t fro m
logrr), indeed fo r fc = 4 we have r±(n) ^ 1 6 n if n i s odd b y th e Jacob i formula . Th e
only case s o f th e lattic e poin t proble m fo r a bal l whic h ar e no t ye t solve d (i.e. , th e
best possibl e erro r term s ar e no t ye t established ) ar e fo r th e circl e (f c = 2 ) an d th e
sphere (f c = 3) . W e shal l addres s thes e fascinatin g problem s o n othe r occasions .
1. ARITHMETI C FUNCTIO N 23
EXERCISE 3 . Prov e b y th e hyperbol a metho d tha t
Y^ r(n 2 + 1) = - z l o gx + 0(x).
n^x
1.6. Sum s o f multiplicativ e functions .
Throughout / i s a multiplicative function . Sinc e / i s determine d b y it s value s
at prim e power s it is possible t o estimat e Mf(x) i n term s o f th e loca l sum s
oo
(1-77) * P (/) = £/&>•>"" •
First w e give simple uppe r bound s whe n / i s non-negative. Th e followin g estimate s
need n o explanatio n
(1.78) M f(x) ^xJ2 fWn- 1 ^x]J a p(f).
One ca n d o bette r i f /i s non-decreasing o n prim e powers . I n thi s cas e h = \x * / i s
non-negative, indee d h(p u) = f(jp v) — / ( p ^ - 1 ) ^ 0 if v > 1 . Writin g / = 1 • h w e
get
(1.79) M f(x) = Y, h(m) [—1 < x J2 fe M™1_
m^x m^x
P^X P^z%
Here i s a crude bu t usefu l applicatio n o f (1.79) fo r f(m) = Tfc(ra)^ . W e hav e
f(p) = k l an d
where c = c(k, £) is a positive constant . W e hav e als o th e elementar y boun d
n(i+^)«io g x.
p^x
for an y x ^ 2 (se e (2.1 5)) . Henc e (1 .79 ) yield s
1
(1.80) Y^ninY^xilogxf-
n^x
where th e implie d constan t depend s o n k,£. Thi s i s a crude bound , bu t o f the
correct orde r o f magnitude . Notic e tha t i t implies tha t
£
(1.81) r k(n)^n
for an y e > 0 , th e implie d constan t dependin g o n e and k. We shal l b e usin g thes e
bounds fo r th e diviso r functio n r^(n ) ofte n withou t mention .
24 1. ARITHMETI C FUNCTION S
If on e take s th e averag e of f(n) ove r squar e fre e numbers , o r what amount s to
the sam e thin g on e assume s / i s supported o n squarefree numbers , the n th e abov e
arguments yiel d
(1.82) M f{x) ^ n l 1 /(P)-I^
P'
p^x
provided f(p) > 1 for all p. W e ma y requir e f(p) ^ 1 to hold tru e fo r primes onl y
in a certain set . The n th e resul t is
(i.83) *,<*>< * n (»+*?) n ( ' + ^ ) -
p^x P^x
/(P)<1 /(P)> 1
Here i s another elementar y approach . W e onl y assum e tha t / i s non-negative
and completel y sub-multiplicative , i.e. , f(mri) ^ / ( r a ) / ( n ) fo r all ra,n> 1 . More-
over, suppos e tha t f(p) i s bounded o n average,
(1.84) ^2 f(m)A(m) < ex
m^x
for an y x ^ 2 , where c ^ 1 is a constant (se e the Tchebyshe v boun d (2.1 2)) . The n
we ge t
^ / ( n ) l o g n = ^2 f{nm)A(m) ^ e x ^ / ( n ) n - 1 .
n^a: nm^ix n^x
Hence by partial summatio n
3CX y^M f(n) ^ 3C(JX yr / , /(p ) ^
M
(1-85) A*,(* )^£ ^n <^ lor^
g x IAI1'(! + )
^ lo gx ^
n^n
x P ^ lo g x - 1 •^x \ p /
where
-f(ny
To illustrat e thi s resul t w e tak e th e multiplicative functio n f(n) = 6(n ) whic h
is the characteristi c functio n o f numbers representabl e a s a sum of two squares. We
have f(p) = 0 if p = — 1 (mod 4 ) and f(p) = 1 otherwise. I n this cas e (se e (2.30) )
n(i+ M )«(io g ^
p^x
for an y x ^ 2 . Hence on e deduce s b y (1.85) tha t
(1.86) B(x) = \{n^x :n = a2 + b2}\ < C x ( l o g x ) " i
Here ever y numbe r whic h admit s a representatio n a s the sum of two square s i s
counted once . Fo r thi s reaso n th e upper boun d (1 .86 ) i s slightly smalle r tha n the
area of the circl e a 2 -ffr 2 ^ x (se e (1 .70)) . Usin g analyti c method s E . Landa u [Lai ]
established th e asymptotic formul a
(1.87) B(x) = Y^ b (n) ~ CxQogx)-* ,
1. ARITHMETI C FUNCTIO N 25
as x —> co , wher e C is a positiv e constant . Landau' s formul a als o follow s b y a n
elementary metho d wit h C given b y (1 .1 02) .
Very differently , stil l elementarily , th e formul a (1 .87 ) i s proved i n [18] by ap-
plying a half-dimensional sieve . Th e siev e metho d ha s th e advantag e o f producing
results o f great uniformity . Fo r example , on e get s
B{x + y)-B{x)<(l + e)B(y)
for an y e > 0 provided y i s sufficiently larg e i n terms o f e only , wher e x i s an y
positive number . I f x is very larg e compared t o ?/ , then analyti c argument s produc e
rather poo r result s fo r B(x -\-y) — B{x).
Next w e present a n elementar y metho d whic h yield s an asymptoti c formul a fo r
Mf(x) fo r a large clas s o f multiplicative functions . Thi s require s som e regularit y
in th e distributio n o f / a t prime powers . Lettin g A / b e th e vo n Mangold t functio n
associated wit h / (se e (1 .50) ) w e assum e tha t
(1.88) ] T A/(n) = /dog x + O(l )
n^x
where K is a constant. Thi s conditio n mean s f(p) i s about np~ x o n average wherea s
it wa s K previously . Fo r many multiplicativ e function s i n practice (1 .88 ) ca n be
established b y elementary method s (thi s conditio n i s an analogue o f the Merten s
formula (2.1 4) , an d it is weaker tha n th e Prim e Numbe r Theorem) . Wit h som e
applications i n mind w e allo w f(p) t o be slightly negative . Precisel y th e metho d
works if (1.88) hold s wit h K, > — \. I n addition t o this w e nee d a crude estimat e
(1.89) £ | / ( n ) | « ( l o ga; )K
n^x
The metho d begin s wit h evaluatio n o f the logarithmicall y smoothe d su m
(1.90) Y, fW l0 § - = f M f{y)y-ldy.
n^x
Since / • L — f * Ay w e derive b y (1 .88 ) tha t
X ) / ( n ) l o g n = £ / ( d) £ A/(m )
n^.x d^.x m^.x/d
= £/(d)(«log^+0(l)).
d^.x
Estimating th e su m o f error term s b y mean s o f (1.89) w e ge t
lo
(K + 1 ) Y, f( n) § n = *>Mf{x) logz + 0 ( ( l o g x ) N ) .
n^x
Inserting thi s int o (1 .90 ) w e show tha t th e differenc e
1
(1.91) A(x) = M f(x) log x - (K + 1 ) / M f(y)y~ dy
is relativel y small , namel y
(1.92) A ( z ) < ( l o g x ) K
26 . ARITHMETI C FUNCTION S
Next w e divide (1 .91 ) b y x(\ogx)~ K~2 an d integrat e gettin g
[X A(y)y-\logy)- K'2dy = f M f{y)y-\\ogy)-^dy
J2 J2
- ( « + 1) J* y-^logy)-*- 2^ Mf^u-'duyy.
Changing th e orde r o f integratio n i n th e multipl e integra l w e ge t
J' M f(y)y-^hgy)-K1- dy + J 1
X
[(logx)-K1 1
- y-(logu)- K
- ]Mf(u)u- du.
The first integra l cancel s ou t wit h th e secon d par t o f th e las t one , s o w e ar e lef t
with
JX A(y)y- 1 (\ogy)-K-2dy=(\ogx)-^-1 J X
Mfi^u-'du.
Combining thi s wit h (1 .91 ) w e arriv e a t th e followin g identity :
Mf(x) = (logs) * jX -AiyMlogy)-"- 1 + A^Xlogx)" 1 .
Since th e abov e integra l converge s absolutely , w e ca n exten d it s rang e t o infinit y
getting th e identit y
K
(1.93) M f{x) = {c f+rf(x)}(logx)
where cj i s a constan t give n b y th e definit e integra l
/•OO
A
(1.94) Cf =
~J ( ^ ( l o g 2 / ) ~1
"-
and rf(x) i s a functio n give n b y
/•OO
(1.95) r f(x) = / (A(y ) - A(x))d(logy) — \
This tend s t o zer o a s x —> oo , indee d b y (1 .92 )
w K 1
(1.96) r / ( a : ) < (logx) - " .
However, th e constan t (1 .94 ) doe s no t loo k natural .
Now havin g establishe d th e asymptoti c formul a (1 .93 ) wit h th e erro r ter m
(1.96) w e shall us e these result s t o deriv e anothe r expressio n fo r cj b y a n appea l t o
the zet a functio n fo r / ,
oo
1
The series converges absolutely fo r s > 0 by virtue of (1 .89) . W e compute b y partia l
summation tha t
oo />o o
/
y-sdMf(y) = -J M f(y)dy-
S
/•OO pOO
£
= - / Mfie^de-** = - {c f + 0(t- )}tKde~st
Jo Jo
= {c f + 0(s £)}s-KT(K + l)
1. ARITHMETI C FUNCTIO N 27
as s — » 0+. Comparin g thi s wit h th e ^-t h powe r o f th e Rieman n zet a functio n w e
get
C(8 + l)-"D f{8)~Cfr{K+l).
On th e othe r hand , w e have th e Eule r produc t
as+iyKDf(S) = rj(i - p-'-'r ( £ fW)p- va)
p \i/= 0/
which converge s absolutel y fo r s > 0 and i t ha s limi t a s s — > 0 by virtu e o f (1 .88) .
Hence th e constan t c / i s equal t o
(1^) C /=^nH)Wb) + /(P 2 )+---).
We hav e establishe d th e followin g formul a (i f tt ^ 0 , thi s i s du e t o E . Wirsin g
[Wil]).
THEOREM 1 .1 . Suppose f is a multiplicative function which satisfies (1 .88)
and (1 .89) with n > — \. Then
1
(1.98) ] T f(n) = C/(log*r + Odlogx^- )
n^.x
where Cf is a constant given by (1 .97).
We illustrate th e applicabilit y o f this formula b y solving a simple sieve problem .
Let P b e a se t o f primes. Ou r questio n i s how man y number s n ^ x hav e n o prim e
divisors i n VI Le t / b e th e completel y multiplicativ e functio n wit h
/ ( p ) = f i if peP,
\ 0 otherwise .
Then
9(n) = U(l-f(p))
p\n
is th e characteristi c functio n o f number s i n question . Therefor e w e ar e askin g t o
evaluate
S(V,x) = ^g(n).
Opening th e convolutio n g = fif * 1 we obtai n
S(V,x) = Y,Kd)f(d)[^}=xY,Kd)^-+0(M f(x)).
d^x d^x
To procee d furthe r w e ar e goin g t o assum e tha t th e proportio n o f prime s i n V i s
less tha n th e proportio n o f thos e no t i n V. Mor e precisel y w e nee d th e conditio n
(1.99) V J f(p)
-^- log p - Adog x + O(l )
*-? P
p^x
with a constan t K < \. The n w e derive Mf(x) < C x(\ogx)K~1 b y (1 .85 ) an d
1
J2n(d)^-=c(V)(\ogxr" + 0((\ogxy- )
d^.x
28 1. ARITHMETI C FUNCTION S
by (1 .98) , wher e c(V) i s a constant dependin g o n the set V,
Gathering th e above result s w e conclude
COROLLARY 1 .2 . Let V be a set of primes of density K < \ (that is (1 .99)
holds). Then the number of integers 1 ^ n < x having no prime divisors in V
satisfies
(1.101) S(V,x) = c(V)x(logx)-K{l + 0((logx) 2 "" 1 )}
where c(V) is the constant given by (1 .1 00).
EXERCISE 4 . Appl y (1 .98 ) t o derive Landau's formul a (1 .87 ) wit h the constant
v
p=-l(mo d4 ) ^
1.7. Distributio n o f additiv e functions .
Throughout / i s an additive function , s o / i s determined b y its values at prim e
powers, precisel y
f(n) = ]T /(p").
pa\\n
Hence th e average o f / i s given by
X X
Mf(x) = £ f(P a) ( - +1
pa-^.x i^ _
Dropping th e integral par t symbol s w e arrive at
(1.103) M f(x) = xE(x) + 0 ( x b ( x ) ) ,
where
1
a a
(1.104) E(x)=Y,f(p )p- (l-p- )
a
p ^.x
and
a 2
(1.105) D\x) = £ \f( P )\ p~a.
a
p ^x
Actually th e error ter m i n (1 .1 03 ) i s ^ |/(p Q )| whic h w e estimated b y x*D(x) b y
applying Cauchy' s inequalit y an d tha t p a ^ x. Th e resul t reveal s tha t E(x) i s
approximately equa l t o the mean valu e o f /.
Similarly w e evaluate th e average o f f2. W e have
pa q$ n^x
1. ARITHMETI C FUNCTIO N 29
where p, q run ove r primes . I f p ^ q, the inne r su m i s equal t o
"X 1 X X X
a
.p qPl
-
.pa+1 3
qf .
-
.p^q^1.
+ pa+lq(3+l pcxqt • H ) ( ' - ; ) + <*>•
and fo r p = q we mus t hav e a = / ? gettin g
,CK+1
FHW
Hence w e derive tha t
M2;(x) = x £ J2 f(P a)f(lP)P~aQ~0^ ~ P_1)(l " T 1 )
+ xp£^ r r/
a
2
( P V ( I - P _1 ) + O (E
a
El/(P a)/(^)l)-
P ?<? ^ ^ ^
Here w e dro p th e conditio n p ^ q an d estimat e th e additiona l term s wit h p — q
using \f{p a)f{pP)\ ^ \f(p«)\ 2 + \f{jf)\ 2. W e arriv e a t
(1.106) MP(x) = xE 2(x) + 0(xD 2
(x)).
This resul t say s tha t E (x) i s approximatel y equa l t o th e mea n valu e o f f 2.
2
Both approximation s (1 .1 03 ) an d (1 .1 06 ) sugges t tha t th e individua l value s
f(n) fo r n ^ x shoul d b e reasonabl y wel l approximate d b y E(x). Indee d thi s i s
true i n th e followin g sense :
THEOREM 1 .3 . For any additive function f : N—» C we have
(1.107) Yl l/( n) - E ^\2 ^ CXD2
^
n^x
where c is an absolute constant.
P R O O F . W e ca n assum e tha t / i s rea l sinc e th e rea l an d th e imaginar y part s
of / ar e additiv e an d the y ca n b e treate d separately . W e can als o assum e tha t x i s
a positiv e integer . Firs t w e evaluate th e varianc e
V{x) = E ( / ( n ) " x- xMf[x))2 = M P(x) - x^Mfe).
n^x
By (1 .1 03 ) an d (1 .1 06 ) w e ge t
2
V{x) < x*\E{x)\D{x) + xD 2(x) « x D (s).
l
Then replacin g x~ Mf(x) b y E(x) w e mak e a n admissibl e error , an d (1 .1 07 ) i s
established.
•
Theorem 1 . 3 i s du e t o P . Tura n [Till ] an d J . Kubiliu s [Kubl] . Th e expecte d
value E(x), i f on e prefers , ca n b e reduce d t o
(1.108) A(x) = J2f^P~ 1
p^x
30 1. ARITHMETI C FUNCTION S
at th e expens e o f th e constan t c i n (1 .1 07) . Ou r argument s abov e yiel d c = 4 i f
x i s sufficientl y large , however , J . Kubiliu s [Kub2 ] showe d (afte r havin g receive d
suggestions from H.L . Montgomery) tha t (1 .1 07 ) holds with c = c(x) ~ | a s x— > oo.
He and other s als o showe d tha t th e | canno t b e replace d b y a smaller constan t (cf .
A. Hildebran d [Hil]) .
As a n exampl e w e choose th e additiv e functio n cj(n) whic h count s th e numbe r
of distinc t prim e divisor s o f n. Therefor e uu(p a) = 1 , E{x) = lo g log x + O(l ) an d
D2(x) — log log x - f 0(1 ) b y (2.1 5) , s o th e Turan-Kubiliu s theore m give s
(1.109) Y^(u;(ri ) — log log x)2 < C x log log x.
Hence i t follow s tha t u(n) ~ lo g log n fo r almos t al l n ( a theore m o f Hard y an d
Ramanujan). Precisel y w e hav e
COROLLARY 1 .4 . For any x ^ 3 and z ^ l ,
2
(1.110) \{n ^ x : \u;(n) - l og lo g a; | > z(loglogx)^} | < z~ x
where the implied constant is absolute.
Furthermore on e ca n as k i f a give n additiv e functio n / ca n b e renormalize d b y
A(x) an d som e B(x) s o that th e frequencie s
have a limi t fo r an y z G R a s x tend s t o oo. Thi s questio n i s addresse d i n th e
probabilistic theor y o f numbers . W e shal l onl y giv e a glimps e o f centra l results .
The ver y firs t on e wa s establishe d b y P . Erdo s an d M . Ka c [EK] .
THEOREM 1 .5 . Let f(n) be an additive function with — 1 ^ f(p a) — f(p) ^ 1
such that
1
(1.112) B{x) = Y,f\p)P~ ^™
as x —> > oo. Then v(x\z) ~ G(z) for any z G R , where G(z) is the Gaussian
distribution function
(1.118) G ^'WJ-, -<'"dt.
In particular , i t follow s fro m th e Erdos-Ka c theore m tha t fo r an y rea l z
\{n ^ x : uj(n) — log log x < z(loglog#)2} | ~ G(z)x
as x tend s t o oo . Man y othe r arithmeti c function s (no t necessaril y additive ) follo w
the Gaussia n limitin g distributio n law . Fo r example , H . Halbersta m [Hal ] prove d
that th e numbe r o f prim e divisor s o f shifte d prime s satisf y
(1.114) \{p^ x :u(p+l) - l o g lo g x < 2:(loglogx)i} | ~ G(Z)TT(X)
for an y fixed z G R a s x —> oc .
http://dx.doi.org/10.1090/coll/053/03
CHAPTER 2
ELEMENTARY THEOR Y
OF PRIM E NUMBER S
2.1. Th e Prim e Numbe r Theorem .
It wa s observe d nearl y 20 0 year s ag o b y Legendr e an d Gaus s (independently )
that th e densit y o f primes p ^ x i s (logo:) -1 , precisel y the y postulate d th e followin g
P R I M E NUMBE R T H E O R E M . Let ir(x) denote the number of primes p < x. As
x —> oc ; we have
x
(2.1) ir(x)
log x
Gauss observe d tha t a n eve n bette r approximatio n t o TT(X) i s give n b y th e
singular integra l
(2.2) Li(x) = f^ — r(l--) f ^ + l o g l o g x+ 7 .
Jo log y J ± V yJ logy
For x > 1 this satisfie s th e asymptoti c expansio n
f m
(2.3) Li(a O= j ^ - | ] T &(\ogx)- + 0((logxr )\-
Some people use £i(x) — Li(x ) —Li(2) in place of Li(x) fo r x ^ 2 . Later , Tchebyshe v
realized i t i s simpler t o coun t prime s p wit h th e weigh t logp , s o he investigate d th e
sum
p^x
rather tha n n(x). I t i s stil l mor e convenien t t o evaluat e th e averag e o f th e vo n
Mangoldt functio n
(2.4) ip{x) = ] T A(n) .
n^x
Note tha t (2.1 ) i s equivalen t t o eac h o f th e asymptoti c formula s
(2.5) 9{x) ~ x , ip(x) ~ x
by partial summatio n an d trivia l estimatio n o f the contributio n o f n = p a ^ x wit h
a^ 2.
Although A is give n b y th e convolutio n (1 .40 ) th e formul a (1 .72 ) fall s shor t o f
yielding (2.5 ) becaus e th e erro r ter m exceed s th e mai n term . Applyin g Dirichlet' s
trick o f switchin g divisor s (th e hyperbol a method ) on e ca n reduc e th e numbe r o f
31
32 2. ELEMENTAR Y THEOR Y O F PRIM E NUMBER S
error terms , bu t i t doe s not complet e th e work . Thi s onl y lead s t o anothe r proble m
of estimating sum s o f the Mobiu s function . Indeed , on e ca n sho w b y th e hyperbol a
method tha t th e Prim e Numbe r Theore m i s equivalen t t o th e estimat e
m =
(2.6) M(x) = 2\. M ) °{ x)-> a s x ^ oo .
To deriv e (2.5 ) fro m (2.6 ) w e conside r
A(x) = ^ ( l o g r c - r(n) + 27) .
n^x
Subtracting (1 .75 ) fro m (1 .63 ) fo r k = 1 we ge t
(2.7) A(x) < y/x.
Now applyin g (1 .42 ) w e ge t
il>(x) - x + 2 7 = Y, Y, v(d)(logk- r{k) + 2 7)
dk^.x
(2.8) = ^ ( l o g f c _ r ( f c ) + 2 7 ) M ( | ) + J2 l^(d)(A(~)-A(K))
1
k^K d^xK-
for an y 1 ^ K ^ x. Th e last su m is 0(xK~ 1 ^2) b y virtue of (2.7 ) while for an y fixed
K th e precedin g su m i s o(x) b y th e hypothesi s (2.6) . Sinc e K ca n b e arbitraril y
large, thi s show s tha t (2.6 ) implie s (2.5) . T o establis h th e convers e w e us e th e
identity
V^ /i(n ) log — = M(x) lo g x - f Y ^ Y ^ fjb(m)A(n)
n^x mn^x
(1 \ ' C 1
which follow s fro m f - J = — — -. Th e lef t sid e i s triviall y 0(x) (se e (1 .64 ) fo r
k = 1 ) , so
M(x)logx = - ^ /i(ra)V>(— J +0(x).
Applying (2.1 8 ) w e arriv e a t
M(x)logx = Y, M H ( ^ -V>(^) ) +0(X).
m^.x
Now usin g th e hypothesi s (2.5 ) thi s equatio n implie s (2.6) .
2.2. Tchebyshe v method .
The firs t significan t attemp t fo r provin g th e Prim e Numbe r Theore m wa s mad e
by Tchebyshe v i n 1 848-1 852 . Retrospectivel y on e ma y sa y tha t hi s approac h i s
based essentiall y o n th e produc t formul a
N = l[p
P
meaning tha t th e archimedea n valuatio n equal s th e produc t o f p-adi c valuations .
This obviou s formula i s particularly fruitfu l fo r th e factoria l number s J V = 1 • 2 • • • n.
2. E L E M E N T A R Y T H E O R Y O F P R I M E N U M B E R S 33
Like Tchebyshev w e present th e arguments i n a more friendly analyti c format. Firs t
we hav e
(2.9) L(x) = 2_] log^ = xlog x — x + O(logx) .
n^.x
On th e othe r hand , usin g (1 .41 ) w e ge t
(2.io) ^ ) = EA wS = E ^ ©-
d^x m^.x
Hence evaluatin g L(x) — 2 L ( | ) i n tw o way s on e get s
(2.11) ^ ) - v { f ) + ^ ( | ) - ^ ( | ) + - - -= zlog 2 + 0(logz) .
Using th e monotonicit y o f ip(y) on e derive s fro m (2.1 1 ) th e firs t Tchebyshe v esti -
mates
(2.12) xlog 2 + <3(logx ) < \l>(x) < xlog4 + <3(logx) .
Tchebyshev improve d thes e estimate s severa l time s usin g mor e involve d linea r
combinations o f L ( ^ ) . Fo r exampl e le t u s conside r ij)f(x) = L(x ) — L ( |) — L ( |) —
L(f) + L ( § ) . B y (2.9 )
ipf(x) = ax + O (log x)
where a = \ lo g 2 + § log 3 + \ lo g 5 - ^ lo g 30 = 0.921 2.. . O n th e othe r hand ,
</>/(*) = £A(n)/Q
n^x
where
x~ X1 "X X
ran ra n ra n
L2 J L3 - L5 30 J
+
Since 1 — ^ — | — | + ^ = 0 , th e functio n f(x) i s periodi c o f perio d 30 . On e ca n
check tha t f(x) take s onl y tw o value s 0, 1 an d f(x) = 1 if 1 ^ x < 6. Therefor e
Hence on e derive s
(2.13) ax + O(logx ) < ip(x) < f a x + O(logx) .
Since th e differenc e betwee n th e uppe r an d th e lowe r bound s i n (2.1 3 ) i s quit e
small, on e ca n deduc e fro m (2.1 3 ) tha t an y dyadi c interva l [x , 2x] contain s prime s
provided x i s sufficientl y larg e (actuall y Tchebyshe v prove d thi s fo r al l x ^ 1 ,
which wa s postulated b y Bertrand) . Tchebyshe v faile d t o prov e th e Prim e Numbe r
Theorem, however , h e succeede d t o sho w tha t i f li m ip(x)/x exists , the n thi s limi t
x—>oo
is equal to one (thi s follows instantl y fro m (2.1 1 ) because 1 — § +1_ \~\ = log2) .
Interesting result s wer e derive d fro m Tchebyshe v work s b y Mertens . Firs t
approximating [x/d] i n (2.1 0 ) b y x/d on e get s
(2.14) £ ^ M = i o g x+ 0 (l).
n^.x
34 2. E L E M E N T A R Y T H E O R Y O F P R I M E N U M B E R S
Next applyin g partia l summatio n on e derive s fro m (2.1 4 ) tha t
(2.15) V - = lo g log x + p + 0 ( - 4
^—' p \ log x J
p^x
where ft i s a constant . Finall y usin g log( l — p _ 1 ) = — p~l + 0(p~ 2), w e hav e th e
Mertens formul a
< 216 » n(i-; /
) = iS{i + o(i)}-
p^x x v
'
Mertens showe d tha t th e exponen t 7 i n (2.1 6 ) coincide s wit h th e Eule r constan t
(1.69).
Similarly, usin g S = 1 */x i n plac e o f L = 1 * A, on e derive s a handfu l o f result s
for th e Mobiu s function . Firs t fo r an y x ^ 1 one get s
in plac e o f (2.1 0) . Henc e approximatin g [x/m] b y x/m on e deduce s tha t
/i(m)|
(Z-iS) E 771 o.
2.3. Prime s i n arithmeti c progressions .
Every natura l numbe r n = — l(mod 3 ) ha s a prim e diviso r p = — 1 (mo d 3) .
Hence there ar e infinitel y man y prime s p = — l(mod 3) . I n th e sam e wa y one show s
that fo r q — 4 , 5 there ar e infinitel y man y prime s p = — l(mod q).
Let q be an y intege r > 1 . Conside r th e cyclotomi c polynomia l o f degre e <p(q),
*,(*)= I I (x-eQ)eZ[x]
a ( m o d q)
(a,q) = l
One ca n sho w (quit e easil y b y usin g elementar y propertie s o f th e multiplicativ e
group (Z/gZ)* ) tha t ever y prim e diviso r o f $ q(n) coprim e wit h q satisfie s p = 1
(mod q). I f q is prime thi s propert y i s very simple . Indee d i n thi s cas e w e hav e
xq — 1 q l
<&q(x) - _ = 1 + x + .. . + x ~.
Hence i f p divide s $ q(x), the n x q = 1 (mo d p) , an d comparin g wit h th e littl e
Fermat theore m x p~1 = 1 (mo d p) w e deduc e tha t q divide s p — 1 . No w arguin g
like Eucli d w e ca n deduc e tha t ther e ar e infinitel y man y prime s p = 1 (mo d q).
Naturally ever y clas s a (mod q) wit h (a,q) = 1 shoul d hav e infinitel y man y
primes. I t wa s fo r th e proo f o f thi s theore m tha t L . Dirichle t (1 837 ) invente d
multiplicative character s x(mo d q) (se e Chapte r 3) . T o ever y suc h characte r h e
associated th e Dirichle t serie s
(2.19) L(s, X) = J2x(n)r
2. E L E M E N T A R Y T H E O R Y O F P R I M E N U M B E R S 35
As i n th e cas e o f th e zet a functio n th e Dirichle t L-functio n ha s th e Eule r produc t
1a
(2.20) L(s,x) = Y[(l-x(p)p- )-
V
which converge s absolutel y fo r s > 1 . Henc e L(s,x ) doe s no t vanis h i f s > 1 .
Taking logarithm s w e get
(2.21) logL(s, X) = 5 > ( p ) p - s + 0(l) .
P
Hence b y th e orthogonalit y o f character s (se e Sectio n 3.2 )
P
(2.22) Yl ~S = Jq) ^ x(a)logL(s, X) + 0(l),
m
p=a(mod q) x( ° d q)
where th e erro r ter m 0(1 ) i s bounde d i n term s o f s uniforml y fo r s > 1 . Fo r th e
principal characte r w e ca n relat e L(s,xo ) t o th e zet a functio n
L(3,xo)=as)H(l-p-s).
p\q
Since
c(s) = ^ > -s = y x -dx + o(i) = j -i + o(i),
we infer tha t th e principa l characte r contribute s t o (2.22 )
(2.23) _ ^ logL(a>Xo) = _^ l o g _l_ + 0 ( 1 ).
Next w e conside r x 7 ^ Xo- Sinc e x i s periodi c o f perio d q an d th e sum s ove r an y
complete se t o f residu e classe s vanish , i t follow s tha t th e characte r su m ove r an y
segment i s bounded, precisel y
| E X(n) <q.
Therefore th e serie s (2.1 9 ) converge s fo r s > 0 an d |L(s,x) | < q, b y partia l sum -
mation. I f w e kno w tha t
(2.24) £ ( , X ) ^ 0 ,
then logL(s , x) ha s a finite limi t a s s — > 1 + . Henc e w e can deduc e tha t fo r s > 1 ,
(2.25) £ P-'-^Th+OW-
p=a(mod q)
In this way Dirichlet prove d there are infinitely man y primes in any primitive residu e
class. Actuall y (2.25 ) show s tha t prim e number s ar e equidistribute d (i n a certai n
sense) amon g th e primitiv e residu e classe s a (mod q).
It remain s t o sho w th e non-vanishin g o f L(l , x) fo r ever y x 7 ^ Xo5 which i s th e
heart o f th e matter . W e d o no t follo w exactl y th e origina l line s o f Dirichle t bu t
36 2. ELEMENTAR Y THEOR Y O F PRIM E NUMBER S
rather mi x his ideas wit h mor e familia r transformation s o f Tchebyshev. W e begin
with
] T Xi^n' 1 lo g n = Y, X(n)n- 1 ^ A(d )
n^.x n^.x d\n
= J2x(d)A(d)d-1 ^ x H " " 1
d<x ra<^§
Y/x(d)A(d)d-1(L(l,x)+0(d/x))
d^.x
L(l,X)^x(d)A(d)d-1+0(l)
d^x
where th e error ter m 0(1 ) is bounded i n terms o f x (but not of q). Th e left sid e is
also bounded (b y Abel's criterion ) becaus e th e average o f x(n) ove r an y segment i s
bounded. Therefor e w e proved tha t
(2-26) I > ( ^ « 1
d^.x
provided L ( l , x ) ^ 0. Similarl y w e obtain
A(n)_^xW_E/i((i)log,
log,+ X : x ( n ) ^ = E ^ E ^ ) l o g J
n ^ x n^.x d\n d\n
d<x m^^
L(l, X )^Mrf)^log^+0(l).
Hence, i f L ( l , x) = 0, we deduce tha t
(2.27) ^ x ( n ) ^= _ i o g x + o(l).
n-Cx
In view of the Mertens formul a (2.1 4 ) th e above conditiona l formul a ca n be written
as
£ ( i + x(p))^f«i.
p^x
This shows that \(p) = —1 for almost al l primes. I n other words, if L(l, x) vanishes,
then th e characte r x ( n ) on squarefree number s mimic s th e Mobiu s functio n /x(n) ,
which i s no t likel y t o b e tru e (bot h function s ar e multiplicativ e bu t fi(n) i s not
periodic). I n any case, w e have show n rigorousl y tha t
(2.28) J2x(n)^=6 A(«)
x\ogx + 0(l)
n^.x
2. ELEMENTAR Y THEOR Y O F PRIM E NUMBER S 37
an
where 6 X =— 1,0 accordin g t o L ( l , x) = 0 or L ( l , x) / 0 if x ¥" Xo > d Sx = 1 ii
X = ^0 . Summin g (2.28 ) ove r al l characters w e ge t
V
n^x ^ ^ y x(mo d g )
n=l(mod q)
Hence ^2 y5x ^ 0 . Thi s show s tha t ther e i s at most on e character X ¥" Xo with
L ( l , x ) = 0 . I f suc h a n exceptional characte r exists , i t mus t b e real becaus e
L ( l , x ) = 0 implies L(l,x ) = 0 (th e complex conjugatio n i s a continuou s auto -
morphism o f C).
The non-vanishin g o f L ( l , x) fo r a real characte r \ ¥" Xo requires a differen t
argument. T o this en d we consider
where
(2.29) T(n )X) = $>(d).
d\n
Note tha t
T(^X) = n ( x +xw+• • •+x(pn) > o
p«||n
2
for al l n, and r(ra ,x) ^ 1 - Therefor e
On th e other hand , openin g th e convolution (2.29 ) w e evaluate T(x) in the same
fashion a s Dirichlet deal t wit h th e divisor proble m (1 .75 ) (th e hyperbola method) .
We obtai n
mn^x
= ] T xO)m~ ^ ] T n"^ + ^ n" ^ ^ x H ^ ^
J+ +0
- E ^>m-*{j(s) ' ((7) J )} +0 (i:»-**-')
raSC-y/a; n<y/x
= ±L(1 , X)^+0(1 ).
Comparing bot h estimate s w e obtain log x < L(l, x )^ + 0(1), whenc e L(l , x) ^ 0
by lettin g x tend t o oo.
T H E O R E M 2. 1 (DIRICHLET) . For every non-principal character \ (mo d q) we
have L ( l , x ) ^ 0.
By Theore m 2. 1 we complete th e proof o f (2.25). Moreover , w e proved tha t
(2.26) hold s fo r any x ^ Xo - Summin g ove r character s w e deduce
38 2. E L E M E N T A R Y T H E O R Y O F P R I M E N U M B E R S
THEOREM 2.2 . For any a with (a,q) = 1 we have
A(n) log x
KHJ
n<x ^
n=a(mod q)
where the error term 0(1 ) depends only on q.
REMARK. Th e origina l proo f o f Dirichlet tha t L(l , x) ^ 0 for a real characte r
X is a simpl e consequenc e o f his Class Numbe r Formula : le t K = Q ( A / D ) b e a
quadratic fiel d wit h D = 0, 1 (mo d 4 ) so D is the discriminant o f K. Ther e i s an
associated rea l primitiv e characte r x suc h tha t
(K(s)=Y,(Na)-s = ((s)L(s, X)
a
and ever y primitiv e rea l characte r modul o q ^ 1 arise s i n thi s wa y from K —
l)q). Dirichle t showe d tha t
/ 2nh
"^i f x ( - i) = - i ,
(2-31) L(l,x) = {
^ p ifx(-i ) = i,
where h is the class numbe r o f K, w is the number o f units i n the ring o f integers
of K i f x(—1 ) = —1 , an d e i s the fundamental uni t o f K i f x(~~l ) — 1- Henc e
L(l,x) > -75 - (Se e (22.59) fo r odd characters).
In Chapter s 5 , 22, 23, we develop lowe r bound s fo r 1,(1, x) using technique s of
increasing sophistication. Se e also Section 1 5. 9 for some results about clas s number s
of rea l quadrati c fields .
2.4. Reflection s o n elementary proof s o f the prim e numbe r theorem .
The Prim e Numbe r Theore m wa s prove d i n 1896 independently b y Hadamard
and d e la Valle e Poussi n b y analyti c methods . Th e first elementar y proof s wer e
found abou t fift y year s late r b y Erdos an d Selberg. A t the heart o f their proof s lie s
the formul a (du e t o Selberg)
2
(2.32) V^(logp) + ]T £(logp)(log<7 ) = 2xlog: r + O(x).
p^x PQ^x
To prove thi s w e begin by
2
^ log n = x(log2 x - 2 log x + 2) + 0(log 2 x)
rvCx
= rc(log x) (J2 fc-1 ) ~ & + ( 7 + 2) log k] + 0(log 2 x).
rC<^.0C rZ^^OC
2. E L E M E N T A R Y T H E O R Y O F P R I M E N U M B E R S 3 9
where 7 is the Eule r constant . Henc e opening the convolutio n A 2 = /x*L 2 w e derive
J2 A 2 (n) = J2 J2 ^ m ) l o ^ 2 n
l^x m\l
J2 J2 ^m ) [^ + (7 + 2) log ^] + O(z)
l^x m\l
J2 - {S(n) logx + A(n) ) + ^ ( 7<J(n) + ( 7 + 2)A(n) ) + 0(x)
n^x n ^
= xlog x H-xlog x + 0{x) = 2xlog x + 0(x)
by Mertens ' formul a (2.1 4 ) an d Tchebyshev' s uppe r boun d (2.1 2 ) (recal l (1 .1 8)) .
EXERCISE. Deriv e from (2.32 ) by induction on k ^ 2 (use the recurrence (1 .47) )
that
(2.33) Y J^k{n) = kx{\ogx)k-l{l + o(^-)}.
n^x
Using Selberg' s formul a (2.32 ) Postniko v an d Romano v [PR ] derive d th e ele -
gant inequalit y
(2.34) | M ( x ) | l o g x < ^|M(-)|+0(xloglog3x) .
n^.x
Compare thi s wit h th e secon d par t o f (2.1 7) . No w th e Prim e Numbe r Theore m
can b e derive d i n th e for m ip(x) ~ x fro m (2.32) , o r i n th e for m M(x) = o(x) fro m
(2.34) b y elementary , howeve r lengthy , tauberia n typ e arguments .
E. Bombier i [Bol ] an d E . Wirsin g [Wi2 ] refine d th e Erdos-Selber g metho d t o
show
A
(2.35) i/;(x) = x + 0(x(\ogx)- )
for an y A > 0 , th e implie d constan t dependin g o n A. Alon g th e sam e line s H .
Diamond an d J . Steini g [DS ] succeede d i n showin g tha t
2
(2.36) \rp(x)-x\ <rrexp|-(logx)^(loglogx)" j
for al l x ^ e e . Th e argument s o f Erdos-Selberg ar e truly elementar y (n o comple x
numbers ar e harme d i n th e cours e o f th e proof ) nevertheles s w e express ou r mixe d
feelings, becaus e th e earlie r analyti c method s base d o n contou r integratio n insid e
the zero-fre e regio n o f th e Rieman n zet a functio n ar e considerabl y simple r an d th e
results ar e superio r (se e Section s 5. 4 an d 5.6) . I n 1 89 9 d e l a Valle e Poussi n prove d
quite easil y tha t
(2.37) I/J(X) = x + 0(x exp(-c^/\ogx))
where c > 0 and th e implie d constan t ar e absolute . Th e subsequen t improvement s
of th e PN T cam e fro m sharpe r uppe r bound s fo r £(s ) nea r th e lin e R e (s) — 1 ,
40 2. E L E M E N T A R Y T H E O R Y O F P R I M E N U M B E R S
particularly b y Vinogradov' s estimat e fo r exponentia l sum s (se e Chapte r 8) . How -
ever, w e woul d lik e t o sa y tha t Vinogradov' s metho d o f exponentia l sum s shoul d
not b e regarde d a s a ne w techniqu e fo r non-vanishin g o f the zet a function . I n fact ,
all the know n method s o f converting th e uppe r bound s int o th e lowe r bounds , fro m
which t o dra w a zero-fre e region , remai n i n principle th e sam e on e a s originate d b y
Hadamard an d d e l a Vallee Poussin . Yet , som e variations ar e mor e sensitiv e t o th e
input tha n th e others , givin g close r connection s betwee n estimate s an d zeros . Fo r
example th e Borel-Caratheodor y theore m i n comple x functio n theor y doe s th e jo b
very well . Th e bes t erro r ter m i n (2.37 ) know n toda y i s due t o Korobo v [Kor ] an d
Vinogradov [VI ] (se e Corollar y 8.31 ) .
The asymptoti c formul a I/J(X) ~ x i s equivalen t t o th e non-vanishin g o f ((s)
on th e lin e R e (s) = 1 , an d th e latte r ca n b e establishe d relativel y easil y withou t
appealing t o analyti c propertie s o f £(s ) .
Here is how one can arrang e th e analyti c idea s for a proof o f the Prim e Numbe r
Theorem i n the for m (2.35 ) whic h a broad-minded reade r ma y stil l find elementary .
We avoi d comple x variabl e analysis , althoug h fo r clarit y w e d o no t hesitat e t o
employ £(s ) a t comple x number s s = a + it, bu t onl y wit h a > 1 wher e th e
Dirichlet serie s and th e Euler produc t converg e absolutely. W e begin by considerin g
the functio n G(s) = (—l) k(l/((s))(k\ Thi s i s give n b y th e serie s
(2.38) G (s)= £^M (iogm)*.
m
Prom her e w e go to th e finite su m
(2.39) F(x) = ] T /i(m)(logm) fc log —
m^.x
where th e facto r log(rr/m ) play s a tas k o f smoothin g a t th e cut-of f point . Thi s
particular cuttin g i s produced b y mean s o f th e integra l formul a
l o g + y = ^~ / y s
s~2ds
where a > 1 , givin g
s
(2.40) F(x) = ^- f x G{s)s~2ds.
2m J
For th e purpos e o f estimatin g G(s) w e consider (*( s) — (s ~ 1 )C( 5)- Not e tha t
(fe) (fc) ik l)
1
(2.41) ( /C(*)) = (s - l)(l/C*(*)) + k(l/C(s)) -.
Next w e appea l t o th e formul a fro m differentia l calculu s
JJ
a 1 +2a2+---=k L Z J J
where a i , a 2 , . . . ru n ove r non-negativ e integers . Thi s formul a fo r /($ ) = C*( 5)
reduces the problem t o uppe r bound s fo r th e derivative s o f C*( s) a n d a lower boun d
2. E L E M E N T A R Y T H E O R Y O F P R I M E N U M B E R S 41
for |C*( 5)|- Firs t w e derive b y partia l summatio n tha t fo r R e (s) > 1 ,
/'OO I I
(-l)eCW(s) = ]Tn- s(logn/ + y rr- s
(logx)^ + o(^(logX)' +1
)
= y°°x- s(loga;)^ + o ( (l + ^)(logX/ +1
).
Taking X = 2|s | w e ge t
(2.43) (-l) e
CW(s) = ^ • A+ 0((log2|s|)^ 1 ).
(s-iy
Hence
W
(2.44) (C*(«)) = ( * - 1 )C (0(«) + iC ( ' _1 ) («) « M(log2| a |)'
+1
.
Next w e infe r fro m th e Eule r produc t b y positivit y tha t
i< ^ ( i - K i + ^ + p - ^ V7 )
= JJ ( l + (3 + 2plt + 2p~u + p2it + p~ 2it
)p-^
p
xC 3 (a)|C(a + ii)| 4|C(cr + 2zO| 2
where th e implie d constan t i s absolut e ( a simila r formul a i s als o presen t i n th e
arguments o f Hadamar d an d d e l a Valle e Poussin , se e Theore m 5.26) . Henc e an d
by (2.43 ) w e get th e lowe r boun d
(2.45) | C » | » ( < r - l ) ? H ( l o g 2 | S |)-l
From (2.44 ) an d (2.45 ) w e derive usin g (2.42 ) tha t
(1/C*(s))(fe) < ( a - l)-t( fe 1
+ )| s |-1 (log2| S |) K
where K and th e implie d constan t depen d onl y o n k (i n fact , K = (5A ; + l)/ 2 come s
out b y th e abov e arguments , bu t i t doe s no t matte r fo r application) . Henc e b y
(2.41)
G(s)<(cr-l)-t(fe+1)(log2|s|)K.
Inserting thi s boun d t o (2.40 ) an d integratin g triviall y w e ge t
F(x)<^xa(a-l)-^k+1\
Finally takin g a = 1 + ( l o g x ) - 1 w e conclud e th e followin g estimat e fo r th e su m
(2.39)
1fc+
(2.46) F(x)<rr(logx)4( ).
Note tha t thi s i s non-trivia l i f k > 3 .
Now it i s a completely elementar y exercis e to derive the PN T fro m (2.46) . Firs t
we d o th e su m
fc
(2.47) ff(x) = £>(m)(logm) .
42 2. E L E M E N T A R Y T H E O R Y O F P R I M E N U M B E R S
which i s obtained b y differencin g F(x) a s follow s
fc
F(x + y)- F{x) = # ( x ) lo g -—^- + ] T /i(m)(logm) lo g
m
xKm^x+y
= [ff( a;) + 0(y(logx) fc )llog^^.
LJ x X
Hence
H(x)<^y{\ogx)k + y- 1 x2(\ogx)i{k+l) =2x(\ogx) k A
~
by choosing y = x(lo g x)~A wit h ^4 = (k— 3)/8. Finall y we get by partial summatio n
A
(2.48) ] ^ /x(m ) < x(logx)~
where th e implie d constan t depend s onl y o n k. Sinc e k i s arbitrar y s o i s A. Thi s
proves th e PN T fo r th e Mobiu s function . Henc e th e PN T fo r th e vo n Mangold t
function follow s b y th e formul a (2.8) .
http://dx.doi.org/10.1090/coll/053/04
CHAPTER 3
CHARACTERS
3.1. Introduction .
The character s o n residu e classes , bot h additiv e an d multiplicative , pla y in -
strumental part s i n analyti c numbe r theory . Othe r character s suc h a s th e Heck e
characters o f a number field, o r the character s o f a finite field ar e als o indispensabl e
in th e moder n theory . W e shal l introduc e thes e i n du e course . However , t o avoi d
redundancy w e start her e b y givin g basic definition s i n a general contex t o f a finite
abelian group . (Character s fo r non-abelia n group s occu r also , bu t mor e naturall y
as Galoi s group s an d ar e bes t see n fro m th e automorphi c perspectiv e i n analyti c
number theory) .
:
Let G b e a finite abelia n group . A homomorphis m x G —) > C* i s calle d a
na s
character o f G. Therefor e (i n th e multiplicativ e notation ) \ th e propertie s
X(xy) = x(x)x(y) for 1a 1
x,y eG,
x(l) = 1 and x( x)m — 1? where m = \G\ is the orde r o f G , therefor e x( x) i s a ro
°t
of unity .
The character s o f G for m a grou p G wit h multiplicatio n give n b y
(XiX2){x) = Xi(x)X2{x) fo r al l x G G.
G i s called th e dua l group , it s identit y elemen t i s the trivia l characte r
Xo{%) — 1 fo r al l x G G.
Throughout x denote s th e comple x conjugate , henc e als o th e inverse . I f G i s cyclic
of orde r m an d g i s a generato r o f G , the n ever y characte r o f G i s o f typ e
Xa{x) = e 2™y'm, iix = gy
for som e fixed residu e clas s a(mo d m) . Thes e ar e distinc t characters , therefor e G
is als o cycli c o f orde r m , s o G i s isomorphic t o G . Th e isomorphis m G ~ G ca n b e
established fo r an y finite abelia n grou p b y writing G a s the direc t produc t o f cycli c
groups. Ther e i s a canonica l isomorphis m betwee n G an d G give n b y x F- > X where
Hx) = X(x) fo r al l x € G.
43
44 3. CHARACTER S
The raison d'etre o f these characters is found i n the following orthogonality relation s
5>(*) = {| d | if * =1'
which allow s u s t o detec t th e grou p identit y element .
Suppose d divide s th e orde r o f G . A n elemen t g G G i s sai d t o hav e exponen t
d i f g d i s th e identity . Th e orde r o f g i s it s smalles t exponent . Th e character s o f
exponent d ar e exactl y thes e whic h ar e trivia l o n th e subgrou p
Gd = {x d : x G G} ,
therefore the y ma y b e viewe d a s characters o f the facto r grou p G/G d. Th e orthog -
onality relation s becom e
[G : Gd] i f y G Gd ,
2^ XW I n0 i f y\t».dnd
0 G d.
This wil l allo w u s t o detec t th e d-th. powers o f G .
3.2. Dirichle t characters .
First w e consider th e character s o n th e additiv e grou p Z/ra Z o f residue classe s
modulo m. The y ar e give n b y
where e(z) = e 27Tlz. Thi s formul a make s a n additiv e characte r a functio n o n Z ,
periodic o f perio d m. Th e orthogonalit y propert y become s
CM
mi f n = 0(mo d ra),
E
a(mod m)
0 otherwise .
We shal l cal l ^ a (n) = e(an/m) primitiv e i f (a , ra) = 1 . Addin g al l th e additiv e
primitive character s w e obtain th e Ramanuja n su m
(3.1) S(n,0;m ) = c m(n)= ^ * e ( ^fan\ y
a(mod m )
(see (1 .56)) . Recal l tha t throughou t thi s boo k Y^ restrict s th e summatio n t o
"primitive" elements , i n th e abov e cas e amon g additiv e character s modul o ra. On e
shows b y Mobiu s inversio n tha t
(3.2) c m{n)= ] T d/x(^) .
d\(m,n)
Hence
3. C H A R A C T E R S 45
In particular ,
(3.4) c m(n) = /i(ra ) i f (ra , n) = 1 .
In general ,
(3.5) \cm{n)\ < (ra,n) .
Next w e conside r character s o n th e multiplicativ e grou p o f residu e classe s
a(mod ra) wit h (a , ra) = 1,
X : (Z/raZ)* -> C.
As wit h th e additive characters , w e wish t o consider x a s a functio n o n Z; we d o
so b y setting x( n) = 0 whenever n i s not prime t o ra. Thi s make s x a Dirichle t
character, a function o n Z, periodic modul o ra and completely multiplicative . The
corresponding extensio n o f the trivial characte r x o (mod ra) i s called th e principal
character t o modulu s ra. Th e group (Z/raZ) * an d its dual hav e (/?(ra ) elements .
The orthogonalit y relation s becom e
ip(m) if x = Xo,
E X(a ) = {0 otherwise ,
a ( mnod
o d ra)
(/?(ra) i f a = l(modra) ,
E *(« ) = {0 otherwise
%(mod ra)
,
If ra = vfi\Vfi2 wit h (mi,771 2 ) = 1 , then (Z/mZ) * ~ (Z/raiZ) * x (Z/TTT^Z)* ,
so ever y multiplicativ e characte r x ( m ° d m ) i s a produc t X1 X 2 o f multiplicativ e
characters xi(nio d m\) an d X2(mod 777,2).
The Dirichle t serie s associate d t o Dirichlet character s ar e of paramount impor -
tance in analytic numbe r theory . The y ar e called Dirichle t //-function s an d denoted
L(s,x), o r L(x, s) depending o n emphasis:
(3.6) L(s, x) = E *( n ) n ~ S = II (! - X{P)P- S)-1 ;
the series and the Euler produc t ar e absolutely convergen t fo r Re (s) > 1. As is well
known, thes e function s ca n be analytically continue d t o C. Se e Section 4. 6 for a
proof. Man y othe r analyti c properties , application s an d generalizations o f Dirichlet
//-functions will be considered throughou t th e book.
Note tha t b y total multiplicativity , (1 .1 5) , (1 .38 ) giv e
1 L'
— — - - ^ / x ( n ) x ( n ) n - s , ~j;(s,x) = ^ A(n)x(n)n~ 5
for R e (s) > 1.
3.3. Primitiv e characters .
Associated t o eac h characte r x > m additio n t o it s modulu s 777 , is a natura l
number ra*, its conductor. Th e conductor i s the smallest diviso r o f ra such tha t x
may be written a s x = XoX * where x o is the principal characte r t o modulus ra and
X* is a characte r t o modulu s 777* . Fo r some character s th e conductor i s equal t o
the modulus . Suc h character s ar e called primitive . I n the above factorizatio n x * i s
46 3. CHARACTER S
a primitiv e characte r uniquel y determine d b y \, W e shal l sa y tha t \ i s induced b y
X* or tha t x * induce s x - W e hav e
X(a) = x*(a ) i f (a,m ) = 1 .
The numbe r o f primitiv e character s t o modulu s m i s give n b y
i
(3.7) ^ H = " » n ( -|)n(i-J)a-2
p\\m p \m
The proo f i s a simpl e exercis e i n Mobiu s inversion . Indeed , w e hav e (p = 1 * <£*,
whence ip* = \i * ip, i.e.,
giving (3.7 ) b y multiplicativity . I n th e sam e wa y usin g th e orthogonalit y o f char -
acters w e infe r a mor e genera l resul t
if
(3.8) J2* *(«) = £ ^Hl) (a<m) = L
x(mod m) d\(a—l,m)
Incidentally, th e formul a (3.7 ) show s tha t th e primitiv e character s %(mo d m) exis t
precisely whe n m ^ 2 (mod 4) .
The primitiv e character s ar e pleasan t t o dea l with . Fo r example , w e hav e a
convenient formul a (whic h i s not vali d fo r % not primitive )
1(3-9) £ X (ac+b) = \X{b) 'I™ 1 ;*
K} m
, ,, I 0i f rafa.
Indeed, lettin g S b e th e abov e su m w e obtai n x( l + mix)S — S fo r an y x , wher e
mi = m(a , ra)/(a 2, m) . I f 5 7 ^ 0, this yields X(1 +TOI:E ) — 1 for an y x, whic h implie s
that x i s periodi c o f perio d mi . Sinc e x i s primitive , w e mus t hav e m\ = m , i.e. ,
m I a. Thu s S = 0 if m \ a, an d (3.9 ) i s obviou s otherwise .
The Dirichle t character s o f exponen t tw o ar e jus t th e rea l character s (real -
valued). I n a numbe r o f way s the y pla y a specia l role , particularl y the y ar e fun -
damental i n th e theor y o f quadrati c forms . Belo w w e giv e a complet e lis t o f rea l
primitive characters . I f m — p i s a n od d prime , the n ther e exist s exactl y on e
real primitiv e characte r o f conducto r p , namel y th e quadrati c residu e symbo l (th e
Legendre symbol )
s
This ca n b e define d b y sayin g tha t 1 + Xp( n) * t n e numbe r o f solution s x(mo d p)
to x 2 = n(mo d p).
For m — 4 we hav e exactl y on e primitiv e characte r define d b y
X 4 (n) = ( - l ) ^ i if2fn .
If m — 8, w e hav e tw o primitiv e character s
Xs(n) = ( - l ) * ( n - 1 ) ( n + 1 ) i f 2 f n ,
X4Xs(n) = ( - l ) * ( " - 1 ) ( r e + 5 ) H2\n.
3. CHARACTER S 47
If m i s prim e power , ther e ar e n o rea l primitiv e character s o f conducto r m othe r
than X4?X8 5X4X8 an d \ v . Ever y rea l primitiv e characte r x ( m ° d m) i s obtaine d a s
the product o f the above ones. Therefor e th e conductor o f a real primitive characte r
is a numbe r o f typ e 1 , /c, 4/c, Sk wher e k i s a positiv e od d an d squarefre e integer .
3.4. Gaus s sums .
Let u s com e bac k fo r a momen t t o a genera l finite abelia n grou p G. Th e
characters o f G for m a complete orthogona l syste m s o that an y functio n / : G—» C
has th e Fourie r expansio n
ipeG
with coefficient s
</,</>> = £ / ( ^ ( s ) -
geG
EXERCISE 1 Prov e tha t
(DU E T O D E D E K I N D ) .
TJ </,</>> = det(f(gh- 1 )).
In th e cas e o f function s o n residu e classes , an d o n a finite field, bot h additiv e
and multiplicativ e character s ar e presen t an d i t i s ofte n necessar y t o transfor m
the Fourie r expansio n i n on e o f thes e system s t o th e other . I n doin g s o w e en -
counter Gaus s sum s (x^) f° r an y pai r o f multiplicativ e an d additiv e character s
X, ip respectively. W e shal l presen t th e Gaus s sum s fo r finite fields i n du e time .
Now we consider Gaus s sum s associate d wit h character s o n residu e classes , sa y
modulo ra. Fo r an y multiplicativ e characte r x ( m ° d ra) w e pu t
(3-10) r ( X)= £ X(b)e(~).
b(mod ra)
Multiplying (3.1 0 ) b y x( a) an d summin g ove r x w e derive b y orthogonalit y
(3.11) e ( £ )= ^ £ x(a)r( X) if(a,m )= l .
x(mod m)
This i s th e Fourie r expansio n o f additiv e character s i n term s o f th e multiplicativ e
ones. Similarl y
6
(3.12) X(a)r(x)= £ *( ) e ( ^ ) '^(a,m) = l
b(mod m)
which give s th e Fourie r expansio n o f x i n term s o f additiv e character s provide d
r(x) 7 ^ 0 . Not e tha t th e conditio n (a , ra) = 1 i n (3.1 2 ) ca n b e droppe d i f x i s
primitive becaus e bot h side s vanis h i f (a , ra) ^ 1 (apply (3.9)) .
48 3. CHARACTER S
LEMMA 3.1 . Suppose the character x modulo m is induced by the primitive
character x * modulo m*. Then
(3.i3) ^>=<0**(^K) -
If x (mo d ra) is primitive, then
(3.14) |r( X )| = yM-
PROOF. W e hav e
e
r(x)= E * X » e ( ^ ) = E ^ ) x * WE * » ( ^ ) -
o(mod m ) d\m a(mo d *jj-)
The innermos t su m vanishe s unles s d = m/m* giving (3.1 3) . T o prov e (3.1 4 ) w e
write
2 'a — 6>
"(X)| = E E xWxWe(^ ) 771
a,b(mod m )
a(mod 7n ) 6(mo d ra)
The inner su m is the Ramanuja n su m (whic h is also the Gaus s sum for th e principa l
character). Insertin g (3.2 ) w e ge t
|r(X)|2 = E ^ ( 7) £ X(0)
-
d\m a(mo d ra)
aEl(mod d)
If x i s primitive o f conducto r ra, the n th e las t su m vanishe s unles s d — m (appl y
(3.9)), givin g (3.1 4) . D
One ca n se e b y Lemm a 3. 1 tha t r(x ) doe s no t vanis h exactl y whe n m/m* i s
squarefree an d prim e t o ra*. In thi s cas e (3.1 2 ) become s a true formul a fo r x( a )-
We now conside r th e mor e genera l su m
r(x,^a)= E x(b)Mb)
6(mod ra)
where ip a{b) = e(ab/m) i s an additiv e character .
LEMMA 3.2 . Let x modulo m be a non-principal character induced by the prim-
itive character x * modulo m* . Let a ^ 1 . We have
T(X, ^O) = r(x) Yl dx*{a/d)fi(m/dm*),
d\(a,m/m*)
in particular,
if (a , ra) =1 , i.e. if ipa i> s a primitive additive character.
Note tha t r(x ) = x ( - l ) r ( x ) > henc e i f x (m ° d ra) is primitive, the n (3.1 4 ) ca n
be writte n as
(3.15) T(X)T(X ) = x(-l)m .
3. CHARACTER S 4 9
For an y character s xi(mo d mi ) an d X2(mo d 7722 ) with (mi , 7722) = 1 we hav e
(3.16) r(xiX2) = Xi(^2)X2(mi)r(xi)r(x2) .
This multiplicatio n rul e fail s i f the modul i ar e no t relativel y prime . Fo r character s
to th e sam e modulu s th e correctio n facto r i s th e Jacob i su m
(3-17) J{Xi,X2)= Y, Xi(a)X2(l-a) .
a(mod m)
Precisely i f xi > X2 are tw o character s modul o m suc h tha t X1 X 2 is primitive, the n
(3.18) r(xi)r( X 2) = J(xu X2MX1 X2) .
Hence i f al l Xi > X25 X1X2 are primitiv e o f modulu s m , the n
(3.19) |«/(xi,X2) | = Vm" .
If x(mo d m ) i s primitive, the n
(3.20) J ( x , x ) = x("l)Mm) .
3.5. Rea l characters .
For a primitiv e characte r x ( m ° d m ) w e kno w tha t |r(x) | = y/in, however ,
the determinatio n o f th e argumen t o f r(x ) i s a difficul t problem . Usin g Deligne' s
estimate fo r multipl e Kloosterma n sum s on e ca n sho w tha t r(x)/ 'y/rfi ar e asymp -
totically equidistribute d o n th e uni t circl e a s x ( m ° d m ) range s ove r al l primitiv e
characters an d m tend s t o infinit y ove r prime s (se e Chapte r 21 ) .
In th e cas e o f rea l character s r(x ) wer e evaluate d completel y b y Gauss . I t i s
easy t o se e tha t T(XA) = 2i , T(XS) — 2 A/2 an d r(x4Xs ) — 2y/2i.
T H E O R E M 3. 3 ( G A U S S ) . Form odd squarefree
(3.21) r(x ) = emVm~
where
mod4
(3.22) e rn = {1 ^ m
= l
( )>
171
\ i if m = — 1 (mo d 4) .
We shal l giv e a n analyti c proo f o f (3.21 ) . Bu t firs t w e evaluat e othe r Gaus s
sums o f typ e
(3.23) G(m)= £ e(£ )
n(mod ra)
for an y intege r m ^ 1 . I f m i s even w e derive b y shiftin g n t o n + | tha t G(m) =
imG(m), becaus e ^ ( n + f ) 2 = * £ + f (mo d 1 ) . Henc e
(3.24) G(m )= 0,i f m = 2(mo d 4) .
Next w e show tha t
3
(3.25) G(m ) = mG(m), i f m ^ 2(mo d 4) .
This follow s b y splittin g th e summatio n i n G(m 3) int o n = a + 6m 2 (mod m 3 ) i f
2 { m o r n = a- f 6^-(mo d m 3 ) i f 4 | m. No w we ar e read y t o prov e
50 3. C H A R A C T E R S
T H E O R E M 3. 4 (DIRICHLET) . For any m e N ,
1— 4- ? m
(3.26) G(m) = - ^ — ^ .
PROOF. W e hav e
o2>
c
G(m) = 2 £ Gr) + 01( ) -
0<n<-^
In th e segmen t ^ < n < y w e chang e n int o [y ] — n = y — { y } — n . Sinc e
^ ( [ f ] - ^ ) 2 = ^ ( ^ + { f } ) 2 - f + [ f ] - " = ^ ^ + { f } ) 2 - f ( m o d l ) , w e obtai n
<*»>-* E <£)+*- - E .(fit±^l£) +o(.,.
Here, an d a s before , th e erro r ter m 0(1 ) account s fo r th e term s whic h ar e no t
covered i n th e displaye d summatio n (a t mos t tw o o f them) . B y Lemm a 8. 8 eac h
of th e tw o shor t sum s abov e i s approximate d (u p t o a bounde d erro r term ) b y th e
integral
00
rm/4 2 .f 1 / T2 \ + 7
/ e( — )dx= / el— )<f a + 0 ( l ) = ^ >M+ 0(l) .
Jo Vr ™/ J o V m '4
Hence addin g u p thes e result s w e ge t
G(m) = \±^-V^ + 0(1 ).
1+2 i
It remain s t o sho w tha t th e las t erro r ter m 0(1 ) vanishes . Thi s follow s b y iterate d
application o f (3.25 ) i f m ^ 2(mo d 4) ; otherwis e (3.26 ) i s obviou s becaus e bot h
sides vanish . •
We generaliz e G(m) b y settin g
<-> o - E •(£ )
n(mod m )
for an y a wit h (a,m) = 1 . I n particular , G ( —) = G(m). I f ra = 771 1 777 2 wit h
(mi,777,2) = 1 , we obtai n
(3.28) G ( _ £ _ ) = G ( ^ W ^ )
by writin g 7 7 = 77-1 777, 2 + 772777 1 with 77 1 an d 71 2 rangin g modul o m\ an d 777 2 re-
spectively. Thi s formul a reduce s th e proble m o f evaluatin g generalize d Gaus s su m
G(a/777) to tha t o f a prim e powe r modulus . I f 77 7 = p i s a n od d prim e w e can writ e
o(f)- E K)Mf )
6(mod p)
since 1 + ( | ) i s th e numbe r o f solution s t o n 2 = 6(mo d p). Therefor e
<»•»> °(? ) - E $«(£ ) - ( X )" (? M
£>(mod p )
3. CHARACTER S 51
by changin g th e variabl e an d usin g (3.26) . Nex t fo r distinc t od d prime s p , q w e
derive b y (3.26),(3.28),(3.29 ) tha t
^VH- 0 (i)- C (|M|)-(f)(J)^VH.
Note als o tha t
E l£
(3.30) e pq=epeq(-l) ^ ^.
Combining thes e formula s w e deduc e
THEOREM 3. 5 (QUADRATI C RECIPROCIT Y L A W ) . For any odd primes p ^q
we have
1
1 5 5
<>•»> (D^-t- )" " -
Since G(-l/p) = G(l/p) i t follow s fro m (3.29 ) tha t fo r od d p
(3-32) ( ^ ) = e 2= (_i)^ =X4 (p).
EXERCISE 2 . Prov e tha t fo r od d p,
(3-33) ( ? ) = (-i)t( P2-i) = X8 (p).
Now (3.21 ) follow s b y r(x) = G(p) = e py/p i f m = p i s od d prime , an d i n
general b y th e multiplicativit y o n usin g (3.1 6),(3.30 ) an d (3.31 ) .
Note w e have prove d tha t
^ \m) \m) ~ ^ \m)
x(mod m) x(mo d m)
if m i s od d squarefree . I f m i s od d bu t no t squarefree , th e lef t sid e vanishe s whil e
the righ t sid e doe s not .
For convenienc e w e extend th e Legendr e symbo l (^ ) t o p — 2 by settin g
(§H; :;*!:
Then fo r an y b > 0 we defin e
<»*> (i)-n(D' -
Pv\\b F
If b is odd, thi s symbo l ( | ) wa s introduce d b y Jacobi .
52 3. C H A R A C T E R S
EXERCISE 3 . Deriv e fro m (3.31 ) tha t fo r an y od d integer s a, b relatively prim e
(3.36 ) (i)(s)-H)-"W W
Here (x^y)^ i s the Hilber t symbo l define d fo r xy ^ 0 by
f -1i f x < 0 and y < 0,
(3.37 ) (*,y)oo = i ^ •
(^ 1 otherwise .
Notice tha t 2{x, y)^ — 1 4- sign x - f sig n ? / — sign #?/.
EXERCISE 4 . Prov e tha t fo r an y a,r a wit h (2a , m) = 1 ,
(3.38) G(£ ) = f-Wv^.
;c
m
Finally w e exten d th e symbo l (f ) t o al l integer s a , b except fo r a — b = 0 . I f
ab ^ 0, w e se t
< 339 > © -(H)'"-"-
Then w e se t
and
(3.41) ( | ) = 0 if (a, 6 ) ^ 1.
For 6 ^ 0 thi s symbo l wa s introduce d b y Shimur a [Shi ] i n th e contex t o f
modular form s o f half-integra l weight . H e showe d tha t (1 .53 ) hold s wit h
(3.42) i / ( c , d ) = e d Q.
EXERCISE 5 . Chec k th e consistenc y o f th e abov e settings . Prov e tha t fo r an y
6 ^ 1 th e ma p a h->- ( |) i s a Dirichle t characte r modul o b. Prov e tha t fo r an y a ^ O
the ma p b i-> ( | ) i s a Dirichle t characte r o f conducto r a * | 4a.
The abov e extensio n o f ( | ) wil l b e calle d th e Jacob i symbol .
Any intege r A / 0 wit h A = 0, l(mod 4 ) i s calle d a discriminant . I f A = 1
or A i s th e discriminan t o f a quadrati c field, the n A i s sai d t o b e fundamental .
Any discriminan t ca n b e writte n uniquel y a s A — e 2D wit h D fundamenta l an d
e ^ 1 . A prim e discriminan t i s a fundamental discriminan t havin g exactl y on e
prime factor , thu s i t i s a numbe r o f typ e —4 , —8, 8 an d X<±(P)V with P > 2. Ever y
fundamental discriminan t factor s int o prim e discriminants .
To an y discriminan t A on e associate s th e Kronecke r symbo l ( — J whic h i s
defined fo r an y c ^ 0 by mean s o f th e Jacob i symbo l a s follow s
(3.43 ) ^ ^ ^
cJK \ A j \ b
3. CHARACTER S 53
where c = 2 vb wit h b odd. Not e tha t ( f ) K = ( f ) = Xs(A) , explicitl y
'1 i f A = l(mod8) ,
(3-44) | ' | j . { - 1 if A = 5(mod8) ,
0 if A = 0(mod4) .
We als o exten d th e definitio n o f th e Kronecke r symbo l fo r c = 0 by settin g
A\f 1 i fA = l ,
V^ 7 xl 0 otherwise .
Therefore (^) K i s defined fo r al l integer s c and A ^ 0 , A E 0 , l(mod 4) .
We shal l dro p th e subscrip t K i n th e Kronecke r symbo l notatio n an d explai n
in an y relevan t cas e tha t w e ar e dealin g wit h th e Kronecke r symbol . Remembe r
the Kronecke r symbo l i s no t define d fo r A' s othe r tha n discriminants . Whe n th e
symbol ( —) appear s withou t comment s i t stand s fo r th e Jacob i symbol .
EXERCISE 6 . Prov e tha t fo r a fundamental discriminan t A th e Kronecke r sym -
bol (^ ) i s a primitiv e characte r o f conducto r |A| .
3.6. Th e quarti c residu e symbol .
Next w e construc t certai n character s o f orde r four . Thes e ar e associate d wit h
Q(i), th e imaginar y quadrati c fiel d o f discriminan t D — —4. Th e rin g o f integer s
of Q(i) i s
Z[i] = {z = a + bi:a,beZ}
and th e grou p o f unit s o f Z[i] is { l , i , i 2 , i 3 } . Th e irreducibl e element s o f Z[i] ar e
(up t o th e units) ; 1 + i wit h N(l - f i) = 2 , th e rationa l prime s q = — l(mod 4 ) wit h
Nq = q 2 an d th e comple x number s TT = a + bi wit h
(3.46) NTT = TTT T = a 2 + b 2 = p = l(mo d 4) .
Note tha t TT an d 7 f are coprime , thes e ar e calle d Gaussia n primes . Ever y elemen t
of Z[i] different fro m zer o factor s uniquel y (u p t o th e unit s an d permutations ) int o
powers o f irreducibl e elements .
For ever y Gaussia n prim e TT w e define a ma p (th e quarti c residu e symbol )
2 3
(3.47) ( - ) :Z[i]^{0,l,M i }
which satisfie s
(3.48) (-] = a 2^1 (mo d TT) .
Note tha t ( ^ ) = 0 i f 7 r | a . I f 7 r f a , the n a?~ l = l(mo d TT) becaus e th e residu e
class ring Z[i]/7rZ[i] i s a finite fiel d wit h p = NTT elements. Thi s propert y (th e littl e
theorem o f Fermat ) implie s th e existenc e an d th e uniquenes s o f solution s t o (3.48 )
with (^ ) = i m fo r som e 0 ^ m < 4. I n particular , w e hav e
(3.49) [ - ] = i ^ ,i f P = TTTT = l(mod4) .
54 3. C H A R A C T E R S
EXERCISE 7 . Prov e th e followin g propertie s o f th e quarti c residu e symbol :
<""> (¥)-(;)(;) •
(3.51) a = /?(mo d n) => ( - ) = ( - ) ,
(3,2) £ ) = (!! ) i f V=J» ,
<«»> a (I
(3.54) ( 4
) = (")•
— 1 = 1 if and onl y if z = a (mod IT) ha s a solution i n Z[i]* .
If ^ = 7T i • • -7rr i s the produc t o f Gaussian prime s (no t necessaril y distinct) ,
then w e se t th e symbo l
<»» (?) = £)-£)•
Clearly th e propertie s (3.50)-(3.53 ) hol d tru e wit h TT replaced b y 7 . I n particular ,
(3.53) give s
(f)=l i f
(«.P ) = 1 -
Note tha t ( 1 + i)2 = 2i . W e sa y tha t a G Z[i] i s odd i f ( 1 + i) \ a an d
primary i f a = l(mo d 2( 1 + z)). Ever y od d numbe r i n Z[i] i s associated wit h
exactly on e primary element , i.e. , ea = l(mo d 2( 1 +z)) fo r exactl y on e unit e = i m.
A primar y elemen t ca n b e writte n a s th e produc t o f primar y irreducible s uniquel y
up t o permutations . W e hav e th e followin g
T H E O R E M 3. 6 ( T H E LA W O F QUARTI C R E C I P R O C I T Y ) . If TTI^2 are distinct
primary Gaussian primes, then
where p\ = NTTI and p 2 = N7T2 . If TT = a + bi is primary, then
2 2 >
' l + ^)\ .(„.-•[-h-h
Q) =i (l-)/ a> (i±l }/d /
= j (-l-6-^)/4 f ( ^ r6'2.
The quarti c residu e symbo l (— ) is a multiplicative characte r o f th e finite field
¥p ~ Z[i]/irZ[i\. Th e Gaus s su m o f thi s characte r is
,3,7) ,< „ = £ (=).(| )
a ( m o d p)
where a runs ove r th e rationa l residu e classe s modul o p and a is the representativ e
of a i n Z[i] modulo 7r . If 7r is primary, the n
2
(3.58) S M= -(-l)^V^ .
We shal l b e intereste d i n th e quarti c residu e symbo l a t rational integer s
(3.59) X7r (n)=(-), forneZ .
3. C H A R A C T E R S 55
This is a Dirichlet characte r o f conductor p = TTTT and o f order four. Sinc e (71 - , n) = 1 ,
we have xi( n) = n^~ (mo d p). Therefor e
(3.60 ) ^») = G )
is th e quadrati c residu e symbol . Clearl y Xir{ n) an d X7f( n) a r e distinc t Dirichle t
characters bu t thei r square s yiel d th e sam e quadrati c character . Mor e generally ,
if q = p\ • • - pr i s th e produc t o f distinc t prime s pj = 1 (mod 4) , the n ther e ar e 2 r
distinct Dirichle t character s x ( m ° d q) satisfying \ 2 — Xqi namely x = Xm ''' Xn r =
X7, say , fo r an y 7 = 7r i • • • irr wit h 7 7 = q.
3.7. Th e Jacobi-Dirichle t an d th e Jacobi-Kubot a symbols .
Let q b e th e produc t o f prime s = l(mo d 4 ) (no t necessaril y distinc t primes) .
The Jacob i symbo l Xq( n) = (~ ) c a n b e extended t o Gaussia n domai n Z[i ] in severa l
ways correspondin g t o representation s o f q as th e su m o f tw o square s
(3.61) q = u 2+v2 with(u,v )= l .
By requirin g w — u + iv t o b e primar y w e distinguis h u fro m v an d w e fix the sig n
of u. Not e that u = l(mo d 2 ) and i ; = u — l(mod 4) . A Gaussian intege r w — u + iv
is sai d t o b e primitiv e i f (u,v) = 1 , thu s a n od d it ; i s primitiv e i f (w,w) — 1. Fo r
any primar y primitiv e w w e defin e th e symbo l (^ ) : Z[i]—> > {0,1 , —1} by
z\ /Re wz^
<"»> (=) - ( I = F )
where o n th e righ t sid e i s the Jacob i symbol . Mor e explicitl y w e hav e
/ ~ Nf z\ fur — vs\
(3.63) ( _ )= ( _ _ ) , li z = r + rs
where q = ww. I f q i s prim e = l(mo d 4) , w e ge t tw o differen t symbol s (^ ) an d
(=|) whic h wer e considere d b y Dirichle t [Dir] . Throughou t w e call (~) th e Jacobi -
Dirichlet symbo l wheneve r w i s primary an d primitive .
We coul d introduc e th e Jacobi-Dirichle t symbol s usin g root s o f quadrati c con -
gruences: ther e i s one-to-on e correspondenc e betwee n th e root s o f
2
(3.64) UJ + 1 = 0(mo d q)
and th e factorization s q = ww wit h w — u + iv primar y whic h i s given b y
(3.65) u = —uv(mod q).
Here u denote s th e multiplicativ e invers e o f u modul o q. W e obtai n
i i z = r + l s
(3.66) ( « )= ( —)
by (f ) = ( ? ) = ( f t ) = ( g ) = 1 - Henc e i t i s clea r tha t
1(3.67) ( ) = Q if reZ.
For z = i we hav e
(«8> (s) l2 .
56 3. C H A R A C T E R S
Clearly (^ ) i s periodi c i n z o f perio d q, an d i t i s multiplicativ e a s wel l sinc e
(r*i + ujsi)(r 2 + ws 2) = r i r 2 - sis 2 + o;(ris 2 + r 2 si) (mo d g) .
EXERCISE 8 . Deriv e fro m th e quadrati c reciprocit y la w (3.31 ) th e followin g
reciprocity la w fo r th e Jacobi-Dirichle t symbo l
<-> (j ) - (f)
for an y z an d w which ar e primar y an d primitive .
For Gaussia n integer s z = r -\- is = 1 (mo d 2) , w e defin e
(3.70) W=^(H )
where ( A ) stand s for the Jacob i symbol. Not e that [z] vanishe s if z i s not primitive .
We are intereste d i n the multiplicativ e structur e o f [z] withi n th e Gaussia n rin g Z[i]
rather tha n wit h respec t t o th e co-ordinate s (r , s) G Z x Z. Fo r thi s reaso n w e refe r
to [z] a s th e Jacobi-Kubot a symbol . Indeed , thi s symbo l i s relevant t o th e Kubot a
homomorphism SX(2,Z )—> • {±1 } whic h ha s muc h t o d o wit h metaplecti c modula r
forms. O f course , [z] is not multiplicativ e i n th e stric t sense , ye t i t i s nearl y so , u p
to a facto r whic h i s the Jacobi-Dirichle t symbol .
EXERCISE 9 . Prov e tha t i f w i s primary primitiv e an d z = \ (mo d 2) , the n
(3.71) [wz]=e[w\{z}(^)
with e = ± 1 dependin g onl y o n th e quadrant s t o whic h z , w an d wz belon g (se e
the detail s i n [FI1 ]) .
Both symbol s o f Jacobi-Dirichle t an d Jacobi-Kubot a ar e utilize d i n th e proo f
of th e asymptoti c formul a fo r prime s p = X 2 + Y" 4. The y pla y a rol e throug h th e
estimation fo r genera l bilinea r form s i n [wz] (see Propositio n 21 . 4 o f [FI1 ]) .
3.8. Heck e c h a r a c t e r s .
E. Heck e [Heel ] generalize d th e Dirichle t character s t o an y numbe r field. Hi s
characters ar e multiplicativ e function s o n ideals . W e restric t ou r consideration s t o
imaginary quadrati c fields. Ever y suc h field is obtained b y a n extension o f rational s
by y/d, wher e d i s a negative , squarefre e integer . Denot e K — Q(\/d). Th e rin g o f
integers o f K i s a fre e Z-modul e O = Z + o;Z , wher e
d = 2(mod4) ,
d= l ( m o d 4 ) .
The discriminan t o f K
if d = 2,3(mod4) ,
D=
if d= l ( m o d 4 ) .
Note that i n an y cas e K = Q(y/D) an d O = Z + \{D + y/D)Z. Th e extensio n
has tw o automorphisms , th e identit y an d th e comple x conjugation . Th e grou p o f
3. C H A R A C T E R S 5 7
units U C O i s finite, cyclic , generate d b y th e roo t o f unit y ( w — e27Tl^w o f orde r
w = 4,6 , 2, precisel y
I7 = { ± l , ± i } i f d = - l ,
(3.72) U = {±1 ,±UJ,±UJ 2} i f d = - 3 ,
U = {±1 } i f d < —3 .
The Kronecke r symbo l Xjr>(n ) = ( ^ ) i s a real primitiv e characte r o f conductor —D,
we cal l i t th e field character . Notic e tha t XD{— 1 ) = — 1 (se e (3.39)) . Ever y idea l
a ^ 0 i n O i s a fre e Z-module , an d th e rin g o f residu e classe s O/a i s finite, th e
number o f it s element s bein g th e nor m o f a
(3.73) Na = |{a(mo d a ) : a e 0}\.
The nor m i s multiplicative . I f a — (a) i s a principa l ideal , the n iV a = Na = \a\ 2.
Every idea l a ^ 0 factor s uniquel y int o power s o f prim e ideals . Ever y prim e idea l
of K appear s a s a facto r o f a rationa l prime . Th e la w o f factorizatio n o f rationa l
primes int o prim e ideal s i n K assert s th e following :
- i f XD(P) — 0 > then p ramifies ; (p) — p 2 wit h Np — p,
- i f XD(P) — 1? then p splits ; (p) = pp wit h p ^ p an d Np = p ,
- i f XD(P) — —1 > then p i s inert; (p) — p with Np = p 2.
An integra l idea l a ^ 0 which ha s n o rational integra l divisor s othe r tha n ± 1 i s
said to be primitive. Ever y primitiv e idea l can b e written uniquel y a s the Z-modul e
(3.74) a = a Z + \{b + y/D)Z = [a , \{b + V3)]
with a = iV a an d 6 the intege r determine d b y — a < b < a , b 2 = D(mo d 4a) . Not e
that w e hav e 6 2 — 4a c = D wit h (a , 6, c) = 1 . Thi s i s clea r fro m th e followin g
computation:
(a) = (aa ) = [a, \{b + y/D)][a, \{b - VB)]
2
= [a ,^(b+VD),^(b-VD),ac}
C [a 2, afr, ac] = a [a, 6, c] C (a) .
For a primitiv e idea l a we se t th e poin t i n th e uppe r half-plan e
b
(3.75) z a = A^~ e H.
The invers e idea l a - 1 i s th e fre e Z-modul e generate d b y 1 and z a ,
1
(3.76) a" =Z+6~Z .
-1
Let D denot e th e differen t o f K, s o by definitio n c> i s th e fractiona l idea l
d'1 = {a £ K : Tr a £ Z } .
In ou r cas e th e differen t o f th e quadrati c field K = Q(yD) i s th e principa l idea l
X> = (y/D). Not e tha t d = d an d Nd = |£>| .
58 3. C H A R A C T E R S
Let / b e th e grou p o f al l non-zer o fractiona l ideal s
— : ai,a2 C C 1 a1 a2 ^ 0
<*2
and P e l th e subgrou p o f principa l ideal s (a ) = aO wit h a G K*. Th e facto r
group H = I/P i s calle d th e clas s grou p o f if . I t i s a finit e abelia n grou p o f orde r
h = \H\ = [I : P] which is called the class number o f K (als o of the discriminant D).
Every clas s A i s represente d b y a uniqu e primitiv e idea l a wit h z a i n th e standar d
fundamental domai n (suc h idea l i s called reduced ) F = F~ U F+ , wher e
(3.77) F~ = {z = x + iy G H I : -\ < x < 0 an d \z\ > 1 }
+
F = {z = x + iy e M : 0 ^ x ^ | an d \z\ = 1 } .
Therefore th e clas s numbe r ft = h(D) i s equa l t o th e numbe r o f primitiv e reduce d
ideals. Fo r a primitiv e idea l writte n a s a = [a , ±(b + VD)} wit h b 2 -4ac = D w e
have \z a\2 = ^ , whenc e fo r a t o b e reduce d i t say s tha t eithe r — a < b ^ a < c or
0 ^ b < a = c . Conversel y an y integra l solutio n t o b 2 — Aac = D satisfyin g th e
above inequalitie s yield s a primitiv e reduce d idea l define d b y (3.74) .
Now we go to characters. Firs t w e consider Dirichle t character s o n O. Le t m be
a non-zer o integra l idea l i n O. Th e residu e classe s a (mo d m) , a G O , (a,m ) = 1
form a multiplicativ e grou p (0/m)* . An y homomorphis m
X : (O/m)* - > C*
is calle d a Dirichle t characte r modul o m . Naturall y w e exten d x t o O b y settin g
x(oi) — 0 i f (a,m ) ^ 1 . I f x i s a characte r t o modulu s m then (b y natura l restric -
tions) i t ca n b e als o viewe d a s a characte r t o an y modulu s n C m (recal l tha t thi s
inclusion o f ideal s mean s m divides n ) . Fo r an y x(mo d m ) an d an y f3 G (Dm)-1 w e
define th e Gaus s su m
(3.78) r x(/3)= J2 x(<*)e(Tr(a0))
a. (mo d m )
This i s well defined, becaus e i f a\ = a 2 (mo d m ) then (5{ai — a2) G ( M )_ 1m = D _1
and T r (/3(ai — a 2)) G Z . Th e sam e argumen t show s tha t r x (/3) depend s onl y o n f3
(mod D" 1 ), i.e . r x{pi) = r x (/3) i f 7 - / 3 G D"1. Clearl y
rx(a/3) = x(a)r x (/3) i f (a,m ) = 1 .
The conducto r o f x (mo d m ) i s the larges t idea l f Dm (tha t i s the smalles t diviso r
f o f m ) suc h tha t x factor s throug h (0/f)* . I f f = m the n th e characte r i s sai d t o
be primitive .
PROPOSITION 3.7 . If x (mo d m ) is primitive, then
2 J
(3.79) | r x(/3)| -n \ ^ \
I 0 otherwise.
After havin g introduce d character s o f (0/m)* , w e look fo r (unitary ) character s
of C*. B y definition thes e ar e continuou s homomorphism s o f C* into the uni t circl e
S1 = {a G C I \a\ = 1}.
3. CHARACTER S 59
Any suc h characte r Xo o : ^ * —> S 1 i s give n b y
for som e £ G Z . W e cal l £ the frequenc y o f Xoo-
Next w e turn t o character s o n idea l classes . Recal l w e have alread y introduce d
the followin g groups :
- / th e multiplicativ e grou p o f fractiona l ideal s ^ 0 .
- P th e subgrou p o f principa l ideals .
- H = I/P th e idea l clas s group .
Now fo r an y integra l idea l m C 0 , m ^ 0 , w e se t th e subgroup s
Im = {ael | (a,m ) = l } c J ,
Pm = {(a)eP I (a,m ) = l } c P .
Here (a , m) = 1 means a = be - 1 fo r som e b , c C (9 , with (bc,m ) = 1 .
EXERCISE 1 0 . Sho w tha t (a,m ) = 1 mean s a = ^~ x fo r som e /3, 7 G O,
(/37,m) = l .
The facto r group s I m/Pm ar e isomorphi c t o I/P — H. An y homomorphis m o f
H int o S 1 i s calle d a clas s grou p character . Ther e ar e exactl y h = \H\ = \H\ clas s
group characters .
Finally w e introduc e th e Heck e character s ("Grossencharakteren") . A Heck e
character modul o m i s a continuou s homomorphis m ip : I m—> • S 1 fo r whic h ther e
exist tw o character s
X : {O/mf - > S 1
l
Xoo : C * - > S
such tha t
^((«)) = x(a)Xoo(c O
for ever y a G 0 , ( a , t n) = l . Not e that \ (mo d m ) an d Xo o are determined uniquel y
by ijj. Indeed w e hav e
XOO{OL) = %l)([a)) i f a = l (mo d m) .
Moreover, th e grou p { a G K * | a = 1 (mo d m) } i s dens e i n C * s o th e abov e
formula define s Xo o uniquely o n C * by continuity . Afte r havin g determine d Xo o we
find x b y
X(a) = ^((a))/xoo(a) .
Not ever y pai r o f character s x (mo d m ) an d Xo o come fro m a Heck e characte r
ip (mo d m) . Indeed , i f e G J7, the n w e hav e x(^)Xoo(^ ) = ^((^) ) = 1 - O n e c a n
show tha t th e conditio n
(3.80) X (e)Xoo{e) = 1 for all e G U
is sufficien t fo r th e pai r o f x (mo d m ) an d Xo o to resul t fro m a Heck e characte r ip
(mod m) . However , a pai r o f character s x (mo d m ) an d Xo o satisfyin g th e abov e
"units consistenc y condition " determin e ip (mo d m ) onl y u p t o a clas s grou p char -
acter, i.e . i t induce s exactl y h Heck e characters .
60 3. CHARACTER S
In ou r cas e th e grou p U i s cycli c o f orde r w = 4 , 6 or 2 generated b y th e roo t
of unity £ = e 2nl/w. Therefor e th e conditio n (3.80 ) reduce s t o tha t fo r e = £ , whic h
becomes
(3.81) X(C ) = C £-
In othe r words , thi s i s a conditio n fo r £ (mo d w). I f D < —4 , we hav e ( = — 1, s o
the unit s consistenc y conditio n become s
(3.82) < ? = A ( 1
1X ( - ) - ) (mo d 2) .
The conducto r o f a Heck e characte r ip (mo d m ) i s th e smalles t diviso r f o f m
such tha t tp is th e restrictio n o f a Heck e characte r t o modulu s f . Thi s i s th e sam e
as th e conducto r o f x (mo d m) . I f f = m , the n ip is said t o b e primitive . Therefor e
tp (mo d m) i s primitiv e i f an d onl y i f th e correspondin g \ (mo d m ) i s primitive .
The Heck e character s o f modulu s m = (1 ) an d o f th e frequenc y £ = 0 correspon d
exactly to the class group characters. Henc e there can be many primitive character s
of conducto r one .
Let ^ b e a primitiv e Heck e characte r o f conducto r m an d o f frequenc y £. W e
consider ip as a multiplicativ e functio n o n al l non-zer o integra l ideal s b y settin g
i/j(a) = 0 if (a , m) ^ 1 . T o ip we associat e th e Heck e L- function
L(s,V0= Yl V>(a)(iVa)- s.
0/aCO
This serie s converges absolutel y fo r R e (s) > 1 and ha s a n Eule r produc t ove r prim e
ideals
P
We als o introduc e th e loca l facto r a t th e infinit e plac e
(3.83) Loo(*,V 0 = (\D\Nm) s/2(27r)-sr(s+ ±\£\).
THEOREM 3. 8 ( H E C K E ) . Let \j) (mod m ) be primitive. Then the complete
product A(s,'0 ) = L OG(s,%p)L(s,ip) has analytic continuation to the whole complex
s-plane except for a simple pole at s = 1 when ip = ipo is the trivial character. In
this case m = (1 ) , £ = 0 and L(s, ipo) = CRT(S ) = ((S)L(S,XD) is just the Dedekind
zeta function of K, so
res A(s, t/>o) = hw 1_ .
s=l
Moreover, A(s , ip) satisfies the functional equation
(3.84) A(s , V) = WWOA( l - s ,V O
with the root number W(ip) given by the corresponding Gauss sum
(3.85) W{iP) = rV(^)(iVm)- 1 /2.
The Gaus s su m T(I/J) associate d wit h ip (mo d m ) i s define d b y
(3.86) ^ = W£ ^K^f )
aGc/cm
3. CHARACTER S 61
where 7 G O and c C 0 ar e such that (c , m) = 1 , cDm = (7) . Not e that th e definitio n
of r(ip) doe s no t depen d o n th e choic e o f 7 an d c du e t o th e unit s consistenc y
condition (3.80) .
EXERCISE 1 1 . Sho w tha t fo r a primitiv e Heck e characte r ij; (mod m ) whos e
Dirichlet componen t i s \ (mo d m ) w e hav e
(3.87) r(V 0 = V>(a)r x(/3)
-1
for som e (o,m ) = 1 and / 3 G (Dm) .
EXERCISE 1 2 . Prov e tha t fo r a primitiv e Heck e characte r ip (mo d m) ,
(3.88) \T($)\ = (TVm) 1 /2.
EXERCISE 1 3 . I f ip modulo m an d £ modul o n ar e primitiv e Heck e character s
with (m , n) — 1 , the n
T ( ^ ) = V(n)£(m)TtyOr(£) .
REMARK. I f ip is a primitiv e Heck e characte r o f conducto r m = (1 ) an d fre -
quency £, then r(ip) = i l. I n thi s cas e th e functiona l equatio n (3.84 ) ha s th e roo t
number W(ip) = 1 . Sinc e ip(a) = ^(a ) w e find tha t L(s,ip) = L{s^) b y changin g
a int o a i n th e Dirichle t series .
If ^(nio d m ) ha s frequenc y £, the n it s comple x conjugat e ip (mod m ) ha s fre -
quency —£. Therefore , withou t los s o f generality , w e ca n assum e tha t ^(nioc l m )
has frequenc y £ ^ 0 . The n th e functio n / : HI -> C give n b y
(3.89) f( z) = J2 ^ ( *) (Na) * e{zNa)
a
is a n automorphi c for m o f weigh t k — £ + 1 o n TQ(N) o f leve l A ^ = |Z)|AT m an d
character x(mo d N) define d b y
(3.90) X{n) = XD(n)i;(n) i f n G Z.
If £ > 0 , the n / i s a cus p form , whic h i s primitiv e i f th e characte r ^(mo d m ) i s
primitive. I n particular , i f m = (1 ) , then N = \D\ an d x = XD-
REMARKS. W e say a few words abou t th e proof s o f the abov e results. Firs t th e
Fourier serie s (3.89 ) splits into a finite sum over ideal classes. I n each class we get a n
infinite serie s ove r equivalen t ideal s whic h i s a thet a functio n fo r a positiv e definit e
binary quadrati c for m twiste d b y a suitable harmonic polynomial (se e Section 1 4.3) .
Applying th e Poisso n summatio n formul a on e show s tha t thes e individua l thet a
functions ar e automorphi c form s o f th e sam e typ e (cf . Chapte r 1 0 of [1 4]) . Henc e
the / , bein g a linea r combinatio n o f thet a functions , i s als o a n automorphi c for m
of th e sam e type . Anothe r applicatio n o f Poisso n summatio n show s tha t th e thet a
functions satisf y a Jacob i typ e involutio n formula . Summin g ove r th e classe s thes e
formulas yiel d
„ k f ( z \ _ iT W^f - 1
i
' ( ^ )
(at thi s poin t i t i s crucia l tha t -0(mo d m ) i s primitive) , wher e r ( ^ ) i s th e Gaus s
sum an d N = \D\Nm . Pro m this , on e derive s th e functiona l equatio n (3.84 ) b y
taking th e Melli n transfor m wit h respec t t o z = iy. Se e als o Chapte r 1 4 .
62 3. C H A R A C T E R S
We en d thi s sectio n b y givin g specifi c example s o f Heck e character s o n imagi -
nary quadrati c fields.
EXAMPLE 1 . Fo r K = Q(i) th e discriminan t i s D — —4 , the rin g o f integer s i s
O — Z[i], the grou p o f unit s i s U — {±1, ±i} an d th e clas s numbe r i s h = 1 , thu s
every idea l i s principal. Fi x a n intege r £. Pu t m = (1 ) o r m = 2(1 -h i) accordin g t o
4 | £ or 4 \ L Defin e £ : Im -» C * b y
(3.91) « a )= (^) '
where a is the unique primary elemen t whic h generates a , with a = l(mo d 2( 1 -\-i)).
Then £ is a primitive characte r o f frequenc y £ and conducto r m .
EXAMPLE 2 . Le t K — Q(i ) b e a s above . Tak e a positiv e intege r q = 0(mod 4 )
and x : C^fy]/Q^IA)* ~* ^ * a characte r o n th e multiplicativ e grou p o f primitive
residue classe s modul o q in Z[i]. Defin e £ : Iq —> C * b y
««0 = X(«)Q'
where a is the uniqu e primar y elemen t whic h generate s a . The n £ is a Hecke
character (no t necessaril y primitive ) o f frequenc y £ and modulu s q.
EXAMPLE 3 . Le t K = Q(v^D ) hav e discriminan t D < - 3, D = l(mod 4) .
Then th e rin g o f integer s i s
(3.92) 0 = <U(r a + n\/l>) : ra,nG Z, ra = n ( m o d 2 ) j ,
the grou p o f units i s U = {±1} while th e clas s number h = /i(D) ca n b e arbitraril y
large. Sinc e the principa l idea l (y/~D) ha s norm \D\ the rin g inclusion Z c O define s
an isomorphis m
0/(VD)^Z/\D\Z
( 3 - 93 ) V { m + ny/D m , , , ^ IN
2^ y ( m o d I^D -
Composing \i with th e Jacob i symbo l we get a quadratic characte r e : O —> {0 , ±1},
explicitly
,3,4) , ( = ± ^) =@ .
Note tha t £ i s odd , i.e. , e(—a) = —£(a ) becaus e |D | = —l(mo d 4) . Give n £ odd,
we hav e h Hecke character s £ : J—>• C suc h tha t o n principa l ideal s a = (a),
e
(3.95) £(a) = e( a)(f:) .
These ar e primitiv e character s o f frequenc y £ and conducto r m = (VD).
3. C H A R A C T E R S 63
EXAMPLE 4 . Le t K = Q(VD) b e an y imaginar y quadrati c field wit h w units .
Given £ = O(mo d w), w e hav e h = h(D) Heck e character s £ : / - > C * suc h tha t
£(a) = a/|a|) ^ o n principa l ideal s a = (a) . Thes e ar e primitiv e character s o f
frequency £ and conducto r m = (1 ) . Amon g thes e ther e i s a specia l characte r o f
frequency £ = hw define d b y £(a ) = (a/\a\) w fo r an y o G / , wher e (a) = a h.
EXAMPLE 5 . Le t K — Q(\/D) b e any imaginary quadrati c field of discriminan t
D, an d x(nio d q) a primitiv e Dirichle t characte r o f conducto r q wit h (q,D) = 1 .
Define th e homomorphis m \ ° N : Iq— > C * b y
(3.96) ( X oA0(a) = X (7Va).
Then £ = %oT V is a Hecke character o f frequency zer o which is primitive of conductor
m = (q). I n thi s cas e
2
(3.97) r(0 = XD(q)x(\D\)r (X )
where r(x ) i s th e Gaus s su m fo r th e Dirichle t characte r x(mo d q).
This page intentionally left blank
http://dx.doi.org/10.1090/coll/053/05
CHAPTER 4
SUMMATION F O R M U L A S
4.1. Introduction .
Series o f arithmeti c function s ofte n underg o som e kin d o f involutar y transfor -
mations. Thes e ar e derived no t b y re-grouping an d changin g th e orde r o f terms bu t
deeper change s ar e mad e b y applyin g Fourie r analysis . Fo r example , th e equatio n
(4.1) Yl e-™2/2/ = VvYl z~™ 2y
raGZ n E Z
cannot b e verifie d simpl y b y a combinatorial argument . A ric h sourc e o f relation s
of this kin d lie s i n automorphi c theory . Fo r instance , fo r a classical cus p for m f(z)
of weight k on th e modula r grou p T = SL2 (Z) whos e Fourie r serie s is
00
(4.2) f(z) = ^2\(n)n^ 1 e(nz)
1
the automorph y equatio n f(z) = z~kf(—l/z) yield s (b y integratio n alon g th e ver -
tical lin e z — iy) th e formul a
00 0 0
(4.3) ^ A ( m ) ? H = ^A(n)Mn )
11
where g(x) i s an y smooth , compactl y supporte d functio n o n R + an d
/»oo
(4.4) h(y) = 2m k / J 1k- (47ry/^)g(x)dx.
Jo
See mor e genera l result s i n (4.71 ) an d (4.72) .
Modular transformation s ar e typica l bu t ther e ar e plent y o f other importan t
cases. Fo r exampl e th e formula s (1 5.30 ) (th e Selber g trac e formula ) o r (1 6.34 ) (th e
Kuznetsov formul a fo r sum s o f Kloosterma n sums ) d o no t com e fro m automorph y
equations. Naturall y a n equatio n connectin g on e serie s o f an arithmeti c functio n
(weighted b y a test functio n o f certai n class ) wit h anothe r i s called a summatio n
formula. I n thi s chapte r w e develo p a few suc h formula s an d giv e standar d appli -
cations.
65
66 4. SUMMATIO N F O R M U L A S
4.2. Th e Euler-Maclauri n formula .
We begi n b y considerin g a functio n define d o n a segmen t o f rea l numbers .
Suppose a , b £ Z an d / i s continuous i n [a , b\. The n th e su m o f f(n) ca n b e writte n
as th e Stieltje s integra l
(4.5) £ /(n) = f f(x)d[x]
where [x] denotes th e integra l par t o f x (se e (1 .65)) . Puttin g
(4.6) ^(x ) = x-[x]-\
we derive b y partia l integratio n th e followin g Euler-Maclauri n formula .
LEMMA 4.1 . For f of class C 1 on [a, b] with a < b,a,b £ Z we have
(4.7) Yl / W = / V ( s ) + Hx)f'(x))dx + |(/(6 ) - /(a)) .
This formul a produce s goo d result s whe n f f(x) i s relatively small . I f the highe r
order derivative s o f / exis t an d ar e ver y small , i t ca n b e profitabl e t o continu e
integrating b y parts . T o this en d w e nee d th e Bernoull i polynomial s Bk{X).
We defin e Bk(X) G Q[X] b y th e followin g recurrenc e conditions :
(4.8) B 0(X) = 1,
(4.9) B' k(X) = kB h-
1 (X),
(4.10) f B k(x)dx = 0,
Jo
ifA ; ^ 1 . Th e conditio n (4.9 ) define s B' k(X) i n term s o f Bk-i(X) u p t o a constan t
while the last conditio n (4.1 0 ) (th e orthogonality o f Bk(X) t o constants) determine s
the constant . Th e monomia l X k satisfie s th e firs t tw o condition s bu t i t fail s th e
third one . W e fin d tha t B k(X) = X k - | X f c _ 1 - f . . . . I n particular , w e hav e
Bi(X) = X — | an d B 2(X) = X 2 — X + | . Th e generatin g powe r serie s fo r th e
Bernoulli polynomial s Bk(X) i s
(4.ii) ^ > * )= I>Wfc!=^rr
The las t equalit y follow s b y noticin g tha t
— F(t,X) = Y 1iBk1
. (X)jj— yl=tF(t,X).
4. SUMMATIO N F O R M U L A S 67
EXERCISE 1 . Prov e tha t
k
Bk(l-X) = (-l) Bk(X)
Bk{X + l)-B k{X) = kX k~1
x k
£B k(x+-)=q ~ Bk{qX).
0^a<q
Next w e define th e Bernoull i number s a s constan t term s o f the Bernoull i poly -
nomials
(4.12) B k = B k(0).
The generatin g powe r serie s fo r Bernoull i number s i s
tk t
(4.13) F(t) = Y,Bk
o fc! e * -1 '
Since F(-t) = t + F(t), i t follows tha t B k = 0ifk>l,k odd .
EXERCISE 2 . Prov e tha t
k k i
(4.14) B k(X) = ^2( e)BlX - .
REMARK. Th e Bernoull i polynomial s an d number s hav e bee n generalize d wit h
respect t o a character x ( m ° d q) a s follows :
0' l^^ g
The constan t ter m o f B ka(X) i s the generalize d Bernoull i number , namel y
Bka = B k,x(0) = q k~l Y. x(o)Bfe(-) -
0^a<q V
The generalize d Bernoull i number s occu r i n values o f L-function s a t specia l points .
They trul y belon g t o th e theor y o f p-adi c L-function s (du e t o Leopoldt , se e e.g .
[W]).
For an y fc^Owe pu t
(4.15) M*) = B k({x})
where {x} = x — [x] is the fractiona l par t o f x. Thes e function s ar e periodi c o f
period one , s o i s th e generatin g functio n
k 1
^t te ^
k=0
68 4. S U M M A T I O N F O R M U L A S
Therefore the y ca n b e represente d b y Fourier series . Th e n-t h Fourie r coefficien t o f
the generatin g functio n i s
/ ( E ^ y e ( - ^ ) ^ = IF3T/ el*-™****
JO 1 r, ' ^0
tW t \ *
t — 27rin V(—Y-
fe=i
Hence th e n-t h Fourie r coefficien t o f ^ ( x ) i s — k\(2irin)~k givin g th e expansio n
27Tin
(4.16) i/; k(x) = -k\ J2( )~ke(nx)
(there i s n o ter m n = 0 by (4.1 0)) . Thi s serie s converge s absolutel y i f k ^ 2 . Fo r
k = 1 we arrang e th e term s symmetricall y gettin g th e serie s
oo
-1
(4.17) ^(x) = - ^(Trn) sin(27rn^ )
l
which converge s pointwis e an d boundedl y excep t fo r x G Z.
EXERCISE 3 . Prov e tha t fo r an y N ^ 1 and x eR
N
(4.18) VO ) = - X^ 7™)"1 sin 27rnx
( ) + °((! + INI N)-1)
1
where ||x| | i s the distanc e o f x t o th e neares t intege r
(4.19) \\x\\ = min{| z - m\ : m G Z} .
REMARK. Evaluatin g (4.1 6 ) a t x = 0 we ge t
«2™> - ^ ^
for m ^ 1 . Thi s turn s ou t t o b e als o tru e fo r m = 0 b y th e functiona l equatio n
(4.75). Moreover , b y th e functiona l equatio n (4.75 ) i t follow s tha t fo r an y m ^ 1 ,
C(l-2m) = --?-B 2 m .
Furthermore, fo r a primitiv e characte r \ o f conducto r g > 1 one ha s
- 1 / - 2 7 r n m , N _J
rn,x
L(l-m,x)= — 5 m ,x
for an y m ^ 1 with ra = ±( 1 - x ( - l ) ) (mo d 2 ) (se e e.g . [IR] , [W]) .
Integrating b y part s fc — 1 times i n (4.7 ) an d usin g (4.9 ) on e derive s th e Euler -
Maclaurin formul a o f orde r k.
4. SUMMATIO N F O R M U L A S 6 9
THEOREM 4.2 . Suppose f e C k([a, b]) with a < b, a, b e Z . Then
b / (_l\k
{k
(4.20) J2 fW = Ja (f(x)-t-j^M*)f \x))d*
Very ofte n th e firs t orde r Euler-Maclauri n formul a (4.7 ) i s a s goo d a s (4.20) .
Inserting (4.1 8 ) int o (4.7 ) an d integratin g b y part s w e ge t
COROLLARY 4.3 . For f e ^([a, b}) with a <b,a,b e Z we have
where N is any positive integer and the implied constant is absolute. Here and
hereafter the dash indicates that the terms at the end-points of summation are taken
with half values.
4.3. Th e Poisso n summatio n formula .
This sectio n i s based o n classical Fourier analysi s a s described i n the Appendix .
Recall tha t fo r an y functio n / G LX(IR) it s Fourie r transfor m i s denne d b y
(4.22) f(v) = I f(x)e(-xy)dx.
THEOREM 4.4 . Suppose that both /, / are in L l(M) and have bounded variation.
Then
m
(4-23) £ / ( ) = E/>0
where both series converge absolutely.
PROOF. Conside r th e functio n
which i s periodi c o f perio d one . Thi s ha s th e absolutel y convergen t Fourie r serie s
expansion
F(x) = y^CF(n)e(nx)
nez
with coefficient s give n b y
pl /»C O
cF(n)= F(t)e(-nt)dt= f(t)e(-nt)dt = f(n).
JO J-oo
Taking F(0 ) w e ge t th e Poisso n summatio n formul a (4.23) . •
70 4 . S U M M A T I ON F O R M U L A S
EXERCISE 4 . Deriv e (4.23 ) fro m (4.21 ) .
Changing f(x) into f(vx + u) wit h v G M + and u G R the formul a (4.23 )
generalizes to
(4.24) E / ( m , + .)_IE/ (=) e (!2).
rnGZ n £ Z
By th e Fourie r inversio n f[x) = f(—x) thi s ca n als o b e writte n a s
EXERCISE 5 . Usin g (4.24 ) prov e tha t fo r a primitive characte r \ (mo d g) ,
(4-26) J ; /MxM = ^ £ /©*(")
where r(x ) i s the Gaus s sum .
EXAMPLES. Pro m th e Fourie r pair s (4.83) , (4.84) , (4.85 ) w e obtain th e follow -
ing equations :
2
(4,7) E(l -M ) e ( „ l H „ E (Ei^>)
27r| l?
(4.28) ^ e(na;)e- " ' = (TO/)- 1 5 3 I ™ + x +i>\-2.
Tl 77 1
(4.29) Yl e(nx)e-Tn2/y = ^/y^e-*^*?*.
nm
for an y x G R and y G M + . Th e thir d equatio n generalize s (4.1 ) . Th e secon d
equation ha s a geometric serie s o n it s lef t side , s o b y a direct summatio n
sin h 2ny
Yje(nx)e-2^n ly
cosh 2ny — cos 27TX
The firs t su m ca n b e als o execute d b y usin g geometri c series ; it yields
1
(s[n7rxiy}\2 , {y} sm7rx(2[y] + 1)
E (l\ y- M\J(y\) e nx =
si n TTX ) y si n TTX
If y is a positive intege r thi s reduce s t o y _1 (sin7rx?//sin7rx) 2 .
The sam e metho d (averagin g o f integral translations ) work s in several variable s
giving
4. S U M M A T I O N F O R M U L A S 7 1
THEOREM 4.5 . Suppose f is in the Schwartz class S(R £). Then
(4.30) £ /(m ) = £ /(„) .
The formula s (4.24 ) an d (4.25 ) exten d t o severa l variable s i n a simila r fashion .
The assumptio n tha t / i s a Schwart z functio n ca n b e weakene d considerably . I n
two dimensions th e Poisso n formul a (4.30 ) i s just th e trac e formul a fo r th e Laplac e
operator an d a n invarian t integra l operato r o n th e toru s IR 2 /Z 2 , se e [14].
4.4. Summatio n formula s fo r th e ball .
If / i s radial , the n s o i s / (se e Lemm a 4.1 7) . I n thi s cas e th e Poisso n formul a
(4.30) become s a summatio n formul a fo r th e arithmeti c functio n
fi{m) = | { m i , . . . , rri£ G Z : m\ + • • • + mj = m}\.
Precisely, i f g(x) i s smooth an d compactl y supporte d o n R + , the n
oo oo
(4.31) ^2r 2k(m)g(m)mi-i = ^r 2fc (n)/i(n)n^
o o
where h(y) i s the Hankel transform o f g(x) give n by (4.1 03) . Throughou t w e assume
that £ = 2f c is a n intege r > 1 , so fc is a hal f integer .
REMARKS. Th e conditio n tha t g i s compactl y supporte d ca n b e considerabl y
weakened. I t suffice s tha t bot h g an d h ar e o f type (a , /?) with a < | + \ < (3 (see
(4.104)). T o this en d notic e tha t
1 +e
r2k(m) < r a * -
for m ^ 1 , th e implie d constan t dependin g o n e an d k (i f k ^ 2 , the n thi s hold s
without e).
Although ther e i s n o contributio n fro m m = 0 o n th e lef t sid e o f (4.31 ) ther e
is stil l a contributio n fro m n = 0 o n th e righ t sid e o f (4.31 ) becaus e h(y) i s no t
compactly supporte d i n R + eve n i f g{x) is . I n practic e thi s contributio n yield s a
main term , s o it i s desire d t o singl e ou t th e zero-t h term . W e hav e r 2fc(0) = 1 an d
rk
ft y 2 dx
( )~ Tuj J ^){^y)
as y —> 0 by the asymptoti c J u(x) ~ x v jTY{y + 1). Henc e (4.31 ) ca n b e writte n a s
THEOREM 4. 6 (SUMMATIO N FORMUL A FO R A BALL) . Suppose g is smooth and
compactly supported on R + . Then
OO £. OO
(4.32) ^r 2k{rn)g{m)m^-i = ^ y M ( ^ ) + ^r 2/c(n)/i(n)n^-*,
where M(g) is the Mellin transform of g at s — ^^, i.e.,
POO
h
(4.33) M(g) = / g{x)x ^dx,
Jo
72 4 . SUMMATIO N F O R M U L A S
and h(y) is the Hankel type transform of g
/>o o
(4.34) h(y) = 7 T / g(x)J 1k- (27Ty/xy)dx.
Jo
COROLLARY 4. 7 (SUMMATIO N FORMUL A FO R A CIRCLE) . Suppose g is smooth
and compactly supported on R+ . Then
OO pQQ OO
m
(4.35) 2 _ V ( ) # ( m ) = 7 r / g(x)dx + y^r(m)h(m)
l^ °l
where
•>o o
(4.36) % ) = IT / g(x)J 0(27ry/x^)dx,
Jo
and both series converge absolutely.
Summation formula s fo r the circle wer e establishe d i n one form o r another b y
Hardy-Landau [HaLa ] an d Voronoi [Vor] . Ther e ar e plenty o f expressions fo r the
Bessel functio n i n th e kerne l o f (4.36) . Her e w e selec t a fe w particularl y usefu l
integral representations :
TTJO(Z) = / cos(zsmO)dO
Jo
OO
{t2 -l)-^sm(zt)dt
/
« r °. , , N , ~ f°° • f zw\ f z\dw
= 2 / smize n t)dt = 2 / si n — co s — — .
Jo Jo V2 / \2wJ w
Moreover, w e have th e following asymptoti c expansio n (se e (23.451.1 ) o f [GR])
(4.37) TT J0(z) = ( ^ ) * {cos(z - f ) + 1 sin( s - f ) + O (1 ) }
valid fo r z > 0. Insertin g thi s int o (4.36 ) w e get
/•O O
/
/i(y) = / (xt/)"^(x)cos(27r v Sy- l)dx
Jo
+ x
1 67 T / ( y)~*9(x)sin(2n\^-j)dx + °{ (xy)-i\g(x)\dxy
Integrating b y parts i n the second integra l w e obtain
/>o o
(4.38) h(y) = / (x2/)- i (/(x) C O S ( 2 T T V ^ - f )dx + 0(R(y))
Jo
where
/>0 0
(4.39) R(y)= (xy)-i(\g(x)\+x\g'(x)\)dx.
Jo
Next integratin g b y parts i n the first integra l w e obtain
1 f°°
(4.40) h{y) =/ (a;y)ty(x)sin(27rV^ - f )ds + 0(R(y)).
ty Jo
4. SUMMATIO N F O R M U L A S 73
In the case of sphere (£ = 3 , k = § ) w e encounter th e Bessel function o f order | ,
which i s an elementar y function , namel y Ji{z) = (2/nz)^ sinz , s o (4.34 ) simplifie s
to
/»oo
h(y)= /
(xy)~ig(x)sin(27ry/xy)dx.
Jo
Setting G(x) = g(x)x~* an d H(y) = h(y)y* th e formula (4.32 ) withA; = | become s
COROLLARY 4. 8 (SUMMATIO N FORMUL A FO R A SPHERE) . Suppose G is smooth
and compactly supported on R + . Then
(4.41) y V 3(m)G(m) = 2T T / x*G(a:)da : + yV3(n)n-*JJ(n )
i^ oi
w/iere
/•OO
(4.42) # ( y ) = / G(x) sm{2<Ky/xy)dx,
Jo
and both series converge absolutely.
To illustrate th e result s w e apply th e summatio n formul a (4.35 ) t o improv e th e
error ter m i n th e Gaus s circl e proble m (se e (1 .70)) .
COROLLARY 4.9 . We have
r m = nX
(4.43) Yl ( ) + °(x*)-
P R O O F . W e shall establis h a n uppe r boun d fo r th e su m i n question , th e lowe r
bound ca n b e derive d b y applyin g simila r arguments . I n wha t follow s i t i s crucia l
that r(n) i s nonnegative, s o we can smoot h ou t th e su m b y enlargin g it s range . T o
this end w e choose the tes t functio n g{x) ^ 0 supported o n O ^ : r < X - H y i n whic h
segment g{x) — min{x , 1 , (X+ Y — x)Y~1}. Thi s i s not smooth , nevertheles s (4.35 )
holds true . Her e Y i s a paramete r a t ou r disposa l t o b e chose n late r t o minimiz e
the resultin g uppe r bound . W e assum e tha t H 7 ^ l 2 . B y (4.35 ) w e obtai n
r
r(m) < ^r(m)g(m) = 7 r(x+ - y - J + ^ (n)^n)'
where h(n) i s th e integra l transfor m o f g give n b y (4.36) . Usin g (4.40 ) an d (4.39 )
one show s tha t
(4.44) % ) < j r W ( l + 3//Z)- i
where Z = XY~ 2. Henc e
OO
1a b
Therefore
] T r(m) < TTX + 0(Y + X*Y~*).
m^X
Choosing Y = X^ w e balanc e th e erro r term s t o O(X^). B y a simila r argumen t
we establis h a lowe r boun d wit h th e sam e mai n an d erro r terms . Thi s complete s
the proo f o f (4.43) . •
74 4. S U M M A T I O N F O R M U L A S
EXERCISE 6 . Prov e tha t
r(n)
(4.45) T r(m) (l - - ) = ^x + V ^ J 2(27rv^) = \x + 0(x~i).
1m^.x
Note th e erro r ter m i s very smal l du e t o smoothing .
Similarly t o (4.43 ) on e ca n deriv e fro m (4.41 ) th e followin g approximat e for -
mula:
(4.46) £r 3 (m) = yX§+0(X*).
The strongest erro r term her e was obtained b y Chamizo - Iwaniec [Chi ] and Heath -
Brown [HB5] . Interestingl y enoug h i n eithe r pape r characte r sum s wer e brough t
into pla y independently ; the y obtaine d th e exponent s | | an d | | i n plac e o f |
respectively.
4.5. Summatio n formula s fo r th e hyperbola .
First w e establis h summatio n formula s fo r th e diviso r functio n r(n). The y
are essentiall y du e t o Vorono i [Vor ] excep t fo r ou r arrangement s o f th e integra l
transforms.
THEOREM 4.1 0 . Suppose g(x) is smooth and compactly supported on M + . Let
ad=l (mo d c) . Then
(4.47 )
Y^r(m)9[—)cos(——) = /\ log
~ + 2yg(x)dx
, x (n\ /27rdn\
Ex r(n) Hc) C0S l^-)'
(4.48)
m
Ei r (tm\ ) nf 7\)•s i n/27rara \ ^- v , fn\ . /27rdn\
x
( — ) = X > <n H c M — ) •
where p(y) and q(y) are the integral transforms
/>oo
p(y) = / C(27Ty/xy)g(x)dx,
Jo
^(y) = / S(27r^xy)g(x)dx,
Jo
with kernels
C(z) = 4 / cos^w ) cos ( —y ) —
Jo v ^ w
S(z) — 4 / sin(2:u> ) sin( — ) —.
wJ w
Jo ^
4. SUMMATIO N F O R M U L A S 7 5
These kernel s ar e also give n by (see (3.864.1) an d (3.864.2) o f [GRj)
C(z) = AK0(2z) - 2TTY 0(2Z),
S(z) = AK0(2z) + 27rY0{2z);
therefore bot h result s ca n also be written i n the following singl e formula :
^r(m)e(^ jg(m) = - / (logx+ 2j - 2logc)g(x)dx
(4.49) _ ^ g r(n)e(-^)jf~y0(^Vni)^)dx
4 ^. N ( dn\ f°° T ^ /4T T ^ - \ , N J
+ - /. r\n)e[—) / -ft-ol— \Jnx\g(x)dx.
We shal l deriv e (4.49 ) fro m a stil l mor e genera l summatio n formul a fo r the
tempered diviso r functio n
(4.50) T g(m)= ] P g(m um2).
mim2=m
Here g(x,y) i s a functio n o f clas s C 1 , compactl y supporte d o n R 2 . B y the way ,
the arithmeti c functio n r g(m) i s a prototype o f Fourier coefficient s o f automorphi c
forms i n the space o f continuous spectrum .
PROPOSITION 4.1 1 . Suppose g is a compactly supported function of class C l
2
onR . Let ad =1(mode) . Then
= ( n ) e
(4.5i) £ ^ M < v ) ^ (-T)
where h is given by the Fourier transform of g, namely
(4.52) h(x,y) = -g(-J).
c V c cJ
For n — 0 we have
(453) (
^<°>=/^OlH!}|;> ^>'"<'
P R O O F . Firs t w e open r g(m) o n the left sid e of (4.51) a s in (4.50) an d w e spli t
the summation int o residue classes (mi , 777-2) = (^1 , ^2) (mo d c) . The n in each clas s
we appl y th e Poisson summatio n formul a (4.24 ) gettin g
Y^rg(m)e(^ J = ^ T ^ e(^-u 1 u2J ^ ^ # ( ^ 1 + cv u u 2 + cv 2)
771
ui,u 2 (mo d c) « iV 2
= c~ 2 ^ ^ ] P ^2 e
c(auiu2 + niixi + n 2u2)gy—, — J
^ 1 ™ 2 iii,tt 2 ( m o d e )
^"TE e (44(7'7) = S^(-7)-
76 4. SUMMATIO N FORMULA S
The ter m n = 0 is special sinc e ther e ar e infinitel y man y divisor s o f zero . W e com -
pute Th(0) by reversing th e abov e analysis (o f course, thi s analysi s coul d b e avoide d
altogether, nevertheless , fo r th e econom y o f presentation sometime s a repeate d ap -
plication o f a n involutio n i s justified). Firs t w e arrang e th e summatio n ove r ni , r&2
as follows :
..I«o, 0 ) + il:j („,=) + i E ,(=,o).
nn
Then w e apply Poisson' s summatio n an d th e Euler-Maclauri n formul a (4.7 ) gettin g
/ / ( -c + { l 9{x y)dxdy
c}^) ' -
Similarly w e execut e th e summatio n o f #(^,0) . Gatherin g th e result s w e obtai n
(4.53). •
P R O O F O F (4.49) . On e woul d lik e to appl y (4.51 ) directl y fo r g(x,y) = g(xy),
but thi s i s no t possibl e becaus e suc h a functio n doe s no t hav e compac t suppor t
in R 2 eve n i f g(t) doe s hav e compac t suppor t i n R . Fo r thi s reaso n w e attac h
to #(771 1 777,2 ) redundan t factor s 77(7771 ) , 77(7712) t o localiz e th e rea l variable s x,y i n
compact segment s o f R + . Precisel y w e se t g(x,y) = 7](x)rj(y)g(xy) wher e rj(t) =
min{£/£, 1 } wit h 0 < e < 1 s o tha t r g(m) = r(m)g(m). Not e tha t rf(t) ha s
discontinuity a t t = e, bu t thi s i s allowed fo r applicatio n o f Proposition 4.1 1 as ca n
be see n fro m th e proo f b y inspectio n (w e coul d choos e rj(t) smooth , however , th e
involved computation s woul d have been less explicit). Fo r th e abov e choice we have
by (4.53 )
MO) = JJ{\ + {-C}1 + {\ }^Hy)9( Xy)dxdy
= - 9{y)L{y)dy,
where
WC^'G)^}! 3 ^))*-
Since y i s bounded fro m belo w an d above , an d e is small, w e find by examinin g th e
support o f th e involve d function s th e followin g representation :
^-/('-^{fD-x^Of-^AfWDi-
4. S U M M A T I O N F O R M U L A S 7 7
The las t integra l range s ove r x > s~ 1 y, s o it is bounded triviall y b y 0(ec/y). The n
we hav e
f°° , N /y\dx n [^ , ,dx n ,
/ ^ H x J "/ v(x)— = 2 + logy-2loge,
The par t o f th e las t integra l wit h ^ < x < 1 equals lo g | an d th e remainin g par t
equals 1 — 7, wher e 7 is the Eule r constan t (se e (1 .69)) . Fro m thes e evaluation s w e
get
L(y) = logy + 27 - 21 og c + O ( - ) .
Hence
(4.55) 77,(0 ) = - c Jg(y)(logy + 27 - 2 log c)dy + 0(e).
As £ tends t o zer o thi s give s th e leadin g ter m o n th e righ t sid e o f (4.49) .
Now w e comput e Th(n) fo r n ^ 0 . Sinc e th e factorizatio n nin 2 = n permit s
the sig n chang e o f both n\,U2 w e ca n replac e g in (4.52) b y th e cosine-Fourie r
transform, i.e. , w e ma y tak e
9 f f 9-7T
h(u,v) = - / / rj(x)rj(y)g(xy) cos— (ux + vy)dxdy.
Changing variable s w e write thi s a s
M«,«) = f/«(»)(/^(*h(f)oo8(^(« a :+^))|)dy.
If w e omit r)(x)rj(y/x), the n th e pur e cosin e integra l equal s
0
f°° /2TT / ^ ^ ^ - i " ^ ^ ^ ifw>0 ,
(see (3.871 .2 ) an d (3.871 .4 ) o f [GR], respectively , o r ou r Appendix) . Thi s yield s
exactly th e remainin g term s o n th e righ t sid e o f (4.49) .
Now w e onl y nee d t o show tha t th e distortio n introduce d b y rj(x)rj(y/x) i s
negligible. W e denot e th e differenc e b y ho(u,v) = h\(u,v) + h\(v,u), wher e
hi(u,v) = -/ ( 1 ) / g{xy) cos— (ux + vy)dydx.
c v £J c
Jo Jo
Integrating b y parts twic e with respec t t o y and the n applyin g Fubini' s theore m w e
get
e r°° f£ f x\ 2TT
2
hi(u,v)= 2 2 / / [l )x g"(xy)cos—(ux + xy)dxdy.
78 4. SUMMATIO N F O R M U L A S
Now integrating b y part s twic e wit h respec t t o x w e estimate th e inne r integra l b y
0(eu~2). Th e inne r integra l i s als o estimate d triviall y b y
2 ,f
/x \g (xy)\dx<^y-3.
Jo
Combining bot h estimate s w e arriv e a t
2 2
hi(u,v) <^v~ / mm(su~ ,y~3)dy
Jo
Next, addin g th e correspondin g boun d fo r h\{v,u) w e obtain ho(u,v) < C £ 3 | i ^ | ~ 3.
Therefore th e tota l distortio n o f the righ t sid e of (4.51 ) , which wa s caused b y intro -
duction of the localizing factors r](x)r](y) t o the left side , is bounded b y 0(e 2/3) sinc e
the serie s ^2r(n)n~ 4^3 converges . Lettin g e ten d t o zer o th e distortio n vanishe s
and w e complete th e proo f o f (4.49) , henc e als o o f Theore m 4.1 0 . •
The summatio n formul a (4.51 ) fo r th e tempere d diviso r functio n (4.50 ) wa s
established i n [DI] . Th e specia l cas e (4.49 ) i s les s flexible , bu t i t i s quit e effectiv e
when applicabl e ( a different, rathe r indirect , proo f o f (4.49 ) wa s give n b y M . Jutil a
[Jul]). W e procee d t o compos e som e variation s o n (4.49) .
First, w e ge t a summatio n formul a fo r certai n serie s o f Kloosterma n sums : i f
(c,q) = 1 , the n
2^ T(m)S(h,m;c)g(m) = -5(/i,0;c ) / (lo g \-^)g(x)dx
m^l C J 0 \ C /
(4.56) - — Vr(n)S(/i-n,0;c ) / Y 0( — y/r^)g(x)dx
C
n>l J 0 \ C J
-\—Y^ T~(n)S(h -f- n,0;c) / K$( —y/nx)g(x)dx.
for an y intege r h. T o prov e this , w e open th e Kloosterma n su m (1 .56 )
e
S(fc,ra;c) = ^ {J >
a (mo d c )
then fo r eac h a , w e appl y (4.49 ) an d the n exchang e th e orde r o f summatio n again .
Note tha t th e Kloosterma n sum s o n th e lef t sid e collapse d t o th e Ramanuja n sum s
on th e lef t side . Thi s featur e i s significant i n applications , se e e.g. [DFI2] , [KM1 ] ,
[KMV1].
Next, playin g wit h additiv e character s w e derive fro m (4.49 ) a summatio n for -
mula fo r th e diviso r functio n i n a n arithmeti c progression .
COROLLARY 4.1 2 . Let (g,r ) = 1 , and g be smooth, compactly supported on
R+. Then
(4.57 )
V r(m)g(m) = ^ / (log x + 27 - 2 V(q))g(x)dx
=±1,,^
ra=r(mod q) ? Jo
+ \ E \c E r(n) {S(r, n ; c)T+ ( J) + S{r, - „; c)T~ ( £) }
QV
c\q
4. SUMMATIO N F O R M U L A S 7 9
where rj(q) is the additive function
(4-58 ) -K.)-E^ .
P\Q
S(r,n\c) is the Kloosterman sum, and T +(y), T~(y) are the integral transforms
(4.59) T+(y) = -2T T / Y 0(4iryfty)g(x)dx,
Jo
POO
(4.60) T~(y) = 4 / K 0(4Tryfty)g(x)dx.
Jo
REMARKS. Sinc e YQ(Z) ha s th e asymptoti c expansio n (4.37 ) bu t wit h co s an d
sin interchange d (se e (23.451 .2 ) o f [GR]) , w e ge t th e approximat e formul a
T+(y) = — / (xy)ig'(x) c o s ( 4 7 r ^ - -)dx + 0(R(y))
ny Jo 4
by th e sam e argumen t whic h le d u s t o (4.40) , wher e R(y) i s given b y (4.39) . Simi -
larly on e derive s
4
T~(y) = — / (xy)ig'(x)e- ^dx + 0(R(y)).
^y Jo
EXERCISE 7 . Usin g the Wei l bound fo r Kloosterma n sum s (1 1 .1 6 ) an d th e tes t
function g(x) a s i n th e proo f o f (4.42) , deriv e fro m (4.57 ) tha t fo r (q,r) — 1 an d
X^2,
(4.61)
2
J2 r ( m ) - ^ X { l o g X + 27-l-277(g)}+ 0(r (g)(^+X*)logX)
m=r (mo d q)
where th e implie d constan t i s absolute . I n particular ,
r
(4.62) Y, (m) = x l
( °gx + 2 7 - 1 ) + O p d logX) .
This improve s th e erro r ter m i n th e Dirichle t diviso r proble m (1 .75) .
EXERCISE 8 . Prov e tha t fo r an y primitiv e characte r x(mo d q) with q > 1 , an d
any smoot h functio n g(x) compactl y supporte d o n 3R + w e hav e
oo oo
r
(4.63) ^ r(m)x(m)g(m) = 2
r (x)Q~ J2 2
(n)x(n)Mn2~2)
ii
where r(% ) i s th e Gaus s sum , an d h{y) i s th e integra l transfor m
/•O O
(4.64) h(y) = / K(2nyfty)g(x)dx
Jo
with kerne l
(4.65) K(z) = A X(-l)K0(2z) - 2TTY 0(2Z).
[Hint: Expan d \ int o additiv e character s (se e (3.1 2)). ]
80 4. SUMMATIO N FORMULA S
Next w e deriv e a summation formul a fo r
n
(4-66) T„{n, X)= £^ ^{~Y
niri2=n
where x(niod q) is a primitive character o f conductor q > 1 and v is a fixed complex
number. Thes e ar e eigenvalues o f the Heck e operators Tfi i n the spac e o f Eisenstei n
series fo r To(q) an d characte r x- I* 1 principle th e metho d tha t w e use d fo r r(n)
applies fo r 7v(n,x) , thoug h som e arithmetica l argument s nee d t o b e revise d an d
a fe w modification s i n integra l transform s ar e required . However , th e analysi s o f
convergence is the same, so here we do not repea t it . W e treat onl y the two extrema l
cases q\c and (g , c) = 1. Se e als o [KMVI ] fo r othe r simila r formulas .
THEOREM 4.1 3 . Let ad = 1 (mo d c ) and x be a primitive character of con-
ductor q > 1 with q\c. Then for any smooth, compactly supported function g on R +
we have
f^ r„(m , X )e(^)g(m) = ^ {^\{ X)L{
1 + 2i/, x) J°° g(x)x»dx
(4.67) + mf2T„(n,X)e(-^) J™g(x)J±^V^c)dx
+ ^ E ^ x ) e ( f) [ 9{x)Kt(^)dX
where r(x) is the Gauss sum, L( l + 2z/,x ) is the Dirichlet L-function, and
Kl(z) =4cos(7ru)K 2l/(z), ifx(-l) = 1,
K;v(z) = 4ism(irv)K 2v(z), » / x ( - l ) = - 1-
REMARK. Fo r i / = 0we hav e
(4-68) r(n )X ) = £x(d)-
If x is °d d (th e cas e o f modula r form s o f weigh t one) , ou r formul a reduce s t o
f>(m,x)e(^Wm) = ^-T(X)L(l,x) f°°g{x)dx
1 \ c /c J o
+ 27n^jrT(n,x)e(-^) J g(x)J 0(^-V^)dx.
4. SUMMATIO N FORMULA S 8 1
P R O O F . O n th e lef t sid e o f (4.67 ) w e spli t th e summatio n int o residu e classe s
(mi, 777,2 ) = (ui, U2) (mo d c) and fo r eac h clas s we apply th e Poisso n formul a (4.24 )
getting
— ^2^2
1 Y1Y x(ui)e c(auiu2 - rixui - n 2U2)I(n
1 ,n2)
Tii n 2 ui,u 2 (mo d c )
where I(1 21 ,71 2) i s th e Fourie r integra l
I(nlln2)= / / (- J g(xy)e c(xni+yn2)dxdy.
JO JO y
The su m ove r w 2 (mo d c ) vanishes unles s au\ = 1 1 2 (mo d c) in which cas e it equal s
c. Therefor e w e showed tha t th e lef t sid e o f (4.67 ) i s equal t o
]P^x(n2)e(—ni7i2J/(ni,n2).
n\ n 2
Note tha t x{°) — x{d)- Fro m nin 2 = 0w e ge t
X(d) "
£X(n)[/(0,n) + x(-lK(0,-n) ]
c
1
2l/
= ^(^) L ( - 2 ^ , X ) ( 1 ° ° g ( x ) x v d x ) j°°{jv + xC-lJc^Jy-^^dy .
The las t integra l i s equa l t o 2T(—2v) COSTTV or — 2ir(—2v) smirv accordin g t o
x(—1) = 1 o r x(—1 ) = — 1 b y (4.1 08 ) an d (4.1 09 ) respectively . Her e w e trans -
form L(—2i/,x) m t ° L(l + 2i/,x ) b y th e functiona l equatio n
n -1
«-*-*-^Tw^ \*Tw ,VTT VTT / t -iv(i + ^ ) / r ( f - 1 / ) , i
It!! '.
f x ( - i ) = - 1,
see (4.73) . I n th e cas e o f eve n characte r w e encounte r
2 r
(^+^)w o ^ _ , _ o =
r(-2i/)cos7n/ - 2 2 -„ 2 ^
r(-i/)
and i n th e cas e o f od d characte r w e encounte r
-2r(i + i/)w 0 ; ^ _._ 0 _ ,2
-T(-2v)smTTV = 2 -2 v
^.
r a + i/)
Hence w e conclude tha t i n bot h case s the term s wit h n i n 2 = 0 contribut e th e sam e
amount whic h i s exactl y th e firs t ter m o n th e righ t sid e o f (4.67) . Pro m n i n 2 ^ 0
we ge t
^E E X(n 2)e(--cnin2) [J(m , n2 ) + x( - l ) / ( - n i , -n 2)]
+c ^ - ^ X(»2)e(-nin 2) [ J ( - n i , n2) + X ( - l ) J(ni, - n 2 ) ] -
82 4 . SUMMATIO N F O R M U L A S
Changing th e variables (x,y) - * {u^Jvn2/n\,u~ l\Jvn\jn2) w e ge t
J ( ± n i , ±772 ) = ( — ) / g(y) I /e c(>/?jn 177,2(±u ± u~ 1 ))u2v~ldu J dt> .
Hence th e remaining term s o n the right sid e of (4.67) emerg e b y applying (4.1 1 2) -
(4.115). •
REMARK. I n the proof o f Theorem 4.1 3 we assumed tha t — v is a small positiv e
number, bu t the result extend s t o all complex v by analytic continuation .
THEOREM 4.1 4 . Let ad = 1 (mod c) and x (mo d q) be a primitive character
of conductor q > 1 with (c , g) = 1 . Tfte n /o r any smooth, compactly supported
function g on M+ we have
J2 TA™, X)e[°f)g{rn) = x ( c ) c 2 ^ 1 L ( l - 2i/, X) J 9(x)x~ v
dx
1
(4.69) +^(x)^ £-^)e(-^)/o 5 (^(^>
c
X(c).
» T W -i2 T _v(n>x) C (^)jr f l (a:)^(^)dx
c
where qq = 1 (mo d c), and J ^ , K^ ar e £fte sam e a s m Theorem 4.1 3.
REMARK. Fo r v — 0 and x °d d our formula reduce s t o
(4.70)
^c / \ /»o o
£ r ( m , X ) e ( ^ ) c , ( m ) = ^ L ( l > x ) / sO*) ^
-2^iM£T(„,Be(^)^9(l)Jo(^)fc
P R O O F . O n the left sid e o f (4.69 ) w e split th e summation int o classe s rai=
Til (mod cq)i ^ 2 = U2 (mod c) an d fo r eac h clas s w e apply th e Poisso n formul a
(4.24) gettin g
^ E E ILYI x(wi)ec(awit£ 2-yMi-n2U2)/I/(y,n2)
n\ ™ 2 n i ( m o d cq)
u-2 (mo d c)
where Iv(z\,Z2) i s the same integra l a s in the proof o f Theorem 4.1 3 . Not e tha t
The su m over u 2 (mo d c) vanishes unles s au\ = 772 (mo d c) in which cas e it equal s
c. Puttin g u\ = dn2qq + ct , where £ ranges modul o g we obtain
=
E ^ S ce(-—nin 2Jx(-cni)r(x).
4. SUMMATIO N FORMULA S 83
Hence th e lef t sid e o f (4.69 ) i s equa l t o
v v
n i ri2
This look s ver y simila r t o th e su m w e hav e alread y deal t wit h i n th e proo f o f
Theorem 4.1 3 . Therefor e (4.69 ) i s derive d fro m (4.67 ) b y makin g th e substitutio n
(x,v,d)— » (x , — v, dq) i n appropriat e places . •
Theorems 4.6 , 4.1 0 , 4.1 3 an d 4.1 4 ar e specia l case s o f summatio n formula s
for th e Fourie r coefficient s o f modula r forms . Actually , thes e result s confir m i n
themselves th e modularit y o f relevan t forms . A s a matte r o f fac t w e wer e abl e t o
establish suc h modula r relation s directl y b y applyin g th e two-dimensiona l Poisso n
summation onl y because in each case our arithmeti c functio n appear s i n the Fourie r
coefficients o f a GL2 for m whic h i s lifte d fro m GL\ forms , i.e. , th e correspondin g
L-function factor s int o two Dirichlet L- functions. Fo r genuine cusp form coefficient s
one canno t appl y th e ordinar y Poisson' s summatio n formula . I f on e know s tha t a
Fourier serie s
n
is a modular form , sa y / G Sk(q,x) ( s e e Chapte r 1 4 fo r notatio n an d surve y o f
holomorphic modula r forms) , the n th e correspondin g summatio n formul a fo r th e
coefficients A/(n ) woul d b e just anothe r expressio n fo r th e equatio n
EXERCISE 9 . Suppos e / i s a cusp for m o f weigh t k ^ 1 , leve l q ^ 1 an d
character x niodulo q. Le t c > 1 , c = 0(mo d q) an d ad = 1 (mo d c) . Prov e tha t
the Fourie r coefficient s o f / satisf y
(4.71) ±XM)e(^)9(rn) = ^±X f(n)e(^)h(n)
for an y smooth , compactl y supporte d functio n g o n R + , wher e
h(y) = 2m h / g(x)J k-i( — y/xy)dx.
Jo c
[Hint.] Appl y th e modula r equatio n fo r z = — - + — and integrat e th e resultin g
Fourier serie s o n bot h side s wit h respec t t o y agains t a suitabl e tes t functio n G(y).
If / i s primitiv e (se e Sectio n 1 4.7) , s o tha t i t satisfies th e Frick e involutio n
equation
with 1 77/ 1 = 1 (Proposition 1 4.1 4) , the n w e als o hav e
00 0 0
m
(4.72) 2 > / ( Mm) = W-
1 /2
^A/(n)%)
11
84 4. S U M M A T I O N F O R M U L A S
where h i s th e sam e Hankel-typ e integra l transfor m o f g a s abov e wit h c = ^Jq
(apply (4.71 ) fo r
(a b\ = ( 0 -l/y/q\
\c d) \y/q o ;
and chang e \(d) int o rjf t o ge t (4.70)) .
In moder n analyti c numbe r theor y ther e i s a grea t deman d fo r summatio n for -
mulas fo r arithmeti c function s o f typ e y(m) = a(m)f3(m - f h) wher e a(ra),/3(n )
are essentiall y modula r form s coefficient s an d h i s a fixe d intege r (bu t a unifor -
mity i n h i s crucial) . Thes e kin d o f problem s ar e ofte n encountere d whe n eval -
uating power-moment s o f familie s o f L-function s o n th e critica l line . See , fo r in -
stance, [DFI1 ] , [Sal] , [Ml] .
4.6. Functiona l equation s o f Dirichle t L-functions .
A summatio n formul a fo r arithmeti c function s ofte n ha s it s realizatio n a s a
functional equatio n fo r th e correspondin g generatin g Dirichle t series . Conversely ,
a functiona l equatio n connectin g tw o Dirichle t serie s lead s (b y wa y o f takin g th e
inverse o f Melli n transform ) t o a summatio n formul a fo r th e coefficien t sequences .
In thi s sectio n w e establis h thi s correspondenc e i n ful l detai l fo r th e Dirichle t L -
functions (3.6 ) usin g th e origina l metho d o f Riemann . Thes e idea s wer e develope d
further b y E . Heck e [Heel ] fo r L-function s o f numbe r field s an d fo r automorphi c
L-functions, an d late r pu t i n th e genera l adeli c settin g i n Tate' s Thesi s [Tal] .
Furthermore th e connection s betwee n functiona l equation s fo r twiste d L-function s
and modularit y vi a summatio n formula s i s th e essenc e o f th e so-calle d convers e
theorems i n th e theor y o f automorphi c form s i n general . Ther e ar e delicat e issue s
of compatibilit y whic h w e d o no t discus s here ; se e fo r exampl e th e Wei l convers e
theorem fo r GLi form s i n Theore m 1 4.21 , o r [1 4] .
Let q ^ 1 an d le t x b e a primitiv e characte r modul o q. W e le t S(x) = 0 i f
q jtz 1 and S(x) = 1 if < / = 1 , i n whic h cas e x = 1 a n d L(x , s) = ((s). Als o w e pu t
K = | ( 1 — x(—1 )) , s o K = 0 if x i s eve n an d n — 1 if x i s odd .
T H E O R E M 4.1 5 . With the above notation, L(s,x ) extends to a meromorphic
function on C which is entire if x 7 ^ 1 an d otherwise admits a unique simple pole
with residue 1 at s = 1 . The completed L-function
A(„)-(J)^r(i±i)L(.,x)
is entire ifqj^l and has simple poles with residue 1 at s = 0 and s = 1 otherwise.
Moreover, it satisfies the functional equation
(4.73) A ( s , X)=£(x)A(l-5,x)
where
(4.74) £ (x) = i -«I&.
Recall tha t th e Gaus s su m r(x ) i s o f modulu s yfq (se e (3.1 0) , (3.1 4) ) fo r x
primitive, s o tha t th e "roo t number " e(x) i s o f modulu s 1 . Theore m 3. 3 o f Gaus s
shows tha t e(x) = 1 for an y rea l primitiv e characte r x -
4. SUMMATIO N F O R M U L A S 85
For th e Rieman n zet a function , th e functiona l equatio n i s therefor e
(4.75) A(s ) - 7r- s / 2 r(s/2)C(s) - A( l - s) .
PROOF. W e sho w th e argument s fo r q =/= 1 , th e cas e o f ((s) i s onl y slightl y
different becaus e o f th e pol e a t s — 1 . Le t 9(y,x) t> e the thet a functio n
nGZ
for y > 0. B y splittin g int o progression s modul o g , w e hav e
a (mo d q)
where
K
0(y,q,a)= J2 ri e-™2y'«.
n=a (mo d q)
We appl y th e Poisso n formul a (se e (4.24) ) t o thi s sum . Th e functio n e~ nx i s it s
own Fourier transfor m (4.29) . Differentiatin g th e correspondin g Fourie r integral , i t
follows tha t th e Fourie r transfor m o f f y(x) = x Ke~nx y i s give n b y
fy(t)=i-Ky-K-hy->{t).
By (4.24 ) w e obtai n
e(y,q,a.) = i-"y-"-hq-* £ e (-)e(-;q,b)
o (mo d q)
after splittin g agai n i n progressions . Usin g th e formul a (3.1 2 ) fo r a primitive char -
acter i t follow s tha t
(4.76) HV,X) = e(x)v- K~* 0 (~>x)-
Now w e appea l t o th e Melli n transfor m formul a (se e (4.1 07) )
+ 00 7
Kp-nn2y/q (S + K)/2 ^V_
ne
to deriv e
v
^'2 J0 y
for R e (s) > 1 . Spli t th e integra l a t y = 1 and appl y (4.76 ) fo r 0 < y < 1 getting
K/2
(4.77) (~) Hs,x) = \ l +°° {e(x)0(y,x)y {1 -s+K)/2 + 0(y,x)y (s+K)/2
}j
from whic h bot h th e analyti c continuatio n o f A(s,x ) (henc e tha t o f L(s,x) ) an d
the functiona l equatio n (4.73 ) follo w sinc e 9(y,\) decay s exponentiall y fas t a t oo. D
86 4. SUMMATIO N F O R M U L A S
REMARK. Thi s proof shoul d b e compared wit h that o f Theorem 1 4.7 , the func -
tional equatio n o f Heck e L-functions .
Quite ofte n i n application s a formul a fo r th e su m o f a n arithmeti c functio n
A(n) wit h a shar p cu t n ^ x i s mor e convenien t tha n tha t wit h smoot h ending . I n
this situatio n i t i s better t o wor k wit h a n approximat e formul a havin g a good erro r
term rather tha n wit h an exact infinit e serie s which is only conditionally convergent .
Here w e stat e a classica l resul t fo r a twiste d diviso r functio n whic h ca n b e derive d
from th e functiona l equatio n fo r th e Dirichle t L-serie s (se e [FI2]) .
THEOREM 4.1 6 . Let Xi,...,X d be primitive characters to moduli q\,...,qd re-
spectively. Put
Kn) = Yl Xi(ni)>-Xd(n d).
n\ •••nd=n
For any 1 ^ y ^ x we have
Y X(n) = R(x) + w ^- ] T -jz(-) cos(2^(- j+ - )
Here q = q\ • • • q d, u = d — 3 — 2(« i + • • • + no) and w is the root number of the
L-function L(s) = L(s , xi) • • • L(s, Xd) ( w is a product of the normalized Gauss
sums). The main term R(x) is equal to the residue of s~ 1 xsL(s) at s = 1 , so
R(x) = xP(\ogx), where P(x) is a polynomial of degree ^ d. The implied constant
depends only on d and e.
1
1 d-
Estimating th e dua l su m triviall y an d choosin g y = qd+i xd+i w e ge t
h£
] T X(n) = R{x)^0(qdTixdTT- )
where the implie d constan t depend s onl y o n d and e. This , of course, i s not th e bes t
error term . Usin g exponentia l sum s method s on e ca n improv e th e resul t slightly .
The bes t erro r ter m wit h respec t t o x whic h on e ca n hop e fo r i s 0(x2 -2d - — ++ £
).
Indeed, th e first ter m o f th e dua l su m i s abou t a s large . Not e tha t th e Exponen t
Pair Hypothesi s (se e Chapte r 8 ) doe s yiel d th e bes t estimate , whil e th e Rieman n
Hypothesis doe s not !
4.A. Appendix : Fourie r integral s an d series .
Let L 1 (M) denot e th e spac e o f Lebesgue integrabl e function s o n R. Th e Fourie r
transform o f / G L1 (R) i s define d b y
(4.78) Tf(y) = f(y) = f f(x)e(-xy)dx.
Jm
4. SUMMATIO N F O R M U L A S 87
Though f(x) i s no t necessaril y continuou s it s Fourie r transfor m f(y) i s uniforml y
continuous an d f(y) —) • 0 as \y\ —> o o (th e Riemann-Lebesgu e lemma) . Th e convo -
lution o f tw o function s / , g G L1 (R) i s defined b y
(4.79) (f*g)(x)= If{*-y)g{y)dy
for almos t al l x, an d i t belong s t o L 1 (R), indee d th e L 1 -norm satisfie s | | / * p | | i ^
H/llill^lli- Th e convolutio n i s a smoothin g operator , precisely , i f g ha s th e firs t j
derivatives i n L 1 (R), the n s o doe s f*g fo r an y / i n L X (R).
The Fourier transform f(y) determine s its original f(x). Precisely , i f / G L1 (R),
then
/ (l-y)Kv)e(xy)dy^f{x)
as Y —> o o i n th e L 1 -norm. I f bot h / , / ar e i n L 1 (R), the n th e abov e integra l
converges uniforml y i n x givin g
(4.80) f(x) = j f{y)e(xy)dy.
JR
In othe r words , f(x) = f(—x). Thi s i s the inversio n formul a fo r th e Fourie r trans -
form. Th e Fourie r transfor m o f p(Y) = max{l — M^ , 0 } is calle d th e Feje r kerne l
Y
We have f*g = f-g, i n particular, <p*f = p-f, whic h show s tha t th e linea r spac e
of function s i n L 1 (R) havin g compactl y supporte d Fourie r transfor m i s dense .
Here ar e basi c Fourie r pairs :
{ 1 if \x\ < 1
sin 2iry
(4.82) f{x) = \ i f M = i f(y) iry
v 0 if |x | > 1 ,
2
(4.83) / ( * ) = m a x { l - H , 0 } , f(y) = ( ! ^ ) ,
(4.84) /(x ) = e- 2^l, /(y)=7r- 1
(l + y 2)-1 ,
v
(4.85) f(x) = e-™\ Hv) = e-* \
1
(4-86) /(* ) = -J—, f(y)
ch TTX c h Try
There i s a correspondin g Fourie r analysi s o f function s o n th e circl e T =
These ar e regarded a s functions o n R which ar e periodic o f period one . Th e Fourie r
coefficient o f a functio n / G Ll(T) i s define d b y
(4.87) c n(f) = [ f(x)e{-nx)dx.
Jo
88 4 . SUMMATIO N FORMULA S
They for m th e Fourie r serie s fo r /
(4.88) f(x)^Y,Cn(f)e(nx)
n£Z
which i s nothin g bu t a forma l expressio n a s lon g a s th e convergenc e i s no t estab -
lished. Th e Riemann-Lebesgu e lemm a assert s tha t c n{f)— > 0 a s \n\ —> oc , and , o f
course, thi s i s not sufficien t fo r th e convergenc e o f (4.88) . Nevertheless , th e Fourie r
coefficients c n(f) determin e / , precisely , i f c n(f) = 0 fo r al l n , the n / = 0 almos t
everywhere.
The convolutio n
(4.89) (f*9)(*)= I f(x-y)g(y)dy
Jo
is defined almos t everywhere , an d ha s it s Fourie r coefficient s equa l t o th e produc t
Cn(f*g) =Cn(f)Cn(9)-
In particular , convolvin g / G Ll{T) wit h a trigonometri c polynomia l
P c
N(X) = Yl n(P)e(nx)
\n\^N
gives a trigonometri c polynomia l
(f*PN)(x)= ^2 c n{f)cn(PN)e(nx).
\n\^N
This trick solve s many summabilit y problems . Th e mos t popula r i s the Feje r kerne l
N
(4.90) F N(x) = £ ( l " y )e(nx) = {-^r) •
\n\^N
This produce s th e Feje r partia l sum s
(4.91) J2 (l-^)c„(/)e(nx )
\n\^N
which d o converg e t o f(x) i n th e L 1 -norm o n T . Th e exac t partia l sum s o f th e
Fourier serie s fo r / ca n b e tailore d similarl y b y convolvin g / agains t th e Dirichle t
kernel
/.^s^ / x v ^ / x sin7r(27 V + l)x
4.92 D N(x) = > e(nx) = ^ —
*-^ sin7n r
for an y positiv e intege r N.
Concerning a pointwis e convergenc e fo r symmetri c partia l sum s w e hav e
c
(4.93) J2 nU)<nx) -+ § ( / (* + 0) + f{x - 0) )
\n\^N
for al l x G T provide d / G L l{T) ha s bounde d variation . Th e convergenc e i s
uniform o n close d interval s o f continuit y o f / .
4. SUMMATIO N F O R M U L A S 89
In man y way s th e proble m o f representin g a periodic functio n b y it s Fourie r
series simplifies i f / i s square integrabl e becaus e L 2(T) i s a Hilber t space , th e inne r
product bein g
(f,9) = / f(x)g(x)da
and th e additiv e character s e n(x) = e(nx) for m a complete orthonorma l system .
Note tha t ||/|| i ^ ||/|| 2 b y Cauchy-Schwar z inequality , s o L 2(T) C L1^). Fo r an y
/ , g i n L 2 (T) w e have th e Parseva l formul a
(4.94) 5> n (/)c n ( 5 ) = {f,g).
n
Hence th e Fourie r coefficient s ar e squar e summable . Th e symmetri c partia l sum s
of th e Fourie r serie s converg e t o th e functio n i n th e L 2 -norm.
Next w e conside r function s i n severa l variable s x = ( x i , . .. ,#& ) G R fc. Say
/ : Rk -» C i s a Schwartz functio n i f
A
(4.95) f^(x)^\x\-
for an y a = ( a i , . . . , o^) an d A ^ 0, wher e \x\ 2 = x\ -f • • • + x\. Le t 5(R fc ) denot e
the clas s o f Schwart z functions . Fo r an y / G S(W k) w e se t th e Fourie r transfor m
(4.96) f(y) = f /(.x)e(—x • y)dx
where x -y — x\y\ -\ h XkVk is the scala r produc t i n R k. Th e Fourie r transfor m
A;
maps th e spac e *S(IR ) into itsel f (chec k (4.95 ) fo r f(y) b y partia l integration) , an d
it ha s th e followin g properties : f(x) — f(—x\ (f,g) = (f,g), explicitly ,
(4.97) f{x) = [ f(y)e(x • y)dy,
(4.98) / f{x)g{x)dx = f f(y)g(y)dy.
JRk JR k
In othe r words , th e Fourie r transfor m i s a n isometr y o n <S(M fc). Th e convolutio n
(4.99) (f*9)(*)= [ f(x-y)g(y)dy
is turne d int o th e produc t T(f • g) = f • g.
One ma y thin k o f R^ a s a homogeneous spac e acte d o n b y th e grou p o f trans -
lations G = R k. I n additio n t o th e translation s th e orthogona l grou p SO(R k) act s
on R k by rotation s makin g R k the euclidea n space . Th e riemannia n metri c o n R k
(of curvatur e zero ) i s given b y th e differentia l ds 2 = (dxi) 2 H h (dxk)2:, an d th e
corresponding Laplac e operato r i s
d2 d 2
^• 10 °) D =
M + '" + M-
The exponentia l function s <p(x) = e(x • y) fo r y G R fc ar e eigenfunction s o f D,
precisely (D + X((p))(f = 0, wit h eigenvalu e X(ip) — 47r 2|y|2. A n importan t poin t i s
that th e Laplac e operato r i s rotation invarian t (i.e. , th e rotation s ar e isometrie s o f
90 4 . SUMMATIO N F O R M U L A S
Rfc), therefor e th e Fourie r transfor m o f a radia l functio n i s als o radial . Precisely ,
employing th e Fourier inversio n (4.97 ) on e shows
LEMMA 4.1 7 . Let k^2. Put v = § - 1 . Suppose
(4.101) f(x)=g(\x\*)\xr
where g is a smooth compactly supported function on R+ . Then
2
(4.102) f(y) = h(\y\ )\y\-"
with
(4.103) h(y) =7T / J v(2v^y)g(x)dx
Jo
where J v(x) is the Bess el function of order v.
If P i s a homogeneous polynomia l i n k variable s an d complex coefficient s suc h
that DP = 0 (it i s called a spherical harmonic) , the n w e have th e Fourier pai r
d
F(x) = P{x)e~^\ F(y) = i P(y)e-*M2
where d = de g P. Thi s self-dualit y (du e to Hecke ) actuall y characterize s spherica l
harmonics. I n principl e i t say s tha t th e multiplication b y spherical harmonic s ha s
little effec t o n Fourie r transform . Fo r example , Lemm a 4.1 7 generalize s t o th e
following equatio n (du e to Bochner )
T(P(x)g(\x\2)\xD = P(y)h(\y\ 2
)\yr
where h i s give n b y (4.1 03 ) wit h th e Besse l functio n o f orde r increase d fro m v t o
v + d (recall tha t v = § — 1). Thes e fact s appea r naturall y i n the context o f unitary
representations.
We en d thi s surve y o f classica l Fourie r analysi s b y comment s o n th e Melli n
transform o n R+. Sa y / : R+ - > C is of type (a, /?) if
(4.104) f(y)y 1s
~ e L X (R+) fo r a < Re s < f3.
For suc h / t h e Melli n transfor m
/>oo
1s
(4.105) M(f)(s) = / f(y)y - dy
Jo
is defined fo r complex variable s in the vertical strip a < Re s < /?. Clearly M(f)(s)
is holomorphi c i n this strip . Sinc e th e Melli n transfor m o n R + i s merely a versio n
of th e Fourie r transfor m o n R derive d b y th e chang e o f variable s y — ex, s = it,
the result s fo r th e latte r translat e appropriatel y fo r th e former . Fo r example i f /
is continuou s o f typ e (a , ft) and ha s bounded variation , the n th e Melli n inversio n
formula holds :
(4.106) /(!/ ) = 5 ^ / M(f)(8)y-ds
where (a) indicate s tha t th e integration i s on the vertical lin e R e (s) = a.
4. S U M M A T I O N F O R M U L A S 9 1
Here i s a selectio n o f Melli n transform s fro m Chapte r 3 o f [GR ] (formula s
(381.4), (761 .9) , (761 .4) , (41 1 .3) , (523.3) , (222.2) , (293.3) , (871 .4) , (871 .3) , (871 .2) ,
(871.1) respectively )
•>o o
y1
8
(4.107) / e- y - dy = T(s), a > 0,
Jo
1
s
(4.108) / (cosy)y - dy = T(s)cos~- , 0 < cr < 1 ,
Jo 2
/>00
s l
(4.109) / (smy)y - dy = 1
r(s)sm—, - <a < 1,
(4.110) / (l + y)- 1 ys-1 dy=-. , 0 < a < 1,
(4.111) / ^( l + y ) ^ - 1 dy
. = —:, - 1 < cr < 0.
For x > 0 we hav e
1 7TS
(4-112) J cos(^(y-^)y- dy = 2K(x)cos—,
s(:
7T 5
(4.113) ^ sin( | (y - fyv^'dy = 2K s(x) si n
7TS
(4.114) / cos(-(y + -jjy s~1 dy = -7rJ s(x)sm-— -7ry s (x)cos
7T 5
(4.115) / sin( | [y + -^y s~xdy = TTJ S{X) CO S y - 7ry s(s) sin
T"
In th e las t fou r formula s — 1 < cr < 1 an d J s , / ^ , ! ^ ar e th e standar d Besse l
functions.
In th e las t tw o formula s w e als o hav e
(4.116) J s (x)Siny+ys(a;)cosT= 2 s i n ^•
and
(4.117) J J x j c o s — — Y s(x) sm — = — .
v) s V) y J
2 2 2cos ^
This page intentionally left blank
http://dx.doi.org/10.1090/coll/053/06
CHAPTER 5
CLASSICAL ANALYTI C
THEORY O F L-FUNCTION S
We wil l no w discus s a ver y classica l par t o f analyti c numbe r theory . Indeed ,
most o f th e result s discusse d i n thi s chapte r ca n b e trace d bac k i n som e for m
to Riemann' s Memoi r o n th e zet a function . The y wer e subsequentl y extende d
to Dirichle t characters , wit h tw o essentiall y ne w phenomeno n appearing . On e i s
that uniformit y o f estimate s wit h respec t t o th e conducto r i s vital . Th e other ,
not unrelated , i s th e stubbor n appearanc e o f th e so-calle d exceptiona l zero s (o r
Landau-Siegel zeros ) fo r quadrati c characters . Rulin g ou t th e existenc e o f suc h
zeros remain s on e o f th e mai n problem s o f numbe r theory .
Nowadays mor e refine d L-function s occu r frequentl y (se e Chapter s 1 4 an d 1 5
for a survey o f automorphic forms , on e of the sources for L-functions) , bu t i t i s well-
known tha t th e classica l analyti c result s remai n vali d i n grea t generality . Findin g
explicit statement s i n th e literature , unifor m i n al l parameters , i s howeve r some -
times difficult . Fo r thi s reaso n w e prov e al l mai n result s i n a genera l abstrac t
context. Then , startin g fro m Sectio n 5. 9 , we survey th e mos t importan t classe s o f
L-functions tha t aris e i n practice :
(1) Dirichle t L-functions ; se e Sectio n 5.9 .
(2) Dedekin d zet a function s an d L-function s o f Hecke Grossencharaktere n o f
number fields ; se e Sectio n 5.1 0 .
(3) Th e automorphi c L-function s o f classica l cus p form s (tha t i s t o sa y o f
holomorphic an d Maas s form s o n GL(2)) ; se e Sectio n 5.1 1 .
(4) Genera l automorphi c L-function s o n GL(ra) , m ^ 1 ; see Sectio n 5.1 2 .
(5) Th e L-function s o f algebrai c varietie s an d Galoi s representation s (al -
though mos t o f thos e ar e onl y conjecture d t o fi t th e framewor k discusse d i n thi s
chapter); se e Section s 5.1 3 an d 5.1 4 .
In eac h o f these case s we spell out th e concret e meanin g o f the abstrac t results ,
and giv e a fe w additiona l results .
5.1. Definition s an d preliminaries .
In orde r t o trea t al l th e L-function s tha t interes t u s i n on e stroke , w e in -
troduce som e abstrac t definitions , althoug h w e ar e no t tryin g t o creat e axioms .
The alternativ e i s to introduc e automorphi c L-function s a t th e outse t (se e Sectio n
5.12), bu t som e translation woul d stil l be require d fo r th e mos t classica l case s (an d
Rankin-Selberg convolution s woul d requir e a separat e treatment) . W e tr y t o bal -
ance familiarit y an d eas e o f us e wit h sufficien t generality . Th e reade r unfamilia r
with L-functions , o r acquainte d onl y wit h Dirichle t L-functions , shoul d rea d th e
introductory paragraph s o f Sectio n 5. 9 first , the n rea d wha t follow s wit h thi s im -
portant cas e i n mind . Basi c idea s ar e alread y presen t i n th e contex t o f Dirichle t
L-functions. W e the n encourag e hi m o r he r t o ge t acquainte d wit h automorphi c
93
94 5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S
L-functions (se e [BG ] for instance) . Th e analog y with Dirichlet o r Hecke character s
on ideal s i n a number field d o no t reall y revea l th e beaut y an d th e dept h o f the
genuine modula r forms . Thi s i s especially tru e whe n Rankin-Selber g L- functions
occur, whic h ar e instrumenta l fo r establishin g zero-fre e regions .
We denot e L-function s b y L(/ , s), L(g, s) , althoug h th e symbol s /, g carry n o
specific meanin g unti l Sectio n 5.9 . The y ar e use d merel y fo r th e suggestio n tha t
L-functions usuall y aris e a s attache d t o som e interestin g arithmeti c object .
We sa y tha t L(f,s) i s an L- function i f we hav e th e followin g dat a an d condi -
tions:
(1) A Dirichle t serie s wit h Eule r produc t o f degre e d ^ 1 ,
(5.1) L(f, a) = £ X f(n)n-S = JJ (1 - ^(p)?-)" 1 • • • (1 - ^(pjp-)" 1
with A/(l ) = 1 , Xf(n) G C, cti(p) G C. Th e serie s an d Eule r product s mus t be
absolutely convergen t fo r Re (s) > 1. Th e cti(p), 1 < i < d, are calle d th e loca l
roots o r loca l parameter s o f L(f, s) a t p , an d the y satisf y
for a1 1
(5.2) |<*i(p) l <P P-
(2) A gamm a facto r
ds/2
(5-3) 7 ( / , S)=7r- nr(£^)
J =l
where th e number s Kj G C are calle d th e loca l parameter s o f L(/, s) a t infinity.
We assum e thes e number s ar e eithe r rea l o r come i n conjugate pairs . Moreover ,
Re (K,J) > — 1. Thi s las t conditio n tell s u s tha t 7 ( / , s) ha s n o zer o i n C an d n o pol e
for R e (s) ^ 1 .
(3) A n intege r q(f) ^ 1 , called th e conducto r o f L(/ , s), suc h tha t ai{p) ^ 0
for p \ q(f) an d 1 ^ i ^ d. A prime p \ q(f) i s sai d t o b e unramified .
Prom these , th e so-calle d complet e L- function
(5.4) A(f,s) = q(f)2 1 (f,s)L(f,s)
is defined . Clearly , i t is holomorphic i n the half-plan e R e (s) > 1, ye t it mus t
admit analyti c continuatio n t o a meromorphic functio n fo r s G C of order 1 (see
the Appendix) , wit h a t most pole s a t s = 0 and s = 1. Moreover , i t must satisf y
the functiona l equatio n
(5.5) A(f, 3)=e(f)A(f,l-s).
where / i s an objec t associate d wit h / (th e dua l o f /) fo r whic h Xf(n) = A/(n),
7 ( / , 5 ) = y(f.s). q(f) = q(f) and s(f) is a complex numbe r o f absolute valu e 1,
called th e "roo t number " o f L(/ , s).
We le t r(f) denote th e orde r o f the pol e (i f positive ) o r zero (i f negative) of
A(/, s) at s = 0 and 5 = 1 , equal because o f (5.5) . Sinc e 7(7,1 ) 7 ^ 0, 00, r(f) is also
the orde r o f the pol e o r zer o of L(/ , s) at s =1. Late r w e shall sho w i n al l concret e
cases tha t L(/ , s) doe s no t vanis h a t s = 1, so r ( / ) ^ 0.
We see k unifor m estimate s fo r various analyti c quantitie s relate d t o L(f,s).
For a n individua l L- function, th e onl y paramete r i s s G C, bu t whe n L(f,s) varies ,
we also have to deal with th e degree, conductor , loca l parameters, an d eac h of thes e
5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S 95
or a combinatio n ma y be unde r investigation . I t turn s ou t tha t mos t result s fo r
L(f, s) are expressed convenientl y i n terms o f the analytic conductor . Firs t pu t
d
(5.6) qoo(5 ) = n^ + ^' l + 3 )-
i=i
Multiplying thi s b y q(f) w e get the analytic conducto r
d
(5.7) q(/ , s) = g(/)qo o W - q(f) U (\s + K,\ + 3).
j=i
We als o denot e
d
q(/) = q(/,0) = <?(/) n(K-1 + 3)-
Note tha t q(/ ) ^ 3 d g(/), s o d < logq(/), an d tha t
d
(5.8) q ( / , S)<q(/)(|S|+3) ,
so estimates could be performed i n terms of this last quantit y without compromisin g
much o f strength .
From th e definition , w e see that i f L(/ , s) an d L(g,s) ar e L-functions , the n
L(f,s)L(g,s) i s one with conducto r q(f)q(g) an d analytic conducto r q(/ , s)q(#, s),
gamma facto r 7(^,5)7(^,5) , roo t numbe r s(f)e(g). Moreover , L ( / , 5) i s als o a n
L-function b y construction, wit h sam e degree , conductor , gamm a facto r an d wit h
s(f) — e(f). Als o i f L(/ , s) i s entire , the n fo r an y fixe d t G R, th e shifte d L -
function L(g, s) = L(/, s + zt)L(/, 5 — it) is another L-function , wit h gamm a facto r
7(17, 5) = 7'(/ , s + it)j(f, s — ii), conducto r q(g) = q(f) 2, roo t numbe r e{g) — 1. The
analytic conducto r i s q(# , 5) = q(/ , 5 + it)q(f, s — it). Th e condition tha t L(/ , 5)
be entir e i s required becaus e w e are not allowe d t o shif t th e poles. W e take th e
product o f two shifte d L-function s b y it an d —it to maintain th e local parameter s
coming i n complex pairs . O f course, thes e ar e only cosmeti c arrangements .
In the sequel we often simplif y th e above notatio n b y not displaying th e depen-
dence o n /, 5 ; we write q = q(f), q^ = q ^ s) , q = q(/ , 5), r = r(f).
If / = / , the n L(/ , 5) is said to be self-dual. Thi s means that th e Dirichlet serie s
of the L-function ha s real coefficients . Fo r a self-dual L-function , th e root numbe r
is real , henc e e(f) = ± 1 . I t i s the n calle d th e sig n o f th e functiona l equation .
The followin g observatio n i s originall y du e to Shimura ; despit e it s simplicity , i t
is significan t i n a numbe r o f application s (se e Sectio n 23. 6 and Chapte r 2 6 for
instance):
PROPOSITION 5.1 . Let L(/, 5) be self-dual with s(f) = - 1. Then L(/ , | ) = 0.
PROOF. B y definition w e have 7 ( /, |) 7 ^ 0, and the functional equatio n applie d
to th e central poin t 5 = \ yield s L ( / , |) = — L(/, | ) , hence L ( / , \) = 0 . •
The loca l root s ai(p) ar e well define d u p to permutatio n o f the indices. I f for
any i w e have |a^(p) | = 1 for al l p \ q(f) an d |a^(p) | ^ 1 otherwise, the n L(/ , 5)
is sai d t o satisfy th e Ramanujan-Petersson conjecture . Thi s implies , i n particular ,
96 5 . CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S
that |A/(n) | < Td(n). Similarly , i f fo r an y j w e hav e Re(Kj) ^ 0 , the n L(f,s) i s
said t o satisf y th e (generalized ) Selber g Conjecture , o r th e Ramanujan-Petersso n
conjecture a t th e infinit e place . I n thi s cas e j(f, s) ha s n o pole s fo r R e (s) > 0.
Because A(/ , s) = 7 ( / , s)L(/, s) i s assumed t o b e holomorphic , excep t possibl y
at s = 0 and s = 1 , it follows that pole s of 7(/, s) a t s ^ 0 are zeros of L ( /, s). Thes e
are called th e "trivia l zeros" . The y ar e located exactl y a t th e point s —2 m — K,J ^ 0 ,
1 ^ j ^ d , wher e ra ^ 0 i s a n integer . Fo r example , th e trivia l zero s o f £(s ) ar e
at s = - 2 , - 4 , - 6 , . . . whil e £(0 ) = —\. Th e othe r zero s o f L ( / , s) ar e calle d th e
non-trivial zeros . The y ar e locate d i n the critica l stri p 0 ^ R e (s) ^ 1 . Presumabl y
some trivia l zero s ma y la y i n th e critica l stri p a s well , however , on e conjecture s
they ar e neve r i n th e interior .
As mentioned i n th e introduction , w e will discus s example s startin g i n Sectio n
5.9. I t i s important t o note that man y interestin g L-serie s exis t beyon d ou r context ,
either becaus e o f lac k o f strengt h i n ou r fingers t o prov e th e require d assumption s
(for instanc e Arti n L-function s fo r non-trivia l irreducibl e character s ar e no t know n
to b e entir e i n general , an d genera l L-function s o f varietie s ar e no t eve n know n t o
be meromorphic , se e Section s 5.1 3 an d 5.1 4) , o r becaus e som e o f th e condition s
fail. Fo r instance , i n th e secon d clas s com e Dirichle t L-function s o f non-primitiv e
characters (the y hav e Eule r produc t bu t th e functiona l equatio n ha s extr a factors) ,
L-functions o f genera l modula r form s (se e Theore m 1 4. 7 fo r instance ) whic h hav e
no Euler product s an d a functional equatio n relatin g L(/ , s) wit h som e function no t
directly relate d t o L ( / , s). Stil l interesting ar e partial zet a function s o f ideal classe s
of numbe r fields (se e (22.55 ) fo r imaginar y quadrati c fields) whic h hav e functiona l
equations (relatin g a n idea l clas s a wit h D — a , wher e c ) is th e different) , bu t n o
Euler produc t i f the clas s numbe r i s > 1 . However , thes e partia l zet a function s ca n
be reconstructe d a s linea r combination s o f th e ful l L-function s fo r th e clas s grou p
characters.
The moder n analyti c theor y o f L-functions i s much influence d b y two concepts :
that o f a famil y o f L-functions , an d tha t o f (Langlands ) functoriality . Th e forme r
lacks a formal definitio n bu t i s a strong guiding principle (se e [M2]) . I n this book we
will indicat e whe n example s o f familie s appear . Not e tha t i n thi s context , varyin g
the imaginar y par t t o f s should coun t a s varying i n a family. Th e mai n parameter s
in a famil y o f L-function s ar e encode d i n th e analyti c conductor .
Functoriality, o n th e othe r hand , reflect s th e principl e tha t sometime s ne w L -
functions ar e buil t ou t o f ol d one s b y makin g som e operatio n (simpl e looking , ye t
quite subtle ) o n thei r coefficient s A/(n) , suc h a s raisin g t o som e powe r (Xf(n k)
or Xf(n) k). A prope r formalizatio n require s quit e majo r notatio n an d setup , an d
the naiv e idea s mus t usuall y b e changed ; moreover , muc h o f th e globa l settin g i s
still conjectural. W e will do here with a simple-minded definitio n o f Rankin-Selber g
type L-functions, an d mentio n elsewhere the symmetric square an d other symmetri c
power L-function s (se e Sectio n 5.1 2 an d th e discussio n o f the Sato-Tat e conjectur e
in Chapte r 21 ) .
Let L ( / , s) an d L(g,s) b e L-functions o f degrees d and e , with local component s
at infinit y KI an d v^ an d loca l root s (a^(p) ) an d {/3j(p)) respectively . Fo r p \
q(f)q(9), le t
1
(5.9) L p(f ® g, s) = J] ( 1 - OiipWjWp-')- .
5. CLASSICA L ANALYTI C THEOR Y O F L-FUNCTION S 97
We say that / an d g have a Rankin-Selberg convolutio n i f there exists an L-functio n
L(f ® g,s) o f degre e de suc h tha t
8
L(f®g,s)= n L p(f®g,s) J ]H p(p~ )
p\q(f)q(g) p\q(f)q(g)
where
de
s
(5.10) fl p(p- ) = n ( l - 7 j ( p ) p - a r 1 wit h| 7 ,-(P)|<P.
3=1
The gamm a facto r mus t b e writte n a s
7(/®,lS) = , -f a / 2 n r ( ^ )
with Re(fiij) < Re(« i + fj) an d |/x^- | ^ |K; | + | ^ j | , where ^ an d Vj ar e th e loca l
components a t infinit y o f / an d g. I n addition , th e conducto r o f / ® g must divid e
q(f)eq(g)d an d L(f<8>g, s) mus t hav e a pole at s — 1 if # = / . I n fact, unles s L(/ , 5)
or L(# , 5) factor , w e will se e tha t i n concret e case s L(f (g ) g, s) i s entire i f g ^ / .
We cal l L(f ® g,s) th e Rankin-Selber g L-functio n o r convolutio n o f / an d g ,
or th e Rankin-Selber g squar e i f g = f.
Note tha t i f L{f <S> / , 5) o r L(f 0 / , s) exists , the n th e estimate s o n th e loca l
roots an d o n Kj ca n b e improve d t o |c^(p) | < -y/p and R e (KJ) > — \.
The condition s als o impl y th e inequalit y fo r th e analyti c conducto r
e
(5-11) *(f®g:S)^q(f) q(g)d(\s\ + 3) de
(see (5.8)) . Althoug h it i s by no means obvious that L(f®g,s) shoul d exist, i t is the
case whe n L(/ , s) an d L(g, s) ar e automorphi c //-functions , a s discusse d i n Sectio n
5.12. Thi s cover s mos t case s o f importanc e i n analyti c numbe r theor y (includin g
the L-function s o f numbe r fields). Notic e tha t th e dua l L-functio n i s L ( / 0 g, s).
5.2. Approximation s t o L-functions .
We begi n wit h a crud e resul t whic h will b e use d extensively , an d improve d
later.
LEMMA 5.2 . Any L-function L(f, s) is polynomially bounded in vertical strips
s ~ a + it with a ^ a ^b, \t\ > 1 .
PROOF. Th e L-functio n i s bounded i n the half-plan e a ^ 1 + e by the absolut e
convergence of the Dirichle t series . Henc e we deduce a polynomial boun d fo r L(/ , s)
in the half-plane a < — e by the functional equatio n an d the Stirling formula (5.1 1 4) .
Then the polynomial bound in the remaining strip follows by the Phragmen-Lindelo f
principle (se e Theore m 5.53) . •
We no w stat e a n exac t formula , know n a s th e "approximat e functiona l equa -
tion", whic h gives analytically convenien t expression s fo r L(/ , s) i n the critica l stri p
where th e serie s doe s no t converg e absolutely .
98 5 . CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S
THEOREM 5.3 . Let L(f,s) be an L-function. Let G(u) be any function which
is holomorphic and bounded in the strip — 4 < Re (u) < 4, even, and normalized by
G(0) = 1 . Let X > 0. Then for s in the strip 0 ^ a ^ 1 we have
( , 12 , «,,., =? M0V.(^)+^..,E^-.(^)+I»
where V s (y) is a smooth function defined by
j(f,s + u)du
(5.i3) v s(y) = ^i J v uG(u)
7C/» u
(3)
and
1
(5-14) e(f,s)=e(f)q(f) 2-^^s)S).
The last term R = 0 if A(fJs) is entire, otherwise
« = ( re s + „ )^L^m x..
S
\u=l — s u= — s/ 0 /Z7 f,S ) U
PROOF. Conside r th e integra l
I(XJ,s) = -±i [x»A(f,s + u)G(u)^.
(3)
This integra l exist s becaus e A(/ , 5) decay s rapidl y a s t —> +0 0 fo r fixed a b y
Stirling's formul a (se e (5.1 1 3)) . Fo r th e sam e reaso n on e ca n mov e th e integratio n
to R e (u) = — 3 by Lemm a 5.2 . Applyin g th e functiona l equatio n ther e yield s
s/
(5.15) A(/ , s) = I(X, f, s) + e(f)I(X-\f, 1 - s) + Rq Hf,«)
where A(/ , s) come s fro m th e simpl e pol e o f u~ 1 G(u) a t u = 0 and th e las t ter m R
comes fro m possibl e residue s at u = 1 — 5 and u = — s if A(/, s) i s not entire .
By expandin g int o absolutel y convergen t Dirichle t serie s w e hav e
n>l (3 )
We d o th e sam e fo r I(X : , /, 1 — 5) an d combin e the m wit h (5.1 5) . Dividin g bot h
sides b y g s / 2 7(/, s) yield s (5.1 2) . •
5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S 99
EXERCISE 1 . Let L(f,s) b e an L-function whic h i s entire. Fo r a smooth ,
compactly supporte d functio n F on M+ with Melli n transfor m F, prove tha t
(5.16) Y. WW = e_U±
^ £ M»W»)
where
(3)
(More explici t formulas , i n the case of classical automorphi c forms , ar e described
in Sectio n 4.5 ; in particular, se e the formula (4.71 ) i n which th e test functio n goe s
through th e Hankel transform) .
Show tha t i f L(/, s) = £(s) is the Riemann zet a function , (5.1 6 ) hold s wit h an
additional ter m
/>+o o
R = re s C(s)F(s) = / F(y)dy
s==1
Jo
on th e right side . Prov e tha t i n this cas e
r+oc
H(y) = 2 / F(y) cos(2irxy)dy,
Jo
/o
and (5.1 6 ) i s the Poisson summatio n formula .
E X E R C I S E 2 . Le t L(/, s) be the L-function associate d wit h a holomorphic prim -
itive cus p for m / o f weight k = 2 and level g (think o f the Hasse-Wei l zet a functio n
of a n elliptic curve) . Deriv e fro m (5.1 2 ) th e following formul a fo r the central valu e
where X is any positive number [Hint : tak e G(u) = 1.] Differentiatin g wit h respec t
to X this yield s the summation formul a
A/(n) / 2nn \2 xV^A/(n) / 2irnX\
E /jP
Taking X = 1 we get for self-dual L-functio n wit h roo t numbe r e(f) — 1 ,
I( /.i)= S E^-p(-^)-
For suitabl e tes t function s G(u), both sum s in (5.12) ar e effectively limite d to
the term s wit h n < C \Al(/> s )* W e shall se e this clearl y fo r a particular choic e of
G(u),
G(u) = (co s —J
where A is a positive integer .
100 5. CLASSICA L ANALYTI C THEOR Y O F L-FUNCTION S
PROPOSITION 5.4 . Suppose R e (s - f K,J) ^ 3 a > 0 for 1 < j ^ d. Then the
derivatives ofV s(y) satisfy
a
(5-17) y V}a\y)^(l + J=y A
,
a
(5-18) y V}a\y) = 5 a + 0((^=y) V
^ \Alo o 7 7
where 5Q = 1 , S a = 0 if a > 0 an d £/i e implied constants depend only on a, a, A
and d. Recall that q^, = qoo(s ) is given by (5.6).
P R O O F . Fo r s an d u wit h R e (s) = a > 0 an d R e (u) = (3 > —a we deriv e b y
Stirling's formul a (5.1 1 2 )
~ W« ' i exp (2 (|s| -| s + u | ) )
«(H+3)' 3 exp(|H;
where th e implie d constan t depend s onl y o n a an d /3 . Henc e
7(/,s + w ) 5/ /7r d
<q^2exp(y|U|).
7(/,s)
We hav e
s
yavs{a\y) = ^ Jy- u
G{u){-ufsalUi + u )du
7(/,s) ^ '
(3)
Moving th e integratio n t o th e lin e R e (u) = fi = —a we derive (5.1 7 ) whil e movin g
to th e lin e R e (u) = A. we deriv e th e boun d 0((y/q^/y) A). Thi s combine d wit h
(5.17) yield s (5.1 8) . •
The formul a (5.1 2 ) expresse s L(/ , s) i n the critica l strip , essentially , a s two par -
tial sum s o f th e Dirichle t serie s L(f,s) an d th e dua l one , eac h o f lengt h \ / q ( / , s).
Hence on e ca n easil y deriv e estimate s i n th e critica l strip . W e giv e her e th e indi -
vidual estimate , an d th e unifor m versio n fo r L-function s satisfyin g th e Ramanujan -
Petersson conjecture . Se e Sectio n 5.1 2 fo r stronge r result s i n th e cas e o f automor -
phic L-functions .
Because L(f,s) ca n hav e a pol e o r zer o a t s = 1 of orde r r , i t i s appropriat e t o
kill this singularity befor e estimating. Therefor e w e shall often estimat e p r(s)L(f) s)
rather tha n L(/ , s), wher e
r
(5.19) Pr(s)= ( i _ il V
\
EXERCISE 3 . Prov e tha t fo r s wit h 0 < a < 1 we hav e
1
(5.20) p r(5)L(/,,)«q(/,5)( -)/2^
for an y £ > 0 , wit h th e implie d constan t dependin g o n e an d / . I f L ( / , s) satisfie s
the Ramanujan-Petersso n conjecture , the n th e implie d constan t depend s onl y o n e
and th e degre e o f th e L-function . [Hint : Us e th e absolut e convergenc e o f L(/ , s)
in a > 1 (o r th e boun d |Aj(n) | ^ T^(n) , respectively) , th e functiona l equation ,
5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S 101
Stirling's estimat e fo r th e gamm a functio n an d th e Phragmen-Lindelo f convexit y
principle.]
In particular , (5.20 ) yield s th e followin g boun d o n th e critica l lin e
+£
(5.21) L ( / , S)«q(/,S)I
which i s known a s the convexit y bound. Usin g the approximat e functiona l equatio n
rather tha n th e convexit y principle , an d th e Ramanujan-Petersso n conjecture , on e
can deriv e a slightl y bette r estimat e
(5.22) L(f, s) « q(/ , s) i (lo g q(/, s ) ) 4 " 1
where the implied constan t depend s only on the degree d of L(/, s). I t i s conjecture d
that
L(/,S)«q(/,s)£
for R e (s) = \ wit h an y e > 0 , th e implie d constan t dependin g onl y o n e. Thi s i s
the so-calle d Lindelo f Hypothesis , whic h i s on e o f th e consequence s o f th e Gran d
Riemann Hypothesi s (se e Corollary 5.20) . However , i n many application s the majo r
step i s t o improv e o n th e convexit y boun d fo r relevan t L- functions, i n th e sens e o f
reducing th e exponen t \ b y a positive number , howeve r smal l that numbe r ma y be .
Such result s g o back t o H . Wey l fo r th e Rieman n zet a function , namel y
+£
C(s) « \s\-*
(see (8.22)) . Fo r Dirichle t characters , on e derive s fro m Burgess' s estimat e fo r shor t
character sum s tha t
L(X,s)^\s\qi6+£
(see Theore m 1 2.9) , whic h i s wors e tha n (5.21 ) i n ^-aspect , bu t muc h bette r i n
^-aspect: th e latte r i s ofte n th e mos t importan t (e.g . fo r application s t o algebrai c
number theory) .
In vie w o f th e proo f o f (5.22 ) b y th e formul a (5.1 2 ) i t i s clea r tha t an y sub -
convexity estimat e amount s t o provin g tha t ther e i s considerable cancellatio n whe n
_i_.
summing th e oscillatin g value s Xf(n)n 2 % ? and thi s prove s t o b e a n arithmetica l
problem.
Currently, i t is known that a subconvexity boun d hold s in ^-aspect an d ^-aspec t
separately (bu t no t ye t fo r bot h jointly ) fo r an y L-functio n attache d t o classi -
cal modula r form s (se e th e survey s [M2] , [IS1 ] , an d th e paper s [DFI2] , [DFI3] ,
[KMV2], [PSa] , [Ml] , amon g others) .
5.3. Countin g zero s o f L- functions.
One o f th e deepes t subject s o f th e theor y o f L-function s i s th e distributio n o f
the zero s o f L(f,s). W e discus s firs t wha t ca n b e don e usin g basi c propertie s o f
holomorphic function s throug h th e metho d o f Hadamard an d d e l a Valle e Poussin .
LEMMA 5.5 . Let L(f,s) be an L-function. All zeros p of A(f Js) are in the
critical strip 0 ^ a ^ 1 . For any £ > 0, we have
102 5. CLASSICA L ANALYTI C THEOR Y O F L-FUNCTION S
PROOF. Sinc e th e Eule r produc t expansio n o f L(/ , s) i s absolutel y convergen t
and 7 ( / , s) doe s no t vanis h fo r R e (s) > 1 , ther e ar e n o zero s o f A(/ , s) i n thi s
region. B y th e functiona l equation , th e sam e hold s fo r R e (s) < 0 . Th e secon d
statement i s a genera l resul t o f comple x analysi s fo r entir e function s o f orde r 1 . •
Notice tha t i f p = ft + r y i s a zer o o f A(/ , s), the n p i s a zer o o f th e dua l
A(/, s). Hence , b y th e functiona l equation , th e poin t reflecte d i n th e critica l lin e
p* = 1 — p = 1 — / 3 -H7 i s als o a zer o o f A(/ , s) (wit h correspondin g multiplicity) .
Of course, ther e i s no symmetry amon g th e trivia l zero s of L(/, s) whic h com e fro m
the pole s o f T ( ^ ) .
THEOREM 5.6 . Let L(f,s) be an L-function. There exist constants a = a(f)
and b = b(f) such that
ss
(5.23) (,( 1 - s)) rA(f, s) = e^ bs J J ( l - -)e P /p
where p ranges over all zeros of A(/, s) different from 0,1 . Hence
(5.24) --(^ = 1^ ^ ) - ^ ^ ^- E (^- + l).
both expressions being uniformly absolutely convergent in compact subsets which
have no zeros or poles (recall that r is the order of the pole or zero of L ( / , s) at
8=1).
PROOF. Th e expansio n (5.23 ) i s simpl y th e applicatio n o f th e Hadamar d fac -
torization theore m o f entir e function s o f finite order , an d (5.24 ) follow s b y takin g
the logarithmi c derivative . •
We denot e
s
(5.25) _ ^ ( /)S) = £ A/(n)n-
the expansio n o f th e logarithmi c derivativ e o f a n L-functio n i n Dirichle t serie s
supported o n prim e powers . I n term s o f th e loca l root s ai{p) o f th e Eule r produc t
(5.7) w e hav e
d
k k
(5.26) A f(p ) = Y^^(p) logp.
3 =1
For Dirichle t characters , A x (n) = x(n)A(n) , an d i n genera l A/(p ) = Xf(p) logp fo r
p prime. Not e tha t Aj(n ) = A/(n) .
PROPOSITION 5.7 . Let L(/, s) be an L-function of degree d ^ 1 and let p denote
the zeros of A(/, s) different from 0,1 .
(1) The number of zeros p = (3 + ij such that \j — T\ < 1 , say m(T, f), satisfies
(5.27) m ( T , / ) < lo g q(/,zT)
with an absolute implied constant.
5. CLASSICA L ANALYTI C T H E O R Y O F L
1 -FUNCTION S 0 3
(2) For any s in the strip — ^^a^2we have
\S + Kj\<l J | S —p|<l
with an absolute implied constant.
(3) The constant b(f) in (5.23) satisfies
1
(5.29) Re(6(/) )= -^Re(p- ).
PROOF. Firs t w e prove (5.29) . Takin g th e logarithmic derivativ e o f the func -
tional equatio n (5( 1 - s)) rA(f, s) = e(f)(s(l - s)) rA(f, 1 - 5) , we get from (5.23 )
11
(5.30) 2R e (6(/)) = b(f) + b(f) = - £ (-^ -+ * + ^- 4+- -
z 5 — p l — s — p p p
0 , l \s — p
p #--' 1 —s— p p
We have (s — p) x — (1 — 5 — p) l < C |p| 2 for 5 ^ p, the implied constan t dependin g
on s , an d similarly p~ 1 + p _ 1<C | p |- 2 so the series
are absolutel y convergen t b y Lemm a 5.5 . S o we can separat e the m i n (5.30 ) an d
the firs t on e vanishes (th e terms cance l ou t sinc e p and 1 — p are zeros o f A(/, s)),
giving (5.29) .
Let T ^ 2 and 5 = 3 + iT. B y (5.26 ) w e have |A/(n) | < dnlogn . Henc e
(5.31) | ^ ( / , * ) | < < ( 2 ) < l o g q ( / ) .
By Stirling' s formul a (5.1 1 6 ) w e have ^logq + — (/, s) < C logq(/, s). Observ e als o
that fo r any zero p = / 3 + 2 7 we have
< Re <
9 + ( T -7 ) 2 Vs-p 74 + (T- 7)2'
Hence w e can tak e th e rea l par t i n (5.24 ) an d rearrang e th e resultin g absolutel y
convergent serie s t o derive usin g (5.29 ) tha t
(5.32) £ _ _ ! _ _ <<logq(/ ,iT).
This implie s (5.27) . T o derive (5.28) , we write s = a + it an d
~(f,s) = ~(f,s) + j(f,3 + it) + +0{\ogqV,s)).
by (5.31 ) , and we get by (5.24 ) an d (5.1 1 6 ) agai n
-£</,.) -£</,.) + r- + ^ - E (7=7 - ST^;)+OP°«'(/-'» -
104 5. CLASSICA L ANALYTI C THEOR Y O F L-FUNCTION S
In the series, keep the zeros with \s — p\ < 1 and estimate the remainder b y log q(/, s)
using
i 1 1 i 3
\s — p 3 -h it — p l+ (T-7)2
and (5.32) . Moreove r w e have (se e (5.116))
7' v d ^1 v^T' /S + K
JJ 3
-E -^ - + O(iogqoo(s)).
5 + ft-?
|S+«J|<1 ^
Thus (5.28 ) follows . •
T H E O R E M 5.8 . Le t L(f,s) be an L-function of degree d. Let N(T,f) be the
number of zeros p = f3 + ry o / L(/, 5) ST/C/ I t/m t 0 ^ / 3 < 1 and |7| < T. W^ e ftave
(5.33) N(T, f) = l lo g ^^ + 0(log q(/, iT))
for T ^ 1 wit/i a n absolute implied constant.
P R O O F . Le t N'(T, f) b e the number o f zeros p of A(/, s) wit h 0 ^ / 3 ^ 1 an d
0 < 7 < T. W e hav e
(5.34) JV(T , /) = N'(T, f) + N'(T, f) + 0(log q(/) )
where the error term account s for possible real and trivia l zeros of L(f, s) in 0 ^ a <
1. B y (5.27) w e can assume tha t A(/ , s) does no t vanish o n Im (s) = T b y adding
to T a n arbitraril y smal l numbe r i f necessary , modifyin g N(T, f) b y a quantit y
<C log q(/, iT). Choosin g smal l 5 > 0 such tha t A(/ , s) ^ 0 in — 5 < I m (s) < 0, w e
have N(T, f) = I(T) + 0(logq(/,tT)) wit h
'^db/£<'••>*
where C is the rectangle wit h vertice s 3 — iS, 3 + iT, — 2 + iT and —2 — i<5 . W e cut
this rectangl e symmetricall y a t the points \ — iS and \ + iT . B y the functiona l
equation, th e integral o n the left par t o f the contou r equal s tha t o n the right part .
Therefore I(T) equal s twice the integra l ove r th e righ t par t o f C whic h we are goin g
to estimat e b y observing th e variation o f arguments o f each facto r in
Mf,*) = qs/Hf,s)L(f,s) = Tr-^y2 rj r(i±^)L(/,*). ' S + Kj
3=1
ds 2 s 2
The variatio n o f the argumen t o f ^~ l q l give s a contribution t o I(T) equa l to
(5.35) £ lo g £ + 0 ( 1 ) .
By th e Stirling formul a (5.1 1 3) , th e argument o f T(a + it) i s tlogt — t + O(l) i f
t ^ 1 , so the contribution o f gamma factor s t o I(T) i s
lo
(5.36) ^(f ^-f)+°( l o ^(/))-
5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S 105
For L(/ , s) o n th e vertica l segmen t fro m 3 — id t o 3 + iT , w e estimat e
s
log L(f, s ) = - £ ^ - n ~ « lo g q(/)
t-^ log n
as i n (5.31 ) , s o th e integra l o n thi s segmen t i s < C logq(/). The n o n th e remainin g
horizontal part s fro m \ — id t o 3 — id and 3 + iT t o \ + iT, w e appeal t o (5.28 ) an d
(5.27) t o ge t th e estimat e 0(logq(/,£T) ) .
By (5.34) , (5.35 ) an d (5.36 ) applie d t o L(f,s) an d L ( / , s ) , w e ge t (5.33) . •
REMARKS. Th e mai n ter m o f (5.33 ) come s ou t fro m ou r computatio n o f th e
variation i n the argumen t of q s/2ry(f.s) alon g certain (right ) segments , the contribu -
tion fro m th e L-functio n itsel f bein g smal l (well , onl y fro m th e relevan t segments) .
This illustrate s ho w th e L-functio n i s intrinsicall y connecte d wit h th e companio n
at th e infinit e place .
5.4. T h e zero-fre e region .
Analytic applications of L-functions mak e use of complex integration i n a region
free o f zero s of L(f,s) fo r variou s / . Th e result s ar e stronge r i f the zero-fre e regio n
spreads deepe r int o th e critica l strip . Th e basi c metho d t o cu t on e i s stil l tha t
of Hadamar d an d d e l a Valle e Poussin . I t i s remarkabl e tha t thi s metho d als o
reappears i n Deligne' s secon d proo f o f th e Rieman n Hypothesi s fo r varietie s ove r
finite fields [De2] (th e basic trigonometric inequality (5.69 ) is misprinted there) . W e
present th e traditiona l arguments , bu t i n somewha t mor e elegan t fashion , startin g
with a usefu l genera l lemm a o f Goldfeld , Hoffstei n an d Liehma n [GHL] .
LEMMA 5.9 . Let L(f, s) be an L-function of degree d with R e (A/(n)) > 0 for
(n,q(f)) = 1 . Suppose that at ramified primes \aj(p)\ < p/2. Then L(/ , 1 ) ^ 0 .
Let r ^ 0 be the order of the pole of L(f,s) at s = 1 . There exists an absolute
constant c > 0 such that L ( / , s) has at most r real zeros in the interval
r
S > 1
d ( r + 1 ) log q ( / )'
P R O O F . Le t fy b e zero s o f L(f,s) i n th e segmen t \ ^ f3j < 1 . B y (5.28 ) fo r
s = a > 1 we ge t
(5.37) Yl -V< -^ + ^ y (/,a ) + 0(logq(/))
J
j
on ignorin g zero s othe r tha n /3j's , by positivit y sinc e
Re ^- = ^ ^ 0
a-p \cr-p\2
for an y zer o p — j3 + ij i n (5.28) . Sinc e R e (A/(n)) ^ 0 for (n , q(f)) = 1 we hav e
(5.38) R e ^ ( / , c r ) < 0
106 5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S
where L nr(f, s) i s the Eule r produc t o f L(/ , s) restricte d t o unramifie d primes . W e
estimate th e contributio n o f ramifie d prime s b y
IE E^SS«E>°^<<>^</> .
p\q(f)KJ^d JKFJF p\q(f)
Hence b y (5.37 )
E— <
a — Bj a
jJ
7 +0(dlogq(/))
— 1
where r i s th e orde r o f th e pol e o r zer o o f L(/ , s) a t s = 1 . Th e abov e inequalit y
shows that fij = 1 is not possible (i n this case r < 0) so we conclude that L(/ , 1 ) ^ 0 ,
in othe r word s r ^ 0 . Suppos e fy > 1 — c(d(r + 1 ) l o g q ( / ) )_ 1 fo r 1 ^ j < n . W e
choose a — 1 + 2c(dlogq(/)) _ 1 an d ge t
ndlogq(/) <
£+0(l))dlogq(/)
2c + c/( r + l )
which implie s n < r + r/2( r + 1 ) + 0(c ) an d n ^ r i f c is smal l enough . •
T H E O R E M 5.1 0 . Let L(/ , s) 6 e an L-function of degree d such that the Rankin-
Selberg convolutions L ( / ® /, s ) and L(f 0 / , s) exist, and the latter has a simple
pole at s = 1 while the former is entire if f ^ f . Suppose that at the ramified
primes \aj(p)\ 2 ^ p/2. There exists an absolute constant c > 0 such that L(f,s)
has no zeros in the region
(5.39) c r ^ l
d 4 log(q(/)(|*| + 3) )
except possibly for one simple real zero f3f < 1 , in which case f is self-dual.
PROOF. Fo r t e R , le t L(g, s) b e th e L-functio n
L(g, s) = C(s)L(f, s + it) 2L(f, s - it) 2L(f<g > /, s + 2it)L(f®f, s - 2it)L(f ® /, s) 2
of degre e ( 1 + 2d) 2. It s analyti c conducto r satisfie s
(5.40) q(g) < q(/ , it)\(f 0 / , 2it) 4 q(/ 0 f) 2 < q(/) 4 + 1 2 d (|t| + 3) 6d2
by (5.1 1 ) .
The produc t i s arrange d s o tha t it s Dirichle t serie s coefficient s ar e real , non -
negative. I t i s sufficient fo r u s to chec k this a t unramifie d places . Fo r a n unramifie d
prime p , th e loca l Eule r facto r a t p fo r L(g , 5) i s o f th e for m (5.1 ) wit h "roots " 1
with multiplicity one , c\jp lt an d ajp~ lt wit h multiplicity two , ctjOck wit h multiplicit y
two, aj(XkP 2lt an d OLjC\^p~ 2lt wit h multiplicit y one , where a^ , a^ ru n ove r th e root s
(5.1) fo r L(/ , s). Therefor e fo r an y k ^ 1 , the su m o f the fc-th powers o f these root s
is
2
I + X ^ + E ^ V -kit >0
JJ
so that A g(n) ^ 0 for an y n coprim e wit h q(f)
5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S 107
Let p — /3 + 27 be a zero of L(/, s) with / 3 ^ | an d 77^ 0. The n w e take £ = 7
so L(g , s) has a pol e a t s = 1 of order ^ 3 whereas f3 is a zer o o f L(g, 5) o f order
^ 4 . Henc e b y Lemma 5. 9 we have
cd
(5 41) < l <
' ^ ~ dHog^g) * " dMog(q(/)(|t|+3) )
for som e absolut e constant s c > 0 and c' > 0.
Now consider rea l zero s of L(/, 5). Fo r this purpose w e take the product L(g , s)
with ^ — 0 . An y real zer o o f L(/, 5) i s a zer o o f !/(#, 5) o f multiplicity a t leas t 4.
On th e other hand , th e point s = 1 is a pole o f L(g, s) of order 3 if / ^ f an d o f
order 5 if / = / . Henc e by Lemma 5. 9 we conclud e the proof, excep t tha t w e must
prove tha t th e possible exceptiona l zer o of a self-dual L(/ , 5) satisfie s /3f < 1.
Consider instea d
L(h,s)=as)L(f,s)2L(f®f,s).
Assuming L(/ , 1 ) = 0 , it follow s tha t L(h,s) i s entire sinc e th e double zer o com -
pensates th e poles o f ((s) an d L(f<g > / , s). I t als o ha s non-negative coefficients ,
precisely fo r an unramified prim e p and fc^lwe hav e
fc
(5.42) A h(p )=(l + ^ ^ ) % 0 ,
3
so the series
A
(5.43) log L(h,s) = J2Yl \ h(Pk)P~ks
p k>0 *
has non-negativ e coefficients . Le t <r0 ^ 1 be the largest rea l zer o o f L(h, s), s o th e
first singularit y o f log L(h, s) when s decreases. Not e ao exists sinc e £(—2 ) = 0 . By
Landau's Lemm a (se e Lemma 5.56 ) th e series (5.43 ) converge s for any rea l a > cr0,
so b y (5.42 ) w e have log L(h, a) ^ 0 and |L(/i , <r)| ^ 1 . Lettin g a —> cr 0, we get a
contradiction. •
REMARK 1 . I f L(/, s) i s self-dual on e may refin e th e rea l zer o issue of Theorem
5.10 slightly, arguin g with L(h, s) more efficiently. Indee d (5.42 ) shows that A/ l(n) ^
0 fo r all n wit h (n , q(/)) = 1 . Le t I denote th e order o f the pole o f L(f ® /, s) at
5 = 1 . The n L(/i , s) has a pole of order / + 1 at s = 1 while an y real zer o of L(/, s)
is a zero o f L(h, s) of order ^ 2 . Henc e ther e ar e at mos t ^ rea l zero s o f L(/, s)
in th e region (5.39) .
REMARK 2 . Not e tha t i f the Rankin-Selber g L-function s exist , bu t L ( /0 / , 5)
has a pol e o f order > 1 at s — 1, it i s often th e case tha t L(f, s) i s a produc t of
other L-functions . I t is then mor e efficien t t o apply Theore m 5.1 0 t o those tha n to
L(f,s) itself . Fo r instance, conside r th e Dedekind zet a functio n o f a quadratic fiel d
(K(S) = C(s)L(s,x) . The n th e Rankin-Selberg L-functio n i s £(s) 2 L(s, x)2 whic h
has a double pol e a t s = 1.
108 5. CLASSICA L ANALYTI C THEOR Y O F L-FUNCTION S
EXERCISE 4 . Le t L(f,s) an d L(g,s) b e tw o L-function s o f degree s d an d e
respectively suc h tha t g ^ f an d g ^ f. Assum e tha t Rankin-Selber g L-function s
L{f ® g,s), L(f ® #,s), an d henc e thei r dual , exist , an d tha t th e loca l root s a t
ramified prime s satisf y |a^(p)| 2 < p/2. Sho w tha t ther e exist s a n absolut e constan t
c > 0 such tha t L(f ® g,s) ha s n o zer o i n th e regio n
r.
(d + e)*log(q(/)q(s)(|i|+3)) '
What ca n yo u prov e i f / = g o r / = g? [Hint : Us e th e auxiliar y functio n
L(gis) = L(f^ g)s+^)L(f0g,s)L(f^g,s)L(f^g,s-^)
and appl y Lemm a 5.9. ]
In genera l nothin g stronge r i s know n tha n Theore m 5.1 0 . Fo r certai n impor -
tant L-functions , on e ca n d o better , individually , an d als o b y considerin g families .
Examples, includin g th e mos t importan t on e o f Dirichle t characters , ar e discusse d
in Sectio n 5. 7 an d 5.1 2 .
5.5. Explici t formula .
A summatio n formul a somewha t analogou s t o (5.1 2 ) ca n b e derive d b y inte -
grating th e Dirichle t serie s fo r th e logarithmi c derivativ e o f a n L-function . Fo r
historical reasons , thi s typ e o f formul a i s usuall y calle d a n "explici t formula" , em -
phasizing th e lin k i t expresse s betwee n sum s ove r th e zero s o f a n L-functio n wit h
sums o f coefficient s a t prim e powers .
THEOREM 5.1 1 . Let tp :]0,+oo[— > C be a C°° function with compact support,
and
/*+oo
(p(s) — / (p(x)x 1 s
~ dx,
Jo
be its Mellin transform. Put ip(x) = x~ 1 <p(x~1 ), which means ifj(s) = <p(l — s).
Then we have
(5.44) V (A f(n)<p(n) + Aj(n)^(n)) = ^ ( l ) l o g g ( / ) + r / ^{x)dx
(1/2) '
where p runs over the zeros of L(f, s) in the strip 0 < a < 1 (the non-trivial and
the trivial ones) with relevant multiplicity.
PROOF. Star t wit h the first ter m on the left sid e of (5.44) using Mellin inversio n
] T A/(n)<p(n ) = — / - — (f,s)<p(s)ds.
(2)
Let c be a smal l positiv e numbe r s o tha t L ( / , s) ha s n o zero s i n — c ^ R e (s) < 0 .
Move the lin e of integration t o R e (s) = —c , which i s legitimate b y (5.27) . A simpl e
pole i s picke d u p a t s — 1 with residu e r J ip(x)dx, an d a t th e zero s s = p wit h
residues m(p{p), wher e m i s the multiplicity . S o those residue s giv e the secon d an d
the fourt h term s o n th e righ t sid e o f (5.44) .
5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S 109
The functiona l equatio n implie s
-^(f,s) = logq(f) + ^ ( /, s) + ^ ( / , 1 - s) + ^(f,1 - s)
L7 7 L
for a = — c. Integratin g ove r a = —c , the facto r logg(/ ) give s th e firs t ter m o n th e
right sid e o f (5.41 ) b y Melli n inversion . Moreover , b y absolut e convergenc e
n).
(-c)
And finally on e ca n mov e th e lin e o f integratio n fo r th e gamm a factor s t o a = |
getting the integral of gamma factors i n (5.44 ) without residue s (th e poles of — (/, 5)
cancel ou t wit h thos e o f — (/, 1 — s)). •
R E M A R K . Becaus e o f th e appearanc e o f th e conducto r q(f) o n th e righ t sid e
of (5.41 ) , thi s formul a turn s ou t t o provid e a goo d analyti c too l fo r it s study . Se e
Theorem 5.3 2 below fo r a n example . However , i t i s quit e usefu l fo r othe r things .
First o f al l th e explici t formul a i s seen a s expressin g a su m ove r prime s b y th e su m
over the zero s and vic e versa. Certainl y ther e ar e other explici t formula s connectin g
primes wit h zeros .
EXERCISE 5 . Le t <p be smoot h an d compactl y supporte d o n [l,+oo[ . Sho w
that
g A(nMn ) = ^ ( 1 - (g _ 1 )g(g +!)>(*)** " I>(P) .
where p run s ove r th e non-trivia l zero s o f C(s) . [Hint : Star t a s above , bu t mov e
the lin e o f integratio n fa r t o th e lef t instea d o f usin g th e functiona l equation ; th e
first ter m come s fro m th e pol e a t s = 1 and th e trivia l zero s o f £(s ) a t s = — 2/c,
k ^ 1 an integer. ]
Sometimes i t i s more convenien t t o wor k wit h th e explici t formul a i n a Fourie r
form rathe r tha n th e Melli n for m (5.44) . Thi s ca n b e derived fro m (5.44 ) b y simpl e
substitutions (p(x) = x _1 //2 #(log:r) an d x — e y gettin g
THEOREM 5.1 2 . Let g(y) be a function of Schwartz class on R which is even.
Let h(t) be the Fourier transform of g{y). We have
(5.45) ^ ( A/(n)+A7(n))^|^=5(0)logg(/) + r/i(^ )
+
5FSZ (^ + *>+ T"'* " a)M5F)* " ?*(£)
where p = | + ^ 7 runs over the zeros of L(f, s) in the strip 0 ^ a ^ 1 (the non-trivial
and the trivial ones) with relevant multiplicity.
110 5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S
5.6. Th e prim e numbe r theorem .
By Prime Numbe r Theore m fo r a n L- function L(/ , s), we mean first th e asymp -
totic behavio r o f th e su m
(5.46) <&(/,* ) = £ A,(n )
n^.x
which i s essentiall y th e su m o f Xf(p)\ogp ove r primes . Whe n L(f,s) — C(s), thi s
amounts indeed to counting primes p < x. Muc h of the arithmetic interest lie s in th e
dependency o f th e erro r ter m i n asymptoti c approximation s o n extr a parameters ,
notably o n the conductor whe n / varie s in a family. Therefor e w e are seeking result s
which ar e explici t i n ever y possibl e parameter , althoug h no t th e stronges t ones .
The strengt h o f results depend s o n the dept h o f the zero-fre e regio n fo r L(/ , s).
The bes t w e can hope for today i s that L(/ , s) ha s no zeros in the region (5.39 ) wit h
at mos t on e exceptio n o f a simpl e zer o fif i n cas e o f / self-dual . B y ou r definitio n
(only fo r th e purpos e i n thi s section ) th e exceptiona l zer o i s i n th e segmen t
where c i s th e absolut e positiv e constan t fro m (5.39) . Thi s definitio n ma y caus e
some discomfor t fo r ne w enthusiast s o f analyti c numbe r theory , becaus e i t i s loos e
with respec t t o th e constan t c. However , th e practic e show s tha t on e benefit s fro m
such a flexibl e concept . Afte r all , c ca n b e improve d b y ne w tool s i n th e future ,
not t o mentio n tha t w e don't believ e i n th e existenc e o f the exceptiona l zer o of an y
L-function.
In ful l generalit y w e d o no t ye t hav e a stron g boun d fo r th e individua l coeffi -
cients o f L(/ , s). Bu t a crud e boun d fo r A/(n ) o n averag e will suffice . Specifically ,
we postulate th e following :
2
(5.48) ^ | A /(n)| «^2log2(a;q(/))
n^x
for x ^ 1 , where th e implie d constan t i s absolute .
EXERCISE 6 . Deriv e fro m (5.48 ) tha t
2
(5.49) W , a ) = £ A/(p ) logp + 0{^d 2 log (xq(/))
p^x
where th e implie d constan t i s absolute .
Recall tha t th e postulate d zero-fre e regio n (5.39 ) wa s establishe d i n Theore m
5.10 unde r th e assumptio n tha t th e Rankin-Selber g convolutio n L(f 0 / , s) exist s
and ha s a simpl e pol e a t s — 1 . W e shal l se e tha t th e sam e assumptio n i s als o
sufficient fo r th e secon d postulat e (5.48) . T o thi s en d w e appl y Propositio n 5. 7 fo r
L{f ® f,s) rathe r tha n fo r L(f,s). Combinin g (5.28 ) an d (5.27 ) (se e als o (5.1 1 ) )
we ge t
l o
-T(/®/»«^T gl(/)
LG — 1
for 1 < < J ^ 2 , where th e implie d constan t i s absolute. Thi s wit h a = 1 + (log3x) _ 1
yields (5.49 ) b y th e non-negativit y o f coefficients .
5. CLASSICA L ANALYTI C THEOR Y O F L-FUNCTION S 111
From (5.49 ) w e derive usin g Cauchy' s inequalit y tha t
n
(5.50) J2 IM )l « d^ylog(xq(f))
x<n^.x+y
for x ^ y ^ 1 where th e implie d constan t i s absolute .
Now w e ar e read y t o stat e an d t o prov e th e mai n Prim e Numbe r Theorem .
THEOREM 5.1 3 . Let L(f,s) be an L-function for which (5.39) is a zero-free
region with at most one exceptional real zero f3f in the segment (5.47), where c is a
positive absolute constant. Suppose (5.48) holds with an implied constant absolute.
Then we have
(5.51) « / . , )- „ - %+°('^(^',%; ( / ) )(''l°«"l(/>>').
for x ^ 1 , the implied constant being absolute. By convention the term —xP f/(3f is
not in (5.51 ) if the exceptional zero does not exist.
REMARKS. Th e approximat e formul a (5.51 ) ha s meanin g whe n th e erro r ter m
is smaller tha n th e mai n term , an d thi s i s the cas e fo r
1
x ^ ^4c- d 4 log(dlog < ? )
where q = q(/ ) i s th e conductor , d i s th e degre e an d c i s th e relevan t absolut e
positive constant . On e ca n simplif y th e erro r ter m b y compromisin g slightl y th e
strength o f th e result . Fo r example , w e hav e
/
(5.52) V ( / , ^ ) = r - x - ^ + 0 ( v q(7)^exp(-^iV^g^))
where th e implie d constan t i s absolute . Indee d thi s hold s true , becaus e whe n th e
error ter m her e i s smaller tha n tha t i n (5.51 ) th e resul t i s trivial. Th e condition s o f
Theorem 5.1 2 ar e satisfied i f L ( / 0 / , s ) exist s an d ha s a simpl e pol e a t s — 1, bu t
we d o no t restric t (5.51 ) t o thi s cas e only . Indee d w e shal l establis h th e zero-fre e
region (5.39 ) an d w e shall se e the crud e boun d (5.48 ) directl y withou t appealin g t o
the Rankin-Selber g convolutio n L-functio n i n man y classica l cases .
P R O O F O F THEORE M 5.1 2 . Firs t w e smoot h thing s ou t b y writin g
where (j>(z) i s a functio n suppor t o n [0 , x + y] , such tha t </>(z) = l i f l ^ 2 ; < x an d
\(b(z)\ ^ 1 elsewhere. Th e paramete r y wil l b e chose n late r subjec t t o 1 ^ y ^ x.
For example , w e tak e
(p(z) = m i n f - , 1 , 1 H )
\y y J
for 0 ^ z ^ x + y an d <p(z) = 0 if z > x + y. Th e Melli n transfor m o f <f)(z) satisfie s
rx+y x <y / x \
1
s
(t)(s)= / (P(z)z ~ dz < —mi n 1 ,—)
Jo \s \ v
\s \y J
for s = a + it, \ ^ a < 2 .
112 5 . CLASSICA L ANALYTI C THEOR Y O F L-FUNCTION S
Now w e ca n star t wit h
£A f(n)4>(n) = ±-[-*L(f, 3)fa)ds,
(2)
the integra l convergin g absolutely . W e shall b e operatin g i n th e regio n
{s = a + it | a > 1 - Cl /d4 log(q(/)(|t | + 3)) }
where c\ = 2c/3 , or c\ — c/3, dependin g o n whethe r th e exceptiona l zer o i s i n th e
right hal f o f th e segmen t (5.47) , o r elsewher e (wit h possibilit y o f non-existence) .
Let Z denot e th e boundar y o f this region . Th e poin t i s that al l the zero s o f L(/ , s)
are distance d fro m Z b y a t leas t c/6d 4 log(q(/)(|t| + 3)) . Henc e i t follow s b y (5.27 )
and (5.28 ) tha t fo r se Z,
- ^ ( / , 5 ) « d 4 l o g 2 ( q ( / ) ( M + 3)) .
Moving th e integratio n fro m th e lin e R e (s) = 2 to Z w e ge t b y Cauchy' s theore m
£ A/(n)0(n ) = r0(l) - 4>(M + 4- / ~T if' S ^(s)ds-
z
Here th e presenc e o f th e exceptiona l zer o ter m depend s o n whethe r w e passe d th e
point s = (3f or not . Fo r s = 1 , or s = f3f we hav e
0 ( 5 ) = I* sz s-1 dz + 0(y) = -+0(y)
Jo
while th e contou r integra l ove r Z i s estimate d b y
d4 J ^ m vi n ( l , r ^ ) l o g 2 ( q ( / ) ( | t | + 3 ) ) | ^ | < < ^ 4 x ^ T ) l o g 3 ( q ( / ) T )
7z M \s\y'
where T = x/y an d cr{T) = 1 — ci/d 4 log(q(/)T) . Gatherin g th e abov e result s w e
arrive a t
^ /1 x
^(/, a) = r x - ^ + 0 ( V (xT" 2 + x'(T) ) log 3 (xq(/))J
where T i s a t ou r disposa l subjec t t o 1 ^ T < x . W e choos e T = exp(|v / l°g x )
getting (5.51 ) . W e stil l hav e t o addres s th e questio n o f th e exceptiona l zer o term .
An explanatio n abou t it s contributio n t o th e obtaine d formul a i s neede d i f th e
exceptional zer o doe s exis t an d i t lie s i n th e lef t hal f o f th e segmen t (5.47) , bu t i n
this cas e th e ter m —x^f/fif i s consume d b y th e existin g erro r term , s o i t ca n b e
ignored. Thi s complete s th e proo f o f Theore m 5.1 2 . •
If on e i s willing t o emplo y mor e zero s o f L(/ , 5), then on e ca n deriv e alon g th e
above line s stronge r approximation s t o ip(f,x).
EXERCISE 7 . Assum e tha t L(f,s) satisfie s th e Ramanujan-Petersso n conjec -
ture. Deriv e (usin g Perron' s formul a (5.1 1 1 ) ) th e followin g approximat e expansio n
(5.53) tjj(f,x)=rx- £ ^ - ±+ o(|(logx)log(x dq(/)))
5. CLASSICA L ANALYTI C T H E O R Y O F 1
L1
-FUNCTION S 3
where p = ft + 2 7 runs ove r th e zero s o f L(/ , s) i n th e critica l stri p o f heigh t u p t o
T, wit h an y 1 ^ T ^ x , an d th e implie d constan t i s absolute .
EXERCISE 8 . Assum e tha t L(f,s) satisfie s th e Ramanujan-Petersso n conjec -
ture an d Theore m 5.1 3 hold s fo r / . Prov e tha t r ^ d.
5.7. Th e Gran d Rieman n Hypothesis .
The Gran d Rieman n Hypothesi s (GR H fo r short ) refer s t o th e followin g con -
jectural statemen t abou t zero s o f L-functions :
GRAND RIEMAN N HYPOTHESIS . Let L(f,s) be an L-function. Then all zeros
of L(/, s) in the critical strip 0 < R e (s) < 1 are on the critical line R e (s) = \.
REMARKS. Needles s t o sa y w e believ e tha t ever y L-functio n (subjec t t o ou r
definition i n Sectio n 5.1 ) satisfie s th e Gran d Rieman n Hypothesis . Yet , provin g
this eve n fo r on e L-functio n woul d b e a n achievemen t o n a historica l scal e fo r
human beings . Not e tha t a n L-functio n ma y hav e zero s o n th e lin e R e (s) = 0
(certainly som e trivia l zeros , probabl y no t th e genuin e ones) , bu t a s w e alread y
proved no t o n R e (s) — 1 . Th e GR H als o ensure s tha t th e trivia l zero s i n th e
open critica l stri p ar e al l o n th e critica l lin e a s well ; d o the y exis t i s no t clear ,
most likel y the y d o not . I t i s ver y importan t t o conside r onl y L-function s wit h
Euler products . Indee d i t i s know n tha t Dirichle t serie s withou t a n Eule r produc t
(still civilized ones ) hav e many zero s even in the half-plan e o f absolute convergence .
Arithmetic geometer s shoul d no t b e concerned , becaus e thei r L-function s originat e
from a n Eule r produc t o f som e kind . O f course , thi s eas y star t make s i t harde r
to investigat e L-function s furthe r i n analyti c aspects . Fo r example , i t wa s onl y
recently tha t analyti c continuatio n o f th e Hasse-Wei l L-function s o f ellipti c curve s
was establishe d b y wa y o f modularity . On e shoul d no t underestimat e th e analyti c
continuation o f an Euler produc t beyon d th e regio n of absolute convergence, fo r on e
often implement s dee p arithmeti c resource s i n on e wa y o r another . A s envisione d
by Riemann , thi s ventur e int o th e critica l stri p illuminate s prim e number s an d
relevant arithmeti c object s buil t o n the m b y revealin g thei r dua l companion s - th e
zeros of L-functions. Mysteriou s a s they ar e today, thes e zero s (mos t likel y comple x
transcendental numbers ) wil l eventuall y b e cracke d fo r furthe r analysis . However ,
for th e tim e being , a s fa r a s th e GR H goes , w e ar e onl y sur e the y ar e fine tune d
to yiel d surprisingl y stron g estimates . Yes , th e uniformit y o f estimate s i n term s o f
involved parameter s whic h ca n b e derive d fro m th e GR H ar e astonishing . O n a
negative not e thi s super b uniformit y coul d b e a reminde r ho w sli m prospect s ar e
to prov e th e GR H i n ou r lifetime . N o tool s o f curren t analyti c numbe r theor y ar e
capable o f treatin g th e question s fo r whic h th e GR H provide s eas y answers . I n
this sectio n w e present a few traditiona l analyti c consequence s o f the GRH , payin g
attention t o th e uniformit y i n term s o f th e conductor .
First w e state a fe w equivalen t fact s whic h hel p t o gras p th e analyti c meanin g
of th e Rieman n hypothesis .
PROPOSITION 5.1 4 . Let \ ^ a < 1 . The following statements are equivalent
for an L-function:
(1) There are neither zeros nor poles of (s — l ) r L ( / , s) in a > a, where r is a
non-negative integer.
(2) The inverse, the logarithmic derivative and the logarithm of (s — l ) r L ( / , s),
the latter normalized so that\ogL(f, s) - » 0 as a - » +oo ? are holomorphic in a > a.
114 5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S
(3) Let fJ>f(n) denote the coefficients in the Dirichlet series expansion for the
inverse L(f,s)~ x. Then
+£
(5.54) M(f,x) = £ > / ( r c ) « *" ,
the implied constant depending on f and e > 0.
(4) Let r ^ 0 be the order of the pole of L(/, s) at s — 1. Then
a
(5.55) xl>(f,x) = rx + 0{x +e),
£fte implied constant depending on f and e > 0.
The Riemann hypothesi s assert s that th e abov e statements ar e true with a = \.
What i s interestin g (perhap s somewha t surprisin g fo r a beginner ) i s tha t th e esti -
mate (5.55 ) i s self-improving b y passag e throug h zeros . Indeed , first (5.55 ) implie s
that th e zero s ar e o n th e lin e R e (s) = | , whenc e on e derive s b y th e expansio n
(5.53) wit h T — x usin g th e crud e boun d (5.26 ) th e followin g
THEOREM 5.1 5 . Assuming the Riemann Hypothesis for L ( / , 5) and the Rama-
nujan-Petersson Conjecture for L(f, s) we have
d
1(5.56) >(f,x) =rx + 0(xh\ogx)\og(x q(f)),
for x ^ 1 , the implied constant being absolute.
The Rieman n hypothesi s show s tha t L(/ , s), th e invers e L ( / , $ ) - 1 , th e loga -
rithmic derivativ e ^ r ( / , s) an d th e logarith m lo g L ( / , s) ca n b e wel l approximate d
by extremel y shor t partia l sum s o f th e correspondin g Dirichle t serie s uniforml y i n
Re 5 = a ^ a > \. Clearl y thi s i s no t possibl e fo r s o n th e critica l line , wher e th e
zeros are . W e begi n b y considerin g
Let <j)(y) b e a continuou s functio n o n [0 , 00[ whose Melli n transfor m (j>(w) satisfie s
(5.57) w(w + l)(f>(w) < 1 , fo r w suc h tha t - \ < R e (w) ^ \.
Suppose <p{w) ha s a simpl e pol e a t w — 0 wit h residu e 1 (normalization) . Fo r
example, </>(y) = e~ y wit h 4>(w) = T(w), o r <j)(y) = max( l — T/,0 ) wit h <f>(w) =
w~1(w+ 1 ) _1 .
PROPOSITION 5.1 6 . For \ < Re (5) < § we have
(,58) ^ ( / , s ) . ? ^ ( | )
/or an y X ^ 1 , where the implied constant depends only on that in (5.57).
5. CLASSICA L ANALYTI C THEOR Y O F L-FUNCTION
11 S 5
PROOF. B y no w it shoul d b e a standar d argumen t fo r a reade r t o deriv e th e
formula (5.58 ) b y contour integratio n wit h th e error ter m bein g equa l exactl y
Indeed thi s i s obtained b y starting wit h the above integral o n the line Re (w) =
\ an d movin g i t t o th e lin e R e (w) = — \. Th e simple pole s a t w = 0,1 — s , p — 5
produce th e corresponding term s i n (5.58) . No w on the line R e (w) = — \ w e have
Re (s -f w) — a — | , an d we deduce b y (5.26 ) an d (5.27 ) tha t
- \ ( / , s + w) < (2 a - 1 ) - 1 log q(/, s + w).
Integrating thi s agains t th e bound fro m (5.57) , we get the error ter m i n (5.58) . D
The su m ove r th e zero s i n (5.58 ) i s estimate d directl y (withou t cancellation )
by mean s o f (5.26) , (5.27 ) an d (5.57 ) givin g
(5.59) - £ ( / , , )= £ ^ Q )- , 0 ( 1 - ,)X*- + o(^±x\-°)
n
for an y X ^ 1 , where th e implied constan t depend s onl y o n that i n (5.57) .
Next w e are going to estimate th e sum of Aj(n). Essentiall y an y crude bound ,
but independen t o f the conductor , lik e |A/(n) | ^ n , woul d suffic e fo r ou r purpose .
Nevertheless, sinc e w e already assume d th e GRH, there i s no point i n ignoring the
Ramanujan-Petersson conjectur e fo r the local roots , whic h yield s |A/(n) | ^ dA(n).
Prom thi s w e get
E *£>*(£)«'» ,-'+-"<«*•
n
Moreover, th e polar ter m i n (5.59 ) i s
- n £ ( l - s)X 1 ~s = r{s - l ) " 1 + OirX 1 -* + rlogX).
Hence (5.59 ) simplifie s t o
Finally choosin g X = log 2 q(/, s) w e obtain
T H E O R E M 5.1 7 . Assume the Riemann Hypothesis and the Ramanujan-Peters-
son conjecture for L ( / , s). We have
"T ( / ' s ) = 7 ^ 1 + 0 (2^T ( l o g q ( / ' s ) ) 2 " 2 < 7 + dloglogq(/ s)
' )
for any s with \ < a ^ | 7 the implied constant being absolute.
116 5 . CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S
COROLLARY 5.1 8 . For a ^ \ + (lo g log log q)/(log log q) we have
where q = q(/ , s) is the analytic conductor and the implied constant is absolute.
To ge t an estimate fo r the logarithm o f the L-function w e integrate th e loga-
rithmic derivativ e alon g th e horizontal lin e
z.5/4 rj
log L(/, s) = log L(/, I + zt) - y - (/ , a + it)da .
Applying (5.59 ) w e deduce th e following estimate s
T H E O R E M 5.1 9 . Assume the Riemann Hypothesis and the Ramanujan-Peters-
son conjecture for L(/ , s). We have
]og(pr(8)L(f,8)) « f ° g q ( /' ^ " /T ,x + d log log q (/, s)
(2cr- l)loglogq(/,s )
/or am / 5 wi£/ i | < a ^ | , £/i e implied constant being absolute (recall p r(s) =
(s-l)r(s + l)- r.
As immediat e consequence s o f Theorem 5.1 9 w e have lowe r an d upper bound s
for L(/ , s) in the half-plane a ^ ^ + £:
q(/> *)"" < Pr(s)L(f, s) < q(/ , s) £
with implie d constant s dependin g onl y on £. Th e upper boun d ca n be extended up
to th e critical lin e b y the convexity principle . I n particular, w e obtain
COROLLARY 5.20 . For s with R e (s) = | and any e > 0 we have
(5.60) L ( / , S )«q(/,Sr
where the implied constant depends only on e.
The las t estimat e receive d considerabl e attentio n (i t is known a s the Lindelof
Hypothesis), becaus e it is sufficient fo r solving many fundamental problem s of arith-
metical nature du e to its uniformity i n the conductor. W e just prove d tha t i t follows
from th e Riemann hypothesis . On e ma y thin k tha t th e Lindelof hypothesi s shoul d
be easie r t o establish, bu t there i s a popula r opinio n tha t th e Riemann hypothe -
sis mus t b e solved first , th e point bein g tha t th e Riemann hypothesi s i s linked t o
more natura l mathematica l structures . Clearl y th e Lindelof conjectur e implie s the
following estimat e fo r sums o f coefficients o f the L-functio n
(5.61) ^ A /(n)=xP/(logx) + 0(xi(xq(/)) e)
where Pf(X) i s the polynomial o f degree r — 1 given by
s
Pf(logx) = res L(f,s)x -\
S =l
Some unconditional results , weaker yet useful, ar e given in Chapter 4 (see Corollar y
4.9 fo r ((s)L(s, x±) a n d Exercis e 7 of Chapter 4 for ((s)2 fo r instance).
5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S 117
5.8. Simpl e consequence s o f GRH .
We illustrat e som e o f the simpl e arithmeti c consequence s o f GRH, beside s the
Lindelof Hypothesis , whic h ha s important application s o f its own.
The firs t proble m i s to estimate th e order o f vanishing o f an L-function a t a
given poin t o n the critical line , i n term s o f the conductor o f /. W e consider th e
case of the central poin t s = 1 /2 . Not e tha t (5.27 ) implie s
ord L ( / , 5)^m(l,/)«logq(/)
s=l/2
unconditionally, whic h w e can improve slightly .
PROPOSITION 5.21 . Let L(f,s) be an entire L-function satisfying GRH. We
have
logq(/)
(5-62) or d L(/,s)« C , „ ,
' -i/ 2 log(j|logq(/) )
with an absolute implied constant (recall that logq(/ ) > d).
PROOF. W e apply the explicit formul a (5.45 ) t o a test functio n g(y) supporte d
on [-2Y,2Y] wit h 0 ^ g(y) < 2 and g(y) ^ 1 if \y\ ^ Y suc h tha t it s Fourie r
transform h(t) i s non-negative o n R. B y positivity w e can drop al l the zeros p =
\ + 27 ^ \ i n (5.45 ) gettin g
mh(0) ^ - 2 ^ R e ( A / ( n ) ) n - ^ ( l o g n ) + 0(logq(/)) ,
n
where m i s the multiplicity o f the zer o p — \. O n the left sid e we have h(0) x Y.
Since we have |A/(n) | ^ dA(n)n b y (5.2), the sum ove r prim e power s n on the right
side i s bounded by
2d ^2 A(n)n 2 < de Y',
logn^2Y
where th e implied constan t i s absolute. Choosin g 2Y = l o g ( | logq(/)) w e ge t
logq(/)
log(|logq(/))
with a n absolute implie d constant . D
Proposition 5.2 1 is optima l althoug h th e estimate o f the sum over prime s i s
quite crude , whic h migh t sugges t som e improvemen t i s possible. Th e issue i s tha t
this su m is extremely shor t wit h th e choice o f y, i n fact n ^ e 2Y ^ | logq(/ ) and
it i s conceivable tha t Af(n) doe s not change sig n i n this range . I f we take L(f, s) k
for k —> +o o instead o f L(/, s), w e see that (5.62 ) canno t hol d withou t th e facto r
| . However , i n a number o f cases, i t is possible t o do better; se e Proposition 5.34
and Exercis e 1 3 below.
The proble m o f detectin g sig n change s o f Af(n) fo r ver y smal l n i s relate d
to anothe r classica l applicatio n o f the Grand Rieman n Hypothesis : ho w many co-
efficients (i n term s o f th e conducto r q(f)) o f L(f,s) a t prime s ar e require d t o
distinguish L(/ , s) amon g L-function s o f a certain class ?
118 5 . CLASSICA L ANALYTI C THEOR Y O F L-FUNCTION S
PROPOSITION 5.22 . Let L(f,s) and L(g,s) be two L-functions with the same
degree d and gamma factor. Assume that L(f <S> / , s) and L(f 0 g, s) exist and
the latter is entire, that GRH holds for both and that the local roots at ramified
primes for L(f (g ) / , s) and L(f 0 g, s) are of modulus ^ 1 . There exists a prime
p ^ C(dlogq(f)q(g)) 2 unramified for f and g such that the local roots of L(f,s)
and L(g, s) at p are different. Here C is some absolute positive constant.
P R O O F . Th e poin t i s that b y assumptio n th e L-functio n L ( /<g> /, s) ha s a pol e
at s = 1 whereas w e assum e tha t L(f ® g, s) i s entire . Assum e th e loca l root s o f
/ an d g coincid e fo r al l prime s p ^ 2X whic h ar e unramifie d fo r / an d g. The n
Af®g(n) — Af^f(n) fo r n ^ 2X suc h that (n , q(f)q(g)) = 1 . B y the explicit formul a
(5.44) applie d t o a functio n o f th e typ e 0(n/X ) wher e 0 ^ 0 is smooth, compactl y
supported o n [1 , 2], an d 0 / 0 , w e hav e
^Am(n)<l>(n/X)<z:Vxiogq{f®g),
n
J2 K^f{n)4>{n/X) = 4>(0)X + 0(VX\og q( / ® / ) )
n
where 0(0 ) = J <p(t)dt > 0 . Thes e estimate s ar e smoot h version s o f th e prim e
number theore m fo r th e Rankin-Selber g convolutio n L-functions . W e use d GR H
to estimat e th e su m ove r zeros . O n th e othe r hand , th e tw o lef t side s coincid e
by assumptio n u p t o th e contributio n fro m ramifie d primes . Thi s contributio n i s
small, precisel y i t i s
A 2
«E l / 0 / » l + |A/ M(n)| « d (\ogq{f)q{g))\ogX.
n^2X,{n,q{f)q{g))^l
Therefore 0(O) X « VX\ogq(f 0 f)q(f ® g) + d 2 (iog q{f)q(g)) logX , o r X <
2
(d\ogq{f)q(g)) wher e th e implie d constan t i s absolut e (recal l tha t th e definitio n
of th e Rankin-Selber g convolutio n implie s th e boun d q( / 0 g) ^ (q{f)q(g)) d)- •
REMARK. Th e bes t genera l unconditiona l resul t i s muc h weaker . Suppos e
L(/, s) ha s degre e d with root s |QJ^(p ) | < p 1 / 4 an d tha t i t admit s L(f (g ) /, s). The n
for ever y e > 0 ther e exist s C > 0 , dependin g o n e an d o n / wit h th e followin g
property: I f L(g, s) ha s th e sam e gamm a facto r a s L(/ , s) (s o th e sam e degree) , i f
L{f<8 > g, s) i s entire an d i f X g(p) = Xf(p) fo r al l p ^ Cq(g) d/2+£, the n g = / .
To prove this, notice that i f X ^ 1 is the largest intege r fo r whic h Xf(p) = X g(p)
for p < X , the n b y multiplicativit y w e hav e Xf(n) = X g(n) fo r n \ N(X) wher e
N(X) i s th e produc t o f prime s ^ X. Fo r h = / , o r ft = g, on e ca n facto r
L(f<g > ft,s) = L\f <g > ft,s)H(f,ft, s)
where L b(f ® ft, s) i s th e Dirichle t serie s restricte d t o squarefre e integer s an d
H(f,h,s) converge s absolutel y fo r a > \. I n particular , L b(f ® ft, 5) i s entir e i f
ft = g an d ha s a pol e a t 5 = 1 for ft = / , an d satisfie s th e sam e convexit y boun d a s
the Rankin-Selberg L-function . W e get by complex integration o n the line a = | + 5 ,
with S > 0 small enoug h
J2 A / ^ ( n / X ) = 6(f,g)XP(\ogX) + 0(xh^{f ® ff )i+=)
5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S 119
where r i s the orde r o f the pol e o f L ( / ® / , s) a t s = 1 and P ^ 0 is the polynomia l
of degre e r — 1 given b y
P(logX) = re s L b(f<g > h, s)X s
~\
s=l
Comparing th e formul a fo r h = f an d tha t foih = g yield s th e result .
5.9. Th e Rieman n zet a functio n an d Dirichle t L-functions .
The Rieman n zet a functio n ((s) i s a self-dua l L-function , wit h conducto r 1 ,
gamma facto r 7(5 ) = 7r - s / 2 r(s/2) an d roo t numbe r 1 . It s Rankin-Selber g squar e
is als o C{s).
More generally, le t x b e a primitive Dirichlet characte r modul o q. Th e Dirichle t
//-function L(s , x) i s a n L-functio n o f degre e 1 with conducto r g , gamm a facto r
7(,) = ^ / 2 r ( ^ ) ,
where 6 = 0 i f x(~~l ) = 1 an
d< 5 = 1 i f x ( — 1 ) = ~1 > a n d it s roo t numbe r i s
e(x) = r ( x ) / ^ w h e r e
a: (mo d q)
is the Gaus s su m associate d t o x (se e Sectio n 3.4) . I f x i s non-trivial , the n L(s , x)
is entire , otherwis e i t ha s a simpl e pol e a t 5 = 1 with residu e 1 .
Any L(s , x) satisfie s triviall y th e Ramanujan-Petersso n conjecture . Th e ana -
lytic conductor i s given by q(x?s) = <?(| £ + 5| + 3) x g(|t | + 3) and q(x ) = (2 + 5)q x g ;
see Theore m 4.1 5 .
A Dirichle t L-functio n L(«s,x ) i s self-dua l i f an d onl y i f x i s quadratic . I n thi s
case e(x) = 1 by Theore m 3.3 .
Any tw o Dirichle t character s x i a n d X 2 admi t a Rankin-Selber g convolutio n
given simpl y b y L(\i <8> X2, s) — L(X3i 5 )> where X 3 is th e primitiv e characte r tha t
induces th e produc t XiX2 - W e hav e X 3 — X1 X 2 if ? for instance , th e conductor s qi
of X i ar e coprime , bu t no t i n general . I n particular , th e Rankin-Selber g squar e o f
L(s,x) i s C( 5) a n d th e finite produc t (5.1 0 ) i s
Fixing a modulu s q and considerin g al l primitiv e character s modul o q gives a n
example o f a "family " o f L-functions . Fixin g X ^ 2 an d considerin g al l primitiv e
quadratic character s o f conducto r q < X give s anothe r family .
By (5.22 ) w e deduc e th e convexit y boun d fo r Dirichle t L-functions .
THEOREM 5.23 . Let x modulo q be a primitive Dirichlet character. We have
(5.63) L( 5 ,x)«(<7|*|)i
for Re s — \, the implied constant being absolute.
The followin g exercis e provide s ver y simpl e estimate s whic h ar e quit e hand y
anyway.
120 5. CLASSICA L ANALYTI C THEOR Y O F L-FUNCTION S
EXERCISE 9 . Le t x modul o q be a non-trivia l Dirichle t character . Sho w tha t
for 0 ^ ( J < 1 we have
1
1 k+
(5.64) LW(<T, X)<q -''(logq)
for / c ^ 1 , th e implie d constan t dependin g onl y o n k. [Hint : Us e the boun d
l E n ^ x X W I ^min(x,g). ]
Theorem 5. 8 becomes:
T H E O R E M 5.24 . Let x modulo q be a primitive Dirichlet character, N(T,x)
the number of zeros p = (3 + i^ of L(s,x) ^ n the critical strip 0 ^ / 3 ^ 1 with
|7| ^ T. We have
7V(T,X) = - l o g | ^ + 0 ( g ( T + 3)),
7T lire
with an absolute implied constant.
In additio n t o th e general explici t formul a o f Theore m 5.1 1 , i t i s sometime s
more practica l t o work wit h th e following approximat e formul a fo r
A
ip(x,x) = ^ (n^(n)-
PROPOSITION 5.25 . For any character we have
(5.65) >(x,
1 X) = 5xx- ] T ?—± + o(|(log^) 2)
|Im(p)|<T
where S x — 1 if X — Xo and Sx = 0 otherwise, 1 ^ T ^ x and the implied constant
is absolute.
P R O O F . Thi s i s a specia l cas e o f (5.53 ) fo r primitiv e characters . However ,
(5.65) als o hold s a s it i s fo r x n ° t primitive . Indeed , suppos e %*(mo d q*) wit h
q* | q is the primitive characte r whic h induce s x(mo d q). The n
pa^.x,p\q
Moreover, th e zero s o f L(s,x) agre e wit h thes e o f L(s,x*) excep t fo r th e one s
coming fro m th e product
n v-p")
which ar e p = 27ril/\ogp. Henc e thes e zero s contribut e a t mos t
E E | £^L|«aog^)2.
p\q pill^T
logp ^
The abov e correction s ar e absorbe d b y th e erro r ter m i n (5.65 ) completin g th e
proof. •
5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S 121
By (5.65 ) w e derive a strong approximatio n t o
A
ip(x,q,a)= ^2 (n)
n=a (mo d q)
in term s o f zeros o f L-functions. Firs t b y the orthogonality o f characters w e have
ip(x, q, a) = —- Y^ X{a)ip{x, x)
(P[Q)
X (mo d q)
Applying (5.65 ) w e obtain
(5.66) ^ ( W )= ^ -^ ^ m ^ £^ l+ 0 (£(Iogarg)a)
WJ r\HJ x ( m o d q) L{p ,x)=0 F
|Im(p)|^T
for q ^ 1 , (a, q) — 1 and 1 ^ T ^ x , with a n absolute implie d constant .
Theorem 5.1 0 gives th e zero-fre e regio n fo r Dirichle t L- functions, excep t fo r
the possibl e exceptiona l zer o /3 X for a rea l primitiv e characte r x - Becaus e o f its
importance w e include her e th e slightly differen t traditiona l proof .
T H E O R E M 5.26 . There exists an absolute constant c > 0 such that for any
primitive Dirichlet character x modulo q, L(s, x) has at most one zero in the region
(5.67) a ^1 -
logq(\t\ + 3)'
rea
The exceptional zero may occur only if x is h an d it ^ s then a simple real zero,
say (3 X, with
(5.68) ! _ _ £_ </3x < !.
P R O O F . Assum e x i s non-trivia l (th e case o f ((s) i s similar , bu t i t need s a
change of vocabulary becaus e our definition o f L-functions doe s not include ((s+it)
with t ^ 0 for the reason tha t i t has a pole a t s = 1 — it ^ 1 ) . The n conside r the
L-function o f degree 8
L(/, s) = C(s) 3L(s + it, X)4L(s + 2% X2)
for som e t G R where X 2 is the primitive characte r modul o # 2 | q which induce s x 2 •
By the trigonometric inequalit y o f de la Vallee Poussi n
(5.69) 3 H + 4 R e ( ^ ) + Re(£ 2 ) = |z|( 3 + 4cos# + cos2(9 ) = 2|z|( l + cos(<9)) 2 ^ 0
for z = \z\ exp(i#), i t follows tha t R e (A/(n)) ^ 0 for all n > 1 such tha t (n , q) = 1 .
In fact , a simpl e computatio n usin g th e definition o f X2 show s tha t thi s remain s
valid fo r all n ^ 1 .
Let p = / ? + ry be a zer o o f L(/, 5) wit h / 3 ^ \. I f x i s not real, o r if 7 ^ 0 ,
then takin g t = 7 it follow s tha t L(/ , s) has a pole o f order 3 at s = 1 while / 3 is a
real zer o of order 4 of L ( /, s). Henc e b y Lemma 5. 9 we have
/3<1-T -4VT<1~
logq(/,p) log( 9 (|7| + 3))
122 5. CLASSICA L ANALYTI C THEOR Y O F L-FUNCTION S
with a n absolut e positiv e constan t c. I f \ 2 — 1 > P — P i s r e a l w e tak e t — 0, the n
L(f,s) ha s a pol e o f orde r 4 a t s — 1 . B y Lemm a 5. 9 again , L(f,s) ca n hav e a t
most fou r rea l zero s wit h multiplicit y satisfyin g th e abov e inequality , whic h mean s
that ft mus t b e a simpl e zer o o f L(s , x). •
REMARK. On e ca n als o recove r tha t / ? < 1 by Theore m 2.1 .
As a n applicatio n o f Theore m 5.1 3 , on e derive s th e followin g prim e numbe r
theorem .
THEOREM 5.27 . Let x modulo q be a primitive Dirichlet character. We have
r@x / / —c\0gX
(5.70) Y. x(n)Mn) = S xx - ^ + + O
0 I( xx exp (-7=i^f-)(log^4)
e x p( —
V
^P x VVlog x + logg / /
for any x ^ 1 , wt/ i som e absolute effective constant c > 0, the implied constant
being absolute.
For any q ^ 1 an d (a , g) = 1 we /iai/ e
S 4
(5.71) V A(» )= 4,- ^ ^ + 0 ( x e x p ( -!!° f )(log g) )
n = a (mo d g )
for x ^ 1 , where x (mo d q) is the possible exceptional real character modulo q and
fix the corresponding exceptional zero.
Have i n min d tha t ther e coul d b e on e o r tw o rea l primitiv e character s modul o
q, fo r example , i f q = Sr wit h r od d squarefre e w e hav e tw o suc h character s XsXr
and X4:XsXr- Nevertheless, a t mos t on e o f thes e ma y giv e a n exceptiona l zero . I n
fact, th e exceptiona l character s ar e very rare , an d th e exceptiona l zer o itsel f ca n b e
somewhat controlled .
T H E O R E M 5.28 . (1 ) (Landau) Let xi and X2 be distinct real primitive charac-
ters modulo q\ and q^. Assume Xi an d X2 have real zeros fii and fa respectively.
There exists an absolute constant c > 0 such that
(5.72) m i n O S u & K l -°
loggig 2
(2) (Siegel) For any primitive real Dirichlet character \ modulo q, we have
(5.73) P x < 1 -C M
for any e > 0 , with a constant c(e) > 0 depending only on e. If e < \, this constant
is ineffective, meaning it is not numerically computable.
PROOF. (1 ) Conside r th e L-funetio n o f degre e 4
(5.74) L(/ , s) = ((s)L(s, X i)L(s, X 2 )L(s, X1X2)
which ha s conducto r a t mos t (gi^) 2 - Sinc e th e X1 5X 2 are non-trivial , an d distinct ,
L(/, s) ha s a pol e o f orde r 1 at s = 1 . Moreover ,
~U, s) = E ( ! + X i ^ K 1 + X2(n))A(n)n-°
5. CLASSICA L ANALYTI C THEOR Y O F L-FUNCTION S 123
has non-negativ e coefficients . B y Lemm a 5.9 , ther e exist s c > 0 suc h tha t L(/ , s)
has a t mos t on e rea l zer o with (3 > 1 — c/logq(/). Sinc e q(/ ) ^ 9(^i^2) 27 thi s give s
(5.72).
(2) Fo r th e proo f o f (5.73) , w e follo w th e nic e argumen t o f Goldfel d [Gol] .
Let xi-> X2 be (fo r th e moment ) arbitrar y primitiv e rea l character s modul o q\ an d
#2 an d conside r agai n th e L-functio n (5.74) . W e hav e A/(n ) ^ 0 . Le t r/(x ) b e a
function o n [0 , -foc[ suc h tha t 0 < rj ^ 1 , rj(x) = 1 for 0 ^ x ^ 1 and r)(x) = 0 fo r
x ^ 2 . Conside r th e sum s
Z x
^)= T,^ rl(^)>1 (sinc e A/(l) = 1 )
n
for | ^ / S ^ 1 and x ^ 1 . B y contou r integratio n w e hav e
Z(f3, x) = -^ij Hf, * + P)x sf)(s)ds = L(f, /3 )
(2)
+ L(l,xi)L(l,X2)i(l,XiX2)r?(l-/3)^ 1 - / 3 + ^ J L(f,s + p)f)(s)x s
ds.
By (5.63) , th e las t integra l i s < C Q i ^ ^2 • Thus w e hav e
L(f,p) + L(l, X i)L(l, X 2)L(1 , XiX2)fi(l - f3)x l-P > 1 + Ofatoxh-?).
Now assum e tha t /3 i > | i s a zer o o f L(s,xi) - The n i t follow s
L ( l , x i ) £ ( l , Xa)i(l , XiX2)r/(l - A)* 1 "' 3 1 > 1 + 0( qiq2x^).
4
We take x = c(qiq2) wit h c sufficiently large , s o the ter m o n th e lef t i s > | . Pro m
(5.64) w e have L ( l , x i ) < logg i an d L(l,xiX2 ) < ^ogq 1 q2. Moreover , 77( 1 —/3i) <
(1 — /3i) _ 1 sinc e 7/(5 ) ha s a simpl e pol e a t s = 0 wit h residu e 1 . Gatherin g thes e
estimates w e ge t ou r basi c lowe r boun d
1
(5.75) L( , X2) » ( 1 -/3i)(9i9 2 )- 4 ( 1 - / 3 l ) (log9i92)- 2 .
We will use (5.75 ) t o prov e (5.73) , usin g th e hypothetica l zer o Pi o f L(s , xi) a s
a too l t o contro l othe r characters . Le t 0 < e < ~. I f al l L(s,x) f° r X r e al modul o
# d o no t vanis h o n th e rea l segmen t s ^ 1 — £ , the n (5.73 ) i s obviou s (jus t tak e
c(s) = e) . Suppos e otherwis e tha t fo r som e x i modul o qi , th e L-functio n L(s,xi )
has a zer o s = f3\ ^ 1 — £. The n fo r an y othe r rea l primitiv e characte r X 2 modul o
#2, we can writ e (5.75 ) a s
L(l,x2)»g2-4£(logg2)-2
where th e implie d constan t depend s onl y o n £ , sinc e #i , x i a n d /^ i ar e fixe d onc e
e i s chosen . O n th e othe r hand , i f P2 i s a rea l zer o o f L(s,X2 ) w e hav e b y th e
mean-value theore m
_/32
L(1,X2) = ( 1 - 02)L'(lT, X2) « ( 1 - /?2)<?2 0og<Z2)2
for som e a wit h /3 2 < c r ^ 1 by (5.64) . Combinin g thos e tw o bounds give s 1 — fa ^>
g^"4e, henc e (5.73 ) afte r re-definin g e. D
124 5. CLASSICA L ANALYTI C THEOR Y O F L-FUNCTION S
REMARK. Essentiall y th e boun d (5.73 ) wa s first prove d wit h e = | b y Landa u
[La2] (als o no t effective) , an d wit h an y e > 0 shortly afterward s b y Siege l [Siel] .
The ineffectivenes s o f th e Siege l boun d (5.73 ) i s apparen t i n applications . Fo r
instance, i t i s no t possibl e t o us e i t t o solv e th e clas s numbe r on e proble m fo r
i_
imaginary quadrati c fields. Yes , b y Siegel' s lowe r boun d L(l,x ) > A 2 £ an d b y
the Dirichle t Clas s Numbe r Formul a (2.31 ) , o r (22.59) , w e ge t
(5.76) h(Q(V-A)) = ^ i ( l , X) » A * " '
for — A a negativ e fundamenta l discriminant , \ bein g th e correspondin g Kronecke r
symbol. Henc e h tend s t o infinit y a s doe s |A| . Bu t whe n computin g al l A wit h
h(Q(\/—A)) = 1 it i s no t possibl e t o kno w whe n t o stop . I n Chapter s 2 2 an d 23 ,
we us e muc h mor e sophisticate d idea s o f Goldfel d involvin g L-function s o f ellipti c
curves t o prov e a n effectiv e bound , althoug h muc h weake r (se e Theore m 23.2) .
Siegel's estimat e implie s th e Siegel-Walfis z theore m abou t prime s i n arithmeti c
progressions.
COROLLARY 5.29 . Let q ^ 1 and A > 0 . We have
(5.77) „ (w) = !g + 0(_i_),
< 5 - 78 ' ^ ;
«'«» = ^ )+ 0 ( ( i ^ F ) '
for any x ^ 2 . Moreover, for any primitive Dirichlet character % modulo q > 2, we
have
(5.79) ^ x ( p ) « v ^x(logx)-'4)
p^x
A
(5.80) Yl x(nMn) < y/qx(\ogx)-
n^x
for any A > 0 . The implied constants depend only on A, but are ineffective.
PROOF. I f q ^ (lo g x)A+1 on e gets (5.78 ) from th e prime number formul a (5.71 )
by employin g th e boun d (5.73) . Fo r large r q there i s nothing t o prove , becaus e th e
formula (5.78 ) i s trivial . The n (5.77 ) follow s fro m (5.78 ) b y partia l summation .
Similarly on e derive s (5.79 ) fro m (5.52 ) b y employin g th e boun d (5.73 )
The cas e o f (5.80 ) i s a bi t harder . On e coul d follo w th e proo f o f (5.51 ) usin g
I / - 1 instea d o f V jL, bu t ther e ar e seriou s complication s whic h occu r nea r multipl e
zeros or cluster s o f zeros (s o far w e cannot rul e ou t thi s scenario) . Therefor e w e ar e
going t o deriv e (5.80 ) directl y fro m (5.79) . Firs t w e writ e
Yl XWM W = 1 - Yl X{™>)v(m) ^ xip)
n^.x m^x Pm<P^^
where p m stand s fo r th e larges t prim e diviso r o f m . Le t r ^ 0 b e th e numbe r o f
prime divisor s o f m . The n th e summatio n condition s impl y m 1 //r ^ p m < x/m, s o
m < x r^r+l\ Hence , usin g (5.79 ) th e inne r su m ove r p satisfie s
A
^fqx / x\~ ^/qx /r + l\ A r(m) s/qx
y
*—* m m \ mJ m \V log x / 7 m(logx)
log2
5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S 125
because (r + l)A < C 2r = r{m). Finally , summin g ove r m trivially w e obtain (5.80 )
with extr a facto r (lo g a;)2. Thi s complete s th e proo f b y resettin g A. •
For Dirichle t characters , th e problem discusse d i n Proposition 5.2 2 becomes
that o f finding th e smallest prim e p wit h x(p) ¥" 1 f°r a non-trivia l characte r x
modulo q > 2, say p m in(x)- On e derive s directl y fro m variou s estimate s fo r short
character sum s th e followin g bounds :
+£
Pmmix) < Vqiogq, 4* , <f , (logg) 2
using respectively th e estimate s o f Poly a-Vinogradov, Burgess , Lindelof hypothesis ,
Riemann hypothesis . Actuall y w e only ge t (logg) 4 by (5.56), but wit h a little extr a
effort on e ca n squeez e fro m th e GRH th e bound (logg) 2 . On e ca n also d o better
unconditionally usin g Burgess's estimate enhanced b y combinatorial argument s (se e
e.g. [HI]) . Linni k proved (b y the large sieve inequality) tha t th e smalles t quadrati c
non-residue fo r a primitive rea l characte r modul o q ^ N, q prime, i s < N £ wit h at
most finitely man y exceptions , th e numbe r o f exceptions dependin g onl y o n e > 0.
See Sectio n 7. 4 for the proo f an d a n analogue fo r elliptic curves .
The Rieman n zet a functio n i s further discusse d i n Chapter s 1 0 , 24, 25, an d
Dirichlet character s i n Chapters 1 7 , 1 8 .
5.10. L- functions o f number fields .
Let K/Q b e a number field of degree d. Th e Dedekin d zet a functio n
at*) = £ (;va)- s = n a - (^prr1
defines a self-dual L- function o f degree d — [K : Q] wit h conducto r D = |dx| , the
absolute valu e o f the discriminan t o f K, and roo t numbe r + 1 . Th e gamm a facto r
is
dS/2r
(5.81) ^ ) - - (I)n+r2r(^)r2
where r\ i s the number o f real embedding s o f K an d r2 the number o f pairs of
complex embedding s s o that d = T\ + 2r2- Th e zet a functio n ha s simple pol e at
s — 1 with residu e
„ , N 2 ri (2yr) r2 hR
res (K(s) = - =,
5=1 wy/D
where h is the class numbe r o f K, R the regulator an d w the number o f roots of
unity. Fo r a proof, se e e.g . [La] .
The Ramanujan-Petersso n conjectur e i s obviously satisfie d fo r C K ( 5 ) - Th e an -
alytic conducto r is
q(CK, s) = D(\t\ + 3 P +r 2 ( | i + 1 | + 3f2 < D(\t\ + 4) d
and (\(CK) = 2 r i + r 2 3 r 2 D ^ 3 dD. Ther e exist s a n absolut e constan t c > 1 such tha t
D > cd~1 (se e Exercis e 1 0 below) s o we hav e logq(Cx ) x \ogD.
126 5 . CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S
REMARK. Dedekin d zet a functions facto r a s products o f Artin L-function s (se e
Section 5.1 3) . Le t E/Q b e th e Galoi s closur e o f K, G = Gal(E/Q) an d H =
Gal(E/K). Le t TQ/H D e th e permutatio n representatio n o f G o n th e cose t spac e
G/H. Thi s is the representation induced by the trivial representation of H, henc e by
the invarianc e o f Arti n L-function s unde r inductio n w e hav e L{TQJ H,S) — (K(S)>
One ca n decompos e TQ/H m term s o f irreducibl e representation s o f G, TQJ H —
(Bpnpp, gettin g th e factorizatio n
n
(5.82) ( K(s) = l[L(s,p) ».
P
In cas e o f a failur e o f th e Arti n conjectur e ( a possibilit y w e canno t ye t rul e out )
this i s not a factorizatio n i n term s o f L-function s i n th e sens e o f Sectio n 5.1 .
If K/Q i s abelian , the n E = K, H — 1 and r G/H = 0 x wher e \ r u n s o v e r a
group o f Dirichle t character s s o tha t
CK(s) = l[L(s,x)
X
is a produc t o f L-function s fo r distinc t primitiv e Dirichle t character s o f conductor s
whose produc t i s D = \CIK\ and exactl y on e characte r i s trivial .
We deduc e fro m th e abov e an d th e result s o f th e previou s sections :
THEOREM 5.30 . Let K/Q be a number field of degree d. We have
(s-l)CK(s)^\Dsd\^-^2+e
for 0 ^ a ^ 1 , the implied constant depending only on d and e.
THEOREM 5.31 . Let K/Q be a number field of degree d. Let N K{T) be the
number of zeros p = (3 -f- ij of Or(s) in the critical strip 0 ^ / 3 ^ 1 with \*y\ ^ T.
We have
d
T DT
NK(T) = - lo g —~ d + 0(log DT d)
for T ^ 2, the implied constant being absolute.
One can use the analyti c properties of L-functions t o deduce information o n th e
discriminant o f numbe r fields. Fo r instance , th e explici t formul a yield s quit e goo d
lower bound s fo r D = |d^| . Thi s powerfu l metho d wa s implemente d b y Odlyzk o
[Od], Poito u [Poi] , Serr e [Se7 ] an d others . Her e i s a simpl e result .
T H E O R E M 5.32 . Assume GRH for the Dedekind zeta functions of number
fields. Then we have
(5.83) liminfi{log D+ (d-r 2 )^(i)+r2^(f)}^log7r,
as d = [K : Q ] = n + 2r 2 -5 - +00 .
5. CLASSICA L ANALYTI C T H E O R Y O F 1
L-FUNCTION S 2 7
PROOF. Appl y th e explicit formul a (5.44 ) t o (K(S) wit h
a x
(P(x) = x~h- ^ ^\
for som e a > 0. Althoug h thi s tes t functio n i s not of compact support , i t decay s
very fas t a t infinity , s o (5.44) holds . Th e Mellin transfor m i s
<P(± +lt) = \ e 4a
for t G 1. Not e tha t thi s i s positiv e (compar e wit h Propositio n 5.55 ) and tha t
4>(x) = x~ 1 cj){x~l). Sinc e (K(S) is self-dual w e ge t
11 1 p+°° i p v
X)A/f(n)0(n) + - ^ 0 ( p ) = -log£ > + - / <l,{x)dx + —J j-(s)4>(s)ds.
np - .
(5)
By positivity , w e derive
11 1f +°° f V
(5.84) -\ogD + -J 4>(x)dx+— I ^(#)d S )0.
By (5.81 ) an d the Fourier inversio n formul a w e hav e
(5 85) S
' h l ^ M*)ds = ~\ l0g7T + ^_lAj^ {lA + «)^(1 + i t ) d t
When a—>> 0, the function (27r) -1 ^)(^ +it) converge s to the Dirac delt a a t 0, hence
Therefore, dividin g (5.84 ) b y d, then lettin g d—>> +00 , an d then lettin g a —> 0 , w e
derive (5.83) . •
Similar argument s ca n be applie d unconditionall y wit h slightl y weake r resul t
using tes t function s a s constructed i n Proposition 5.55 .
EXERCISE 1 0 . Sho w tha t ther e exist s a n absolut e constan t c > 1 such tha t
D = \dx\ > cd for any number field K/Q o f degree d ^ 2 , and, in particular, D ^ 3
if K 7 ^ Q so if ha s at least on e ramified prime . [Hint : Procee d a s in Theorem 5.51
below.]
One canno t appl y directl y Theore m 5.1 0 becaus e th e Rankin-Selberg squar e of
CK(S) involve s convolution s o f Artin L- functions o f K whic h ar e not known t o be
entire, althoug h the y ar e conjectured t o be. However , on e can adapt th e traditional
arguments (a s in the proof o f Theore m 5.26 ) using (K(S) 3CK(S + ^ ) 4 C K ( S + 2i£ )
128 5. CLASSICA L ANALYTI C THEOR Y O F L-FUNCTION S
and deriv e th e zero-fre e region , the n th e prim e numbe r theore m fo r K. W e phras e
it i n term s o f
f lo g A^p i f a = p k fo r som e fc ^ 1 ,
\ 0 otherwise .
Thus thes e ar e coefficient s i n th e Dirichle t serie s
THEOREM 5.33 . Let K/Q be a number field of degree d. There exists an abso-
lute constant c > 0 such that (K(S) has no zero in the region
d2\ogD(\t\+3)d
except possibly a simple real zero (3 < 1 .
We have
XP
2
}] Atf(a ) = x - — + 0(vDx exp(-ccT \/logx) )
Na^x P
for x ^ 2, where c > 0 is an absolute constant, and the term —x^/p should be
removed if there is no exceptional zero.
Be awar e tha t th e zero-fre e regio n o f Theore m 5.3 3 i s no t necessaril y th e bes t
accessible. Indee d b y (5.82) , on e ca n factoriz e (K{S) an d i f on e know s tha t th e
factors ar e L- functions, the n zero-fre e region s fo r the m produc e on e fo r (K whic h
can b e muc h bette r becaus e o f a smalle r conductor . Fo r instance , i f K — Q(/ip ) i s
the fiel d o f p-th root s o f unity , w e hav e
(K(S) = ((S)
1 [L(S,X)
where the produc t i s over non-trivial Dirichle t character s modul o p. Thu s b y Theo -
rem 5.2 6 an d Theore m 5.2 8 ther e exist s a n absolut e constan t c > 0 such that 0<r( s)
has a t mos t on e simpl e rea l zer o (3 in th e regio n
r.
logp(|*| + 3 )
whereas d = p — 1 and D = |d# | = p p~2 s o Theore m 5.3 3 i s muc h weaker .
Confirming tha t th e zero s o f rea l Dirichle t character s ar e th e mos t difficul t t o
deal with, Star k [St3 ] has show n tha t i f K/Q doe s not contai n a quadratic subfield ,
then ther e i s n o exceptiona l zer o o f CK(S) .
For Dedekin d zet a functions , Propositio n 5.2 1 ca n b e improved .
PROPOSITION 5.34 . Let K/Q be a number field. Assume GRH holds for ( K(s).
We have
, . , N lo g 3D
s=i lo g log 3D
where D = \dic\ ^nd the implied constant is absolute.
5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S 129
P R O O F . Le t m b e the order o f vanishing a t \. W e argue a s in the proof of
(5.62). I n applying th e explicit formul a (5.45 ) a n additional ter m arise s fro m th e
simple pol e o f CK(S) a t s = 1 , hence we ge t
/»+o o
mh(0) + J2 h(^)= J 9(y)e y/2
dy
-2J2^7=-9(logNa) + 0(\og3D).
a vW a
Both sum s ove r th e zeros and the prime ideal s can be dropped b y positivity. Sinc e
h(0) x Y an d the integral i s 0(e Y) w e infer tha t mY < C eY + lo g 3D. Takin g
Y = log log 3D yield s th e result. •
Generalizing Dirichle t character s t o number fields ar e Hecke character s (Gros -
sencharakters). Thos e are defined thoroughl y i n Section 3.8 for imaginary quadrati c
fields, bu t her e w e use the basic terminolog y i n the general cas e (fo r characters of
weight 0) . See e.g . [La ] fo r details.
Let K/Q b e a number field an d £ a primitive Heck e Grossencharakte r modul o
(m, f2) wher e m is a non-zero integra l idea l in K an d ft is a set of real infinite place s
where £ is ramified. Th e Hecke L-functio n i s defined by
-1
L(Z,8) = l[(l-Z(p)Np->)
for R e (s) > 1 . Heck e showe d tha t L(£ , s) i s an L-function o f degree d — [K : Q ]
which i s entire i f £ ^ 1 and which coincide s wit h CK(S) for £ = 1 . The conducto r is
A — \dK\N K/qxn, wher e N K/Q i s the norm an d the gamma facto r i s given by
7«,*) = .r--/>r(ip'-|n|r(i±A)"+|01,
so q(£ ) ^ 4 d\dK\NK/qm = 4 d A. The root numbe r ca n be explicitly writte n a s a
normalized Gaus s su m for £.
One get s th e following generalizatio n o f Theorem 5.26 .
T H E O R E M 5.35 . Let K/Q be a number field, £ a Hecke Grossencharakter mod-
ulo (m , ft). There exists an absolute effective constant c > 0 such that !/(£,« ) has
at most a simple real zero in the region
dlogA(|t| + 3)'
The exceptional zero can occur only for a real character and it is < 1.
R E M A R K . Thi s sectio n highlight s th e fac t tha t L-function s o f variou s type s
are naturall y define d ove r an y number field K, usin g Dirichle t serie s ove r non-zer o
integral ideals , an d Euler product s ove r prime s ideals . Thu s ther e ar e Art in L-
functions ove r K, automorphi c L-function s ove r K, L-functions o f varieties ove r K,
whereas i n a sens e o f analytic numbe r theor y w e like t o see these a s L-function s
over Q . Thi s vie w i s possible becaus e an y L-function o f degree d ove r if , wher e
[K : Q ] = / , i s a n L-functio n ove r Q , a s define d i n Sectio n 5.1 , of degre e df.
For instance , (K(S) i s of degre e 1 over K. Thu s i t i s easy fo r the reader t o see
what result s ther e ar e for L-function s ove r numbe r fields, wit h uniformit y i n the
130 5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S
number field. Sometimes , however , th e proo f ha s t o b e adapted , a s fo r Theore m
5.33, becaus e th e Rankin-Selber g //-function s hav e a differen t for m dependin g o n
whether th e L-function s ar e see n ove r K o r ove r Q . Th e reade r shoul d hav e n o
difficulty supplyin g ne w proof s i f necessary . I t i s wort h pointin g ou t tha t fo r al l
known L- functions, th e conducto r o f a n L-functio n o f degre e d ove r K take s th e
form q = \dK\ dNK/qf fo r som e non-zer o integra l idea l f of K. Thu s th e conducto r
and th e analyti c conducto r contai n th e dependenc y i n th e discriminan t o f the bas e
field K.
Analytic numbe r theor y flourished beautifull y i n th e las t tw o centurie s i n th e
garden o f L- functions ove r th e rationals . Th e extensio n t o numbe r fields i s fruitfu l
as well, bu t i t need s t o b e mor e explored . T o encourage th e newcomer s w e end thi s
section wit h a fe w cut e application s o f th e theor y ove r th e Gaussia n domai n Z[i] .
There ar e fou r unit s i n Z[i], namely 1 , —1, i, —i. Ever y idea l o f Z[i] i s principal.
Consider th e "angle " character s
6(a) = ( Ar = eifcarga
if a = (a ) 7 ^ 0 fo r an y k = 0 (mo d 4) . Thi s i s a primitiv e Heck e characte r o f
conductor (1 ) . Th e associate d L-functio n
has conducto r D = 4 an d i t satisfie s th e self-dua l functiona l equatio n
A(5,&) = TT- ST(S + J|1 )L( 5 , &) = A( l - s, &)•
All characters are complex except the trivial character fo r k = 0 giving the Dedekin d
zeta functio n L(s,£o ) = C( s)L(s,X4)- Therefor e ther e i s n o exceptiona l zero , an d
the Prim e Numbe r Theore m (5.52 ) give s
tk c
J2 (^) = 26kU(x) + 0(\k\xe- ^)
where c > 0 an d th e implie d constan t ar e absolute .
As a n exercis e w e propos e t o deriv e fro m th e abov e formul a th e followin g
equidistribution propert y o f Gaussia n prime s i n sectors .
THEOREM 5.36 . Let 0 < / 3 - a ^ f and x ^ 2 . Then
c
\{TT e Z[i ] | K K x , a < argy r < /3} | = ^ ^ L i ( x ) + o(xe~ ^^^
where c > 0 and the implied constant are absolute.
[Hint: Approximat e the characteristic function o f [a , /?] modul o 2n by a smooth
function, periodi c of period 2-7r , and symmetric with respect t o x— > ±x±n. Expan d
this functio n int o Fourier serie s and appl y fo r x = arg7r . B y the symmetr y th e non -
zero Fourie r term s ar e o n frequencie s k = 0 (mo d 4) . A t eac h frequenc y appl y th e
Prime Numbe r Theorem. ]
See Sectio n 1 1 . 8 fo r a n applicatio n o f thi s resul t t o sho w tha t th e Wei l boun d
for th e numbe r o f points o n a curve over a finite field i s optimal i n horizonta l sense .
5. CLASSICA L ANALYTI C THEOR Y O F L-FUNCTION S 131
EXERCISE 1 1 . Playin g with arg a an d |a | deriv e the equidistributio n o f Gauss -
ian prime s i n regula r domain s o f C .
EXERCISE 1 2 . Prov e tha t ther e ar e infinitel y man y pair s o f prime s p , q suc h
that pq = a 2 + b 2 with 0 < b < ( 3 log a)2 .
We recommen d W . Duk e [Du4 ] fo r a powerfu l treatmen t o f a variet y o f prob -
lems b y mean s o f Heck e characters .
5.11. Classica l automorphi c L- functions.
By classica l automorphi c form s w e mea n th e form s o n GL(2) , eithe r holomor -
phic o r eigenfunction s o f th e Laplac e operato r (th e Maas s forms) .
Let / b e a primitiv e holomorphi c cus p form , o f weigh t k ^ 1 , leve l q, wit h
nebentypus \ ( s e e Chapte r 1 4 for definitions) . Le t
n^l
be it s normalize d Fourie r expansio n a t th e cus p oo . The n
np
is a n L-functio n o f degre e 2 with conducto r q and gamm a facto r give n b y
(5.86, 7(/,„ = » - . r ( l ± f c i ^ ) r ( l ± i ^) _ f t p.)~r(. + ^i)
with Ck = 2^ 3~h^2y/7r b y th e Legendr e duplicatio n formula . Therefor e th e analyti c
conductor i s
q(/, S ) = g ( | S + ^ | + 3 ) ( | s + ^ | + 3 ) < g ( H + |fc|+3) 2 ,
q(/) = g ( ^ i + 3 ) ( ^ i + 3)x 9 fc 2 .
The roo t numbe r i s i kf](f) wher e rj(f) satisfie s Wf = rjf. I t i s known tha t L(f,s)
satisfies th e Ramanujan-Petersso n conjectur e b y wor k o f Delign e [Del ] fo r k ^ 2
and Delign e - Serr e [DeSe ] fo r k = 1 . Th e dua l for m / satisfie s
x n
f( ) = XWA/W, i f (n,q) = 1.
Have i n min d the n / ca n b e self-dua l fo r non-trivia l nebentypus . Fo r example , i f
£ i s a clas s grou p characte r o f a n imaginar y quadrati c field K — Q(>/!}) , the n th e
form / wit h coefficient s give n b y
A/(n) = Y, * W
Na—n
is self-dual o f weight k — 1, of level q = \D\ an d nebentypu s x = XD, the Kronecke r
symbol. Fo r thes e facts , se e Sectio n 1 4. 7 an d Propositio n 1 4.1 3 , an d th e reference s
there.
132 5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S
Similarly, le t (p b e a primitiv e Maas s for m o f leve l q with nebentypu s x whic h
is an eigenfunctio n o f the Laplac e operato r wit h eigenvalu e A = \ + r 2 , wher e r £ M
or ir e [0 , \{. Writin g it s Fourie r expansio n a t infinit y i n th e for m
ifi(z) = y/y^2 p(n)K ir(27r\n\y)e(nx),
we associat e wit h (p th e L-functio n
n^lp
of conducto r g , wit h gamm a facto r
where 5 = 0 if <p i s even an d S = 1 otherwise. Thu s th e analyti c conducto r i s
q(tp, s) = q(\s + ir\ + 3)(|a - ir\ + 3) < g(|s | + |r | + 3) 2 ,
q(^)=g(|r|+3)2xA(7.
We hav e p(n) — x( n)p(n) ^ ( n> Q) — 1- Se e Chapte r 1 5 for th e definitio n o f Maas s
forms. Althoug h L- functions an d Heck e operator s fo r Maas s form s ar e no t full y
presented i n thi s book , th e theor y i s very simila r t o tha t o f holomorphic forms ; se e
for instanc e [BG] , [Bu ] an d [DFI3] .
In contras t wit h holomorphi c forms , i t i s no t know n tha t th e L- functions o f
primitive Maas s cus p form s satisf y th e Ramanujan-Petersso n conjecture , althoug h
there i s n o doub t becaus e th e conjectur e fit s correctl y i n th e genera l Langland s
functoriality program . Th e curren t bes t know n estimat e i s
(5.87) Kl\P P\ < P 7/64
an
where 1 — p(p)p~ s + X{P)P~ 2S
— ( 1 ~ a pP~s)(l ~ PpP~ s), d correspondingl y
|Reir| ^ ^ , henc e
(5.88) A = i + r»>^=0.238...
This is due to Kim an d Sarna k [KiS] . For earlier results see e.g. [S2] , [Se8] , [DuI2] ,
[BDHI], [LRS] .
The origina l theor y o f Ranki n [Ra3 ] an d Selber g [S5] , extended b y W . L i [L] ,
and it s adeli c versio n [JPS ] sho w tha t an y tw o cus p forms , holomorphi c o r non -
holomorphic, admi t a Rankin-Selberg convolution . Th e convolutio n L ( / ® # , s) ha s
a simpl e pol e a t s — 1 if g = / an d i s entir e otherwise . Assum e / o f weigh t k an d
g of weigh t k ^ £ (by symmetry) . Th e gamm a facto r o f / 0 g i s give n b y
5. CLASSICA L ANALYTI C T H E O R Y O F 1
L-FUNCTION S 3 3
Let no w / hav e eigenvalu e ^ -f - r 2 , an d parit y 5 = 0 o r 1 , an d g hav e eigenvalu e
\+u2 an d parit y rj = 0 or 1 . The n
/s — i(r - u) + i/\ /s - i(r + u) + v\
where v = 0, 1 accordin g t o whethe r 8 = rj or not . Finall y assum e / i s holomorphi c
of weigh t k an d g i s a Maas s for m wit h eigenvalu e \ + r 2 an d parit y 5 . The n
•r(/8»,.)«-T(l±iiiii)r(l±^i)
These factor s ca n b e derive d b y pleasan t computation s fro m th e fact s convenientl y
gathered i n th e Appendi x t o [RS] .
Although th e loca l factor s o f L(f 0 g, s) a t prime s p | q(f 0 g) canno t b e
expressed i n a simpl e wa y fro m th e Heck e eigenvalue s A/(p) , X g(p) o r p/(p) , Pg{p),
this i s stil l th e cas e i f p divide s exactl y [<?(/) , #(#)]. I n thi s cas e on e ca n sho w tha t
the facto r ove r th e ramifie d place s i n (5.1 0) , o r rathe r 1 /H p(p~s), i s equal t o
(1 - a f(p)ag(p)p-s)(l - a f(p)(3g(p)p-s)(l - f3 f(p)ag(p)p-s)(l - p f(p)(3g(p)p~
s
)-
In othe r words , th e loca l facto r a t al l prime s i s given b y (5.9) . W e emphasize agai n
that thi s i s not tru e i n al l cases. Not e tha t sinc e p \ q(f)q(g), on e or bot h o f a/(p) ,
Pf(p) ( res P- a g(p)i Pg(p)) i s z e r o •
In term s o f Dirichle t series , thi s implie s tha t
s
L{f ®g,s)= L(2s , XfXg) Yl A/(n)A^(n)n"
if [q(f),q(g)] i s squarefree, wher e / an d g can b e either holomorphi c o r Maass form s
and Xfi Xg a r e th e respectiv e nebentypus . Not e tha t L(s,XfXg) ls ^ n e Dirichle t
series o f th e characte r XfXg eve n i f i t i s non-primitive , so , fo r instance , i f Xf =
Xg = 1 the n L(2s,XfXg) = Cq(f)q{g){^ s) 1 S th e Rieman n zet a functio n wit h th e
Euler factor s a t p \ q(f)q(g) removed . I n th e genera l case , th e loca l factor s a t
ramified prime s ca n b e describe d usin g th e loca l Langland s conjectur e fo r GL(2) ,
but thi s i s no t ver y explicit . Se e Sectio n 5.1 2 fo r th e relatio n betwee n L(f 0 / , s)
and th e symmetri c squar e L-function .
REMARK. (1 ) Th e convolutio n o f a Maas s for m an d a n holomorphi c cus p for m
of weight 4 is important i n the Phillips-Sarna k theor y o f spectral deformatio n [PS] .
(2) On e ca n als o conside r convolution s o f GL(1 ) wit h GL(2). Thos e ar e th e
twists define d i n Sectio n 1 4.8 , and th e theor y i s rather elementary . Note , however ,
that th e twiste d modula r for m
1 k )/2
/ 0 i> = J2 ip(n)\ f(n)n( - e(nz)
n
(in case of holomorphic / ) o f level gm 2 , wher e ip is modulo m , ma y not b e primitive ,
although i t i s alway s a n eigenfunctio n o f th e unramifie d Heck e operators . I n thi s
case th e Rankin-Selber g L-functio n L(f 0 -0 , s) i s th e L-functio n o f th e primitiv e
134 5 . CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S
form wit h conducto r dividin g qm 2 whic h ha s th e sam e Heck e eigenvalue s a s / 0 ip
at almos t al l prime s (se e Exercis e 5 of Sectio n 1 4.7) .
As an example, le t / b e a theta functio n fo r a class group character (se e Sectio n
14.3) for a n imaginary quadrati c field K — Q(\J—D), an d \D th e Kronecker symbo l
for thi s field, s o that / E Si(D,XD)- The n w e have L(f ® XD-> S) = L(f, s).
More generally , on e ca n expres s th e dua l L-functio n L ( / , s) o f a modula r for m
/ havin g nebentypu s x as a twist
(5.89) L(f,s) = L(f®x,s)
(see (1 4.48)) .
It i s quite instructiv e t o fix a modular for m an d conside r it s famil y o f twist s
/ ® X by character s o f conducto r q, or b y rea l character s o f conducto r < Q.
For ease of reading we restate som e of the general results from previou s section s
now i n th e specifi c contex t o f classica l automorphi c L-function .
THEOREM 5.37 . Let f be a holomorphic primitive cusp form of level q and
weight k ^ 1 as above. We have
1
(5.90) L ( / , s ) « ( v^(|S|+fc)) -+£)
for \ < a < 1.
Let ip be a primitive Maass cusp form of level q with the Laplace eigenvalue
A = \ +r 2. We have
(5.9i) ^ . i X ^ H+ H))1-^
for I ^ a ^ 1 . In both cases, the implied constant depends only on e.
PROOF. Th e inequalit y (5.90 ) i s merel y th e re-formulatio n o f (5.20) wit h th e
claimed uniformity , becaus e L(f,s) satisfie s th e Ramanujan-Petersso n conjecture .
To prove (5.91 ) , on e use s instea d th e estimat e
(5.92) Yl \p( n)\2 < * ( * + ? + \r\) £
proved b y Iwanie c [1 7] , hence agai n th e implie d constan t i n (5.91 ) depend s onl y o n
e, no t o n (p. •
Next (5.33 ) give s
THEOREM 5.38 . Let f be a holomorphic primitive cusp form of level q and
weight k ^ 1 , N(T, f) the number of zeros p = /3 + ry of L(f, s) in the critical strip
0 ^ / 3 ^ 1 with \j\ < T. We have
N(TJ) = ^ l o g ^^ +0(log(i(|T | + fc)),
for T ^ 2 with an absolute implied constant.
Let (f be a primitive Maass form level q with eigenvalue A = \+r2, N(T, <p) the
number of zeros p =/3 + ry of L(f, s) in the critical strip 0 ^ / 3 ^ 1 with \j\ ^ T.
We have
N{T,v) = ~ log j f^ + 0(}ogq(T + \r\))
7r [Z7re)*
5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S 135
for T ^ 2 with an absolute implied constant.
Now we re-formulate ou r zero-free regio n theorems. Th e first result for modular
forms beyon d th e classical contex t o f Dirichlet character s (an d their extension s t o
number fields) i s due to C. Moreno [Mori] . W e ca n apply Theore m 5.1 0 for both
holomorphic an d non-holomorphic forms , gettin g th e following zero-fre e region :
T H E O R E M 5.39 . Let f be a holomorphic primitive cusp form of level q and
weight k > 1 . There exists an absolute constant c > 0 such that L(/ , s) has no zero
in the region
c
°^ ~log</(|t|+f c + 3)
except possibly a one simple real zero (3 < 1 in which case f is self-dual.
Let (f be a primitive Maass form of level q and eigenvalue A = \ + r 2. There
exists an absolute constant c > 0 such that L(ip, s) has no zero in the region
r
G> 1
logg(|*| + |r | + 3 )'
except possibly a one simple real zero /3 < 1 in which case f is self-dual.
Finally w e state th e prime numbe r theore m fo r classical cus p forms .
T H E O R E M 5.40 . Let f be a holomorphic primitive cusp form of level q and
weight k ^ 1 . We have
x (3
/
] T A / 0 ) logp = - — + 0(y/qx exp(-c v logx))
p^x
for x > 2, where c > 0 is some absolute constant and f3 is the exceptional zero of
L(/, s) if it exists and this term is omitted otherwise.
Let if be a primitive Maass cusp form of level q and eigenvalue A = | + r2. We
have
Y^p{p)^gp= -— + 0(y/qx exp(-c^logx))
for x ^ 2, where c > 0 is some absolute constant and (3 is the exceptional zero of
L(cp, s) if it exists and this term is omitted otherwise.
PROOF. Appl y Theore m 5.1 3 (see th e remark following ) an d (5.92 ) t o verif y
(5.48). •
Classical cusp forms giv e examples of families of L-functions wher e many result s
known otherwis e onl y unde r GR H ca n be proved unconditionally . Fo r instance, the
upper boun d o f Proposition 5.2 1 ca n be improved significantly .
Let q be a prime numbe r an d 5^(q)* th e set of primitive weigh t 2 cusp form s
of leve l q (see Chapte r 1 4) . W e have \S2(q)*\ ~ ^ . Defin e
(5.93) L(J 0(q),s)= n L(f,s).
fes2(q)*
This i s an L-function o f degree d = \S 2(q)*\ wit h q(/ ) = (1 2q)\ S2^*\, henc e d ~ §
and logq(/ ) ~ ^ l o g g . Therefor e i t follow s fro m Propositio n 5.2 1 an d GRH tha t
136 5. CLASSICA L ANALYTI C THEOR Y O F L-FUNCTION S
the orde r o f vanishin g o f L(Jo(q),s) a t s — \ i s < C (/(loggXloglogg) x . However ,
applying th e Petersso n formul a (1 4.60 ) on e ca n easil y sho w (stil l usin g GRH ) tha t
(5.94) or d L(J 0(q),s) < q.
s=l/2
In Chapte r 26 , w e sho w tha t thi s boun d i s sharp . Actuall y th e boun d (5.94 ) i s
established unconditionall y i n [KM2 ] (withou t recours e t o th e Gran d Rieman n
Hypothesis). Se e Sectio n 5.1 3 fo r link s wit h Hasse-Wei l zet a function s o f abelia n
varieties.
5.12. Genera l automorphi c L-functions .
In this section we talk briefly o f L-functions associate d with cusp forms o f higher
rank, becaus e the y ar e abou t t o mak e a permanen t habita t i n analyti c numbe r
theory i n year s t o come . Le t / b e a cus p for m o n GL(m)/Q wit h m ^ 1 . The n
Godement - Jacque t [GJ ] hav e defined a n L-function L(/ , s) o f degree m associate d
to / . Fo r m = 1 , / correspond s uniquel y t o a primitiv e Dirichle t characte r \
modulo q an d L(f,s) = L(s,x) - Fo r m = 2 , / correspond s eithe r t o a primitiv e
cusp form , o r t o a primitiv e Maas s cus p form . Henc e th e automorphi c L-function s
L(f,s) generaliz e bot h Sectio n 5. 9 an d Sectio n 5.1 1 . Sinc e / i s cuspidal , L(f 1 s)
is entir e excep t fo r L(/ , s) = ((s) wit h m = 1 . I n additio n L-function s o f non -
cuspidal automorphi c representation s ar e als o defined , bu t the y facto r a s product s
of cuspida l ones . W e recommen d Cogdell' s Chapte r 9 o f [BG ] a s a surve y o f L -
functions o n GL(m).
It i s expected that an y automorphic L-functio n satisfie s th e Ramanujan-Peters -
son conjecture . Jacque t an d Shalik a prove d tha t th e loca l component s satisf y
l a i(p)l < yjv a n d Re(ftj ) > — |. Thi s ha s bee n improve d b y Luo , Rudnic k an d
Sarnak [LRS] , namely |c^(p) | < p c an d R e (KJ) > —c with
The absolut e convergenc e o f L(f,s) fo r R e (s) > 1 , du e t o Jacquet-Shalik a an d
Shahidi [JS] , follow s fro m th e existenc e o f Rankin-Selber g L-function s (se e th e
proof o f (5.48)) .
A usefu l featur e o f th e associatio n o f L-function s wit h automorphi c theor y i s
that th e roo t numbe r e(f) i s expressed a s a produc t o f loca l roo t number s
e(f) = T\s P(f), wit h e p(f) = 1 if p i s unramified .
p
In fact , s p(f) depend s o n th e choic e o f a non-trivia l additiv e characte r ip p o f Q p ,
but th e produc t i s independen t o f suc h choices .
For an y tw o cus p form s / an d g o n GL(d) an d GL(e), th e L-functio n L(f 0
g, s) exist s i n ou r sens e b y th e far-reachin g generalizatio n o f th e classica l Rankin -
Selberg metho d du e t o Jacquet , Piatetski-Shapir o an d Shalik a [JPS] . Moegli n an d
Waldspurger [MW ] prove d tha t L(f<8>g, s) i s entire i f g / / , an d ha s a simple pol e
at s = 1 if g = / . Th e boun d q(f <8> g) \ q(f)eq{g)d i s due t o Bushnel l an d Henniar t
[BH].
According t o conjecture s belongin g t o th e Langland s Program , th e "mos t gen -
eral" L-functio n shoul d i n fac t b e a produc t o f function s belongin g t o th e clas s
of L-function s o f cus p form s o n som e GL(m)/Q . Othe r part s o f th e Langland s
5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S 137
conjectures impl y tha t th e Ramanujan-Petersso n conjectur e shoul d hol d fo r an y
automorphic //-function . Moreover , th e conjecture s impl y th e existenc e (an d de -
termination, i n principle ) o f a n equidistributio n la w fo r th e coefficient s Xf(p) (se e
the discussio n o f th e Sato-Tat e Conjectur e i n Chapte r 21 ) .
The convexit y boun d fo r automorphi c //-function s ca n b e mad e uniform .
T H E O R E M 5.41 . Let L(f,s) be an automorphic L-function of a cusp form of
degree d other than C(s) . Then we have
(5.96) L ( / , s ) « q ( / , S r+£
for a ^ \, where a = m a x ( | ( l — a),0) , the implied constant depending only on e
and d.
PROOF. Thi s follow s fro m (5.1 2 ) wit h X = 1 , Proposition 5. 4 an d th e estimat e
y
£\*f(n)\2«x(xq(f)y
for x ^ 1 with a n implie d constan t dependin g onl y on e and d. Thi s i s the analogu e
of (5.92 ) prove d b y Molten i [Mol] . •
The mos t commonl y use d L-function s o f degre e fou r i n analyti c numbe r the -
ory ar e th e Rankin-Selber g convolution s L(f ® g,s) o f tw o classica l cus p form s
(described i n Sectio n 5.1 1 ) . Clos e relative s t o thi s ar e th e symmetri c squar e an d
adjoint squar e L-function s o f classica l cus p forms . Le t / hav e nebentypu s \- Th e
symmetric squar e o f / ( / i s either holomorphi c o r real-analytic ) i s defined b y
2 1
(5.97) L(Sym /, a) = L(f ® /, s)L(s, x ) "
whereas th e adjoin t squar e i s define d b y
2
(5.98) L(Ad /, s) = L(f ® /, s)C(^)- 1 .
Thus bot h ar e L-function s o f degre e three , wit h conductor s q(Sym 2 f) = q(f ®
fMx)-1 I <7(/)2 an d q(Ad 2f) = q(f ® /) | q(f) 2. I f / ha s rea l coefficient , the n
the symmetri c an d adjoin t square s coincide . On e show s tha t th e roo t numbe r fo r
the adjoin t squar e i s e(Ad 2 / ) = e(f ® /) = 1 . Bot h hav e gamm a facto r equa l t o
< y ( / ® / , s ) / r ( s / 2 ) , henc e
(5.99) 7 (Ad 2 / , s) = 7 (Sym 2 / , s) = - - 3 s / 2
r ( ~ ) r ( i ± | ^ ) r ( ^ )
if / i s holomorphi c o f weight k an d
(5.100) 7 (Ad 2 / , s) = 7 (Sym 2 f,s) = ^3 s / 2
r ( | ) r ( | + t r ) r ( | + ir)
if / i s real-analyti c Maas s for m wit h eigenvalu e A = \ + r 2 (not e th e parit y doe s
not matter) . Th e loca l Eule r factor s fo r L(Sym 2 / , s) an d L(Ad 2 / , s) ar e give n b y
2
(1 - a{p) p-°)-\l - X {P)P-S)-\1 - (3( P)
2
P-Sr\
(i-ffip-rHi-p-r^i-Hftp-r1
respectively i f p \ q(f), wher e a(p) an d /?(p ) are the local roots a t p for L(/ , s). Not e
that fro m th e twis t formul a (5.89 ) o r the relatio n a(p)/3(p) = \{p) for P unramified ,
138 5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S
one ca n also relat e th e adjoin t an d symmetric square s a s twist s (convolutio n o f
GL(3) b y GL(1 )):
Lp(Ad2f,s) = L 2
p(Sym f®x,s).
If p divide s exactl y g ( / ) , then thi s formul a i s stil l valid . I n particular , fo r q(f)
squarefree a simple computatio n show s tha t
L(Sym 2 /, s) = L(2s, X 2) £ \f(n 2
)n-,
L(Ad 2 /, a) = L(2s, x2) £ X(n)\ f(n
2
)n-S.
At th e point 5 = 1 the adjoint squar e L-functio n take s a special valu e
(6.101) L ( A d V . l ) = resL(/8/,S)= V o l ^ H )
where w — (ATr)k/Y{k) o r w — cosh(7rr)/27 r dependin g o n the weight o r eigenvalue
of / respectively , whil e (/ , /) i s the square o f the Petersso n nor m o f / (se e (14.11)
for holomorphi c cus p form , otherwis e i t i s the standar d L 2 -norm i n th e Hilber t
space of automorphic function s o n ro(g)\H; se e Chapter 1 5 and also (26.47)) . Thi s
formula shows , i n particular, tha t L(A d / , 1 ) > 0 , and it i s well exploite d i n the
spectral analysi s o f cusp form s o n GL(2) (se e for example [Roy]) .
Theorem 5.1 0 gives a zero-fre e regio n fo r automorphic L-function s o f any de -
gree.
T H E O R E M 5.42 . Let L(f,s) be an automorphic L-function of a cusp form of
degree d. There exists an absolute constant c > 0 such that L(/ , s) has no zero in
the region
except possibly a one simple real zero fif < 1. For this exceptional zero to exist, it
is necessary that f be self-dual.
PROOF. B y the properties o f Rankin-Selber g convolution s o f cus p form s re -
called above , al l assumptions o f Theorem 5.1 0 are valid, whic h give s the result. •
R E M A R K . On e ca n show tha t fo r an automorphic L-functio n L(/ , 5), the aux-
iliary L-functio n L(g,s) use d i n the proof o f Theorem 5.1 0 has the property tha t
Ag(n) ^ 0 for all n, no t only unramifie d ones . Th e proof o f this depend s o n the
local Langlands conjectur e fo r GL(d) over p-adic fields, proved by Harris and Taylo r
[HT]. Roughl y speaking , fo r an y prime p , ther e exis t continuou s representation s
pp : W(Q P) - » GL{d,C) ) wher e W i s the Weil-Deligne grou p o f Q p , a varian t of
the Galoi s group , suc h tha t th e p-factor o f L(/, s) is given by
1 s
Lp(f,s) = det(l-p p(Frp)p- y
and tha t o f the Rankin-Selber g convolution s use d i n defining g ar e
s
LP(f ® /, s) = det( l - (p p<g> p p)(Frp)p- )-\
s 1
LpU ® /, s) = det( l - {p p ® / 9 p )(Pr p )p- )- ,
LP(f®f, s) = det(l - (p p ® p p )(Pr p )p- s )- 1 ,
5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S 139
where p p is the contragredient representatio n o f pp, and Fvp is the geometric Frobe -
nius a t p (se e Tate [Tat ] for more details) ; mor e precisely , Fr p is meant t o act on
the invariant s unde r inerti a for all the above representations. Thi s show s that, wit h
the sam e convention , th e p-factor o f L(g, s) is
det(l-(p0p)(Frp)p-s)"1
where p is the representation
P = ie(Pp®|-| i*)e(/5p®|-rit).
But w e hav e quit e generall y
2
Tr(p®p)(g) = \Trp(g)\
for an y g and p, and this give s A g(pk) ^ 0 for all k ^ 1 .
This kin d o f very dee p arithmetica l argumen t depend s cruciall y o n the auto-
morphic natur e o f L(/, s) and is completely beyon d reac h whe n usin g onl y the bare
data o f the Dirichle t serie s or Euler produc t L(f, s).
Statements lik e Proposition 5.2 2 for automorphic L-function s ar e usually calle d
"multiplicity on e theorems". Th e most basi c result , calle d th e strong multiplicit y
one principle , i s due to Jacquet an d Shalika [JS] :
PROPOSITION 5.43 . Let L(f, s) and L(g, s) be two automorphic L-functions of
cusp forms ofGL{m). If the local components of f and g coincide at all but finitely
many primes, then f — g.
P R O O F . Suppos e the local components coincide for p <£ S, where S is finite (an d
contains th e ramified prime s fo r / o r g). Sinc e th e local factor s fo r the Rankin -
Selberg convolutio n L(f <S> g,s) hav e n o poles a t s = 1 , the order o f the pole of
L(f <3>g,s) i s the same a s that fo r the product ove r prime s p ^ S. Bu t
l[Lp(f®g,s) = Y[L p(f®f,s)
Pis p$s
by assumption . Therefor e L(f ® g, s) has a pole a t s = 1 , which implie s / = g. •
For classica l modula r forms , thi s i s Theore m 1 4.1 2 . I f th e local component s
satisfy |c^(p) | < p 1 / 4 , then th e remar k afte r Propositio n 5.2 2 gives a n explici t
version. Suc h a boun d i s know n i n ful l generalit y onl y fo r n — 2, but usin g the
logarithmic derivative of L(f®g, s) instead of the squarefree par t o f the convolution ,
Moreno [Mor3 ] gav e a general explici t boun d o n the number o f primes required .
Theorem 5.4 2 can be applied i n particular t o the symmetric an d adjoint squar e
//-functions. On e can also appl y i t t o the Rankin-Selberg L-function s L(f ® g, s)
for an y two classical modula r form s sinc e Ramakrishnan [Ra ] has shown tha t thos e
L-functions o f degre e 4 are automorphic . Exercis e 4 show s tha t on e can in fac t
prove non-vanishin g result s fo r Rankin-Selberg L-function s withou t modularity .
T H E O R E M 5.44 . (1 ) Let f and g be classical primitive modular forms, either
holomorphic or real-analytic. There exists an absolute constant c > 0 such that
L(f 0 g, s) has no zero in the region
logq(/®0)(|*|+3)
140 5. CLASSICA L ANALYTI C THEOR Y O F L-FUNCTION S
except possibly a one simple real zero /? < 1 . For this exceptional zero to exist, it is
necessary that f (g ) g be self-dual.
(2) Let f be a classical primitive modular forms, either holomorphic or real-
analytic. There exists an absolute constant c > 0 such that L(Sym 2 / , s) and
L(Ad2 / , s) have no zero in the region
r
a> 1
logq(/)(|t|+3)'
except possibly a simple real zero (3 < 1. If f is non-dihedral, then the exceptional
zero does not exist.
PROOF. A S mentioned , th e first point follow s directl y fro m th e fact tha t L(f ®
g,s) i s an automorphic L- function.
For th e second, L(Sy m / , s) is automorphic [GJ] , so we need onl y prov e tha t
there i s no exceptional zer o i f / i s non-dihedral, whic h i s due to Goldfeld , Hoff -
stein an d Liehma n [GHL ] (since / i s self-dua l i f ther e i s a n exceptiona l zero ,
L(Ad 2 /, s) = L(Sym 2 /, s)). T o this en d consider th e L-functio n
L(g, s) = as)L(Sym2 / , s)2 L(Sym 2 / ® Sym2 /, s)
= C(*)£(Sym 2 /, s)3 L(Sym 2 Sym 2 /, s).
The las t L-functio n i s a specia l cas e o f the symmetric squar e o f a cus p for m o n
GL(3) an d has been show n b y Bump an d Ginzburg t o have a simple pol e a t s — 1
if / i s non-dihedral (thi s als o follow s b y the first formul a sinc e Sy m / i s a cus p
form o n GL{3)). Henc e L(g,s) ha s a pol e o f order 2 at s = 1 , whereas an y real
zero o f L(Sym 2 /, s) is a zer o o f L(g, s) of order ^ 3 . B y easy loca l computation s
one check s tha t A g(n) > 0 for (n,q(g)) = 1 , hence th e result follow s fro m Lemm a
5.9. * •
REMARK. Th e formul a (5.1 01 ) als o prove s tha t L(Ad 2 /, 1 ) ^ 0 .
The followin g i s a usefu l corollar y (compar e wit h th e lower bound s fo r clas s
numbers).
COROLLARY 5.45 . Let f be a classical primitive modular form, holomorphic
or real-analytic. We have
2
(5.103) | | / | | »Vol(r0(9)\M)g-£
for any e > 0 . The implied constant depends only on e and the weight k or the
eigenvalue A whatever is relevant.
If f is not dihedral, for instance, if f has trivial nebentypus and squarefree
conductor q, then
2 _ Vol(r 0 ((?)\H)
(5.104) ll/l f»
lo g 2g
where the implied constant is effective and depends only on the weight k or the
eigenvalue A whatever is relevant.
5. CLASSICA L ANALYTI C T H E O R Y O F L
1 -FUNCTION S 4 1
P R O O F . Th e firs t par t i s prove d b y Hoffstei n an d Lockhar t [HL] , an d fol -
lows fro m (5.1 01 ) th e bound (3 ^ 1 — c(e)q~ £ fo r the possible exceptiona l zer o of
L(Ad 2 /, s), whic h implie s a lowe r boun d fo r L(Ad 2 /, 1). Th e proof i s similar t o
that o f Siegel's boun d (5.73) , an d the implied constan t i s non-effective .
If / i s not dihedral, the n ther e i s no exceptional zer o an d one derives (5.1 04 )
by th e same metho d fro m th e zero-free regio n o f Theorem 5.44 . Finally , on e ca n
easily se e that a dihedral for m wit h trivia l nebentypu s ha s a non-squarefree level. •
Generally speakin g the problem o f exceptional zero s for the L-functions o f cusp
forms o n GL(m) wit h m ^ 2 turns ou t to be accessible, th e hardest an d stubborn
case bein g wit h m = 1 for real Dirichle t characters . Th e above resul t o f Goldfeld ,
Hoffstein an d Liehman ha s been extende d t o the following theore m o f Banks [Ba] :
P R O P O S I T I O N 5.46 . If L(f,s) is the L-function of any cusp form on GL(3),
then the exceptional zero does not exist for L(f, s).
5.13. A r t i n L-functions .
In thi s Sectio n w e survey, mostl y withou t proofs , som e o f the definitions an d
properties o f Arti n L-functions . I n contras t wit h th e previous sections , th e fac t
that thos e are indeed L-function s a s defined i n Section 5. 1 remains open . However ,
there is great confidenc e an d evidence that th e relevant conjecture s ar e correct, and
together wit h th e analytic theor y develope d i n thi s chapter , an d possibly GRH,
this provide s heuristi c evidenc e fo r results tha t sometime s ar e accessible b y othe r
methods. Se e e.g. [CF ] for more abou t Arti n L-functions . T o avoid confusio n i n
terminology, w e will sometime s refe r t o the L-functions define d i n Sectio n 5. 1 as
analytic L-functions .
Artin L-function s ar e associated t o a continuous finit e dimensiona l Galoi s rep -
resentation p : Gal(Q/K )— > GL(d, C ) wher e K/Q i s a numbe r field . Defin e the
Artin L-functio n o f p by the following Eule r produc t ove r prim e ideal s o f K:
L(p, s) = IJdet(l- p(Fr p)Np-)-\
p
where Fr p is the Probenius conjugac y clas s at p, acting o n the invariants unde r the
inertia grou p a t p . Thu s L(p , s) i s an Euler produc t o f degree deg(p ) ove r if , o r
deg(p)[K :Q ] over Q.
Artin L-function s enjo y a formalis m paralle l t o tha t o f representations . Fo r
instance, th e dua l L-functio n i s als o th e L-functio n o f the contragredien t repre -
sentation p , an d th e Rankin-Selber g L-functio n o f p\ an d p2 is th e L-functio n
L(pi ® P2, s) of the tenso r powe r pi ® pi (thu s th e notation i s compatible). More -
over, L(p i 0 p2 , s) = L(pi, s)L(p2, s) , an d if p is induced fro m a representation p'
from a finite inde x subgrou p correspondin g t o a finite extensio n K' JK, the n
(5.105) L(p,s ) = L(p / , 5 ).
In particular, takin g K — Q and K' — K an y number field , i t follows tha t an y Arti n
L-function o f degree d can be seen a s one defined ove r Q of degree deg(p)[K : Q] .
For th e trivia l representatio n o f Gal(Q/K) , w e get L(l,s ) = Or(s) . Mor e
generally i f deg(p ) = 1 , class-fiel d theor y show s tha t t o p i s associate d a uniqu e
Hecke character £ such that L(p , s) = L(£ , s). S o it is an analytic L-functio n i n this
case (se e Section 5.1 0) .
142 5. CLASSICA L ANALYTI C THEOR Y O F L-FUNCTION S
It i s conjectured tha t fo r an y p , L(p,s) i s an analyti c L- function. Sinc e L(p i ®
p2,s) — L{p\, s)L(p2, s) , one ca n restric t attentio n t o irreducibl e representation s p ,
and on e expect s tha t L{p,s) i s i n fac t automorphic .
It i s know n tha t L(p , s) ha s meromorphi c continuatio n t o C an d al l o f th e
conditions se t i n Sectio n 5. 1 ar e satisfied , excep t tha t som e o f thes e L- functions
may hav e pole s i n th e critica l stri p (becaus e w e stil l d o no t kno w i f L(p,s) i s
holomorphic i n th e stri p 0 < R e (s) < 1 ) . Th e gamm a facto r ca n b e describe d a s a
product ove r infinit e place s v o f K o f loca l gamm a factor s j v(p, s). Le t a v b e th e
Frobenius conjugac y class , whic h i s o f orde r 2 i f v i s a rea l plac e tha t extend s t o
two comple x place s o f L , an d i s = 1 otherwise. The n
7r_ds/2r(-j r f J i f v i s a comple x place ,
lv(p,s) = {
*""da/2r(!) r
(^T^)^ ^ i s a rea l place ,
where d = deg(p) i s th e dimensio n o f p , an d d~ ,d+ ar e th e multiplicities , respec -
tively, o f th e eigenvalu e + 1 , —1 fo r p(cr v).
The conducto r q{p) i s of th e for m
d
q(p) = \d K\ ^NK/Qf(p)
for som e integra l idea l f(p ) o f K, whic h i s called th e Arti n conductor , define d usin g
ramification group s (se e Serr e [Se4]) . Th e roo t numbe r e(p) ca n b e i n a sens e
defined fro m th e functiona l equation , bu t ther e i s als o a n a prior i definitio n du e
to Dwork , Langland s an d Delign e a s a produc t o f loca l factors . Thu s th e analyti c
conductor satisfie s
d e :Q
q(p,s) < q(p)(\s\ + 4 p « ^ = ( | ^ | ( | s | +4)^ l) d e ^)iVf( / 3 ).
Prom th e Braue r inductio n theore m (se e e.g . [Se5]) , w e ca n writ e
Trp = ^T^TrTT ;
where rii G Z an d 71 ^ is a representatio n induce d b y a n abelia n characte r & o f a
finite (i n fact , cyclic ) extensio n Ki/Q. B y th e invarianc e unde r inductio n (5.1 05 )
we hav e
(5.106) L(p, s) = l[ Lfa, s) n> = H L&, s)" *
ii
and th e cas e o f Heck e character s (ove r numbe r fields) implie s tha t L(p , s) i s mero -
morphic wit h pole s onl y i n th e critica l strip , an d satisfie s a functiona l equatio n
relating s an d 1 — s. Tha t thi s functiona l equatio n i s th e righ t on e involvin g th e
gamma facto r abov e an d th e Arti n conducto r follo w fro m th e formalis m o f both ,
especially th e behavio r unde r direc t sum s an d induction .
To sho w tha t L(p , s) doe s no t hav e extr a poles , w e nee d t o prov e th e
ARTIN CONJECTURE . Let p be an irreducible, non-trivial, Galois representa-
tion of a number field K/Q. Then L(p , s) is entire.
Even th e cas e d — 2 and K = Q i s still no t completel y settled . Mor e precisely ,
if a n irreducibl e representatio n p : Gal(Q/Q ) —> GL(2 , C) i s given , the n it s imag e
in PGL(2 , C) ca n b e classified a s either a dihedral group , th e alternatin g grou p A±,
5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S 143
the symmetri c grou p S4 o r th e alternatin g grou p A$. I n al l cases , excep t A5 , it i s
known that L(p , s) i s holomorphic (eve n automorphic), an d ther e ar e partial result s
in the A§ case. Th e dihedral case is classical, the A4 case is due to Langlands, th e S4
case t o Tunnell . Th e latte r i s crucia l i n Wiles' s proo f o f Fermat' s Grea t Theorem .
In a certain sense , th e Arti n conjectur e i s true o n average : fo r instance , i f K/Q
is a fixed finite Galoi s extension , the n
(5.107) ( K(s) = t(s)l[L(p,s)
where p runs ove r th e non-trivia l irreducibl e representation s o f Gsl(K/Q) (se e
(5.82)). Sinc e bot h ((s) an d CK(S) a re know n t o hav e onl y a simple pol e a t s = 1 ,
the possible poles of L(p, s) must b e compensated b y zeros for other representations .
The followin g i s als o useful .
COROLLARY 5.47 . Let p be a non-trivial irreducible Galois representation of
K/Q. Then L(p , s) has neither poles nor zeros on the line R e (s) = 1 .
PROOF. Thi s follow s b y (5.1 06 ) becaus e th e L- functions o f non-trivia l Heck e
Grossencharakters ar e entir e an d d o no t vanis h o n th e lin e R e (s) = 1 (Theorem
5.35). •
Assuming th e Arti n conjecture , on e ca n deduc e quit e interestin g arithmeti c
corollaries usin g th e propertie s o f analyti c L- functions.
CHEBOTAREV DENSIT Y T H E O R E M . Let L/K be a finite Galois extension of
degree d of number fields. Let C C Gal(L/K) be a union of conjugacy classes and
l
rl>{x,C)= J2 °ZNS>-
Np^x
FrpGC
We have as x—> +00 ,
\C\x
^ ^ ' ^ I G a K W r
/ / GRH holds for Artin L-functions, then for x ^ 2,
desip)[K:Q]
(5.109) i>{x,C) = | G J ^ ) | + o(V£(log*) E M Mx <l(p)))
where p runs over the irreducible representations of Gel(L/K), and
The implied constant is absolute.
PROOF. Althoug h th e proo f belo w depend s o n th e Arti n conjecture , (5.1 08 )
is know n unconditionall y an d (5.1 09 ) doe s no t nee d th e Arti n conjectur e (se e e.g .
[Se6]).
144 5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S
Let G — Gal(L/K). Function s o f th e for m Trp , fo r p irreducible , for m a n
orthonormal basi s o f the vecto r spac e o f conjugacy-invariant function s o n the finit e
group G wit h th e scala r produc t
</,<?> = ^ £/(*)<?(*) .
Hence th e characteristi c functio n 5c o f C C G i s give n b y
where c p i s a s i n (5.1 1 0) . Fo r th e trivia l representation , w e hav e c\ = |C|/|G| , an d
thus (5.1 08 ) follow s fro m th e Arti n Conjectur e an d th e Prim e Numbe r Theore m
5.13 fo r L(p,s). Similarl y (5.1 09 ) come s fro m (5.56 ) an d (5.1 1 0) . •
The estimat e (5.1 09 ) ca n b e mad e explici t i n variou s ways . Sometime s on e
can comput e c p explicitly . Otherwise , a simpl e genera l boun d o n GR H i s given fo r
instance b y
::
^ , C ) = |J^^+o(v^(logx)(v1 ci(log a;[* < M ) + l 0g|dL|)).
To se e this , notic e tha t b y orthogonalit y w e hav e
and i t i s well know n tha t
£ d e g ( p ) 2 = |Gal(L/tf)| .
P
Hence b y Cauchy' s inequalit y w e deriv e
G L K
] P \c p\ logfr**^***!) < [K : QKlogs) ( ^^/K)\) ' \ ^ I ^
= [K:q]{\ogx)y/\C\.
On th e othe r hand , sinc e |Tr(p(x)) | < deg(p) , w e have \c p\ < deg(p) , henc e
^ \c p\ log q(p) ^ lo g JJ q(p) d^ = lo g | dL|
PP
by compariso n o f th e conductor s i n th e identit y o f L-function s
For othe r estimate s an d arrangement s o f th e erro r ter m o n GRH , an d als o fo r
unconditional estimate s wit h explici t erro r terms ; see [Se6 ] an d [LO] .
Here i s a sampl e applicatio n o f th e Chebotare v Densit y Theorem .
5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S 145
PROPOSITION 5.48 . Let n ^ 2 and N > 1 . Let D N be the set of monic
polynomials f G 1\X] of degree n whose coefficients have absolute value < N.
Then we have
^ EE log p = e„ + o(l)
f ha s a roo t m o d p
as x— > o o and N—> • oo , w/iere e n = 1 — | H h (—l)n _ 1 ^ fno£ e £/m £ en— > 1 — ^
as n ^ ooj .
PROOF. W e have |L>w | = iV n . Gallaghe r ([Ga4] , se e Exercise 3 of Chapte r
7) ha s shown tha t th e set CN of elements / € D^ suc h tha t eithe r / i s reducibl e
or th e Galois grou p o f the equation f{x) — 0 is not the full symmetri c grou p S n
satisfies
\cN\^Nn~hogN.
By trivia l estimate , element s i n CN do not contribute t o the limit above .
Let / £ CAT . Choosin g a roo t a o f / i n C, le t L — Q(a) an d K th e Galoi s
closure o f L. W e have Gal(tf/Q ) - S n an d # = Gal(tf/L ) ~ S n _i. B y Galoi s
theory, w e see that
1
{p | / ha s a root modul o p] = {p \ Fv p E I ) a' Ha] = { p | Pr p £ J?}
where H i s the set of element s i n S n fixing a t leas t on e element. Precisely , thi s
holds u p t o finitely man y prime s a t whic h th e localization s o f Z[X]/(f) an d the
ring o f integers o f L differ . B y the Chebotarev Densit y Theore m w e have
I fr\
V " logp= — vx + o{x)
p^x
f ha s a roo t m o d p
By exclusion-inclusion , i t follow s tha t \H\/n\ = e n , hence th e result. D
See e.g. [Se6 ] for other application s o f the Chebotarev Densit y Theorem .
5.14. L-function s o f varieties .
The situatio n wit h L-function s o f algebraic varietie s ove r Q is even les s under -
stood tha n tha t fo r Galois representations. Le t us call them "arithmeti c geometry "
L-functions, o r just "geometric " L-functions . W e will not discuss thing s i n general,
but mentio n the important case s of algebraic curve s and abelian varieties . Th e only
satisfactory theor y exist s for the intersection o f these tw o cases, namely th e abelian
varieties o f dimension on e (that i s the elliptic curves) .
Let E/Q b e a n ellipti c curve . Modularit y o f E, prove d b y Wiles , Taylor -
Wiles an d Breuil-Conrad-Diamond-Taylo r [W] , [TW], [BCDT], mean s tha t th e
Hasse-Weil zet a functio n o f E, normalize d t o hav e critica l stri p 0 ^ R e (s) ^ 1 ,
corresponds t o a modula r for m o f weigh t 2 an d leve l equa l t o th e conducto r o f
.E, henc e i s an L-functio n accordin g t o Sectio n 5.1 1 . Se e Section 1 4. 4 for a mor e
detailed statement , a s well a s a description o f the root numbe r i n most cases .
Conjecturally, th e "modularity " extend s t o th e Hasse-Wei l zet a function s o f
arbitrary smoot h projectiv e curve s ove r numbe r fields. Le t C/Q b e such a curve .
Then fo r all primes p \ TV , wher e N ^ 1 is some integer , th e curve ca n be reduce d
146 5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S
modulo p to a smooth projective curv e Cp/¥p. A s described i n Exercise 2 of Chapte r
11, th e loca l zet a functio n
\Cp(Fp>)\Tk>
Z(CP,T) = exp(^_ , ^ J
is a rationa l functio n o f the for m
Z(C
1 P,T)= { 1_TP){ _pT)
for som e moni c polynomia l P p wit h P p(0) = 1 , o f degre e 2g wher e g > 0 i s th e
genus o f C. Writ e
pp(T)= n (i-«i(p)p5T) .
Then th e Rieman n Hypothesi s fo r curve s ove r finite field s (se e Chapte r 1 1 ) show s
that |<x?(p) | = 1 . Th e Hasse-Wei l zet a functio n o f C outsid e iV , analyticall y nor -
malized, i s defined b y th e Eule r produc t
1
LN(C,S) = JJ ( 1 - a 1 (p)p-sy1 •••(! - ^ ( r i f T
It i s conjecture d tha t afte r multiplyin g b y appropriatel y define d Eule r factor s a t
p | iV, th e complet e L-functio n i s entire , i t i s necessaril y self-dua l an d th e gamm a
factor shoul d b e
S
7(c, s)=n-°n 2+Jm|+lr=^(2^-^(3+\y
for som e constan t c g > 0 (se e (5.86)) . Not e tha t th e Rieman n Hypothesi s fo r th e
curves C p ove r finite fields mean s tha t thi s L-functio n satisfie s th e Ramanujan -
Petersson conjectur e (u p t o th e ramifie d places , bu t i t i s know n tha t thos e stil l
satisfy th e require d condition) . I n genera l fo r an y g > 1 , thi s i s no t ye t know n
except fo r ver y specia l cases .
REMARK. I n arithmeti c geometr y i t i s mor e practica l t o leav e th e loca l root s
otj{p) a s the y occu r naturall y withou t normalization , i.e . t o writ e
PP(T)= J ] ^-PoiP)T)
with \/3j(p)\ = y/p. Thu s
L(c,,)=n^(p-s+i/2)-i=
In thi s notatio n th e critica l lin e i s R e (s) = 1 and th e functiona l equatio n relate s
the value s a t s an d 2 — 5 . Th e sam e remar k refer s t o abelia n varieties .
Abelian varietie s ar e higher-dimensiona l generalization s o f ellipti c curves , see n
as algebrai c groups . A n abelia n variet y A/Q i s a smoot h projectiv e variet y wit h a
given rationa l poin t 0 G A(Q) an d additio n an d invers e morphism s p : A x A— > A,
i : A —> A whic h ar e map s o f algebrai c varietie s ove r Q an d satisf y th e axiom s o f
a commutativ e grou p (wit h i givin g th e opposite) . I f th e dimensio n o f A i s one ,
5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S 147
this correspond s exactl y to elliptic curve s E/Q wit h the "usual " chord-and-tangen t
description o f the group la w (see [Sil] and Sectio n 1 1 .8) .
There exist s a n integer N ^ 1 such tha t fo r p \ iV, the reduced variet y A p/¥p
is smooth . Wei l showe d tha t th e local zet a functio n
Z(AP,T) = e x p (^ j. J
is a rationa l functio n o f the for m
Z ( A p i J
' "P pfi(T) • • • P P,23(T)
where P pj ar e polynomials. Wei l prove d tha t P p^ i s of degree 2g and of the for m
PP,I(T)= n (i-«i(p)p5r )
with |oj i (p) I = 1 , whic h i s th e loca l Rieman n Hypothesis . I n additio n on e ca n
arrange th e cti(p) suc h tha t ai(p)ai+ g(p) — 1 for 1 ^ i ^ g. Th e first polynomia l
P p? i determine s P pj a s the j - th exterior powe r
PPAT)= n (i-«ii(p)---«ii(p)p2T) .
In particular , P P:0 = 1 - T , P p^g = 1 — P gT. Th e globa l zet a functio n o f A/Q
outside N i s
LN(A, s) = l[ ( 1 - a i ^ p - ) - 1 • • • (1 - ^ ^ ( p j p - ) - 1
.
pfJV
Again it is conjectured tha t afte r multiplyin g b y appropriately define d Eule r factor s
at p | TV the complete product L(A, s) is an L-function, whic h is entire and self-dua l
and ha s gamma facto r
j(A, s) = 7 r - » T (! + \) 9T(S- + l) 9 = c 9 (27r)-*T( S + \Y
for som e constan t c g > 0 (see (5.86)). I t satisfie s th e Ramanujan-Petersson conjec -
ture, sinc e the correctly define d ramifie d factor s als o satisfy th e required condition .
For g = 1 , this conjectur e i s true b y the modularit y o f elliptic curve s ove r Q .
For an y g > 1 , except fo r som e case s involvin g so-calle d CM varieties, th e analyti c
continuation an d functiona l equatio n remai n ope n problems . Th e conductor q(A)
(which divide s N) o f this L-functio n i s predicted a s part o f the conjecture an d is a
close relativ e o f the Artin conducto r fo r Arti n L-functions .
Note th e resemblanc e betwee n th e shap e an d definitio n o f th e L-function s o f
curves an d abelia n varieties . I t i s not a coincidence . Th e theor y o f the jacobia n
variety o f curves associate s t o ever y smoot h projectiv e algebrai c curv e C o f genu s
g ove r a field k a n abelia n variet y J(C)/k o f dimensio n g, calle d it s Jacobian , i n
such a wa y tha t fo r a curv e C/Q w e hav e L(C , s) = L(J(C),s). Fo r instance ,
the L-functio n L(Jo(g),s ) o f (5.93 ) i s the L-functio n o f the Jacobia n Jo(q) o f the
modular curv e Xo(q) = ro(#)\H, b y work o f Eichler an d Shimura .
148 5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S
REMARK. A n alternativ e definitio n o f the loca l factors o f L(A, s) (o r of L(C, s)
for a curve ) i s give n i n term s o f th e £-adi c Tat e modul e o f A: le t
Te(A) = QimA[r])®Qi,
n
then th e Galoi s grou p o f Q act s o n Tt(A) an d fo r an y prim e p ^ £ where A ha s
good reduction , w e hav e
P p ,i(T) = d e t ( l - T P r p | T £(A))-\
denoting b y Fr p a Frobeniu s conjugac y clas s a t p. Dually , P P:i i s give n i n term s o f
the etal e cohomolog y grou p i f 1 ^ , (Q^ ) (se e Sectio n 1 1 .1 1 ) b y
1
P P t l (T) = d e t ( l - T < 7 p | T^A))-
where o~ v i s th e invers e o f Fr p , becaus e Ti{A) i s th e dua l o f H l(A, Qi).
The Mordell-Wei l theore m state s tha t i f A/Q i s a n abelia n variety , the n th e
group o f rationa l point s A(Q) i s finitely generated . Th e computatio n o f it s ran k i s
one o f th e grea t problem s o f arithmeti c geometry . W e hav e th e famou s
BIRCH AN D SWINNERTON-DYE R CONJECTURE . Let A/Q be an abelian variety.
Then
rank A(Q) = or d L(A, s).
Of cours e thi s conjectur e assume s tha t L(A,s) i s a n L-function , s o L(A, | ) i s
defined. I f the Birch and Swinnerton-Dye r conjectur e holds , one can get informatio n
on th e ran k o f abelia n varietie s usin g analyti c methods . Ofte n thi s suggest s ver y
interesting problems . Fo r instance , b y Propositio n 5.2 1 w e derive :
COROLLARY 5.49 . Let A/Q be an abelian variety of dimension g ^ 1 with
conductor q. Assume that its Hasse-Weil zeta function is an L-function and that
the Birch and Swinnerton-Dyer conjecture holds for A(Q). Then we have
logq
rank A (Q) <C
log(|logg)'
with an absolute implied constant. Note that q > 3 9 by Theorem 5.51 below.
This ca n b e turne d ove r t o giv e a lowe r boun d fo r th e conducto r o f a n abelia n
variety wit h a give n rank , fo r instanc e fo r ellipti c curves .
COROLLARY 5.50 . Assume that the Birch and Swinnerton-Dyer conjecture
holds for elliptic curves over Q. Then if E/Q is an elliptic curve of rank r ^ 1 and
conductor q, we have
q^>rcr
where c > 0 is some absolute constant and the implied constant is also absolute.
It woul d b e ver y interestin g t o prov e an y unconditiona l estimat e comparabl e
to thos e tw o corollaries usin g algebrai c methods . I n Chapte r 2 6 we study th e orde r
of vanishin g o f th e zet a functio n (ofte n calle d th e analyti c rank ) fo r th e abelia n
varieties Jo(q)/Q, obtainin g quit e precis e results .
As fo r th e discriminant s o f numbe r fields i n Sectio n 5.1 0 , on e ca n us e L-
functions t o investigate , conditionally , th e conducto r o f elliptic curves , or of abelia n
varieties ove r © .
5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S 149
T H E O R E M 5.51 . Let A/Q be an abelian variety over Q of dimension g ^ 1
such that L(A, s) is an analytic L-function of conductor q = q(A). Then we have
q ^ e 1 ,2g, and in particular q > 3.
See [Mes] for even stronger bounds . Th e bound q > 1 is known unconditionally ,
by th e celebrate d theore m o f Fontaine [Fon ] accordin g t o whic h ther e i s no abelia n
variety ove r Q everywhere unramified . Fo r a variety o f dimension g — 1 , this mean s
there i s n o ellipti c curv e define d ove r Q whic h i s unramifie d everywhere , i.e . it s
discriminant i s divisibl e b y a t leas t on e prime . I n thi s cas e i t ca n als o b e prove d
without appealin g t o L-functions o r modularity b y showing that th e discriminant o f
the Weierstras s equatio n (1 4.1 8 ) (wit h integra l coefficients ) canno t b e equa l t o ± 1
(see [Sil] , ex . 8.1 5) . I n th e highe r dimensiona l case , n o suc h explici t reformulatio n
is accessible , an d Fontaine' s unconditiona l proo f i s muc h mor e involved .
P R O O F O F THEORE M 5.51 . Appl y (5.24 ) t o L(A,s) a t s = a > 1 , then tak e
the real part. Becaus e L(A, 5) is self-dual, th e constant b disappears by using (5.29) .
Moreover, b y th e positivit y
Re - -, ^ > 0,
P?
we derive th e inequalit y
1V V
- lo g q - g log 2TT + g — (a + \ ) + —
Next b y th e Ramanujan-Petersso n boun d w e hav e
^W)|<<^(a).
Hence w e ge t
^ l o g O log2T T - V -(o + A ) - £ »
for an y a > 1 . Takin g a = 2.3 , one check s tha t th e righ t sid e i s 0.62... > 0.6 , henc e
q ^ e 1 -2^ an d q > 3 . •
REMARK. Sinc e e 2A > 1 1 , and i t i s known (se e (1 4.41 ) ) tha t th e ellipti c curv e
with smalles t conducto r i s
E : y2 + y = x3 - x 2
with conducto r 1 1 , the argumen t implie s tha t 1 1 is i n fac t th e smalles t conducto r
of a n abelia n variet y ove r Q , assumin g thei r zet a function s ar e L-functions .
5.A. Appendix : comple x analysis .
In thi s sectio n w e presen t i n concis e for m th e concept s an d result s o f comple x
analysis tha t ar e use d i n this chapter . W e omit mos t proof s bu t refe r t o [Tl] , [Ru ]
or [Ahl ] (amon g man y references) .
150 5. CLASSICA L ANALYTI C T H E O R Y O F L - F U N C T I O N S
A.l. Functions of finite order.
DEFINITION A . l . Le t / : C -> C be an entire function . The n / i s said t o be
of finit e orde r i f there exist s f3 > 0 such tha t
|/( S )|«exp(| S n
for s e C . I f one ca n take an y /3 > 1 , / i s said t o be of order ^ 1 , and if no /? < 1
is possible, the n / i s said t o be of order 1 .
Let / b e a meromorphic functio n o n C. The n / i s of order < 1 if there exist s
entire function s g and h of order ^ 1 such tha t f = g/h.
THEOREM 5.52 . Let f be an entire function of order 1.
(1) We have
/w=- p n( i -j) e ' / "
uniformly on all compact subsets ofC, where r is the order of the zero of f at s = 0
and p runs over zeros of f different from 0 .
(2) For any e > 0, the series
A. 2. The Phragmen-Lindelof principle for a strip.
The Phragmen-Lindelof "principle " i s a generic term for theorems which exten d
the maximu m modulu s principl e t o various infinit e region s o f the complex plane .
Only th e case o f strips a ^ a < b interests us.
T H E O R E M 5.53 . Let f be a function holomorphic on an open neighborhood of
a strip a ^ a ^ b, for some real numbers a < b, such that \f(s)\ < C exp(|s|^) for
some A ^ 0 and a ^ a ^ b.
(1) Assume that \f(s)\ ^ M for all s on the boundary of the strip, i.e. for
a = a or a = b. Then we have |/(s) | ^ M for all s in the strip.
(2) Assume that
\f(a + it)\^Ma(l + \t\) a,
\f(b + it)\^Mb(l + \t\)P
for t e R. Then
\f(<r + it)\ < M^Ml- t(a\l + W'W+M-'W
for all s in the strip, where £ is the linear function such that £(a) — 1, £{b) = 0.
A.3. The Perron formula.
Let h(x) b e the function give n by
1i fx>1 ,
l
h(x) = < 22 i f x = 1,
0 iix<l.
5. CLASSICA L ANALYTI C THEOR Y O F L-FUNCTION S 151
PROPOSITION 5.54 . We have
i r rc+iT
c+l
1
ds
s /
(5.111) - ^
27r / x — = h(x) + 0[
% J c_iT s V ,T|logx|
for any x > 0 , x ^ 1 , T > 0 and 0 < c ^ 2 , m'£/ i an absolute implied constant. If
x = 1, the factor | logx| z s omitted.
A.4 • The Stirling formula.
The Stirlin g asymptoti c formul a
*•"*> r 1
W-(T) i
(i)'( +0
(R))
is vali d i n th e angl e |args | ^ TT — e wit h th e implie d constan t dependin g o n e.
Hence fo r s = a -f ^, £ ^ 0 , a fixe d
(5.113) I > + it) = V2^(it)a-2 e~2 1*1 ( ^ ^ {l + °(jji) } •
Let
be a gamm a facto r a s in (5.3 ) with Re(/«j ) > —1 . I t ha s no poles i n Re (s) ^ 1 .
Recall th e corresponding conducto r a t infinit y i s
d
qoo(s) = IJ(|t + /^| + 3).
3=1
Let -\ ^ Re(s) ^ 2 . Pro m (5.1 1 2 ) we deduce
(5.114) |7(s)ini * + ^l = M s ) ^ + a ^ ^
ence w e
whereA: = Re Ylnj- H have
S
7(1 ~ s) _ fMh-*)TT + KJ
(5.H5) ^^tM^Tlj^-.
1
7(5) U - 3 S + Kj
Moreover, usin g th e recurrence T(s + 1 ) = sT(s) w e derive
(5.116) ^ ( s ) - £ __L_«i ogqoo (5).
The implie d constant s i n (5.1 1 4 ) an d (5.116) ar e absolute.
A. 5. Existence of test functions.
For application s o f the explicit formula , especiall y i n the absence o f the GRH ,
it i s useful t o know that ther e exist tes t function s wit h certai n positivit y properties .
152 5. CLASSICA L ANALYTI C THEOR Y O F L-FUNCTION S
PROPOSITION 5.55 . There exist positive, non-zero, C°° functions 7 7 on [0,+oo [
with compact support in [e _ 1 , e] such that rj(l) = 1 , r)(0 ) > 0 , n(x) = r](x~ 1 ) for all
x > 0 and moreover either fj(it) ^ 0 for t G R o r R e (r)(s)) ^ 0 /or all s G C wit/ i
ki<i.
SKETCH O F PROOF . Defin e f(y) — rj(e y), whic h i s a n even , compactl y sup -
ported, C°° functio n o n R , an d
V(») = I f(y)e sydy
JR
for al l s G C, henc e
lty
m= / f(y)e dy
JR
Re(fi{s))= / f(y)e aycos(ty)dy= / f(y)cosh(ay)cos(ty)dy.
JR JR
By th e firs t formul a t o hav e r)(it) ^ 0 i t suffice s t o choos e / ^ 0 suc h tha t it s
Fourier transfor m i s positive. Th e Fourie r pai r (4.83) , suitabl y smoothed , doe s th e
job.
From th e secon d formul a an d th e maximu m modulu s principl e fo r harmoni c
functions, th e inequalit y R e (r)(s)) ^ 0 will hold fo r \a\ < 1 if and onl y i f the Fourie r
transform o f th e eve n functio n g{y) = f(y) cosh(y ) i s non-negative . Conversely , i f
a functio n g (even , smoot h an d compactl y supported ) wit h thi s propert y i s given ,
a suitabl e tes t functio n7 7 is easil y obtaine d b y reversin g thi s procedure . No w i f go
is an y smoot h compactl y supporte d positiv e functio n o n R , the n th e convolutio n
square g = go * go wil l work , sinc e g = p§ .
The suppor t o f r\ an d it s valu e a t 1 can b e easil y adjuste d b y homogeneity . •
Functions o f thi s typ e wer e constructe d b y Poito u an d other s fo r th e purpos e
of obtainin g lower-bound s fo r th e discriminan t o f numbe r field s [Poi] . Not e i n a
way the y ar e simpl y bette r behave d version s o f the basi c functio n f(s) = (a — s)~ l
for a > 1 (se e (5.28 ) an d th e proo f o f Theore m 5.51 ) . Se e als o [PP ] fo r anothe r
construction o f a tes t functio n wit h usefu l propertie s i n applications .
A.6. Landau's lemma.
LEMMA 5.56 . Let X n ^ 0 for all n ^ 1 . Suppose the series
D(s) = J2 X nn~s
converges absolutely for all s with R e (s) > o~o, but not for any s with R e (s) < o"o;
in other words, &o is the abscissa of absolute convergence of D(s). Then s — cr 0 is
a singularity of D(s), i.e. D(s) cannot be continued analytically to a neighborhood
ofo-Q.
http://dx.doi.org/10.1090/coll/053/07
CHAPTER 6
ELEMENTARY SIEV E METHOD S
The siev e method s originate d fro m work s o f Vigg o Bru n o n th e Goldbac h an d
Twin Prim e Conjecture s [Brl] , [Br2] . Moder n siev e method s appl y t o a grea t
variety o f problems, an d whe n combined wit h technique s o f analytic numbe r theor y
they ar e ver y advance d indee d (cf . [FI1 ]) . I n this chapte r w e provid e onl y basi c
results from siev e theory. Firs t w e establish by refining th e combinatorial argument s
of Brun th e so-calle d Fundamenta l Lemma . The n w e develop a n uppe r boun d siev e
of Selber g an d illustrat e it s powe r b y a few popula r applications . Mor e complet e
treatments o f siev e method s ca n b e foun d i n th e book s o f Halbersta m an d Richer t
[HaRi] an d Greave s [Gr] .
6.1. Siev e problems .
Let A = (a n) b e a sequence o f non-negativ e numbers . Th e ultimat e questio n
is ho w ofte n thes e number s ar e supporte d o n primes ? Fo r example , i f A is the
characteristic functio n o f th e shifte d prime s
1 if n = p + 2,
[ 0 otherwise ,
then w e ar e askin g ho w ofte n i s p + 2 prime? I n thi s case , th e heuristi c argument s
described i n Sectio n 1 3. 1 yield th e followin g asymptoti c formul a
2
(6.2) 7T 2(x) = |{ p ^ x ; p + 2 is prime} | ~ Cx(\ogx)~
where C is a positive constan t give n b y
(6.3) C = 2J j ( i _ _ I _ ) =1 -32032.. .
Hence ther e "are " infinitel y man y twi n primes , bu t w e d o no t hav e rigorou s proo f
of thi s ol d conjectur e b y an y method .
The siev e method s alon e (i n thei r classica l format ) ar e incapabl e o f selecting
primes, howeve r the y yiel d satisfactor y result s fo r slightl y modifie d problems . Le t
P b e a se t o f primes , z ^ 2 an d
(6.4) P(z)=Hp.
p£V
Now w e see k estimate s fo r th e sifte d su m
(6.5) S(x,z) -£
(n,P(z))=l
153
154 6. E L E M E N T A R Y SIEV E M E T H O D S
When V i s the se t o f al l prime s an d z — y/x, the n S(x, y/x) agree s wit h
(6.6) S(x) = J2 ap
p^x
up t o a fe w term s a n wit h n < y/x. Th e siev e method s produc e uppe r an d lowe r
bounds fo r S(x,z) o f correc t orde r o f magnitud e a s fa r a s z ^ x a, wher e a > 0 is
a smal l positiv e constant . Not e tha t i f n ^ x ha s n o prim e divisor s les s tha n x a,
then n ha s a t mos t [1 /a ] prim e divisors , s o one ca n sa y n i s almos t prime .
The remarkabl e universality o f sieve is that i t applie s to quite general sequence s
A = (a n). Al l we need ar e asymptotic s fo r th e partia l sum s
a
(6.7) A d(x)= Y, "
n-^x
n=0 (mo d d)
with d | P(z), d < y. W e assum e tha t
(6.8) A d(x)=g(d)X + r d(x)
where g(d)X i s a n expecte d mai n ter m an d rd{x) i s a n erro r ter m whic h w e thin k
of a s being relativel y smal l i f d < y. I n th e mai n ter m X i s approximately equa l t o
(6.9) A(x) = J2a n
n^.x
so g(d) stand s for th e density of the masses a n attache d t o n = 0 (mo d d). Thinkin g
of th e divisibilit y b y distinc t prime s a s independen t events , i t i s not to o surprisin g
that w e assum e tha t g(d) i s a multiplicativ e functio n suc h tha t
(6.10) 0^g(p) < 1 if peV.
Of course , th e approximatio n o f typ e (6.8 ) i s no t unique , howeve r t o ge t goo d
results i n practic e ther e i s no t muc h roo m t o choose g{d) an d X from . Fo r som e
prime modul i q on e ma y hav e a goo d approximatio n (6.8 ) wit h g(q) = 1 , whic h
means tha t almos t al l mas s come s fro m th e a n wit h n = 0 (mo d q). I n thi s case
the chanc e o f finding term s a n supporte d o n primes i s slim; tha t i s why w e exclud e
such prim e modul i fro m th e se t V. Sinc e w e us e (6.8 ) onl y fo r d | P(z), w e ar e
free t o modif y th e multiplicativ e functio n g(d) arbitraril y a t d prim e t o P(z) an d
indeed fo r notationa l simplicit y w e se t
(6.11) g(p)=0ifp£V.
6.2. Exclusion-inclusio n scheme .
Detecting the condition (n , P(z)) = 1 by means of the Mobius inversion formul a
1i fm = 1 ,
(6.12) S(m) = J2^ = { 0 ifra^l ,
d\m ^
the su m (6.5 ) unfold s int o
d
S(x,z) = J2[YJ M ))an-
n<;X d\n
d\P(z)
6. ELEMENTAR Y SIEV E METHOD S 155
Changing th e orde r o f summatio n w e obtain th e Legendr e formul a
d
(6.13) S(x,z)= ^2 »( )Mx)-
d\P(z)
Next introducin g (6.8 ) w e ge t
(6.14) S(x, z) = XV{z) + i?(x , z)
where
(6.15) V(z)= J ] (l-g(p))
p\P(z)
and
r
(6.16) R(x,z) = J2 ^x)-
d\P(z)
On probabilisti c ground s on e ma y expec t tha t th e produc t V(z) represent s th e
true densit y o f the mas s comin g fro m a n wit h n ^ x havin g n o prime divisor s p < z
in V. However , thi s i s almos t neve r correc t excep t fo r z smal l relativel y t o x i n
the logarithmi c scale . I n othe r words , th e divisibilit y b y distinc t prime s ar e no t
completely independen t event s whe n thes e prime s ar e relativel y large . Technicall y
speaking th e proble m i s that , i f z i s large , th e remainde r su m R(x, z) ha s man y
terms, s o i t exceed s th e expecte d mai n ter m XV(z).
In orde r t o reduc e th e numbe r o f term s i n th e Legendr e formul a (6.1 3) , an d
consequently i n th e remainde r ter m (6.1 6) , Bru n truncate d th e Mobiu s functio n
n{d) t o tw o sets , sa y V + an d V~, an d considere d th e incomplet e convolution s
+
(6.17) S (n) = £ //(d )
d\n
dev+
(6.18) 6~(n)= J2 /*(<* )
d\n
dev~
in plac e o f 8(n). H e los t th e exac t propert y (6.1 2 ) bu t maintaine d th e inequalitie s
(6.19) 5-(n) < 5 ( n ) ^ 5+(n)
for al l n ^ 1 . Henc e on e obtain s th e lowe r an d uppe r bound s
(6.20) XV-{z) + R-(x, z) < 5(x , z) < XV+(z) + R +{x, z)
where
± X
(6.21) V {z) = Y, ^(d),
d\P(z)
± X
(6.22) R (x,z) = Y drd(x)
d\P(z)
and Aj , A^ " denote th e Mobiu s function o n the sets V +, V respectively . Assumin g
that
(6.23 ) V+,V~ c[l,y)
156 6. ELEMENTAR Y SIEV E METHOD S
we can estimat e th e remainde r sum s i? ± (x, z) b y
(6.24) R(x,y)= ] T M*)| .
d<y
d\P(z)
Sometimes w e can exploi t th e intrinsi c structur e o f the siev e weight s A J t o ge t
estimates fo r K^^x^z) whic h ar e bette r tha n tha t fo r R(x,y). Thi s i s du e t o a
possible cancellatio n o f th e erro r term s rd(x) i n R ±(x,z)) bu t w e d o no t ventur e
into thi s sophisticate d are a i n thi s boo k (se e e.g . [1 9] , [1 1 0]) .
For notationa l simplicit y w e ar e writin g squarefre e number s a s th e produc t o f
distinct prime s enumerate d i n decreasin g orde r
(6.25) d = Pi'-'Pr wit h Pi>--->p r -
Put
+
(6.26) D = {d = p i • • -p r ; Pm <Vm fo r al l m odd }
(6.27) V~ — {d = pi • • -p r ; Pm < Vm fo r al l m even }
where y m ar e suitabl e parameters . B y conventio n bot h set s V + an d V~ contai n
d = 1 . W e star t fro m th e recurrenc e formul a
(6.28) V(z ) = 1 - £>(p)V(p) .
p<z
Iterating thi s we derive by the well-know n exclusion-inclusio n procedur e th e follow -
ing identities :
(6.29) V(z) = V +(z)- J2 V
n(z),
n od d
(6.30) V(z) = V-(z)+ Y, V n(z)
n eve n
where
(6.31) V n(z)= J2---Y, 9(pi---Pn)V( Pn).
yn^Pn<---<Pi<z
Pm<ym,'m<n->'nT>=n (mod 2 )
Dropping th e non-negativ e term s V n(z) i n (6.29 ) an d (6.30 ) i t follow s tha t
(6.32) V-(z)^V(z)^V+(z).
Now w e ca n easil y verif y th e lowe r an d th e uppe r boun d siev e condition s (6.1 9) ;
indeed (6.32 ) become s (6.1 9 ) o n takin g P(z) = n an d g(d) = 1 .
Prom no w o n th e combinatoria l sieve s A + = (Aj ) an d A - = (A^~ ) depen d o n
the truncation parameter s y m alone . Bru n trie d variou s parameters, hi s best choic e
was ym = y a(3 fo r suitabl e constant s 0 < a < 1 < /? . W e choos e th e parameter s
ym dependin g o n the prime s p i , . .. pm whic h occurre d prio r t o th e ra-th ste p o f th e
iteration, precisel y
(6.33) ym = (y/pi--Pm)*
where (3 is a fixed number , / 3 > 1 . Clearl y ever y d G V + U T>~ satisfies d < y
except possibl y fo r d = p G V~ . Howeve r assumin g z ^ y i n thi s cas e w e als o hav e
d — p < z ^ y. Ou r choic e (6.33 ) i s inspire d b y heuristi c argument s exploitin g th e
6. E L E M E N T A R Y S I E V E M E T H O D S 157
concept o f "sievin g limit" , however , a s we are not going for optimal results , w e are
free t o choose an y / 3 > 1.
6.3. Estimation s ofV+(z), V~(z).
We nee d a n upper boun d fo r V^(z) an d a lower boun d fo r V~(z). B y virtue
of (6.29 ) an d (6.30) i n both case s the problems reduc e t o getting uppe r bound s for
Vn(z). Sinc e V(p n) i n (6.31 ) ar e non-negative som e o f the summation condition s
can be relaxed b y positivity. W e get
Vn(z)^ £ / • • ] £ 9(Pl'-Pn)V(p n)
Vn^Pn<---pl<Z
Pi---Pm-iP^n<y
where th e conditions hol d fo r all 1 ^ m < n regardless o f parity. Thes e condition s
imply tha t
1 (P-l\m
p
Pi" 'Pm <y i fm < n,
by inductio n o n m. I n particular, fo r m = n — 1 this yield s th e following lowe r
bound fo r the least prim e divisor s o f d — p\ • • -pn i n V n(z),
Pn > (y/pi-"Pn-i)fi+1 >y? +l{
* }
^y^^ ]
> z n,
/ / 3 - 1 \n
say, wher e z n — z K $ ' wit h
z = ya, 0 / 3 .
Having establishe d th e lower boun d p n ^ z n w e drop th e other condition s and
estimate a s follows
V„(Z)^ J2---1 2 9(Pl---Pn)V(jp n)
Zn^Pn<---<Pl<Z
To proceed furthe r w e make th e followin g assumptio n abou t th e multiplicative
function g(d): fo r any w with w < z, we have
log z \ K
(6 .34) n <!-««>-<*(&)
^-yypj) ^^y '
w^p<z
where K, > 0 and K > 1 do not depend o n w.
REMARKS. Th e conditio n (6.34 ) is relatively eas y to verify i n practice. W e call
the exponen t K the sieve dimension . Notic e tha t hi is not uniquely defined ; an y
larger numbe r i s also qualified . Th e constant K ca n be disturbingly larg e i f the
densities g(p) ar e near on e for many smal l p. Nevertheles s th e result s ca n be refine d
in thi s respect . Indeed , i n practice on e ca n establis h th e following estimat e
<«*> n <>-*)>-.(^r'( 1 +i ^)
158 6. E L E M E N T A R Y SIEV E M E T H O D S
for an y w wit h 2 ^ w < z, where K! and K' ar e positive number s independen t of
w. Thi s estimat e implie s (6.34 ) wit h K = 1 + K' an d
K'
(6.36) K = 1 + logz
Therefore i f ca n be reduced t o a numbe r nea r 1 at th e expense o f increasing the
dimension K by 1.
By (6.34 ) w e derive usin g th e inequality 1 + x ^ e x tha t
V{zn)^KV(z)(-^) <KV(z)e"l
v/3-
where j3 — Kb + 1 . Henc e
K.W < £(= tkgjf)V'VH <£(=««' ) V vW .
Inserting n ! ^ e(n/e) n w e obtain
1
(6.37) F n (z) < e- anKMV(z)
where a = b~1e1+b . We choose b large s o that a < 1 .
The conditio n p n ^ y n i n (6.31 ) implie sp ™ ^ y , an d this show s tha t th e
sum (6.31 ) i s void i s n ^ s — (3. Therefore (6.37 ) yield s
b + 1
(6.38) ] [ > n (2)= £ y n(2) < _^!_K ^)-
v 7
n>0 n>s-/ 3 " ^
We choos e / ? = 9K + 1 , getting 6 = 9 and a < e~l. Henc e we conclude
THEOREM 6.1 . Let A + = (Aj) , A ~ = (A^~ ) be the combinatorial sieves of level
y > 1 and the parameter ft — 9K + 1 . TTie n /or any multiplicative function g(d)
satisfying (6.1 0), (6.1 1 ) and (6.34) an d an V z ^ V 1 ^ we have
+ 1
s 0
(6.39) V (z) < ( 1 + eP- K )V(z),
(6.40) V~(z) > ( 1 - e ^ - K1 0 ) y ( z )
s
where s = lo g y/ log 2.
COROLLARY 6.2 . Le t the conditions be as above. Then
(6.41) S(x, z)<(l + e^sK10)V(z)X + R(x, z s)
1 0
(6.42) S(s , z) > ( 1 - e ^ ' l T ) VO*) * - i?(x , zs )
where f3 — 9K + I, s is any number ^ j3, and R(x,y) denotes the remainder
sum (6.24).
6.4. Fundamenta l lemm a o f sieve theory .
Combining th e inequalities (6.39) , (6.40 ) wit h (6.32 ) w e conclude th e following
6. ELEMENTAR Y SIEV E METHOD S 159
FUNDAMENTAL LEMM A 6.3 . Let K > 0 and y > 1 . There exist two sets of real
numbers A + = (AJ) and A ~ = (A^~) depending only on K and y with the following
properties:
(6.43) Af = l,
(6.44) | A ± | < 1 if\<d<y,
(6.45) A ± = 0 ifd^y,
and for any integer n > 1,
(6.46) J2x~d^°^J2xt
d\n d\n
Moreover, for any multiplicative function g(d) with 0 ^ g(p) < 1 and satisfying
the dimension condition
logz\K/ K \
/or all 2 ^ w < z ^ y we have
(6.48) J ] A ± 5 ( d ) = ( 1 + O(e-(l + i ^ ; ) 1 0 ))n(l-^))
where P(z) denotes the product of all primes p < z and s = logy/ log z, the implied
constants depending only on n.
REMARK. Th e Fundamenta l Lemm a play s a n assistin g rol e i n siev e theory ; it
usually serve s fo r preliminar y eliminatio n o f the element s o f a sequenc e whic h hav e
small prime divisors, that i s primes p < z — y}l s wit h s —> +oo . I n such application s
the result s ar e asymptoticall y precise , so one loses nothing whil e gaining a practica l
assumption tha t th e sequenc e i n question i s supported o n almos t primes . Quit e
often on e onl y need s a n upper-boun d o f th e righ t orde r o f magnitude , tha t is
w
(6.49) ] T \±g(d)<$:K l[(l-g(p)).
d\P{z) P<z
This holds under th e conditio n (6.34)(whic h i s weaker tha n (6.47) ) wit h th e implie d
constant dependin g onl y o n n.
Although th e sequenc e A = (a n) accompanie d u s whe n constructin g th e sieve s
A + = (Aj) an d A - = (A^"), one shoul d realiz e tha t A + , A - have actuall y nothin g
in commo n wit h A. On e ca n legall y appl y th e particula r sieve s A + , A ~ t o any
sequence o f non-negative numbers , th e onl y risk i s that th e result s ma y no t b e idea l
if th e paramete r K (th e so-calle d dimension ) an d y (th e level ) o f A+ , A " d o no t
match perfectl y th e characteristic s o f A.
Of cours e on e doe s no t nee d t o pus h s to infinity . Th e lowe r boun d (6.42 ) is
already interestin g i f 5 > ft. Assuming that th e remainder ter m R(x, y) i s negligible
for a suitable leve l y, on e infer s fro m (6.42 ) tha t
(6.50) Yl a n »X(logz)-*
(n,P(*)) = l
160 6. E L E M E N T A R Y SIEV E M E T H O D S
where z = y 1 / ^ + £ ) for an y e > 0 provide d y i s sufficientl y larg e i n term s o f e.
Suppose w e ca n contro l R(x,y) fo r y = x a~£ wit h 0 < a ^ 1 . The n (6.50 ) hold s
for z — x ^a _ £ ) / ^ + £ ) , whil e th e conditio n (n,P(z)) = 1 together wit h n ^ x impl y
that n ha s a t mos t a/(3 prim e divisors .
In genera l t o establis h a n uppe r boun d fo r S(x, z) i s muc h easier , becaus e
S(x,z) i s decreasin g i n z s o one ca n reduc e z a t will .
EXERCISE 1 . Deriv e fro m (6.41 ) th e uppe r boun d
(6.51) TT 2(X) <Cx(logx)~ 2 .
EXERCISE 2 . Sho w tha t th e sequenc e (6.1 ) satisfie s th e siev e condition s fo r
dimension K = 1 and leve l a = 1 / 2 (se e Theore m 1 7.1 ) . The n deriv e fro m (6.42 )
that
1 /20 2
(6.52) 7r 2 (x,x ) > x(\ogx)-
where 7^(2 , z) denote s th e numbe r o f prime s p ^ x fo r whic h p + 2 ha s n o prim e
divisors < z. Therefor e ther e ar e infinitel y man y prime s p suc h tha t p + 2 ha s a t
most twent y prim e divisors . Recen t development s o f sieve methods i n this directio n
show (amon g othe r things ) tha t
\{p < x;p + 2 = pi o r P1P2 wit h pi , p 2 > x 3 / / 1 1 }| > x ( l o g x ) -2
for al l sufficientl y larg e x. Henc e p + 2 ha s a t mos t tw o prim e divisor s infinitel y
often. Thi s las t resul t i s du e t o J.R . Che n [Ch] . W e stil l d o no t kno w whic h cas e
p + 2 — p\ o r p + 2 = P1 P2 occur s infinitel y often ? Probabl y both !
6.5. T h e A 2 -sieve.
A powerful an d elegan t metho d o f an upper boun d siev e came from Atl e Selber g
[SI]. Selberg' s metho d yield s result s o f grea t generality , i t i s simple r tha n combi -
natorial sieve s a t th e start , thoug h equall y comple x i n it s advance d forms . W e
modify th e notatio n o f the previou s sectio n slightl y t o displa y clearl y th e econom y
of condition s whic h ar e require d fo r performin g Selberg' s sieve .
Recall tha t a n uppe r boun d siev e o f leve l D i s a sequenc e o f rea l number s A ^
for d < D wit h A i = 1 such tha t
X
(6.53) Yl d > ° fo r al l m > 1 .
d\m
Hereafter th e superscrip t + i s omitted fo r notatio n simplicity . Thi s positivit y con -
dition wa s quit e difficul t t o ge t a hol d o n i n combinatoria l sieve . Selber g mad e i t
very eas y b y choosin g A ^ such tha t
Ad=
(6-54) E (E^)2
d\m d\m
where {pd} i s anothe r sequenc e o f rea l number s wit h
(6.55) p i= 1 .
Since the square s ar e non-negative , suc h a choice guarantee s (6.53 ) n o matter wha t
Pd are!
6. ELEMENTAR Y SIEV E METHOD S 161
Selberg's choic e amount s t o
(6.56) A d= ] Tp dlpd2.
In orde r t o contro l th e leve l w e assume p d ar e supporte d o n integer s < \fl),
(6.57) p d = 0i f O v^D .
Hence th e resultin g siev e {A d} has leve l o f suppor t D. Followin g Selber g w e cal l i t
the A 2-sieve o f leve l D.
Throughout, A = (a n) i s a sequence of non-negative numbers , P i s a squarefre e
number an d fo r an y d \ P w e se t
(6.58) \A d\= J2 a n = g(d)X + r d(A).
n=0 (mo d d)
Here a s before , g(d) i s a multiplicativ e functio n satisfyin g (6.1 0) .
Applying th e A 2-sieve t o th e sequenc e A = (a n) i n th e siftin g rang e P w e ge t
a
S(A,P) = Yl ^ E a - ( E Pdf
(n,P) = l n d\(n,P)
= E 52pd 1 Pd2\A[dud2]\ = XG + R(A,P)
di,d2|P
where
(6.59) G = ^ S^([di,d2])/9 dlpd2
di,d2|P
and
(6-60) R(A,P) = Y J2 PdiP<hr ldlld2]{A).
diMP
The tas k befor e u s i s to mak e thi s genera l inequalit y optimal . Forgettin g fo r a
moment abou t th e remainde r ter m R(A, P , A2) w e wish to minimize G with respec t
to the unknow n number s p d subjec t t o (6.55 ) an d (6.57) . Th e ensuin g number s wil l
satisfy
(6.61) \p d\ <1 ,
hence th e remainde r ter m i s automaticall y unde r control .
The expressio n (6.59 ) i s a quadrati c for m i n p&. In orde r t o find th e minimu m
of G i t help s t o diagonalize . I n th e presentatio n below i t goe s withou t sayin g tha t
Pd is supporte d on , an d th e relevan t variable s o f summatio n ru n ove r th e divisor s
of P (thu s ove r squarefre e numbers) . Furthermor e w e ca n assum e tha t
K
0 < # ( pJ ) < l i f p|p
1(6.62)
; '
g(p) = o ifpfp .
162 6 . E L E M E N T A R Y SIEV E M E T H O D S
Let h{d) be the multiplicative functio n define d b y
(6.63) k( P) = ^ -y
We obtai n
G = ^2 g(abc)p acpbc
abc\P
(a,6) = l
2
i
^ 5 (c)^/,(%(d) 2 (^ 5 (m)p^cdm
c ;
= E^)_ 1( E 5Mp 2
m) .
d|P mE0(mo d d)
Hence b y the linear chang e o f variables
m
(6.64) £ d = p(d) ^2 g( )pm
m E 0 ( m o d d)
we obtain th e diagonal for m
(6.65) G = 5>(d) -1 &
d|P
We stil l hav e t o reinterpre t th e condition (6.55 ) i n terms o f the new variables
£d- To this en d we use the Mobius inversio n t o convert (6.64 ) int o
M6) « = $ | £ fc.
^ V ; d = 0 ( m o d i)
In particular , fo r £ = 1 this give s the linear equatio n
(6.67) £ & = !•
d|P
Moreover, on e observes b y (6.64 ) an d (6.66 ) tha t th e support condition s (6.57 ) fo r
pd are equivalent t o these fo r £d,
(6.68) f d = 0i f O >/# .
Now ou r targe t i s to minimize (6.65 ) o n th e hyperplan e (6.67) . Applyin g
Cauchy's inequalit y t o (6.67 ) w e derive GH ^ 1 where
(6.69) H= Yl M< 0
d<VD,d\P
so G cannot b e smaller tha n H~ l. The equality
(6.70) GH = 1
holds for
(6.71) & = h^H-1 ifd<y/D.
6. ELEMENTAR Y SIEV E METHOD S 163
Note tha t
(6.72) H^Yl h
^ = II^1 + %)) = II^1 - 9(p))-\
d\P p\P p\P
(6-73) ^ U ( i _ 0 ( p ) ) .
P\p
Next w e compute pi by inserting (6.71 ) int o (6.66 ) gettin g
li(£)g(£)peH = £ h(m),
m<y/D
ra=0(mod i)
that is
=
(6.74) ^ i^ E W
d<y/D/£
Now we show (6.61 ) . T o this en d w e group th e term s in (6.69) accordin g to the
greatest commo n diviso r o f d and £ getting
H
= T, H M<* ) = £>(*) Y, Mm )
k\t d<VD k\l m<y/~D/k
(d,£)=k (m,i)=l
^(E^))E Hm)=li(t)PiH
m<\fD/£
(m,l) = l
and thi s prove s (6.61 ) (thi s nea t estimat e i s due to J.H. va n Lin t an d H.-E . Richer t
[LR]). Pro m thi s on e get s directl y b y (6.56)
(6.75) \\ d\^r3(d).
Prom th e above result s w e conclud e th e following
T H E O R E M 6.4 . Let A = (an) be a finite sequence of non-negative numbers and
P be a finite product of distinct primes. For every d\P we write
(6.76) |.A d|= Y, a n=g(d)X + rd(A)
n=0(mod d)
where X > 0 and g(d) is a multiplicative function with 0 < g(p) < 1 for p\P. Let
h(d) be the multiplicative function given by h(p) = g(p)(l — #(p)) _ 1 and
(6.77) H= Y, M< 0
d<VD,d\P
for some D > 1. Then we have
(6.78) S{A,P)= Yl CLn^XH^+RiA^P)
(n,P) = l
164 6. ELEMENTAR Y SIEV E METHOD S
where
(6.79) R(A,P) = ^\ drd{A)
d\P
with \ d given by (6.56) and (6.74)-
Using (6.75 ) on e estimate s th e remainde r ter m crudel y b y
(6.80) \R(A,P)\^ Y, r 3(d)\rd(A)\.
d<D,d\P
The A 2-sieve i s ver y genera l indeed . It s uppe r boun d doe s no t requir e an y
regularity i n the distributio n o f the densit y functio n g(p) ove r primes , therefor e th e
dimension o f a siftin g proble m doe s no t pla y explicitl y a role . I f g(p) = u(p)p~ 1 ,
where u>(p) i s th e numbe r o f residu e classe s (mo d p) whic h on e want s t o exclude ,
then h(p) = oo(p)(p — uj(p))~ 1 i s th e rati o o f th e number s o f rejecte d t o admitte d
classes, an d H incorporate s thes e ratios . O f course , th e large r uj(p) is , th e smalle r
the uppe r boun d (6.78) . However , a fe w loca l deviation s o f uo{p) fro m it s averag e
have insignifican t globa l effect . T o th e contrar y ou r estimate s derive d b y th e com -
binatorial siev e ar e quit e sensitiv e i n thi s respec t becaus e th e hypothesi s (6.35 ) i s
assumed t o hol d fo r al l segment s o f prime s w < p < z, n o matte r ho w short .
The A 2-sieve yield s fantasti c result s fo r siftin g problem s whe n th e numbe r o f
residue classe s t o b e exclude d i s large , i t ca n compet e wit h th e larg e siev e metho d
of Linni k (se e Sectio n 7.4) .
REMARKS. Not e tha t th e number s p d depen d o n th e multiplicativ e functio n
g(d) s o (indirectly ) o n th e siftin g sequenc e A. However , assumin g som e regularit y
of g(d) (a s i n th e K dimensional sieve , se e (6.34) ) on e ca n establis h fairl y goo d
approximation (fo r smal l d a t an y rate )
6.6. Estimat e fo r th e mai n ter m o f th e A 2 -sieve.
To brin g th e Selber g uppe r boun d (6.78 ) t o a practica l for m w e nee d a clea r
lower boun d fo r
H = H(D) = Y. M<*) -
d<y/D,d\P
The uppe r boun d (6.72 ) hold s i n genera l bu t a goo d lowe r boun d require s som e
restrictions o n th e densit y functio n g , s o o n /i , an d th e siftin g rang e P. Pro m no w
on P i s the produc t o f al l prime s p < V^D , so we hav e
h
H(D) = Y! (d)
d<Vn
where th e superscrip t b restricts th e summatio n t o squarefre e numbers .
6. E L E M E N T1
AR Y SIEV E METHOD S 6 5
We ca n estimat e H(D) strongly , elementaril y an d nicel y fo r g(d) = d _ 1 (thi s
occurs whe n on e i s sifting number s i n a n interval) . I n thi s cas e h(d) = (p(d)~ 1 an d
d
H(D)= Y, ~in(l + W + ---)> E m-^logv^ .
d<y/D P\d m<y/D
If g(d) agree s wit h dr x onl y fo r (d , q) — 1 (a s fo r exampl e i n th e cas e o f siftin g a n
arithmetic progression) , the n thi s lowe r boun d hold s wit h th e followin g modifica -
tion:
(6.82) H(D)>l[(l-g(p))(\ogVD).
p\q
The abov e exampl e i s rather special . No w suppos e g(p)p i s n o n average , sa y
(6.83) Yl 9(P) logP = Klog x + 0(1 )
for al l x ^ 2 where K is a positive number . Henc e g(p) logp < C 1. Suppos e als o tha t
2
(6.84) ^ ( p ) logp<oc.
v
Since h(p) = g(p) + 0(g(p) 2), i t follow s tha t (6.83 ) hold s fo r h a s wel l (wit h a
different implie d constant) . Applyin g Theore m 1 . 1 fo r th e multiplicativ e functio n
h i n plac e o f / w e ge t
H{D) = c(lo g VD)" { 1 + 0((logZ?)- 1 )}
where
^rWTylT1-^))-1^-^
P
(the require d condition s (1 .89 ) an d (1 .90 ) ar e satisfie d b y (6.83) , (6.84) ) an d th e
implied constan t depend s o n tha t i n (6.83) . I f D i s larg e i n term s o f thi s constan t
we ca n inver t thi s approximatio n gettin g
(6.85) H(D)- 1
= 2 T ( K + l)H g{logD)-K{l + O^logD)" 1 )}
where
(6-86) H g = H(l-g(p))(l-$)-«.
P
6.7. Estimate s fo r th e remainde r ter m i n th e A 2 -sieve.
Once again w e consider a class of sifting problem s i n which th e individua l erro r
terms satisf y
(6.87) \r d(A)\ ^ g(d)d.
Naturally wit h thi s propert y on e make s th e conditio n
(6.88) g{d)d > 1 i f d\P.
166 6. E L E M E N T A R Y SIEV E M E T H O D S
This implie s g{[d\, c?2])[^i5 ^2] ^ 9(di)g (^2)^1 ^2, therefor e
2 2 2
(6.89) \R(A,P,A )\^ ( ] T \Pd\9iW) < (jj J2 HmMm))
d<\fD m<y/D
by (6.60 ) an d (6.72) , wher e cr(m) denotes th e su m o f divisor s o f m.
Next assum e tha t th e densit y functio n g(p) satisfie s th e followin g crud e boun d
(6.90) J2 9{p)\ogp^\og{2x/y).
Hence w e infer th e sam e propert y fo r h(p)a(p)p~ 1 . The n w e appl y (1 .85 ) gettin g
V^ h(m)a(m) <^i — Y ^ h(m)a(m)m * .
r
m<VD m<\ D
Here w e hav e
1_ 1-
^ /i(ra)cr(m)ra ^ H ^ /i(ra)ra < iJ .
Hence w e conclud e tha t
2
(6.91) R(A,P)^D(logD)- .
Combining wit h (6.78 ) w e get
THEOREM 6.5 . Suppose the conditions of Theorem 6.4 hold. Moreover, assume
(6.87), (6.88) and (6.90). Then we have
(6.92) S { A , P ) ^ + o { ^
where H = H(D) is given by (6.77), D > 1 is arbitrary, and the implied constant
depends only on that in (6.90).
6.8. Selecte d application s o f A 2 -sieve.
Consider th e sequenc e A = (a n) whic h i s th e characteristi c functio n o f a n
arithmetic progressio n i n a shor t interva l
(6.93) n = a(mo d g) , x < n< x +y
where (a , q) = 1 and 1 < q < y. Le t P consis t o f prime s p ^ y/y wit h p \ q. The n
1
(6.94) TT(X + y ; g, a) - TT(X; q, a) ^ S(A, P) + y/yq' .
On th e othe r hand , th e condition s o f Theore m 6. 5 ar e satisfie d wit h X = yq~ x an d
g(d) = d' 1 i f (d,q) = 1 . B y (6.82 ) w e hav e H(D) > ^-logD. Combinin g (6.92 )
with (6.94 ) w e ge t
7T(x + y ; q, a) - TT(X ; q, a) < —— + O - —^ +
z
ip(q) log D \\og Dq
Choosing D = yq~ l w e conclud e
6. E L E M E N T A R Y SIEV E M E T H O D S 167
THEOREM 6.6. For (a,q) = 1 and 1 ^ q < y we have
(6.95) n(x + y; qja)-7r{x;q,a) < —— —-—- + O . 2
2
<p(q)log(y/q) \q\og (y/q)
where the implied constant is absolute.
This i s calle d th e Brun-Titchmars h inequality . Usin g th e larg e siev e metho d
H.L. Montgomer y an d R.C . Vaugha n [MVl ] hav e show n tha t th e erro r ter m i n
(6.95) ca n b e deleted .
Next w e take A — (an) th e characteristi c functio n o f th e polynomia l value s
n = (r a — a±) — • (m — ctk)
with 1 ^ m ^ x , wher e al l aj ar e distinct . I n thi s cas e g(p) — v(p)p~1 wher e v{p)
is th e numbe r o f root s modul o p. I f p i s sufBcientl y large , i/(p) = k s o w e hav e a
^-dimensional siev e problem . B y (6.85 ) an d (6.92 ) w e deduc e
THEOREM 6.7 . Let a = ( a i , . . . , a^) be distinct integers which do not cover all
residue classes to any prime modulus. Then the number of integers 1 ^ m ^ x for
which ra — c*i,.. . , m — oik are all primes satisfies
fe '1 log
0 glog
l 0x >
(6.96) 7 r ( x ; a ) ^ 2 ^ ! ^ x ( l o g x ) - (l + o( 1 Q ^))
logx
where
k
. n ( _ ^u(p)\ /1 ) \- ~
(6 , 7) B 11 )( _i
p J v p)
REMARKS. Th e uppe r boun d (6.96 ) i s large r b y facto r 2 k k\ tha n th e conjec -
tured asymptoti c
7r(x;a) ~ Bx(logx)~ k.
This page intentionally left blank
http://dx.doi.org/10.1090/coll/053/08
CHAPTER 7
BILINEAR FORM S A N D TH E LARG E SIEV E
7.1. Genera l principle s o f estimatin g doubl e sums .
Certainly th e mos t versatil e instrument s o f analyti c numbe r theor y ar e doubl e
sums
a
(7.1) tf (a , /?) = ^ J2 rnM(m, n) .
mn
Here a = ( a m ) , ft = (/3 n) ar e finit e vector s o f comple x number s an d
(7.2) d > = (0(m,n) )
is a finit e matri x wit h comple x entries . I n matri x notatio n \I/(a , /3) = a$/3* . Give n
<I> the goa l i s t o giv e a shar p estimat e fo r \I/(a,/3 ) whic h i s vali d fo r arbitrar y
vectors a , /3 . Yes , i t i s essentia l t o assum e nothin g abou t th e number s a m , /3 n ,
because i n practic e the y ar e quit e comple x objects . Ver y ofte n a n attemp t t o
utilize specifi c structur e o f these vector s bring s bac k a mirro r imag e o f the origina l
situation. Henc e a question , wha t i s th e benefi t fro m arrangin g th e origina l sum s
into bilinea r forms ? Th e ke y featur e i s tha t th e sequence s a , (3 do no t se e eac h
other! Therefor e a cancellatio n i n \J/(a,/3 ) i s possible , subjec t onl y t o propertie s o f
the matri x coefficient s </>(ra , n). Indee d w e hav e
2
(7.3) | * ( a , / ? ) | ^A||a||2||/3||2
where A = A($ ) i s the nor m o f th e correspondin g linea r operato r an d
(7.4) IMI 2 = £ K , | 2 , |L0|| 2 = $ > „ | 2 .
A more transparent wa y of estimating \I/(a , /3) goes by Cauchy' s inequality . T o thi s
end choos e firs t m t o b e i n th e oute r su m an d n i n th e inne r su m gettin g
2
(7.5) |*(a,/?)| < ||a|| 2 £|£/W( m ' n )f-
mn
Note tha t th e unknow n coefficient s a m i n th e oute r su m ar e gone . A t thi s poin t
one coul d b e mor e flexibl e b y introducin g ne w coefficient s f(m) i n plac e o f a m t o
smooth th e nex t ste p an d t o optimiz e th e fina l bound .
Smoothing is one among many optional devices which are put int o practice wit h
remarkable effect s i n analyti c numbe r theory . Thi s i s not jus t a technical device , i t
can b e viewe d a s a kin d o f spectra l completion .
Next squarin g ou t th e inne r su m ove r n followe d b y interchangin g th e orde r o f
summation w e ge t
169
170 7. BILINEA R FORM S AN D TH E LARG E SIEV E
For ri\ — ri2 , the so-calle d diagona l terms , on e canno t ge t cancellatio n i n th e inne r
sum
2
(7.7) 5>Kn)| ,
m
so the savin g facto r comes solel y fro m th e fac t tha t th e diagona l i s smaller tha n th e
full square ! However , fo r mos t o f ni, ri2 of f the diagonal one may find a considerabl e
cancellation i n th e su m
(7.8) ^ 0(m , ni)<£(ra, n 2 )
771
because o f independen t variation s i n th e sig n o f 0(ra,ni) , 0(rn,n2) . Ho w on e
executes th e summatio n i n (7.8 ) i s a separat e questio n dependin g o n particula r
characteristics o f <£ . A t thi s poin t th e sum s (7.8 ) ca n b e eithe r estimate d directl y
finishing th e job, o r i t ca n b e transforme d t o anothe r su m (s o to spea k a dual sum )
by a kin d o f spectra l formula . I n th e latte r scenari o eve n a trivia l estimatio n o f
the dua l su m ofte n produce s a non-trivia l boun d fo r th e origina l sum . Moreover ,
one ca n als o exploi t th e extr a summatio n ove r ni , 71 2 . Ou r poin t i n thes e remark s
is t o sa y tha t th e genera l estimat e (7.3 ) obstruct s th e visibilit y fo r improvements .
We advis e t o carr y ou t th e argument s i n practic e alway s b y Cauchy' s inequalit y
because i t allow s yo u t o contro l you r position .
Before launchin g Cauchy' s inequalit y t o th e doubl e su m \£(a,/3) , mak e sur e
to choos e th e oute r an d th e inne r sum s variable s convenientl y a s th e result s wil l
be different . Thi s optio n o f interchangin g m , n i n \I/(a,/3 ) i s th e essenc e o f th e
following dualit y principle .
DUALITY PRINCIPLE . Suppose for any complex numbers f3 nj
2
(7.9) ^ | ] T / 3 n0(m,n)j <A||
mn
Then for any complex numbers a m
2
(7.10) ^ | ^ a m</>(m,n) ^A||a|l2,
where A is the same in both inequalities.
PROOF. Th e lef t sid e o f (7.1 0 ) i s equa l t o (7.1 ) wit h
(7.11) f3 n = ^2 a m(f>(m, n).
m
By (7.5 ) an d (7.9 ) w e get #(a,/3) < A||a|| 2 ||/?|| 2 . Bu t ||/3|| 2 = *(<*,/? ) s o (7.1 0 ) i s
2
derived fro m (7.9) . •
Because o f th e applicatio n o f Cauchy' s inequality , i t i s inevitabl e tha t muc h
attention concentrate s o n estimatin g sum s o f type (7.9 ) o r (7.1 0) . Thes e equivalen t
statements fo r genera l coefficient s a m , fi n ar e calle d Larg e Siev e Inequalitie s fo r
historical reason s althoug h the y d o no t resembl e a siev e o f an y kind . I t i s better t o
regard (7.9) , (7.1 0 ) a s a near-orthogonalit y propert y o f <I>.
We end thi s introductio n t o genera l bilinea r form s wit h a fe w comment s abou t
obvious cases for which the ideas described definitel y canno t work . Suppos e 4>{m,n)
7. BILINEA R FORM S AN D TH E LARG E SIEV E 171
factors, sa y 0(ra,n ) = x(m)'0(^) , o r <p(m,n) is a shor t linea r combinatio n o f suc h
products. I n othe r words , th e variable s m , n o f </)(ra , n) ar e separate d a t no , o r
low, cost . The n w e ma y choos e th e vector s a — ( am ) , /3 = (/3 n) wit h a m = x(ra) ,
/3n = ^(n) . Fo r this biased choice of vectors the bilinear for m i s \I/(a, /3) = ||a|| 2 ||/3|| 2
showing n o cancellatio n o f terms .
The principles of estimating bilinear form s i n analytic number theor y hav e bee n
articulated ver y slowly . Perhap s becaus e i t i s no t th e simplicit y o f th e ideas , bu t
rather th e additiona l device s introduce d b y researcher s i n particula r case s whic h
make thi s technolog y s o effective today . Firs t application s deal t wit h bilinea r form s
in whic h (/>(ra , n) depende d onl y o n th e produc t mn, sa y
(7.12) (p(m,n) = g(mn),
where g i s a n arithmeti c functio n i n on e variable . O f course , i n thi s cas e g canno t
be multiplicative . I.M . Vinogrado v shoul d b e mentione d fo r th e firs t impressiv e
results. Usin g a n exclusion-inclusio n procedur e ( a siev e method ) h e manage d t o
express exponentia l sum s ove r prime s b y doubl e sum s an d estimate d th e latte r
by elementar y mean s (cancellatio n i n geometri c series) . The n utilizin g th e resul t
in th e circl e metho d Vinogrado v gav e a stunnin g solutio n o f th e ternar y Goldbac h
problem. Hi s ideas are still alive today (se e Chapters 1 3 and 1 9) ; however, ther e ar e
more instances which stimulated th e whole business of bilinear forms . Som e of these
will b e explicitl y describe d i n forthcomin g chapters , an d man y ar e embedde d i n
arguments withou t recognition . Linni k [Li2 ] ha s elaborate d th e ide a o f estimatin g
double sum s int o th e dispersio n method , whic h i s missin g i n thi s book .
The notatio n chose n fo r thi s introductio n serve s a s a model . Obviousl y w e d o
not nee d t o conside r 0(ra , n) a s a functio n i n tw o integra l variables . Actuall y i n
some work s i t i s nice r t o inde x th e tw o variable s b y mor e appropriat e parameter s
(rational fractions , characters , eigenvalues , etc.) .
In thi s chapte r w e giv e a selectio n fro m th e grea t variet y o f bilinea r form s
and larg e siev e typ e inequalities . A fe w basi c one s will b e give n detaile d proofs , a
few mor e wil l b e discusse d i n a broa d context , an d man y wil l b e jus t state d wit h
references t o origina l publications .
7.2. Bilinea r form s wit h exponentials .
The bilinea r form s o f typ e (7.1 ) wit h
(7.13) <f>(m,ri) = e{x rnyn)
appear quit e often i n classical analytic number theory . Her e x m , y n ar e real number s
and e(z) = exp(27rz2: ) a s usual .
LEMMA 7.1 . For any complex numbers a m and real numbers x m we have
(7.14) / \^2a me(xmy)\dy^5Y1
Y,Y I
Y
rn 2Y\x rni-xrn2\<l
172 7 . B I L I N E AR F O R M S A N D T HE L A R G E SIEV E
PROOF. Le t g(y) be a non-negative functio n o n R with g(y) ^ 1 if \y\ ^ \ and
supp(#) C [-1,1]. The n th e left sid e of (7.14) i s bounded by
/ #(^)|X^ me
( X m 2 / ) | dy = 2Y^Y^ arni^rn29(<2Y(xrni ~ Xm2))
m mi,?7i 2
2Y\xmi-xrri2\<l
The Fourie r pai r g(y) = ( S1 ^y J , g(y) = ^- max(l — |f|,0 ) satisfie s th e abov e
conditions, givin g th e result. •
Our basi c inequalit y is
THEOREM 7.2 . Let xm ; y m be real numbers with | x m | ^ X, \y m\ ^ Y. Then
for any complex numbers a m, (3 n
\Y^^2a™Pne(Xrnyn)\ < 5 ( 1 7 + 1 ) 2 ^ ^ ^ |^m 1 ^m2|J
™ ™ |a; mi -xm2|y<l
(7.15) ( E E l&^l)
|yni-yn2|X<l
P R O O F . Pu t e = ( 4 X ) "1 an d
7r£mG?m
sin(27r££mJ
so | 7 m | ^ 7rX|am |. Writin g
e(xy) — / eixtjdt
v yj
sm27rsx J y__£ v ;
we find tha t th e left sid e of (7.15) i s bounded by
, rVn+e , rY+e
n
E^ / ^7 me(xmy)d?/ ^ / ^ |/3 n| ^7me(xm7/)
Hence b y Cauchy's inequalit y an d Lemma 7. 1 we find tha t th e square o f the lef t
side o f (7.15) i s bounded by
(/ ( E l^ n0 dy){ \%2'YTn ex
( my)\ d V)
^2e{ E E \PnJn 2\)5(Y + e){*X)2( J2Y, l^a m2 |).
£
\Vn1 -yn 2\< \x mi-xm2\Y<l
Here 1 0e(Y + S)(KX) 2 = *£(XY + \) < 25{XY + 1 ) completin g th e proof. •
If the points x m an d y n ar e well-spaced, s o that onl y the diagonal term s appea r
on the right sid e o f (7.15), we ge t
7. B I L I N E A R F O R M S A N D T H E L A R G E S I E V E 173
COROLLARY 7.3 . Suppose \x m\ ^ X, \y n\ ^ Y, and for m\ ^ m 2, n\ ^ n 2 we
have \x mi - x m2| ^ A, \y ni - y n2\ ^ B . Then
x
1 A I1
A
Y.Y.arnPne{xmyn)\^h{l + XY)2(Kl + —y( Kl + ^ 2
\\a\\W\\.
PROOF. Appl y (7.1 5 ) wit h max(X , B 1 ) an d max(Y , A1 ) i n plac e o f X an d
Y respectively . •
The spacin g proble m ca n be easily solve d fo r points give n by smooth functions .
For exampl e w e derive b y Corollar y 7. 3 the followin g
COROLLARY 7.4 . Let f(m), g(n) be real functions on [M,2M], [N,2N] respec-
tively, such that f < F, g < G and \f\ ^ FM~ l, \g'\ ^ GN' 1 . Then for any
complex numbers a m, f3 n we have
J2 E <*mMf(m)g(n)) « (FG)-\ (FG + M)\(FG + N)\\\a
PROOF. W e have |/(mi ) - f(m 2)\ > |ra i - m 2\FM~l an d |(j(ni ) - g(n 2)\ ^
|ni - n 2\GN~l henc e th e resul t follow s fro m Corollar y 7. 3 for A = F M _ 1 , J B =
GW" 1 , I = F an d F = G . D
Corollary 7. 3 yields als o interestin g result s fo r exponential sum s wit h monomi -
als, becaus e th e rational point s ar e well-spaced .
LEMMA 7.5 . Let a/3 ^ 0, A > 0, K ^ I, L ^ 1 . The number of integral
quadruples {k,k',t,,l f) with K ^ fc, k' < 2K, L ^ £, f ^ 2 L suc/ i that
'k'\<* /t!\P
^A
™I ( I ) - ( T)
is bounded by 0(K L(AK L + log2K L)) where the implied constant depends only on
a, (3.
EXERCISE 1 . Prov e Lemm a 7.5.
Prom Lemm a 7. 5 we derive
COROLLARY 7.6 . Let a/3^6 ^ 0 , X > 0 , K, L, M, N ^ 1 . Let a k,i and 6 m?n
be complex numbers with \a k^\ ^ 1 an d |6 mjTl | ^ 1 for K ^ k ^ 2K, L ^ £ ^ 2L,
M ^ m ^ 2M ; i V ^ n < 2A \ TAe n
7 17 kaePm^n6
( - ) S E E S a k^b^ne (X-
KaLPM^N6,
k£ rn n
<1 (l ++ — J ( l + — ) (XKLMN)2 \og(2KLMN)
where the implied constant depends only on a, (3, 7, S.
174 7. B I L I N E A R F O R M S A N D T H E L A R G E SIEV E
7.3. Introductio n t o th e larg e sieve .
The larg e siev e typ e inequalitie s ar e ubiquitou s i n moder n analyti c numbe r
theory. Th e nam e i s somewhat misleadin g however , a s sieve s d o no t ente r th e
general picture : i t was derive d fro m Linnik' s origina l ide a (se e [Lil]) , whic h wa s
subsequently transforme d beyon d recognitio n int o genera l L 2 -type estimates , fro m
which siev e result s ca n b e derive d i n som e cases .
First w e explain i n genera l term s th e underlyin g philosophy , whic h i s expecte d
to hav e muc h large r applicability . Le t A ' be a finite set o f "harmonics" , well-suite d
maybe t o solv e analyticall y som e interestin g equation . T o eac h x G X, we assum e
that a sequence (x(n)) i s associated , s o to spea k th e "Fourie r coefficients" .
The larg e siev e proble m fo r X is to find C = C(X, N) ^ 0 such tha t th e
following "larg e siev e inequality " hold s fo r th e L 2 -mean o f a linear for m i n th e
x(n):
(7.18) Yl | Yl a - x ( n )f ^ C(*,JV)||a|| 2
xEX n^N
for an y comple x number s a n , wher e ||a|| 2 = ^2\a n\2. B y Cauchy' s inequalit y on e
gets (7.1 8 ) wit h C(X, N) = A/"^!, bu t thi s i s not usefu l i n applications .
The ide a i s tha t th e "harmonics " ar e orthonormal , o r nearl y so , henc e i f on e
expands th e squar e
i2
] T ] P a nx(n)\ = ] T anian2 ^ x(ni)x(n 2 ),
xeX n^N ni,n 2 xeX
the diagona l term s correspondin g t o ri\ — ri2 should b e dominating , contributin g
roughly |^ f j i|a.||2 becaus e x{n) shoul d b e o f size 1 on average . I n an y cas e i t i s har d
to imagin e tha t C(X, N) woul d b e muc h smalle r tha n th e numbe r o f element s i n
X. Moreover , becaus e o f the dualit y principl e fo r bilinea r forms , whic h mean s tha t
(7.18) i s equivalent wit h
2
(7.19) ^\^2b xx{n)\ ^C(X,N)^2\bx\2
n^N xeX x£X
with diagona l contributio n NJ2\b x\2, i t is no t possibl e t o mak e C(X,N) smalle r
than th e lengt h o f th e vecto r (a n ). Therefor e i n vie w o f thes e diagona l limitation s
the bes t estimat e whic h on e ca n hop e fo r C(X,N) i s roughl y
(7.20) C(X,N)~\X\+N.
This hold s indee d i n a number o f crucia l cases .
A shar p larg e siev e inequalit y (7.20 ) say s tha t th e linea r form s
^a nx(n)
of lengt h i V ^ \X\ wit h \a n\ ^ 1 are, o n averag e ove r x G X, of siz e iV 1 / 2 . Thi s i s
best possible , an d althoug h i n many cases it is expected tha t th e individual sums ar e
of thi s size , ver y fe w case s ar e known . Especiall y interestin g i s th e cas e a n = /x(n)
7. BILINEA R FORM S AN D TH E LARG E SIEV E 175
and x(n) = A/(n) ar e coefficient s o f som e L- function (se e Chapte r 5) . The n th e
individual boun d
Y^ »{n)\ f{n) < e N%+ e
(for an y e > 0) i s equivalent wit h th e Rieman n Hypothesi s fo r L(/ , s) (se e (5.52)) .
Hence, a shar p larg e siev e type inequalit y fo r a famil y o f L- functions i s as powerfu l
on averag e a s th e Gran d Rieman n Hypothesi s woul d give .
7.4. Additiv e larg e siev e inequalities .
In th e additiv e larg e siev e inequalit y th e harmonic s considere d ar e additiv e
characters o f Z with x(n) = e(an) fo r som e a G R (of cours e onl y a mo d 1
matters). S o the linear forms t o be estimated ar e just th e trigonometric polynomial s
(7.21) S(a) = ^2a ne(an)
n
where th e comple x coefficient s ar e supporte d i n M < n ^ M + N. Not e tha t i f ar
are well-space d modul o 1 , i.e. ,
||a r - a 3\\ ^5 i fr ^ s,
for som e 5 > 0 (wher e \\x\\ is the distanc e t o th e neares t integer) , the n th e numbe r
of distinc t point s a ri i.e. o f harmonics , i s ^ 1 + 6~ l.
T H E O R E M 7.7 . For any set of 8-spaced points a r e M/ Z and any complex
numbers a n with M < n ^ M + N, where 0 < S ^ \ and N ^ 1 is an integer, we
have
2
(7.22) 5 ^ | Yl Oneiarntf^tf-'L+N-yWaW .
r M<n^M + N
This estimat e i s bes t possible ; it was prove d i n th e for m state d independentl y
by Selber g [S3 ] an d Montgomer y an d Vaugha n [MV2] , an d i s o f th e strengt h o f
(7.20). W e giv e th e proo f o f Montgomer y an d Vaugha n whic h i s based o n th e
following generalizatio n o f th e Hilber t inequality .
LEMMA 7.8 . Suppose X r are distinct real numbers with |A r — A s| ^ S ifrj^s.
Then for any complex numbers z r we have
<"»> lEEsfr^jEw' -
PROOF. B y Cauchy' s inequality , it suffices t o sho w tha t
2 ^2
<™> ElEi^d^EiA — A. r s^r
r s S2 *
Squaring ou t w e arrang e th e lef t sid e a s follow s
L = J2Y,ZsZt Yl
1
r^s,t
z2^2z2 {Xr
r^s s^t
-x sy+
^J^xr^s,t
s-xt z2{\ r-\s x r-\J-
176 7. B I L I N E A R F O R M S A N D T H E L A R G E SIEV E
For the last su m we have
y ^ /1
1 \ _ y ^1 y ^1 2
^-—' V Ar — A s \ r — \t / ~* X r — X s *—f X r — Xt X s — Xt
hence
Zszt
+2
- E M * X. v: <., O T, £ ?(Aa"A
t t )2
+
^J- AT^A ; i \ J~^K ~ \t A^A ; / •
s^t ryts r^t
Note no w that th e estimate (7.24 ) amount s to finding the norm of the matri x (^ r?s )
with fjL r,s = (A r — A s ) _ 1 i f r ^ s and \ir^r — 0, thus w e may assume tha t th e vector
(zr) i s extremal . Sinc e th e matri x i s skew-hermitian , th e extrema l vecto r i s an
eigenvector, i.e.
VZR
Ar — A s
for som e purel y imaginar y v G C. Thi s show s tha t th e last tw o sums i n L cance l
out. Therefor e fo r the extremal vecto r w e have
ZsZt
I+ 2TT .
s r^s s^zt
Applying 2\z szt\ ^ \z s\2 + \z t\2, w e obtain
Since w e have |A r — A s| ^ S\r — s\ the innermost su m is bounded b y 25~ 2£(2) =
7T2/352. Thi s give s (7.24 ) completin g th e proof o f (7.23). •
COROLLARY 7.9 . For any set of 5-spaced points a r G R/Z and any complex
numbers z r we have
(v 7J-25) \t-j — T r^r -s;—a Ts) IN^ s~x E Z I .
2
E Esin7r(a
^\
r^s r
W e apply (7.23 ) to the doubly-indexed se t o f numbers z^^ = (— l) m z r
PROOF.
and Xm^r = m + a r wit h 1 < m ^ K , gettin g
17 TTK 2
E E (-i)™" — k^Ew -
(r,m)^(s,n) r
Here w e can replace th e summation conditio n (r , m) ^ (s , n) b y r ^ 5 because for
r = s th e remaining term s cance l ou t pairwis e (ra , n) agains t (n,ra) . I f we put
k = m — n and divide b y if w e derive
r^s k= — Kr
7. B I L I N E A R F O R M S AN D T H
1 E L A R GE S I E V E 7 7
This yield s (7.25 ) b y letting K —> • +0 0 becaus e fo r a £ Z
fc
y (-i) = *
^ k + a si n na
kez
where th e series is summed symmetrically . •
COROLLARY 7.1 0 . For any real x we have
z z 5 |2
(7.26) \Z^l^ r s—. ; r\^ 1 ^
\*-^ £—j sm7r(a r — a s) I ^-^
r^s \ r sj r
PROOF. Thi s follows by applying Corollar y 7. 9 twice with z r twiste d b y e(xa r)
and e(—xa r). •
Now w e are ready t o prove (7.22) , firs t wit h S~ x + N instea d o f £_ 1 + N — 1 .
By duality , i t is equivalent t o show tha t
z
(7.27) V J \Y, r<nar)\2 ^(N + S-^Wzf
M<n^M+N r
for an y comple x number s z r. Squarin g out , w e see that th e diagonal ter m r = s
contributes iV||2|| 2 , whil e th e off-diagonal term s yiel d
v—v v-^ \~ ^/ / w v ^ v^ /T^/ \\ sin7r7V(ar — a s)
}^^zrzs}^e(n(ar-as)) = }^}^zrzse(K(ar-as)) ^ _
with K — M + \(N + 1). Thi s quantit y i s smaller tha n th e left sid e of (7.26), henc e
is < <5 -1 ||2:||2. Addin g bot h contribution s give s (7.27) .
To save the —1 , w e apply Theore m 7. 7 to the points (a r + k)K~1 wit h i ^ fc ^
K, fo r the trigonometric polynomia l T(a) = S(aK). W e obtai n
Kj2\S(ar)\2 = ^Y,\T(^)\2^(d~lK + NK K +
- VM2
rk r
because th e points (a r + k)K~ l ar e spaced b y 5K_1, an d the sum T(a) range s
over m = nK wit h (MK + K - 1 ) < m < (MK + K - 1 ) + (NK + K - 1 ) . Henc e
dividing b y K an d lettin g K ten d t o infinity w e obtain (7.22 ) a s stated (thi s tric k
is due to Paul Cohen) .
For man y applications , a weaker estimat e wit h th e factor <5 _1 -f N — 1 replaced
by C(5~ 1 + N) wit h a n absolute constan t C is sufficient. Thi s kin d o f large siev e
inequality wa s firs t state d b y H. Davenport an d H. Halberstam [DH1 ] . W e ar e
going to give a quick proo f o f one such estimate . B y duality w e need t o show tha t
J2 | ^ 7 r e ( a r n ) | 2 ^ C ( J -1 + iV)||7|| 2
\n\<N r
for an y comple x number s 7 r . Le t f(x) b e a non-negative functio n wit h f(x) ^ 1
if \x\ ^ 1 . Th e left sid e abov e i s estimated b y the same expressio n wit h extr a
smoothing facto r f(n/N), an d the latter equal s
X^5Z 7r7sCr ( r ' 5 )
178 7. BILINEA R FORM S AN D TH E LARG E SIEV E
where
n
By Poisson' s formul a
(T(r,s) = NY,f((ar-aa + h)N).
h
We choos e f(x) suc h tha t it s Fourie r transfor m satisfie s f(y) < C (1 -f- y 2 ) - 1 givin g
<r(r,s) < N(l + ||a r - a s\\2N2)-\ Finall y applyin g 2 | 7 r 7 s | ^ | 7 r | 2 + | 7 s | 2 an d
oo
1
^2 W(r, s)\ < N ^ i + (SkN) 2)-1 < (T 1 +TV
s k=0
we complete th e proof .
The abov e argument s woul d yiel d th e bes t possibl e constan t C — 1 if th e tes t
function f(x) wa s chose n appropriately . Selber g di d just that , hi s optimal functio n
being quit e complicated .
Now specializ e th e larg e siev e inequalit y (7.4 ) b y takin g fo r th e point s a r th e
rationals a/q wit h 1 ^ q ^ Q an d (a , q) — 1. Thes e point s ar e space d b y 5 = Q - 2 ,
indeed i f a/q •£ a' /qf, the n
\\a/q - a'/q'\\ = ||(ag ' - a W w ' l l > ( w ' ) " 1 £ Q~ 2-
Therefore Theore m 7. 7 yield s
T H E O R E M 7.1 1 . For any complex numbers a n with M < n ^ M -\- N, where
N is a positive integer, we have
( 7 - 28 ) E T jE a ne(^)fW + iV-l)||a||2.
q^Q a (mod q) M<n^M+N ^
Notice tha t i f A = (a n) i s supporte d o n a n arithmeti c progressio n n = £
(mod /c) , and (/c , q) = 1 , we ca n chang e variable s t o deriv e
COROLLARY 7.1 2 . For any complex numbers a n with M<n^M + Nwe
have
2
(7-29) E E *| E a ne(^)\\(Q + k^N) J2 M 2
-
Q^Q a (mo d a ) n=£ (mo d q) n=l (mo d k)
(k,q) = l
EXERCISE 2 . Prov e th e followin g larg e siev e inequalit y i n several variables : le t
d ^ 1 and S > 0 an d le t a r = ( a r , i , . . . , ar ,d) b e 5-space d point s i n R d/Zd, i.e . i f
r ^ s , w e hav e ma x \\a r,i — a s,i\\ ^ S. The n
(7.30) ^ |^> n e ( n • a r ) | 2 < ((T d
+ N d)\\a\\2
rn
where x • y i s th e standar d scala r produc t i n R d , an d a n ar e arbitrar y comple x
numbers fo r n = (rci,.. . , n^) wit h 1 ^ n ^ ^ AT , the implie d constan t dependin g
only o n th e dimensio n d .
7. BILINEA R FORM S AND TH E LARG E SIEV E 179
7.5. Multiplicativ e larg e siev e inequality .
Instead o f additiv e characters , w e now tak e a s harmonic s th e Dirichle t charac -
ters. I f on e consider s X — {x \ X * s a characte r modul o g} , then b y expandin g th e
square an d applyin g th e orthogonalit y relatio n on e finds immediatel y
2
£ \^2a nX(n)\\(q + N)\\a\\
X (mo d q) n^N
which i s comparabl e t o (7.20) .
Much mor e significant , however , i s th e cas e wher e th e harmonic s ar e al l prim -
itive Dirichle t character s t o modulu s q ^ Q. Her e w e hav e th e basi c resul t o f
Bombieri an d Davenpor t [BD ] (thei r origina l resul t i s slightl y weaker) :
THEOREM 7.1 3 . For any complex numbers a n with M < n ^ M + N, where
N is a positive integer, we have
2
(7.31) E T ]^ I £ a nX(n)\\(Q + N-l)\\af.
q
q^Q ^ ' x (mo d q) M<n^M+N
This i s again a s strong a s (7.20) . Not e tha t withou t th e restrictio n t o primitiv e
X, this inequalit y woul d b e fals e b y a larg e facto r (tak e a n = 1 for al l n , the n th e
trivial character s modul o q ^ Q contribut e alread y abou t N 2Q). A n interestin g
feature o f (7.31 ) i s tha t th e boun d depend s onl y o n th e lengt h o f th e interval , bu t
not o n it s position . I n thi s aspec t th e larg e siev e inequality fo r Dirichle t character s
has som e advantag e ove r th e GRH .
P R O O F . W e ca n i n fac t giv e a slightl y mor e precis e estimate , whic h i s some -
times usefu l i n applications .
The primitiv e multiplicativ e character s x (mo d q) can b e expande d int o addi -
tive character s b y mean s o f Gaus s sum s (se e (3.1 2) )
a (mo d q)
for an y n. I f x i s not primitiv e bu t induce d b y a primitive Xs modulo s with q — rs,
then i f (r , s) = 1 , we hav e
. , /an\ v ^ * v^ */ ,x ( n^ cn \
E ^ a
H j ) =
^ £ X(bs + cr)e[ v + - )
a (mo d q) b (mo d r ) c (mo d s)
= x(n)Xs(r)c r(n)r(xs)
where c r(n) i s a Ramanuja n su m (3.1 ) . W e deriv e
^2aTnx(n)cr(n) = ^—- ^ x{o)S{a/q)
n \Xs) a ( m o d q)
for an y comple x number s a n , wher e S(a) i s th e trigonometri c polynomia l (7.21 ) .
180 7. BILINEA R FORM S AN D TH E LARG E SIEV E
By orthogonalit y o f characters w e then hav e
rs^Q V ;
x (mo d s ) n ^ QX (mo d q ) a
(r,s) = l
= £E * |E-«(f) "
g^Q a mo d g n
Applying Theore m 7.1 1 , we conclude tha t
(7-32) E -77 ^ E * I E «nX(«K(n)| 2
^(Q2 + iV-l)||a||2.
rs^Q X (mod s ) M<n^M+A ^
(r,s) = l
This inequalit y contain s (7.31 ) b y positivit y (ignor e al l term s wit h r ^ l ) . •
REMARK. Bombier i an d Davenpor t di d succee d i n getting interestin g applica -
tions of (7.32) b y exploitin g th e extr a summatio n ove r r.
7.4. Application s o f the large-siev e t o sieving problems .
In thi s sectio n w e explain ho w the larg e siev e inequality fo r additiv e character s
implies a sieve resul t i n the "ordinary " sens e (se e als o Chapte r 6 for background
on sieves) , an d giv e Linnik' s firs t applicatio n o f such a result fo r estimation o f the
least quadrati c non-residue .
Consider a finite se t M of integers an d a finite se t V of prime numbers . For
each p e V, le t Qp C Z/pZ b e a set of residue classe s "t o siev e out" . Th e dat a
(Ai,V,Cl) define s a sieving problem; th e correspondin g sifted set is
(7.33) 5 ( M , P , l ] ) = { m e A 1; m (mod p) 4 0, fo r all p G V\.
The goa l i s to estimate S — \S(M,V,Cl)\, th e cardinality o f S(M,V,fi,). Mor e
generally w e consider , fo r an arbitrary sequenc e o f complex number s a = (a n ), th e
sum
a
(7.34) Z = Yl -'
nes(M.v.n)
and w e see k a bound fo r Z in terms o f the ^2-nor m o f a. Naturall y i n this contex t
the nam e "larg e sieve " i s appropriate , becaus e i n contras t wit h othe r method s (lik e
combinatorial sieves , se e Chapte r 6) , w e ar e looking fo r strong estimate s eve n if
\QP\ is rather larg e whe n p is large.
We deriv e no w fro m Theore m 7.1 1 a result somewha t stronge r tha n Linnik' s
original versio n (se e [Rot] , [Mol] , [Hul] , [Bo2]) .
T H E O R E M 7.1 4 . Suppose M is contained in an interval of length N ^ 1 , and
assume Q p ^ Z/pZ for every p £ V, i.e., uj(p) = \Vtp\ < p. Then we have
(7.35) | Z|2<^±«!|H|2
for any Q ^ \, where
(7.36) H = Y^Kq)
7. B I L I N E A R F O R M S AN D T H E L A R G E SIEV E 181
and h(q) is the multiplicative function supported on squarefree integers with prime
divisors in V such that
u(p)
(7.37 ) h(p) =
p - uj(p)
In particular, taking for a n the characteristic function ofS(A4, V, Q), we have
N + Q2
(7.38 ) S^
H
Let S(a) b e th e trigonometri c polynomia l (7.21 ) , so S(0) — Z. Firs t w e estab -
lish th e followin g inequalit y fo r individua l moduli .
LEMMA 7.1 5 . For any positive squarefree number q we have
|2
S
(7.39 ) h(q)\S(0)\^ X T \ 0
a (mo d q)
P R O O F . Le t X(q, v), fo r v e Z/qZ, denot e
X(q,v)= ^ a n
n=v (mo d q)
Using additiv e character s w e hav e
av\„(a
*(«,") = ; £> e [ •-T^(i)'
-
a (mo d q)
so b y orthogonalit y (Planchere l formula ) i t follow s tha t
q £ \x( q,u)\*= Y: K^)f .
v (mo d q) a (mo d q)
If q — p i s prime, w e have X(p,v) = 0 for v G Hp , s o by Cauchy' s inequalit y w e ge t
\z\2 = |S(o)l2 = I J2 X
(P>")\
v (mo d p)
va (mo d p)
which yield s (7.39 ) b y subtractin g th e ter m fo r a = 0 (mo d p). No w i f q — q\q2
with (^1 ,^2 ) = I , w e hav e
a (mo d q) a\ (mo d q{) a 2 (mo d q 2)
Hence i f (7.39 ) hold s fo r q\ an d #2 ? we derive b y successiv e application s tha t
2
]T* | s ( ^ ) | % % 2) X) * \s(^)\ >h(qi)h(q2)\S(0)\2,
a (mo d q) a\ (mo d q\)
so the proo f o f (7.39 ) i s finished b y inductio n o n the numbe r o f prime factor s o f q.O
182 7. BILINEA R FORM S AND TH E LARG E SIEV E
P R O O F O F THEORE M 7.1 4 . Su m (7.39 ) ove r q ^ Q an d the n appl y Theore m
7.11. •
Linnik [Lil ] establishe d a n inequalit y somewha t weake r tha n (7.38) , ye t pow -
erful enoug h t o giv e a spectacular applicatio n t o th e proble m o f the leas t quadrati c
non-residue modul o a prim e p , i.e. , th e smalles t positiv e intege r q(p) suc h tha t
(q(p)/p) = — 1. Not e tha t q(p) i s prime . I t i s conjecture d tha t q(p) <C £ p£ wherea s
the bes t know n estimat e i s
q(p) < s / + £ , 0 = l/4V e = 0.1 51 6.. .
From th e Gran d Rieman n Hypothesi s fo r Dirichle t L-function s on e derive s tha t
q{p) <C (logp)2 wit h a n absolut e implie d constant .
THEOREM 7.1 6 (LINNIK) . Let e > 0. The number of primes p ^ N such that
q(p) > N £ is bounded by a constant depending only on e.
PROOF. Le t X £ b e th e numbe r o f prime s ^ yN wit h q(p) > N £, whic h w e
want t o sho w i s bounde d i n term s o f e. T o thi s en d conside r th e sievin g proble m
with
M = {1 ,2,...,7V} ,
£
V = {p ^ y/N | ( - ) = 1 for al l n < N },
ttp = {v (modp) | ( - ) = - 1 } .
Note tha t uj(p) = \{p— 1 ) and h{p) = (p — l)/(p + 1). Thi s i s indeed a "larg e sieve"
because h(p) ^ | .
The sifted se t contains the set, say Z £ , of all n ^ N whic h have no prime divisor s
larger tha n N £. Therefor e b y (7.8 ) wit h Q = y/N w e ge t Z £ = \Z £\ ^ 2NH~ l.
Combining thi s inequalit y wit h
\X£^ Y, h(p)^H
q{p)>Ne
we ge t X £Z£ ^ 67V . I t remain s t o estimat e Z £.
It i s know n tha t Z £ ~ 5(e)N fo r som e 5(e) > 0 a s N — > o c (se e e.g . [Bo2] ,
pp. 8-9) , s o the resul t follows . Alternatively , a sufficient lowe r boun d fo r Z £ follow s
2
by counting in the set Z £ number s o f special type n = mp\ • • • pk ^ N wit h N £~£ <
Pj < N £ fo r 1 ^ j ^ k = e~ x. W e ge t
_N
z£> V »iV.
^ Lpi---PkLz> i
p i , - ,Pk
Hence I £ < 1 . Finall y changin g i V t o N 2 an d e t o e/2 w e conclud e Linnik' s
theorem exactl y a s stated . •
See Proposition 7.3 0 for a n analogu e fo r ellipti c curv e usin g the larg e sieve typ e
inequality fo r symmetri c squar e L-functions .
7. BILINEA R FORM S AN D TH E LARG E SIEV E 183
EXERCISE 3 . (1 ) Consider th e followin g siev e i n dimension d : fo r any prim e
peP, le t ft(p)c(Z/pZ) d b e a subset wit h uj(p) = \Q(p)\ < p. Le t M C [-N, N}d.
Put
S(M,V,Q) = {m = ( m i , . .. ,m d) e M \ fo r p e V, m (mod p) £ Qp}.
Show tha t
S = \S(M9V,fi)| < (7V d + Q 2d)H-x
where i f i s given b y (7.36) an d the implied constan t depend s o n d [Hint : Us e
(7.30)].
(2) Fi x n > 1 . Let DM be the se t of monic polynomial s / G Z[X] o f degree n
with coefficient s i n absolut e valu e ^ N. Fo r r = ( r i , . . . , r^) wit h ri + 2r2 H = n
let C r C Div b e the subse t o f polynomials / fo r which / (mo d p) doe s no t facto r
as a product o f r\ linea r factors , T2 quadratic factors , ... , for any prim e p. Sho w
that
|cr|<ivn-hogjv,
the implie d constan t dependin g onl y o n n.
(3) Deduc e tha t th e se t C C D^ o f polynomials / fo r which th e Galoi s grou p
of th e splittin g field of / i s not S n satisfie s
\C\ < TV"- 1 logTV
the implie d constan t dependin g onl y o n n . [Hint : Us e the fac t tha t i f / ha s Galoi s
group H whic h i s a proper subgrou p o f 5 n , th e union o f the conjugate s o f H is
distinct fro m S n, henc e ther e i s a "splittin g type " r as in (2) such tha t / G Cr]
This is due to Gallagher [Ga4] .
Another interestin g applicatio n o f two-dimensional larg e siev e i s the proo f b y
Duke [Du5 ] that almos t al l elliptic curves E/Q have their p-torsion field with Galoi s
group GL(2 , Z/pZ) for all prime s p.
7.6. Panoram a o f the larg e siev e inequalities .
There i s a great variet y o f estimates o f the larg e sieve type which ar e used thes e
days i n analytic numbe r theory . I n this sectio n w e selecte d a few importan t one s
(without proofs ) t o make an impression abou t th e scop e of the matter .
Some application s requir e larg e siev e inequalities i n which th e se t o f harmonic s
consists o f characters twiste d b y exponentials o f different kind . Th e first case of
such a hybrid larg e siev e is due to P.X. Gallaghe r [Gal] .
THEOREM 7.1 7 . Let N, Q, T ^ 1 . For any complex numbers a n we have
E E * / T |E a «^ n ) ni f df<< ^ 2T+iV )iHi 2 -
q ^ Q x (mo d g ) T n^N
A discret e versio n o f this result , amon g furthe r generalizations , ca n b e found
in [Mo2] .
Of course , twistin g th e multiplicative character s b y additiv e one s doe s not
produce stronge r results , bu t if the exponentia l componen t i s not linear, the n i t
is likel y t o be orthogonal t o the characters an d one ca n receive extr a savin g (in
Theorem 7.1 7 th e exponentia l componen t i s e(^plogn)). Her e i s another exampl e
of som e importanc e fo r applications (se e [DuI3]) .
184 7. BILINEA R FORM S AN D TH E LARG E SIEV E
THEOREM 7.1 8 . Let v(x) be a real, smooth function on M + such that 0 <
x\v'(x)\ < 1 and \v'(x)\ 2 < \v"(x)\. Then for X ^ Q ^ 1 we have
E E * |Ea^(n)e(x )| 2 < < ( i V + 0 x l o g X )1
' ^ l|a| 2
-
q^Q x (mo d q) n^N
where the implied constant is absolute.
If th e linea r form s Y2 anX(n) a r e lacunar y (man y coefficient s a n vanish) , the n
the powe r o f th e larg e siev e reduce s significantly . Ther e i s a grea t deman d fo r
results whic h woul d b e sensitiv e i n thi s respec t (cf . P . Elliot t [El]) . O . Ramar e
worked ou t man y interestin g estimate s o f suc h kin d (unpublished) .
PROBLEM 7.1 9 . Prove that for any complex numbers a n supported on primes
|2
E E * |E a «*( n )| «(Q 2 + Ar(log^V)-1)IH 2
q^Q X (mo d q) n^N
where the implied constant is absolute.
A mor e genera l proble m i s t o fi x / G Z[X] an d establis h goo d larg e siev e
inequalities fo r linea r form s
J 2 a n e ^ ) , J2a nX(f(n)).
< a
H
2<N n<N
One shoul d realiz e how fortunate i s the cas e o f large sieve for coefficient s o f ful l
density, becaus e th e dualit y principl e work s efficiently. Fo r sequence s o f coefficient s
an whic h ar e ver y sparse , th e dualit y argument s ar e poo r an d ver y littl e ca n b e
done directl y whil e prediction s fo r th e bes t possibl e estimate s ar e risky .
In th e sam e not e i t i s a challengin g proble m t o establis h th e bes t larg e siev e
inequality fo r harmonic s whic h ar e primitiv e character s o f a fixe d order . Her e w e
have a powerfu l resul t o f D . R . Heath-Brow n [HB1 ] fo r quadrati c character s (se e
also th e earlie r resul t o f M . Jutil a [Ju2]) .
THEOREM 7.20 . For any complex numbers a n we have
Eb |E ba
™ Q f «( M7V £ M+iV
)( )Ni
Here the superscript b restricts the summation to squarefree numbers, ( —) stands
for the Jacobi symbol, e is any positive number and the implied constant depends
only on e.
R E M A R K S . Th e fac t tha t w e ca n vie w ( ^ ) a s eithe r a characte r i n n o f mod -
ulus m , o r a characte r i n m o f modulu s 4 n (b y wa y o f th e quadrati c reciprocit y
law) i s th e ke y featur e o f th e proof . Not e tha t th e proble m i s self-dual , i.e . inter -
changing m an d n doe s no t matter . Fo r interestin g application s o f Theore m 7.20 ,
see [HB1 ] , [Sou] , [IM] , [PP] .
In connectio n wit h Kloosterma n sum s on e meet s character s ( ^ p ) wher e / i s
a quadrati c polynomial . Th e correspondin g larg e siev e wa s considere d i n [II ] wit h
the followin g resul t
7. BILINEA R FORM S AND TH E LARG E SIEV E 185
THEOREM 7.21 . For any complex numbers a n we have
I / n 2_ 4 \| 23 2
an £ |2
X \J2 {—^-)\
m
^(MN) (M2 +M3N)\\a
where e is any positive number and the implied constant depends only on e.
EXERCISE 4 . Le t S(ra , n;c) denot e th e classica l Kloosterma n sum . Fo r an y
complex number s a m , /3 n we hav e
E | X Y^PnSim.^c^iC^M + N^alMl
EXERCISE 5 . Fo r an y comple x number s a m , (3 n we hav e
2
E E * | EE ^ „ e ( = ) | % ( Q + MAr)||a||2||/3||2.
q^Q a (mod q) m^M,n^.N
(mn,q) = l
The estimate s i n th e abov e exercise s ca n b e easil y derive d fro m th e larg e siev e
inequalities (7.1 1 ) , (7.1 3 ) respectively . Muc h harde r i s th e followin g estimat e fo r
bilinear form s wit h Kloosterma n fraction s du e t o W . Duke , J . Friedlande r an d H .
Iwaniec [DFI4] .
THEOREM 7.22 . Let a be a positive integer. For any complex numbers am, (3 n
we have
E E <^0ne(a^)<z(MNy(± + ±) m(a + MN)ka\\
(m,n) = l
where the implied constant depends only on e.
REMARKS. Withou t th e facto r ( M _ 1 + i V - 1 ) 1 / 5 8 , thi s boun d woul d b e jus t
trivial. I t i s ofte n crucia l fo r application s t o hav e a savin g factor , les s importan t
how larg e i t is . Th e proo f o f Theore m 7.2 2 use s th e amplificatio n method ; th e
principle o f thi s i s explaine d a t th e en d o f Chapte r 26 .
Bilinear form s i n comple x domain s ar e als o desire d fo r applications . W e selec t
from [FI1 ] tw o result s fo r th e Jacobi-Dirichle t symbo l (^ ) an d fo r th e Jacobi -
Kubota symbo l [wz] (for definitions , se e Sectio n 3.7) .
T H E O R E M 7.23 . Let a w, (3 Z be any complex numbers with \a w\ ^ 1 , \fi z\ ^ 1
for w, z in the discs \w\ 2 < M, \z\ 2 ^ N. Then
X * X a - ^ ( ^ ) <:(M + N)T2(MNo-
where the * restricts the summation to the primary primitive numbers w, e is any
positive number and the implied constant depends only on e.
Using th e twiste d multiplicativit y (3.71 ) o f th e Jacobi-Kubot a symbo l on e ca n
derive from Theore m 7.2 3 the following estimat e (subjec t t o some minor conditions )
X* X*awf3z [^ < c^+^o ^ (MN) 7o+
^£
186 7. B I L I N E A R F O R M S AN D T H E L A R G E SIEV E
With simila r result s bu t i n a differen t fashio n on e ca n trea t bilinea r form s i n
ijjf(mn) wher e ij)f{() ar e th e normalize d Fourie r coefficient s o f a fixed cus p for m
/ G 5/c(To(AO, ^) o f weight k = hal f a n od d integer , an d v i s the compatibl e thet a
multiplier (se e (1 .53 ) an d (3.42)) . Thes e coefficient s ar e no t multiplicativ e s o a
cancellation i n general bilinear form s i s possible. Th e followin g estimat e wa s proved
in [Dull] ,
b
Y! E <*™Mf{rnn) « (MNY{M\ + MIA0||a||||/3| |
where a m , /3 n are an y comple x number s an d £ is an y positiv e number , th e implie d
constant dependin g onl y o n e an d th e cus p for m / . Probabl y thi s genera l estimat e
holds tru e wit h M 1 / 2 + M 1 /4 7V replace d b y M + N. Hav e i n min d tha t th e boun d
i/jf(l) < C I s i s expected t o be true fo r £ squarefree bu t i t i s not ye t proved ; i t i s essen-
tially equivalent wit h the Lindelof Hypothesi s for a suitable automorphi c L- function
twisted b y th e rea l characte r \i m th e ^-aspec t (vi a Shimur a correspondenc e an d
Waldspurger formula) .
7.7. Large-siev e inequalitie s fo r cus p forms .
The Fourie r coefficient s o f cusp forms, o r better th e eigenvalue s o f Hecke opera -
tors i n the spac e o f cusp forms , ar e analogue s o f Dirichlet characters . A s tools the y
are powerfu l du e t o th e orthogonalit y i n th e sens e o f th e larg e siev e inequalities .
There ar e tw o essentia l aspect s o f th e matter , th e spectra l an d th e leve l aspects .
Moreover, som e hybri d aspect s als o appea r i n practice .
Let (UJ) be a n orthonorma l basi s o f Maass cus p form s fo r T = To(q) wit h q ^ 1
(see Theore m 1 5.5 ) an d le t Xj = Sj(l — Sj) wit h Sj = \ + itj b e th e correspondin g
eigenvalues o f th e Laplac e operator . Le t
Uj(z) = ^/yYl pj(n)K itj(27r\n\y)e(nx)
be th e Fourie r expansio n o f Uj a t th e cus p o o (se e Lemm a 1 5.1 ) . Conside r th e
normalized coefficient s
2
(7.40) Vj(n)= ( k ) Pj(n), fo rn ^ 0 ,t 3 > 0.
These ar e expecte d t o b e essentiall y bounde d o n averag e wit h respec t t o n , q, tj.
Using th e Kuznetso v formul a (Theore m 1 6.3 ) on e ca n sho w
THEOREM 7.24 . Let q ^ 1 , T ^ 1 and N ^ 1 . For any complex numbers a n
we have
(7-41) E | E <W n
) f « (<1 T2
+ NlogN)\\a\\ 2
where the implied constant is absolute.
Many result s o f thi s typ e ar e establishe d i n [DI] , bu t wit h a slightl y weake r
term iV 1 +£ i n plac e o fT V logTV (probabl yT V would suffice) . Not e tha t qT 2 i s
approximately th e numbe r o f cus p form s Uj(z) fo r r 0 (g) wit h 0 < t 3 ^ T b y
Weyl's Law .
7. BILINEA R F O R M S A N D T H E L A R G E SIEV E 187
Still more orthogonalit y i s expected fro m th e primitiv e cus p forms . Th e follow -
ing estimat e i s a n ope n proble m
PROBLEM 7.25 . Let Q ^ 1 , T ^ 1 and N ^ 1 . Prove that for any complex
numbers a n
2
(7.42) E E * | E «n^-(n)| « (Q 2T2 + N)\\af
where the implied constant is absolute and ^ restricts Uj to the primitive cusp
forms, so T nUj — Xj(n)uj and Vj{n) = \j{n)h>j(l) for all n ^ 1 .
Similar inequalitie s hol d tru e fo r holomorphi c cus p forms . I n thi s cas e mor e
precise result s ca n b e derive d fro m th e Petersso n formul a (Propositio n 1 4.5) . Le t
T b e a n orthonorma l basi s o f Sk(q). Le t
f(z) = J2 af(n)e( nz)
be the Fourie r expansio n o f / a t oo . W e consider th e normalize d Fourie r coefficient s
(7-43) ^ ( n ) = (^zJl)3 a / ( n ) .
Recall tha t i f k ^ 2 we hav e
1
(7.44) \T\ = dim S k{q) x kq J] ( 1 +P' ).
P\Q
THEOREM 7,26 = Let J be any orthonormal basis of Sk{q) with k > 2 . Then
for any complex numbers a n we have
(7.45) ^ | V J a n ^ ( n ) | 2 « ( 9 + iV)||a|| 2
where the implied constant is absolute.
PROOF. B y (7.43 ) an d Propositio n 1 4.5 , th e Petersso n formul a implie s tha t
the lef t sid e o f (7.45) , sa y L(q), equal s
'AiT\/mn
g||a|| 2 + 27rg i k ^1
c .ama^^a rnanS(m,n]c)Jk-i(-
nb\m,n;c)Jk-i[
c = 0 (mo d q) m,n^N
Opening th e Kloosterma n su m w e deriv e b y orthogonalit y o f additiv e character s
and b y Cauchy' s inequalit y tha t
| ^ ^ a m a n S ( r a , n ; c ) | ^ (c + N)\\a\\ 2.
Next (t o separat e th e variables ) w e derive vi a th e powe r serie s expansio n
(-l)exk-1+2e
J f c _ i ( 2 z ) = E OT
OO (fc + £
)
188 7. BILINEA R FORM S AN D TH E LARG E SIEV E
that
\^2^arnanS{m,n]c)Jk.ly—y- J J ^ I k-i{— J (c + N)\\a\\ 2
m,n^.N
where
k — l-\-2£
The conditio n x — 27Ty/mnc~ 1 ^ 2TTNC~ 1
^ 1 holds fo r any c = 0 (mod q) if
g ^ 2TTN, givin g
1
L{q)^q\\a\\2 + 2nq £ c" (^f)\c + N)\\ a l l 2 ,
c=0 (mod q)
hence (7.45 ) follow s i n this case .
To remov e th e condition q ^ 2TTN w e observ e tha t Sk(q) C Sk(pq) an d the
index i s [ro(p# ) : To(g) ] ^ p + 1 . Usin g thi s embeddin g wit h appropriat e re-
normalization w e find tha t L(q) ^ ( 1 + p~1)L{pq). I f q ^ 27r7V , w e ca n choos e p
with 27ri V ^ pq ^ ATTN an d appl y th e resul t fo r L(pq) gettin g (7.45) . •
With a sligh t modificatio n th e above argument s wor k als o fo r S k(q,x) w r t n
any characte r \ (mo d q) an d fo r k = 2 . However , th e case fc = 1 is very differ -
ent becaus e th e spac e Si(q,x) * s small (probabl y dim5i(g,x ) < ^ v ^ ° S t f i f X *s a
real character) . Technicall y speakin g thi s intrinsi c distinctio n manifest s itsel f by
the lac k o f convergence o f the serie s o f Kloosterman sum s i n the Petersson for -
mula. Wha t lie s behin d th e scen e i s the hug e spectru m o f Maass cus p form s nea r
the botto m o f which onl y a small par t (th e holomorphi c form s a t the bottom) is
detected b y the Petersson formul a (th e lac k o f spectral completenes s cause s the
problem!). I n this scenari o on e ca n procee d b y the duality principl e (n o spectra l
completeness i s required then) , bu t a new obstacl e emerge s wit h th e degre e o f the
automorphic harmonic s (GL(2) x GL(2) i s essentially GL(4) ; t o the contrar y for
Dirichlet character s GL(1 ) x GL(1 ) i s still GL(1 )) whic h amplie s th e conductor.
Nevertheless som e non-trivia l orthogonalit y ca n be established fo r sufficiently lon g
vectors relativ e t o th e level . Fo r exampl e W . Duk e succeeded i n proving alon g suc h
lines th e followin g larg e siev e typ e inequalit y (no t perfec t bu t quit e useful )
THEOREM 7.27 . Let x he an odd primitive quadratic character modulo q. Let
T he an orthonormal basis of Si(q,x) an ^ ^ ( n ) ^ e ^e corresponding normalized
coefficients. Then for any complex numbers a n we have
a
(7.46) £ | E " ^ ) | 2 « ( 9 + A0IHI2
where the implied constant is absolute.
Duke [Dul ] gav e an important applicatio n o f (7.46) fo r estimatin g th e dimen -
sion of Si(#, x)- Specificall y h e proved
dimSi(g,x)«411/12+£
for q prime an d x a r e a l primitiv e characte r modul o g , the first improvement ove r
the boun d dimSi(q,x) ^ tf/l°g# whic h come s fro m th e trac e formula .
7. B I L I N E A R F O R M S A N D T H E L A R G E SIEV E 189
Using th e existenc e o f Rankin-Selber g L- functions fo r arbitrar y degre e auto -
morphic forms , Duk e an d Kowalsk i [DK ] hav e establishe d b y th e dualit y principl e
a ver y genera l typ e o f larg e siev e inequalit y whic h ha s application s especiall y t o
the stud y o f automorphi c //-function s clos e t o th e lin e R e (s) = 1 . W e mention th e
special cas e o f symmetric squar e coefficient s (se e Sectio n 5.1 2 ) an d giv e a sketc h o f
proof t o illustrat e th e wealt h o f ingredients .
THEOREM 7.28 . For q squarefree, let S2(q)* be the basis of primitive forms of
level q and weight 2 . Let Xf(n) be the Hecke eigenvalues for f G S^^)*- For any
complex numbers a n we have
(7-47) E E lE^/^r^^ 1 0
^)1 ^^^^)!!^ 2
q^Qfes2(q)* n<:N
if N ^ Q ^ 1 , with an implied constant depending only on e.
SKETCH O F PROOF . On e use s dualit y an d on e ca n attac h a smoot h tes t func -
tion t o th e su m ove r n , reducin g th e proble m t o estimatin g
where / , g G £ 2 (<?)* > f°r som e fixed smoot h functio n ip with compac t suppor t suc h
that <p(x) = 1 for 0 ^ x ^ 1 . (Th e Heck e eigenvalue s Xf(n) fo r / G Sl(q) ar e real) .
By Melli n inversio n w e hav e
H
(f^) = ^~ [ G(s)0(s)Nsds
2m
(2)
where
G(s) = 5 > / ( n 2 ) A 9 ( n 2 ) n - .
n
2
Using th e fac t tha t A/(n ) are , u p t o smal l perturbation , coefficient s o f th e sym -
metric squar e L- function (se e Sectio n 5.1 2 ) an d th e descriptio n o f Rankin-Selber g
L-functions o f cus p form s o n GL(3) , i t follow s tha t
G(s) = L(Sym 2 / 0 Sym 2 g, 5 )G?i(5)
where Gi(s ) i s given by a n Eule r produc t absolutel y convergen t fo r R e (s) > \. W e
have Deligne' s boun d |A/(n) | ^ r(n) (se e (1 4.54)) , whic h implie s
G1(5)«|C(<T+i)|^«(a-i)^
for R e (5) = a > | , fo r som e absolut e constan t A > 0 , wit h a n absolut e implie d
constant.
It i s know n (b y propertie s o f Rankin-Selber g convolutions ) tha t L(Sym 2 / ®
Sym2 g,s) i s entire excep t i f Sym 2 / = Sym 2 g, i n whic h cas e ther e i s a simpl e pol e
at s — 1 . Moreover , i n th e Appendi x t o [DK] , Ramakrishna n show s tha t thi s
exception i s equivalen t wit h / = g i n th e cas e o f squarefre e level s (i n th e genera l
190 7 . BILINEA R F O R M S AN D T H E L A R G E SIEV E
case, a quadrati c twis t ca n occur) . S o moving th e contou r t o th e lin e a = | + S for
6 > 0, w e ge t
2 2
#(/>$) = 2 ^ / G^LiSym /® Sym g,s)<p(s)N*ds
for / ^ g . Fo r < J = | - f J, w e use th e convexit y boun d (se e Exercise 3 of Chapte r 5 )
L(Sym 2 / ® Sym2 g, a + it ) < q(Sym 2
/ ® Sym2 g, a + 2t) 1 / 4 ~ < 5 / 2 + £
for thi s L-functio n o f degre e 9 , wher e q(-,s ) i s th e analyti c conducto r define d i n
(5.7). W e hav e
q(Sym2 / <g > Sym 2 g, a + tf) < Q 6 (|t| + l ) 9
by the bound (5.1 1 ) o f Bushnell an d Henniar t fo r the conductor o f a Rankin-Selber g
convolution, wit h a n absolut e implie d constant . B y the rapi d deca y o f <£(s) , we get
for f^g,
6
H(f,g) < N^ Q3/2-36+£5-A
for an y S > 0. W e tak e 5 = (logiV) - 1 s o
+£
H(f,g) « iV5 Q3/2
forT V ^ Q an d / ^ # .
If / = <7 , by Deligne' s boun d w e hav e |A/(n 2 )A^(n 2 )| < r(n 2)2 ^ r ( n ) 4 henc e
by direc t estimatio n
# ( / , / ) «iV(logTV) 1 5 .
2
Since ther e ar e < C Q form s o n th e lef t sid e o f (7.47) , i t follow s tha t fo rT V ^ Q we
have (7.47) . •
In thi s connectio n w e suggest th e followin g proble m a s a firs t cas e o f a higher -
degree larg e siev e inequality :
PROBLEM 7.29 . Prove that
J2 | ^ a n A / ( n 2 ) | 2 « ( g i V r ( g + iV)||a|| 2
feHk(q) n<:N
where Hk(q) is a Heche basis of eigenforms on Sk(q), with q squarefree, and the
implied constant depends only on e and k.
We ca n deduc e fro m Theore m 7.2 8 th e analogu e o f Linnik' s resul t (Theore m
7.16) fo r ellipti c curves .
PROPOSITION 7.30 , Let A > 0 and Q > 2 be fixed. For E/Q a semistable
elliptic curve of conductor ^ Q, let M(E) be the number of semistable elliptic
curves F/Q with conductor ^ Q such that ap(p) — Q>E(P) f or a ^ V ^ (l°gQ) A -
Then we have
M{E) < QP' A
where the implied constant depends only on A.
It i s conjectured tha t th e numbe r o f semistable ellipti c curve s of conductor ^ Q
is abou t Q 5 / 6 ; th e lowe r boun d i s eas y b y constructin g explici t families , bu t th e
best know n uppe r boun d i s Q 1 + £ fo r an y e > 0 (se e [DK]) . Therefor e ou r resul t i s
non-trivial fo r an y A > 1 1 .
7. B I L I N E A R F O R M S AN D T H E L A R G E S I E V E 191
PROOF. Fo r a given weight 2 primitive form / o f conductor q < Q , q squarefree,
let M'(f) b e th e numbe r o f primitive modula r form s g, o f weight 2 with squarefre e
conductor ^ Q suc h tha t A/(p ) = \ g(p) fo r p ^ (logQ) A. B y modularit y o f
(semistable) ellipti c curve s ove r Q , w e hav e M(E) < M^/E) wher e JE i s th e
weight 2 form associate d t o E. Not e tha t A/(n ) = A^(n ) i f al l prim e divisor s p \ n
are < (logQ) A.
As i n Linnik' s proof , th e ide a i s t o fin d coefficient s a n supporte d o n prim e
numbers tha t mak e th e linea r for m
Lf=^T a nXf(n)
large, henc e L Q = Lf i s larg e an d repeate d fo r al l g counte d i n M f(f). O n th e
other hand , th e larg e siev e estimat e (7.47 ) wil l sho w tha t thi s canno t happe n ver y
often producin g a boun d fo r M f(f). Becaus e th e coefficient s \f(p) ca n b e quit e
small (i n contras t wit h Dirichle t characters) , a n = A/(n ) ma y no t work . Henc e
one i s tempte d t o us e Propositio n 1 4.22 , bu tT V ther e woul d b e to o smal l (limite d
to N ^ (\ogQ) A). Instead , notic e tha t b y th e Prim e Numbe r Theore m an d th e
formula
\f(p)2-\f(p2) = lfov(p,q(f)) = l,
one o f th e set s T\ o r T 2 give n b y
Ti = {p < (\ogQ) A I \\ f(p*)\ > i an d (p,q(f)) = 1 }
satisfies \T{\ ^> (log(5) A (^41 oglog(5) _1 wit h a n absolut e implie d constant . Assum e
it i s T2 (th e cas e o f T\ i s similar , bu t simpler) .
We hav e |A/(n) | ^ 2~^^ i f n i s squarefre e wit h al l it s prim e factor s p G T2.
Fix m ^ 1 (t o b e chose n later) . Le t a n = Xf(n) fo r suc h n wit h uj(n) = ra, an d
an — 0 otherwise. I t follow s tha t
2
£/=£w =EiA/(n)i2^2"2mc/
where U i s th e numbe r o f value s o f n ^ T V satisfying th e desire d condition . B y
Theorem 7.28 , w e hav e
1 5
M'{f)L) « iV(logA0 ||a|| 2 = iVaogTV) 1 5
^
for an y N ^ Q s. Henc e fo r N = Q s w e ge t
M'(/)«Q8(logQ)1522mC/-1.
Choosing m = [8(logQ)(AloglogQ)- 1 } w e ge t 2 2 m < Q e
an d
showing tha t
8/A
M'(f) « Q +£ « Q 9/A
for e smal l enough . •
192 7. B I L I N E A R F O R M S A N D T H E L A R G E SIEV E
7.8. Orthogonalit y o f elliptic curves .
A straightforwar d approac h t o the large siev e inequalitie s require s near-ortho -
gonality an d almost completenes s o f the outer harmonics . However , som e space s
contain smalle r subspace s whic h preserv e completenes s wit h respec t t o thei r own
structure. The n provin g a larg e siev e typ e inequalit y fo r the subspace become s a
harder ye t feasible proble m o f independent interest . A clear exampl e i s offered b y
the famil y o f elliptic curve s
(7.48) E : y 2 = x 3 + ax + b
with a, 5 G Z , K a ^ i , 1 ^ K 5 . Th e corresponding modula r form s f ab ar e o f
weight 2 and level q ^Q = 16(4A 3 + 27B 2). W e hav e abou t AB suc h curve s whil e
the tota l numbe r o f all primitive cus p form s o f weight tw o and level q ^ Q is about
Q2. Therefor e w e are dealing wit h a very smal l subse t indeed .
In thi s sectio n w e present a large siev e typ e inequalit y fo r the harmonics asso -
ciated wit h th e family o f elliptic curves . I f m is squarefree th e Hecke eigenvalu e for
the cus p for m associate d t o (7.48 ) i s given b y the character sum
(7.49) X ab{m) = fi(m) ^ (
m
x (mo d m )
(see Sectio n 1 4.4) . However , i t i s mor e interestin g t o conside r th e sum over th e
reduced classe s x (mo d m) , (x,m) = 1 . Put
' x3 + ax + b
(7.50) K b(m)=,(m) £ * ( ~ ^ ) -
x (mo d m )
Note tha t
W = E"M(^(7)-
d\m
THEOREM 7.31 . For any complex numbers a a, /3 & we have
where e is any positive number and the implied constant depends only on e.
PROOF. W e can assume that m is odd squarefree. B y Gauss sums (se e Theorem
3.2)
m 3
Kb(m) = M )"T y ]( — ) Y\ e m (2:(x + ax + b))
z (mo d m) x (mo d m)
e
= /x(m ) ~p= ^2 ™ (ax) Yl ( — )em(z2x3 + zb).
x (mo d m) z (mo d m )
Hence the inner su m in (7.51 ) equal s
a 2 3
//(m)-p= ^2 \^ aern(ax))(^2f3b ^2 e m(z x + zb)j
m
x (mo d m) <*> b z (mo d ra)
7. B I L I N E A R F O R M S A N D T H E L A R G E S I E V E 193
By Cauchy' s inequalit y th e lef t sid e o f (7.51 ) i s bounde d b y (AB) 1 ^2 wher e
A=
H E * |E a«e™(ax)f < I H | 2 ( A + M 2 )
mx (mo d m) a
and
B
= E^m E * I E A E e m(^ + zb)
x (mo d m) b z (mo d m)
<E~E IE ^ E e ^ v + zft )
ra a : (mo d m ) fc> z (mo d m)
TO z\ = z\ (mo d m ) *> i ,& 2
2
<£73(m)T(m) ^ * |/? 6em(^)| « ||/J|| 2 (B + M 2)M*.
mz (mo d m )
This complete s th e proo f o f (7.51 ) . •
REMARKS. Pro m th e Hass e boun d A* 6(m) < m 1 /2-^?™) (se e Theore m 1 1 .25 )
it follow s tha t th e lef t sid e o f (7.51 ) i s < C \\a\\\\f3\\(AB) 1 /2M3/2log2M, whic hwe
regard a s a trivial bound , becaus e n o cancellation i s used. I n application s on e ofte n
wishes t o hav e a savin g slightl y mor e tha n M 1 / 2 . Ou r theore m guarantee s thi s
saving relative to the trivial bound i f AB i s larger than M 2. Choosin g A = X 1 / 3 an d
B = X 1 !2 w e have the elliptic curves with discriminant A a& = — 16(4a3 +276 2 ) < C X
and w e obtaine d th e desire d savin g i f M < X 5 / 1 2 . Thi s i s quit e a larg e rang e fo r
M, bu t no t ye t completel y satisfactory . Indeed , fo r application s t o L- functions on e
needs t o pas s th e barrie r M — X1/2. Th e followin g conjectur e woul d provid e th e
passage
CONJECTURE 7.32 . For any complex numbers (7 m ) supported on squarefree
numbers we have
I |2
EE 7rnKb{m)\ <(A + M)M J2 1 7
mT(m)
where the implied constant may depend on b slightly.
When b i s a squar e w e ar e havin g ellipti c curve s o f positiv e rank . Simila r
estimates ar e expecte d t o b e tru e fo r specia l familie s o f ellipti c curve s o f a give n
rank.
EXERCISE 6* . Le t m b e a positive , odd , squarefre e number . Prov e tha t fo r
any comple x number s a a , /?& ,
J 2^ 2 ^ a «&A a 5 (m) < r 4(m)(A + m) 2 (B + ra) 2 ||a
B^b^2B
194 7. B I L I N E A R F O R M S AN D T H E L A R G E SIEV E
7.9. P o w e r - m o m e n t s o f L-functions .
We hav e remarke d tha t a shar p larg e siev e typ e inequalit y (7.1 8) , (7.20 ) fo r
coefficients o f a famil y o f L-functions , ha s strengt h comparabl e t o th e Gran d Rie -
mann Hypothesi s o n average . S o it i s not surprisin g tha t th e larg e siev e can als o b e
employed t o deriv e estimate s fo r average s o f L-functions whic h ar e a s stron g a s th e
Lindelof Hypothesi s woul d giv e (se e Corollar y 5.20) . Ther e i s a vas t literatur e o f
relevant results . Fo r classica l L-function s w e recommen d th e boo k b y A . Ivi c [Iv] .
In thi s sectio n w e only gras p th e industr y b y showin g a fe w representativ e results .
THEOREM 7.33 . For T ^ 2 we have
2
(7.52) / |C(|+it)| dt<TlogT.
THEOREM 7.34 . We have
8
(7.53) J2 E * l^+^x)| « Q 2 ( * 2 + l)(logQ(|<|+2))17
g ^ Q x ( m o d q)
for any t G R , where the implied constant is absolute.
THEOREM 7.35 . Let k ^ 2 and J 7 be a Heche orthogonal basis of Sk(q)- We
have
(7.54) Y, l L(/» \ + ft )l4 « <*? + l)(loK^r(|*| + 2))17
fer
for any t E l , where the implied constant depends only on k.
REMARKS. I n al l cases , slightl y mor e precis e estimate s ar e know n (i n som e
cases asymptoti c formulas) . Se e [T2 ] fo r th e Rieman n zet a function , an d [KMV1 ]
for cus p forms . Als o (7.53 ) an d (7.54 ) ar e onl y comparabl e t o th e Lindelo f hy -
pothesis i n q- aspect. I n t-aspec t the y ar e a s goo d a s th e convexit y bound , extr a
averaging ove r t woul d improv e th e situation .
In Chapter 26 , as a by-product o f a deeper study of special values of L-functions ,
we prov e asymptoti c formula s fo r th e first an d secon d moment s o f derivative s
L ; (/, | ) wit h respec t t o th e od d weigh t 2 cusp form s (se e (26.35 ) an d (26.36)) . I n
fact, th e moder n method s use d i n the analyti c stud y o f special values of L-function s
(and thei r zeros ) rel y heavily o n various type s o f averaging, som e of which ar e quit e
sophisticated indee d (the y requir e orthogonalit y o f a capacity beyon d th e diagonal ,
so to speak) . Se e the introductio n t o Chapte r 2 6 for som e references , t o whic h ca n
be adde d [CI1 ] (amon g man y others) . Se e als o Sectio n 9.2 .
SKETCH O F PROO F O F THEOREM S 7.33 , 7.34 , 7.35 . Th e argument s ar e quit e
familiar s o w e onl y indicat e th e essentia l steps . Th e metho d start s wit h suitabl e
short approximation s o f L-function s b y Dirichle t polynomial s (se e Chapte r 5) , an d
then applie s a large sieve inequality t o th e resultin g linea r forms . Withou t compro -
mising estimate s on e ca n tak e a momen t o f orde r 2k a s lon g a s th e lengt h o f th e
approximation fo r th e fc-th power i s bounde d b y th e tota l numbe r o f "harmonics" .
For instance , i n Theorem 7.34 , ther e ar e abou t Q 2 character s involved , an d fo r eac h
one th e approximatio n t o L ( | + it, \) * s °f lengt h abou t (q(\t\ -\- l)) 1 ^ 2 , s o w e ca n
apply Theore m 7.1 3 fo r th e mea n squar e o f L( ^ - f z£,x) 4 , an d ge t a boun d fo r th e
eighth power-momen t whic h i s nearl y bes t possible .
7. B I L I N E A R F O R M S AN D T H E L A R G E SIEV E 195
We sketc h Theore m 7.3 4 thi s way , leavin g th e othe r tw o theorem s a s exercise s
(use Theore m 7.1 7 wit h Q = 1 and Theore m 9. 1 i n th e firs t case , o r Theore m 7.2 6
in th e las t case) .
Let x b e primitiv e modul o g , 2 ^ q ^ Q. W e hav e
s
L(s,xY = ^x(ji)T A(n)n-
w e se e
for R e (s) > 1 . B y Theore m 5. 3 fo r L(s, \Y tha t L ( | - f z£ , x)4 i s equal t o
^X(n)r4(n) / n \ 4-4^(5-^ ) V - x H ^ w p \
where V(2/ ) — Vs(y) an d W(y ) = W s (y) ar e give n b y (5.1 3 ) an d (5.1 4 ) wit h th e
gamma facto r 7(5 ) = TT~ 2ST(^(S + ft)) 4 fo r n = | ( 1 — x( — l)) an d som e suitabl e
choice o f auxiliar y functio n G(u). Th e function s V(y) an d VF(y ) deca y rapidl y fo r
y ^ > t 2 + 1 (se e Propositio n 5.4 ) s o practicall y w e ar e lef t wit h sum s o f lengt h
TV x q 2(t2 + 1 ) . B y Theore m 7.1 3 w e ge t
£ E * |£^r«« !+»><'<*»> 17
Using thi s estimat e essentiall y fo r N x Q (t + 1 ) on e derive s (7.53) . •
This page intentionally left blank
http://dx.doi.org/10.1090/coll/053/09
CHAPTER 8
EXPONENTIAL SUM S
8.1. Introduction .
As usua l w e denot e th e additiv e characte r o n R b y e(x) = e 27Tlx. Exponentia l
sums whic h w e consider i n thi s chapte r ar e o f typ e
(8.1) S f(N)= £ e(/(n) )
where / i s a smooth, rea l value d functio n o n th e interva l [1 , N] whic h i s calle d th e
amplitude functio n an dT V is a positive intege r calle d th e lengt h o f the sum . We
could begi n th e summatio n a t any poin t an d conside r
e
(8.2) S f(M,N)= Yl (/(n))
M<n^M+N
where M is an y intege r an d / i s a smooth functio n o n [M + 1 , M + N]. O f course ,
the latte r ca n b e transforme d t o th e forme r b y shiftin g th e argument ,
Sf(M,N)= V J e(/( n + M)).
We have th e trivia l boun d |5/(M , N)\ ^ N and ou r goa l wil l b e t o improv e o n thi s
bound a s muc h a s w e can . However , i n man y case s eve n a slight improvemen t o f
the trivia l boun d i s sufficient fo r applications .
Exponential sum s Sf(N) fo r suitable / ar e encountere d i n various area s of
analytic numbe r theory . Fo r example , the y ca n b e use d t o estimat e th e Rieman n
zeta functio n ((s) o n vertica l lines . Indee d b y th e simpl e approximatio n
s
/V 1_s
(8.3) <(*) = J2 n- + j-^+0(N-n
for s = aH- it with a ^ \ an d 1 ^ t ^ N the proble m reduce s t o estimatin g sum s
which ar e o f typ e (8.2 ) wit h f(x) — =£ logx. Anothe r exampl e i s the proble m o f
counting lattic e point s insid e a planar domain . Thi s reduce s t o estimatin g integra l
points unde r a curve y = g(x), wher e g is a smooth, positiv e decreasin g functio n
on [1 , TV]. Th e numbe r o f point s i n questio n equal s
E \9(n)]= E 9(n)~ E ^W ) - f •
l<n<N l<n<N l<n<N
197
198 8. EXPONENTIA L SUM S
On th e right sid e the first su m i s well approximated b y an integral (se e (1 .67) ) whil e
the secon d su m i s equa l t o
by th e truncate d Fourie r serie s (4.1 8 ) fo r ip(x). Henc e th e proble m reduce s t o
estimation o f sum s (8.1 ) wit h f(x) = hg(x). I n particular , i n th e Gaus s circl e
problem (lattic e point s i n th e circl e x 2 + y 2 = R 2) w e encounte r sum s Sf(R) wit h
/(x) = hy/R 2 — x 2, an d i n th e Dirichle t diviso r proble m (lattic e point s unde r th e
hyperbola xy = N) w e encounte r sum s Sf(N) wit h f(x) = hNx~ x.
In this chapte r w e present classica l method s fo r estimatin g genera l exponentia l
sums S f(M,N) du e t o H . Weyl , J . G . Va n de r Corpu t an d I . M . Vinogradov .
Our result s d o no t depen d o n particula r propertie s o f / bu t onl y o n estimate s fo r
derivatives. I t i s alway s understoo d tha t th e function s involve d ar e o f clas s C fe,
where k i s th e larges t orde r o f derivatio n appearin g i n th e statements .
Note tha t i f / i s wel l approximate d b y g, the n 5/(M , N) ca n b e estimate d b y
Sg(MJ N f) fo r som e N' ^ N. Precisely , i f / = g + /i , then b y partia l integratio n
(8.4) Sf(M,N) = S g(M,N)- / S g(M,x - M)de(h(x)).
JM
Hence w e ge t
f
(8.5) \S f(M,N)\ < C h max \S g{M,N )
where
rM+N
Ch = l + 2ir \ti(x)\dx.
JM
If h i s monotoni c an d bounde d (s o Ch i s bounded) , the n th e origina l su m Sf an d
the modifie d su m S g ar e practicall y equal . Sometime s i n application s w e find tha t
a smal l alteratio n o f / i s necessary t o mee t th e condition s o f ou r results , i n whic h
case (8.5 ) ca n b e used .
8.2. Weyl' s method .
The first application s o f exponential sum s i n numbe r theor y wer e given i n 1 91 6
by H. Weyl [Wl ] t o the problem of equidistribution o f sequences modulo one. I n th e
second pape r o f 1 92 1 Wey l [W2 ] develop s a genera l metho d whic h i s particularl y
good fo r Sf(N) wher e / i s a polynomia l
f{x) = ax k + px 1k
- ^'"eR[x]
with a > 0 . I n thi s cas e w e cal l Sf(N) a Wey l su m o f degre e k.
The Wey l su m fo r a linea r polynomia l f{x) = ax i s a su m ove r geometri c
progression
^ /,r x v ^/ x sin7rai V /a / 7 % r _ \
l<n<N
8. E X P O N E N T I A L SUM S 199
Since | sin7ra| ^ 2||a|| , wher e ||a| | denote s th e distanc e o f a t o th e neares t integer ,
we obtain fo r a £ Z,
(8.6) I V e(an)\ ^ mi n (iV , — -
I ^— ' I \ 2 a
The Wey l su m fo r a quadratic polynomia l f(x) = ax2 + /3x is also calle d a
Gauss sum . I n the specia l cas e wher e a , b are integer s wit h (2a , N) = 1 we hav e
the exac t formul a (3.38 ) fro m whic h it follows b y completin g th e squar e tha t
MI £ 4°-^"' N.
l<n<N V
Though ther e i s no simpl e expressio n fo r a general Gaus s su m
2
Sf(N)= ] T e(an +/3n),
we ca n estimat e thi s quit e well . Firs t w e arrange \Sf(N)\ 2 |2a s follow s
\Sf(N)\2= J2 e(<*t2 + Pt) J2 e
(2ain)-
\£\<N l^n,n+£^N
Then applyin g (8.6 ) w e ge t
2 1
(8.8) \S f(N)\ ^N + Yl min(2JV,||2crf||- ).
Hence on e ca n deduc e tha t
1, 1
(8.9) \S f{N)\ < 2y/aN + —
— loi ogg r -
y/a a
if 0 < a ^ \, whic h restrictio n ca n alway s b e arranged. However , a more pre -
cise estimat e depend s o n th e diophantin e natur e o f the leadin g coefficien t a . By
Dirichlet's approximatio n theore m ther e exist s a rational approximatio n t o 2a o f
type
l
3.10) ' 2" a"- - " <"
2Nq
'ith (<
with (a,g ) = 1 and 1 < q ^ 2N. Henc e ||2a£| | ^ \\\a£/q\\ fo r an y 1 ^ £ < NJ ^
0(mod q) an d
2 ||2a£||-^2( ^ + l) £
1<*<JV V ^ 7
^(mo
^(mod d cq)
^ 0 ( m o d q) ^ 0 ( m o ^ 0 ( m o d <? )
Then estimatin g th e partia l su m of (8.8) ove r 1 < £ < N, £ = 0(mod q) triviall y b y
2N2q~1 w e arriv e at
\Sf{N)\2 ^N + 2N 2q~l + 4(7V + g) logg.
This togethe r wit h th e trivia l boun d |5/(iV) | ^ N implie s
20 0 8. EXPONENTIA L SUM S
THEOREM 8.1 . If f(x) = ax 2 + fix with 2a satisfying (8.1 0), then
(8.11) \S f(N)\ ^ 2Nq~^ + q* logq.
Theorem 8. 1 i s essentiall y bes t possible . Slightl y bette r result s hol d tru e fo r
almost al l coefficient s a,/ 3 (mo d 1 ) wit h respec t t o th e Lebesqu e measure . Indee d
we have the followin g estimat e fo r th e second , th e fourt h an d th e sixt h powe r mea n
values
2
(8.12) / Yl e(an +/3n) da = TV ,
J° l<n<N
2
(8.13) f \ ^2 e(an +pn)\4da<$:(Nlog2N)2,
2 6
(8.14) [ [ \ Y2 e(an + f3n)\ dadfi<^(Nlog2N)3.
PROOF. Th e firs t formul a i s th e Parseva l identity . Th e secon d integra l i s
bounded b y th e numbe r o f solution s t o n 2 - f n\ — n2 + n\ i n positiv e integer s
ni,ri2,ri3,n4 < TV , and th e latte r i s bounded b y
]T r r yt)2{i)<^{N\og2Nf
^ ^ v iug ^J V
^2N2
The thir d integra l i s equa l t o th e numbe r o f solution s t o th e syste m
J m + n 2 + n 3 = n 4 + n 5 + n 6,
\ n\ + n\ + n\ = n\ + n § + n §
in positiv e integer s 71 1 ,71 2,723,724,71 5,72 6 ^ AT . Puttin g k v — nv — n v+s an d ^ =
71^+71^+3 for 1 ^ 1 / ^ 3 we get the system /ci-f/C 2 + fc3 =0 an d kiii+k2i2 + k$is = 0 .
This reduce s t o on e equatio n k\(i\ — ^3 ) + ^2(^ 2 — ^3) = 0 an d k 3 i s determine d
uniquely b y &i,/c2 - I f &i(^ i — is) — 0> then £2(^ 2 — is) — 0, therefor e ther e ar e a t
most 1 6-/V 3 suc h solutions . I t remain s t o coun t th e solution s wit h k\(i\ — is) ^ 0 .
Given m wit h 1 ^ m ^ 4./V 2 an d £ 3 wit h 1 < is ^ 2i V ther e ar e a t mos t r 2(m)
solutions t o m = fci(£i — £3) = —^2(^ 2 — ^3) i n fci, /C2, ^1, ^2- Thu s th e tota l numbe r
of solution s i n questio n doe s no t excee d
4N 2
Yl r (m)<^(N\og2N)3.
2
Km^4iV
This complete s th e proo f o f (8.1 4) . •
Now w e estimate Weyl' s su m Sf(N) fo r a polynomia l o f an y degre e k ^ 2 . W e
begin b y
\Sf(N)\2= E J2e(f(m)-f(n))
0<ra, n^N
=E E e(/( £ + n)-/(n)).
|£|<AT 0<n^i V
0<^+n^n
8. EXPONENTIA L SUM S 201
If / i s a polynomia l o f degree k, the n g(x) = f(£ -f x) — f(x) i s of degree k — 1
(for an y £ ^ 0) . B y repeated applicatio n o f the abov e "differencin g process " we
ultimately arriv e a t a Weyl su m of degree on e for which w e can us e th e non-trivia l
bound (8.6) . Precisel y w e deriv e by induction
PROPOSITION 8.2 . If f(x) = axk -\ with k ^ 1 , then
m i n
\Sf(N)\ < 2N{N~k X)"' "S ^ H afc!^ '''^-lU"1)} •
-N<eli...1ek.1<N
PROOF. Fo r k = 1 this boun d i s interpreted a s that i n (8.6) . Moreover , we
have alread y prove d thi s resul t fo r k = 2 in (8.8) . Suppos e th e result i s true for
k ^ 2 . Let f(x) = ax k+1 + . . . , the n g(x) = f(£ + x) - f{x) = a(k + \)£x k + . . .,
therefore
|S/(A0| 2 ^ E | E e(a(k + l)nk + --.)\
\£\<N 0<n^N
0<£+n^N
2l~k
k 1
^2N ^2{N~ J^-"Y
1 niin^Hf c + l)!^..-^-!^- }
\£\<N -iV<£i,...A-i<^ V
by th e inductio n hypothesis . Henc e b y Holder' s inequalit y w e ge t
|5 / (iV)| 2 ^4iV 2 {Ar- f e - 1 £ • • • £ ; m i n ^ M f c + lJWx.-.^ll-1)}21 * .
-JV<£i,...,*fc<N
This yield s th e resul t fo r /c + 1 . •
Next w e use Proposition 8. 2 to estimate sum s Sf(M,N) fo r / whic h ca n b e
well approximate d b y polynomials . Le t k ^ 2 . Suppos e
(8.15) ^ | / « 0 ( a ; )|<F
in M ^ x ^ M + N. The n w e have /(a ; + M) = p(x) + r(x), wher e
p(x) = f(M) + xf'(M) + ••• + ^^/(fc-^M )
and th e derivativ e o f r(x) satisfie s
\r'(x)\ = ^^\f( k)(y)\ < kx k l
~ M'kF
for som e M ^ y ^ M + N. Therefor e b y (8.5) w e obtai n
k
\Sf{M,N)\ ^ (l + 27rFN M-k)\Sp(Nf)\
for som e N f ^ N. Thi s estimat e i s reasonably goo d fo r short sum s S/(M , TV) , i.e.,
if A 7" i s much smalle r tha n M , becaus e w e ca n mak e th e facto r 1 + 2irFN kM~k t o
be bounded , b y choosing k whic h i s not very large . Applyin g Propositio n 8. 2 fo r
the Wey l su m S P(N') w e ge t
202 8. E X P O N E N T I A L SUM S
COROLLARY 8.3 . Let f(x) be a smooth function in the interval [M , M -\- N] in
which it satisfies (8.1 5). Then
k
\Sf(M, N)\ ^ 2N(1 + 27rFN M~k)V22~k
where
V = N'~ k E ' ' ' E min(7V , \\f^ k-l\M)h • • • 4-2H"1)-
-iV<^,...,4-2<^
We shall use the above results to estimate Sf(M, M f) fo r an y M' wit h 1 ^ M' <
M. T o thi s en d w e assum e tha t (8.1 5 ) hold s i n th e whol e interva l M ^ x < 2M.
We choos e 1 ^ N < M an d spli t 5/(M , M') int o shorte r sum s o f lengt hT V getting
\Sf(M,M')\^ E \Sf(M + jN,N)\+2N
where J = [M/N]. Henc e b y Corollar y 8. 3 an d Holder' s inequalit y w e obtai n
\Sf(M,M')\^2N(l + 2nFN kM-k) E vf~ k
+ 27V
o^j<J
^ 2M( 1 + 27rFN kM-k)W22~k + 27V
where Vj ar e th e correspondin g sum s associate d wit h th e point s M + jM an d W
is the mean-valu e o f thes e sum s
0<j<J
1
=J -iNi-k J- E - E min(7V,||/^- )(M + j i V K 1 . . . 4 - 2 i r 1 ) .
0 ^ j < J -N<l!...,£ k-2<N
r an
Gathering term s accordin g t o th e produc t 4 • • • 4 -2 — d fo r r = 0 estimatin g
trivially w e arriv e a t
W^ k2 k 1
N~ +2kN
1 k
- E °r J Y
~ E min(AT , ^ / ^ - ^ (M + jiV)!!- 1 )
where R = N k~2 an d c r denote s th e numbe r o f representations r = 4 • • • 4 -1 wit h
1 ^ J 4 , . . . , 4 -1 < -N" - Fo r eac h r w e conside r separatel y th e inne r su m
U 1
=-j E minfAU^-ir )
J
oc?<J
where y 3 = rf^^^M + jN). Not e tha t1 3/^- 1 ^ rk\M 1 ~kF = y , say . W e wan t th e
points t/ j t o b e wel l spaced , an d fo r thi s reaso n w e requir e a lowe r boun d fo r th e
derivative o f / o f orde r k throughou t th e whol e segmen t [M , 2M]. Precisel y fro m
now o n w e assum e tha t
I/W(l)l<f
(8.16) j 4
8. E X P O N E N T I A L SUM S 20 3
with A ^ 1 for M ^ x ^ 2M . The n b y th e mean-valu e theore m w e deduc e tha t
\Vj' "Vol ^ &\f - J\ w n e r e A = rk\(2M)~ kNFA~1 . No w i t i s clear tha t
\u\^Y0^j<J
2F + l / _ , v - 2
^ — y - ( ^ + 5 Z T ^ - J ^ ( 2F + J- 1 )(7V + 2A- 1 log3M).
0<j<J
This yield s
W<m^(%^+%Z).
Finally
k 1k
/N \/FN A N AM ~ \22~k
where th e implie d constan t i s absolute . Thi s hold s fo r an yT V with 1 ^ N ^ M.
Suppose M k ^ F ^ 1 , so we may tak eT V = [MF _1 / fc ] givin g
T H E O R E M 8.4 . Let k ^ 2 . Suppose f satisfies (8.1 6) in the segment [M , 2M].
T/ien for I ^ M f ^ M we have
(8.17) S f(M,M') <^ A 4/2k(FM~k + F' 1
)4^2"M\og3M
where the implied constant is absolute.
REMARK. A t th e en d o f th e proo f o f (8.1 7 ) w e assume d tha t M k > F ^ 1 ,
however thi s conditio n i s no t necessar y becaus e otherwis e th e resul t i s trivial .
COROLLARY 8.5 . Suppose f(x) satisfies (8.1 6) for fc = 2 and k — 3 in the
segment [M , 2M]. TTie n for I ^ M' ^ M ^ F we have
(8.18) S f(M,M') < AF^Mi log3M .
where the implied constant is absolute.
PROOF. Thi s follows by taking the minimum o f the two bounds (8.1 7 ) fo r fc = 2
and fc = 3 . •
Note that (8.1 8 ) i s trivial if M ^ F s . However , fo r an y M whic h i s comparabl e
with F i n the logarithmi c scal e Theore m 8. 4 yield s a non-trivial boun d b y choosin g
k appropriately . Fo r example , choosin g fc = [ 2 log Fj lo g M] w e deriv e
COROLLARY 8.6 . Suppose f(x) is smooth in [M , 2M] and all its derivatives
satisfy
2 2 k
(8.19) A - V^|/<*'(i)|a F
with some A^\. Then for 1 < M' ^ M < F we have
(8.20) S f(M,M') < A*M l~4~" log3 M
where 7 = lo g Fj lo g M and the implied constant is absolute.
20 4 8. EXPONENTIA L SUM S
As application s o f th e abov e result s w e deriv e estimate s fo r th e Rieman n zet a
iction o n th e line s a — \ an d a — 1 . Fo r t ^ 3 we hav e
21) ((a + it)=J2 n ~a~U + 0{t- a).
2<t
Hence w e deriv e b y (8.1 8 ) th e followin g subconvexit y boun d
2
(8.22) C ( | + ^ ) < ^ ( l o g t ) .
Moreover, w e derive b y (8.20 ) th e followin g approximation :
(8.23) C( l + it) = Y, n ~1 ~U + 0(e~ VT rt
° )
with y = t 3 / l o g log f, whic h yield s b y trivia l estimatio n
8.3. Va n de r Corpu t method .
This metho d emerge d fro m tw o paper s o f 1 92 1 an d 1 92 2 b y J . G . va n de r
Corput [Corl] , [Cor2] . Briefl y speakin g th e novelt y consist s i n replacin g th e su m
by integral s an d estimatin g th e latter . Thi s i s execute d b y a n applicatio n o f th e
Poisson summatio n formul a followe d b y a n asymptoti c evaluatio n o f th e resultin g
Fourier integral s (b y using various techniques such as stationary phase) . On e arrive s
at a n approximat e equatio n fo r tw o sums o f different length ; thi s transformatio n i s
called th e jB-process . Th e amplitud e function s i n bot h sum s ma y chang e shap e bu t
not size , an d the y hav e comparabl e derivatives . Nex t thes e amplitud e function s
are reduce d b y a differencin g proces s whic h i s a refinemen t o f tha t use d i n Weyl' s
method; thi s routin e i s called th e yl-process . Th e final estimat e fo r th e exponentia l
sum 5/(M , N) i s obtained b y repeate d applicatio n o f th e tw o processes .
Now w e procee d t o th e first ste p o f th e va n de r Corpu t method , whic h i s a
truncated versio n o f th e Poisso n formul a
(8.25) ^ F ( n ) = ^ F ( m ) .
nm
PROPOSITION 8.7 . Let f(x) be a real function with f"{x) > 0 on the interval
[a, b}. We then have
(8.26) J2 e
a<n<b a
(/(n))= E /
— E<m<fi+E
r
e(f(x)-rnx)dx + 0(e- 1 +\og(P-a + 2))
an
where a,(3,e are any numbers with a ^ /'(& ) ^ /'(& ) ^ fi d0 < £ ^ 1 , the
implied constant being absolute.
8. EXPONENTIA L SUM S 205
PROOF. Le t M stan d fo r th e interva l [a — e,f3 -f e]. Firs t w e giv e th e trivia l
bound
fb\ i P i
dx
/V ^ e(—mx)\dx^2(b — a-\-l) / Y ^ e(—mx)
a
^ meM ^ ° rneM
(8.27)
2
< 2( 6 - a + 1 ) / min(/3 - a + 2 , x_1)cfo < ( 6 - a + 1 ) log(/3 - a + 2) .
This proves (8.26 ) if O < 6 —a < 2 . Therefor e fro m no w on we assume that b — a > 2 .
We appl y th e Poisso n formul a t o th e tempere d su m
fb
^2 9{^)e{f(n)) = Y1 9{x)e(f(x) - mx)dx )
Ja
a<n<b m
where g(x) i s a function whic h planes the edges, precisely g{x) = min( x —a, 1, b—x).
This functio n i s introduce d temporaril y fo r convergenc e o f th e su m o f th e Fourie r
integrals an d wil l b e remove d later . Fo r m ^ M w e write b y partia l integratio n
2TT Z
fb f ( g( x b
)V
/ d(x)e(f(x) - mx)dx = - / I _ (x) ——m;J e(f(x) - mx)d
V X m /
Ja Ja J\ )
b
e(f(x) - mx) g{x)f"{x)
•e(f(x) — mx)dx
\Jb-l J a f'(x)-m Ja (/ ' (x) — my
= Tb(m) - T a(m) + Tab(m),
say. Fo r 7^>(ra ) w e hav e
rb
\Tab(m)\ < / f"(x)(f(x) - m)- 2dx = ( a - m ) " 1 - (/ 3 - m ) " 1 .
For T a(m) w e hav e
/•a+l
|Ta(m)| < / \f'(x) - ml^dx ^ \a - m]' 1 + |/ 3 - ml" 1 .
Integrating b y part s onc e mor e w e obtai n anothe r boun d
\Ta(m)\ < (/'( a + 1 ) - m ) - 2 + (/'(a ) - m ) " 2 + I f + d(f(x) - m) \ "- 2
= 2max{(/ / (a + 1 ) - m)~ 2 , (/'(a) - m ) " 2 } ^ 2( a - ra) -2
+ 2(/ 3 - ra)" 2
.
The sam e bound s hol d fo r Tb(m). Combinin g thes e bound s w e deriv e
\Ta(m)\ + |7i(m) | + |T a 6(m)| <
(/3 — m)(a — m)
for an y m £ M.. Henc e w e conclud e tha t
fb
] T 9(n)e(f(n))= ^ / g(x)e(f(x) - mx)dx + 0{e~ l + log(/ 3 - a + 2)) .
^^ ^Is ™a
a<n<b m£A4
KA J &
206 8. EXPONENTIA L SUM S
It remain s t o remov e th e weight s g(ri) an d g(x). Th e correctio n o n th e lef t sid e i s
0(1) an d o n th e righ t sid e i t i s 0(log(/ 3 - a + 2) ) b y (8.27 ) fo r b = a + 1 . Thi s
completes th e proo f o f (8.26) . •
EXERCISE 1 . Deriv e (8.26 ) fro m th e Euler-Maclaurin formul a (4.7 ) using (4.1 8 )
or directl y fro m (4.21 ) .
It i s easy t o generaliz e (8.26 ) fo r a weighte d su m a s follows :
(8.28) ] T g(n)e(f(n))= £ / g(x)e(f(x) - mx)dx
a
a<n<b a-£<m<P+e
+ 0 ( G ( s - 1 + l o g ( / 3 - a + 2)) )
where g i s an y smoot h functio n o n [a, b] and
(8.29) G=\g(b)\+ f \g'(y)\dy.
Ja
For th e proo f pu t
e V
S(y)= E (/W)-E f e(f(x)-mx)dx.
a
a<n^y a-s<m<(3+e
Thus S(y) < C e~ x 4 - log(/3 — a + 2 ) b y virtu e o f (8.26) . No w applyin g partia l
summation w e obtain (8.28 ) wit h th e erro r ter m
g(y)dS(y) = 5(b)g(b) - [ S(y)g'(y)dy
which i s bounded b y G(e~ l 4 - log(/3 — a -\- 2)) a s claimed .
In a specia l cas e (8.28 ) simplifie s t o
LEMMA 8.8 . Let f(x) be a real function with \f f(x)\ ^ I — 0 and f"{x) ^ 0 on
[a, b]. We then have
1
(8.30) E 9(n)e(f(n))= f g(x)e(f(x))dx + O^- )
a
a<n<b ^
where G is given by (8.29) and the implied constant is absolute.
Our nex t tas k i s t o evaluat e th e exponentia l integral s i n (8.26) . W e begi n b y
proving simpl e estimate s fo r
*.31) I f(a,b)= f e(f(x))da
Ja
where f(x) i s a real , smoot h functio n o n [a , b]. Notic e tha t th e trivia l boun d
|i/(a, 6)| ^ b — a cannot b e improve d substantiall y i f f^ < C (6 — a)~ k fo r al l k ^ 1 ,
because suc h a functio n i s more o r les s constant. However , i f one o f the derivative s
is somewha t larger , the n ou r resul t wil l b e non-trivial .
8. E X P O N E N T I A L SUM S 20 7
LEMMA 8.9 . Suppose f'(x)f"(x) 7 ^ 0 on [a , 6]. Then
(8.32) \I f(a,b)\ ^\f{a)\-^\f\b)\-\
PROOF. W e ca n assum e withou t los s o f generalit y tha t f"(x) > 0 . The n b y
partial integratio n w e ge t
Hence
^ |//(a ' 6)K M + i?k + 7^-?k
which i s bette r tha n wha t i s claimed. •
LEMMA 8.1 0 . Suppose that for some k ^ 1 we have
(8.33) | / « ( x ) |^ A
for any x on [a , b] with A > 0 . Then
1
k /k
(8.34) | J / M ) I ^k2 A~ .
P R O O F . W e appl y inductio n wit h respec t t o k. Fo r k = 1 the resul t follow s
from (8.32) . Suppos e tha t f( k+1 ) > A > 0 on th e whol e interva l [a , b]. Thus f^ i s
increasing, s o it ma y hav e a t mos t on e zer o i n [a , 6], sa y c if it exists . I f f^ ^ 0 on
[a, 6], w e stil l defin e c by puttin g c — a o r c — b according t o whethe r f^ > 0 o r
f(k^ < 0. Le t 5 be a positive numbe r t o b e chose n later . W e put a\ = max(a , c — S)
and b\ = min( c + 5 , b) and w e spli t //(a , b) = / / ( a , ai ) + If(o>i, &i) + Ifipi, b). Fo r
the integra l ove r th e middl e segmen t w e us e th e trivia l boun d
|//(ai,fci)|Oi-ai ^25 .
To estimat e th e integra l ove r th e lef t segmen t [a , ai] w e ma y assum e tha t a i > a ,
that i s a i = c — S > a , o r els e w e ge t nothing . No w w e verify tha t fo r a ^ x ^ ai ,
-/ ( f c ) (*)= f 1
C {k+ )
f (y)dy-f{k)(c)
Jx
{k+1 )
>I f (y)dy >{c- x)A ^6A>o.
JX
Hence b y th e inductio n hypothesi s
\If(a,a1)\^k2k(5A)-1/k.
The sam e boun d hold s tru e fo r th e integra l ove r th e righ t segmen t [bi , b]. Addin g
these thre e bound s w e ge t
k 1 /k
\If(a,b)\ < 25 + 2k2 (5A)~ .
This i s tru e fo r an y 5 > 0 . O n puttin g 5 = A - 1 /( f e + 1 ) w e complet e th e proo f o f
(8.34) fo rif c + 1 . D
Combining Lemma s 8. 8 an d 8. 9 w e obtai n
208 8. EXPONENTIA L SUM S
COROLLARY 8.1 1 . Let f(x) be a real function with 6 ^ \f'(x)\ ^ I — 0 and
f"(x) ^0 on [a, b]. Then
(8.35) Yl 9(n)e(f(n)) « G0" 1
a<n<b
where g is given by (8.29) and the implied constant is absolute.
EXERCISE2 . Prov e tha t fo r an y real number s f n satisfyin g 6 ^ f n — f n-\ ^
/n+i ~ fn < 1 - 0 we have
(8.36) X X - M ^ cotan -
Combining Propositio n 8. 7 with Lemm a 8.1 0 for k = 2 we derive
COROLLARY 8.1 2 . Let b — a ^ 1 . Let f(x) be a real function on [a , 6] with
f"(x) ^ A > 0 on [a , b]. Then
e
(8.37) J2 (/( n )) « (A 6) " A*) + 1) A _ i
a<n<b
where the implied constant is absolute.
PROOF. B y Lemma 8.1 0 the integrals in (8.26) ar e bounded b y 8A"1/ 2 an d the
number o f these integral s i s less tha n f'(b) — f'(a) + 1 (take a = f'(a),(3 = f f(b)
and e = | ) , hence (8.37 ) follows . D
Note tha t f'(b) - f'{a) = (b - a)f"{y) fo r som e y wit h a ^ y ^ b, thus (8.37 )
implies
COROLLARY 8.1 3 . Let b — a ^ 1 . Let f{x) be a real function on [a, b] such that
A < f"{x) ^ rjA with A > 0 , rj ^ 1 . Tfter c
(8.38) ] T e(/(n))<77A*(6-a ) + A-i
a<n<6
ic/iere the implied constant is absolute.
Now w e ar e goin g t o establis h a n approximat e formul a fo r th e exponentia l
integrals (8.31 ) whic h i s mor e precis e tha n th e estimat e (8.34) . W e begi n b y a
special cas e
LEMMA 8.1 4 . Let h(x) be a real function on [0 , X] such that
(8.39) h{0) = 1 , h{x) > 1 , (xh{x)) f > 1 ,
(8.40) ti(x) < X " 1 , h"(x) < X~ 2
.
Then for a > 0 we have
h dx +0
.41) [<^ ^ = ^Wa ^X
where the constant implied in O depends only on the constant implied in ^> of (8.39)
and that in < C of (8.40).
8. E X P O N E N T I A L SUM S 20 9
PROOF. Fo r notationa l simplicit y w e se t h(x) = g 2{x). Clearl y g{x) satisfie s
all th e condition s o f h(x). W e chang e th e variabl e x 2g2(x) = t t o obtai n
2 2
/ e(ax g (x))dx = e{at)dU - e(at)f(x)dt
Jo Jo Jo
2
where T - (Xg(X)) an d
f(x) = ((xg(x))'-l)/2xg(x)(xg(x))'.
Using th e condition s (8.39) , (8.40 ) fo r g(x) an d th e Taylo r expansion s
g(x) = 1 + ^ ' ( 0 ) + 0(x 2X~2), g'(x) = g'(0) + 0{xX~ 2
)
we deduc e tha t f(x) < C X~ l an d f'(x) < C X~ 2. Henc e w e obtai n b y partia l
integration
,Tf T
2nia / e(at)f(x)dt = / f(x)de(at)
Jo Jo
= e(aT)f(X) - /(0 ) - / e(at)f'(x)dx(t)
Jo
fx
<^X~X+ \f {x)\dx ^ X~ l
Jo
because x f{t) i s positive. Thi s boun d i s absorbed b y th e erro r ter m i n (8.41 ) . Nex t
we exten d th e firs t integra l t o
1 f°° / 1
/ e(at)dt2
/o vo = e - / y/Sa
Jo ^°
and w e estimat e th e exces s b y partia l integratio n a s follows :
f°° I I f°° I
2nia / e{at)dU = -T~ 2 e(aT) - / e(at)dt~2
JT JT
<r"i = (^(X))- 1 <A:- 1 .
This boun d i s als o absorbe d b y th e erro r ter m i n (8.41 ) completin g th e proof . •
COROLLARY 8.1 5 . Let f(x) be a real function on [a,b] such that
(8.42) /"(c )> A,
3 1 2
(8.43) \f^ \x)\ < AX' , \f«\x)\ < AX"
/or som e A > 0 and X > 0 . Suppose /'(c ) = 0 a i som e poin t c m (a , 6). T/ie n w e
(8.44) / /(0)6)=c(/(c) + | ) / " ( c ) - 5 + 0 ( I ( ^ l -+ - l -+ i
where the implied constant is absolute.
210 8. EXPONENTIA L SUM S
PROOF. W e writ e
pb pb — c pc —a
If(a,b)= / e(f(x))dx= / e(/( c + x))dx + / e(f(c-x))dx.
Ja JO JO
By Taylor' s expansio n w e hav e f(c + x) — f(c) + ax 2h{x), say , wher e a = \f"{c)
and h(x) i s a functio n whic h satisfie s
and /i'(z ) < X " 1 , ^ ^ ) < X - 2 . Thes e estimate s follo w fro m (8.42) , (8.43) .
Hence th e condition s o f Lemm a 8.1 4 ar e satisfied i n th e interva l 0 < x ^ Y wit h
Y = min( 6 — c , X) gettin g
1
j\(f(c + x))dx = e(f(c) + l)(4f(c)r 2+0(Kl^).
For th e remainin g par t o f th e integra l w e appl y Lemm a 8. 9 gettin g
' #+ #« I/'C C + K)!"1 + i/'^r1«(Ar)- 1.
/y
From bot h estimate s w e obtai n
rb
jT e(/(z))ct e = e(/(c ) + §)(4/"(c))- § + o Q (^ + 1 )) .
Similarly w e evaluat e th e integra l ove r th e segmen t [a , c] completin g th e proo f o f
(8.44). D
Now w e ar e read y t o sho w th e mai n resul t o f th e va n de r Corpu t metho d
THEOREM 8.1 6 . Let f(x) be a real function on [a , 6] whose derivatives satisfy
the following conditions: A ^ f ' ^ 77A , |/( 3 >| ^ 7]A(b - a ) " 1 , |/< 4>| < r]A(b - a)~ 2
for some A > 0 and rj ^ 1 . Then we have
e
(8.45) Y, (/H)=E e(/(x m)-mxm + |)r(x m )-i+i?/(a,6)
a<n<b a<ra</ 3
where a — /'(a),/ 3 = f'{b) and x m is the unique solution to f'{x) = m. Here
Rf(a,b) is considered as an error term; it satisfies
(8.46) R f(a, b) < A" 5 + r/ 2 log(/3 - a + 1 )
where the implied constant is absolute.
P R O O F . W e appl y (8.25 ) wit h e = | , a = /'(a ) an d / 3 = /'(b) . Fo r m wit h
a + £ < m < / 3 — s (not e thi s rang e i s voi d i f /3 — a ^ 1 ) w e evaluat e th e involve d
exponential integral s b y (8.44 ) gettin g
fb _ I
/ e(f(x) -mx)dx = e(f(x m) - mx m + \)f"(x m) 2
+ 0
( ? ( r J - + ^ - ) )-
V A V o — x m Xm — a/J
8. EXPONENTIA L SUM S 211
Here w e hav e rjA(b - x m) > /'(& ) - f'(x m) = f3 - m an d r]A(x rn - a) ^ f(x m) -
f'(a) — m — a , therefor e th e erro r term s contribut e a t mos t
^2 [ ^~ ~ m ) _ 1 + ( m ~ a) _ 1] < rf log(/ 3 - a + 1).
a+e<m</3 — e
For th e tw o missin g integral s wit h a — e < m ^ a -\- e an d / 3 — e ^ r a < / 3 + £w e
use th e boun d 8A - 1 / 2 whic h follow s fro m (8.34 ) wit h k = 2 completin g th e proo f
of (8.46) . •
REMARKS. I n genera l th e boun d (8.46 ) canno t b e significantl y improve d be -
cause i t i s almos t a s smal l a s th e individua l term s i n bot h sum s (8.45) .
EXERCISE 3 . Usin g partial summatio n deriv e from (8.45 ) th e following formul a
for a weighte d su m
(8.47)
n
Y^ g(n)e(f(n))= ^ 9(x m)f {xm)~2e{f{xm) - mx m + | )
a<n<b a<ra</ 3
+ 0(GA~l + Grj 1 log(/3 - a + 1 ) )
where g i s an y smoot h functio n o n [a , 6] an d G i s give n b y (8.29) .
EXAMPLE. Le t X > 0 ,TV > 0 and a> l,v > 1 . W e the n hav e
(8.48)
N<n<uN M<m<fiM
+ 0(./V~ 2 l 0g(7V + 2 ) + M " 2 log( M + 2) )
where ^ + i = 1 , //^ = u a an d MA/ " = X , th e implie d constan t dependin g onl y o n
a, v.
The approximat e formul a (8.47 ) (whic h constitute s th e B-proces s o f va n de r
Corput) transform s a n exponentia l su m o f amplitud e f(x) t o a n exponentia l su m
of amplitud e h(y) relate d b y
(8.49) h(y) = f(x)-xy, y = f'(x).
Thus / an d h hav e essentiall y th e sam e size . However , th e lengt h o f th e sum s
changes fro m b — a t o / 3 — a = f'(b) — f f{a). Not e tha t th e operatio n (8.47 ) i s
involutary, s o it woul d b e useless t o appl y i t tw o times i n a row. I t i s recommende d
to appl y th e ^-proces s i n th e directio n whic h reduce s th e lengt h o f the exponentia l
sum. The n o n th e shorte r sid e on e ca n estimat e triviall y (gettin g fo r exampl e
(8.24)), bu t i t i s not necessar y t o terminat e i n this way . A new possibilit y i s offere d
by a differencing proces s which reduces the amplitud e function , bu t doe s not chang e
the lengt h o f summation. Thi s i s called th e ^-proces s o f Weyl-van de r Corput . W e
begin b y th e followin g genera l inequalit y
212 8. EXPONENTIA L SUM S
LEMMA 8.1 7 . For any complex numbers z n we have
(8.50) ^ ^ ( i+ ^ ^ i - M) £ w „
a<n<b \q\<Q a<n,n+q<b
where Q is any positive integer.
P R O O F . Settin g z n = 0 if n £ (a , b) we obtain
^= / v Zn =
2_J
Z
"a = / j ^n+q
a<n<b n n
for an y g G Z. Su m up thi s ove r q with 0 ^ q < Q getting
a-Q+l<n<b 0^q<Q
Hence b y Cauchy's inequalit y
Q2\S\2 < (b -a + Q) £| £ z
-+^
n 0^q<Q
Z
= (b-d + Q) Yl YJ n+qi~Zn+q2
0^<?i,<?2<Q n
= (b-a + Q) ^2 uti) ]C Zn+^
\q\<Q n
where u(q) is the numbe r o f integers 0 ^ qi,q2 < Q with # i — # 2 — q, i.e., v(q) —
Q — \q\. Thi s complete s th e proof o f (8.50). D
Taking z n = e(f(n)) w e obtain th e differencing proces s
PROPOSITION 8.1 8 . Let f(x) be real function in (a, b) and Q a positive integer.
We have
(8.51) I £ e(f(n))\\ (l + ~) £ (l-^ ) £ «(/( * + «)-/(»))
wftere a(g ) = max(a, a — q) and b(q) = min(6 , b — q).
R E M A R K S . Not e tha t q = 0 is always there , s o the resulting boun d fo r th e
original su m X^ e (/( n )) c a n n e v e r b e better tha n ( 6 — a ) Q - 1 / 2 , i.e. , one cannot
save mor e tha n Q 1 / 2 from th e trivial boun d b — a. I n practice Q is chosen t o b e
much smalle r tha n b — a, so the new amplitud e functio n h(x) = f(x + q) — f(x) has
reduced derivatives , whil e a(q) and b{q) do not change much . Her e th e flexibility
in choosin g Q is another innovatio n introduce d b y van der Corput t o the original
differencing metho d o f Weyl.
1
EXERCISE 4 . Prov e tha t fo r 1 ^ Q ^ ( 6 - a) 2 we have
1
1 1 1 i
e
(8.52) Yl e(/(n))<(6-a)Q"2+(6-a) 2 ^ (M™))
a<n<b a+Q<n<b-Q
where ft (a;) = f(x + q) — f(x — q) for some 0 < q < Q and the implied constan t is
absolute [Hint : Us e only eve n integer s fo r Weyl shifts. ]
8. EXPONENTIA L SUM S 213
By successiv e applicatio n o f Propositio n 8.1 8 an d Corollar y 8.1 3 w e deriv e
COROLLARY 8.1 9 (VA N DE R C O R P U T ) . Let b - a ^ 1 . Let f(x) be a real
function on (a , b) such that A ^ f^ 3\x) ^ 77 A where b — a > 1 , A > 0 and rj > 1 .
Then
(8.53) ] T e(/(n))«r|2A^&-a)+A^(6-a) 2
a<n<6
where the implied constant is absolute.
PROOF. Fo r 0 < q < Q and a<x<b-qwe hav e h"(x) = f"(x + q) — f"(x) =
qf^(y) fo r som e y G (a, b). Henc e A g < h"(a; ) ^ r]Aq an d b y (8.38 )
£ e(f(n + q)- f(n)) « 7 ] ( A ^ ( 6 - a ) + ( A g ) " i
a<n<6—g
A simila r resul t i s true fo r th e sum s wit h negativ e q. Fo r q = 0 we hav e th e trivia l
bound b — a -f 1 . Collectin g thes e result s w e deduc e b y (8.51 ) tha t
2
J£ e(/(n))| «(l+ ^ ) { f e - a+ V J ( „ A M ( 6 - a )+ (AqH) }
a<n<6 ^ 0<g< Q
&-a\/, . 1
< ( l + - £ - ) ( 6 -o + r;A5Qi(6-a)+A-iQ§ )
~Q
for an y positiv e intege r Q. However , th e abov e estimat e remain s vali d fo r an y
positive rea l numbe r Q fo r obviou s reasons . Takin g Q = A _ 1 / / 3 w e obtai n (8.53 )
provided A ^ (b — a )- 3 , an d otherwis e th e resul t i s trivial . •
The Weyl-va n de r Corpu t inequalit y (8.51 ) ca n b e use d repeatedl y t o reduc e
further th e amplitud e functio n unti l (8.38 ) become s useful . Thi s lead s t o th e fol -
lowing generalizatio n o f (8.38 ) an d (8.53) .
THEOREM 8.2 0 (VA N DE R C O R P U T ) . Let b-a^l. Let f(x) be a real function
on (a , b) and k ^ 2 such that A < f^ k\x) < nA, where A > 0 and rj ^ 1 . Then
n
(8.54) Yl <f( ^< ^ ^ ( b~ a) + A~ K(b - a) 1 2
' *^
a<n<b
where K = (2 k — 2)~ 1 and the implied constant is absolute.
For larg e k th e bound s (8.1 7 ) o f Wey l an d (8.54 ) o f va n de r Corpu t ar e com -
parable.
8.4. Discussio n o f e x p o n e n t pairs .
Throughout Sf (N) denote s a n exponentia l su m o f typ e
Sf(N) = V J e (/(n))
N<n^Nf
with N ^ N' ^ 2N an d / i s a smoot h functio n o n [iV , 2N]. W e hav e establishe d
numerous estimate s fo r Sf(N) whic h depen d essentiall y o n th e lengt hT V and o n
the siz e o f th e amplitud e functio n / . I n thi s sectio n w e describ e thes e result s i n
a unifie d for m an d discus s possibl e extensions . Fo r th e sak e o f simplicit y w e dro p
some minor , ye t necessar y conditions , s o ou r assertion s ar e no t strictl y correc t i n
214 8. E X P O N E N T I A L SUM S
this section. Rigorou s assertion s ca n be found i n the boo k b y S . W. Graha m an d G .
Kolesnik [GK ] a s well a s i n th e origina l paper s b y va n de r Corpu t [Corl] , [Cor2] ,
E. Phillip s [Ph ] an d R . A . Ranki n [Ralj .
Suppose / behave s lik e a monomial, precisel y w e assum e tha t th e derivative s
satisfy
j
(8.55) \fW(x)\^FN~
for N ^ x ^ 2N an d an y j ^ 0 , wher e th e implie d constan t depend s o n j . In
particular, \f(x)\ x F and \f'(x)\ x FN~ X. I fT V i s large r tha n F by a sufficientl y
large factor , the n Corollar y 8.1 1 yield s
1
Sf(N) 4ZNF- .
This boun d i s the bes t possible , becaus e i n thi s cas e the su m Sf(N) i s well approx -
imated b y th e correspondin g integra l (se e Lemma 8.8) . I n what follow s w e stay ou t
of thi s specia l cas e b y assumin g tha t
(8.56) A = FN^ > 1.
Inspired b y man y example s w e postulat e th e existenc e o f universa l number s
(8.57) 0 < p , < K §
having th e propert y tha t
q
(8.58) S f (TV) < APN ^F£
for an y exponentia l su m o f length N and th e frequenc y functio n / satisfyin g (8.55 )
and (8.56) . Her e e i s an y positiv e numbe r an d th e implie d constan t depend s onl y
on e an d o n th e sequenc e o f constants implie d i n (8.55) . W e call (p , q) an exponen t
pair (notic e tha t ou r definitio n o f q differs b y | fro m tha t i n th e literatur e [GK]) .
Clearly i f (p, q) i s a n exponen t pair , the n an y pai r o f large r number s for m a n
exponent pair . Th e trivia l estimat e fo r Sf(N) correspond s t o th e exponen t pai r
(8.59) (P,q)=(0,\).
The followin g statemen t i s the mos t optimisti c fo r thi s aspec t o f th e theor y o f
exponential sums , an d i s widel y believe d t o b e correc t i n it s essential s (mino r as -
sumptions o n / migh t b e require d t o avoi d pathologies) :
E X P O N E N T PAI R HYPOTHESIS . The pair {p,q) = (0,0 ) is an exponent pair.
In other words, an exponential sum of type Sf(N) with / , N satisfying (8.55) and
(8.56) should satisfy
£
(8.60) S f(N)<z:N^F .
This bound , i f tru e fo r reasonabl e functions , woul d lea d t o solution s o f man y
problems i n analyti c numbe r theory . Fo r example , it implies th e Lindelo f Hypoth -
esis
8. E X P O N E N T I A L SUM S 215
Moreover, i t would solve the Gaus s circle problem an d the Dirichlet diviso r problem ;
namely i t implie s
~ ^ r(n) = TTX + 0 ( > i + £ ) ,
^2r(n) = xlog x + ( 2 7 - l) x + 0(x^ + f f ),
n^.x
where th e exponent s ar e bes t possible . Actuall y fo r thes e problem s on e onl y need s
to kno w tha t (p , q) = (0, \) i s a n exponen t pair . However , ou r curren t knowledg e
is very poor .
Corollary 8.1 3 tell s u s tha t
(8.61) (P,q)=(\,0)
is an exponen t pair . Clearl y a linear conve x combinatio n o f exponent pair s i s agai n
an exponen t pair . Therefor e (8.59 ) an d (8.61 ) impl y tha t (p,q) = (\,\) i s an
exponent pair , however , w e have alread y prove d a stronge r resul t (8.1 8 ) whic h ca n
be no w interprete d a s sayin g tha t
(8.62 ) M = {\>\)
is a n exponen t pair . Theore m 8. 4 yield s th e exponen t pair s
4
/x / 1 4 ( / c -1 ) \
and Theore m 8.2 0 yield s
(8-64) fe ,)=^____, _ _ _ _ _ _
for an y intege r k ^ 2. Notic e that a s k tend s t o infinit y i n both sequences , p — p(k)
tends to zero slightly faste r tha n q — q(k) tend s to |. This featur e make s i t possibl e
to clai m non-trivia l bound s fo r th e exponentia l sum s Sf(N) whos e lengt h N i s
comparable wit h th e amplitud e F in th e logarithmi c scale . I n th e nex t sectio n w e
shall produc e b y Vinogradov' s metho d a sequence o f exponent pair s (p , q) in whic h
p tends t o zer o muc h faste r tha n q tends t o \. S o far w e d o no t hav e a n exponen t
pair wit h p — 0 and q<\\ an y suc h exampl e woul d mar k a majo r breakthroug h i n
the theor y o f exponential sums .
A continuu m o f exponen t pair s ca n b e create d b y alternatin g th e A and B
processes. B y th e Weyl-va n de r Corpu t differencin g inequalit y (8.51 ) on e derive s
A-PROCESS. If(p,q) is an exponent pair, so is
*-65» ^«»-(_?2-_^) -
By th e va n de r Corpu t approximat e functiona l equatio n (8.45 ) on e derive s
216 8. E X P O N E N T I A L SUM S
^-PROCESS. If(p,q) is an exponent pair, so is
(8.66) B(p,q) = (q,p).
Here ar e som e example s o f exponen t; pairs generate d fro m th e trivia l pai r
B(O,
D- -(5.' 0),
AB(O,
2
$ • < • J).
2
A B(O,
\)- -(n ' 7.)/ '
A3B(O,
\)- V30 - )
'3<V'
A4B(O, = (1 - )
\)- '31/'
V62
2
ABABA B(O,
* ) •
»(n • i ) -
For th e purpos e o f estimatin g th e Rieman n zet a functio n o n th e critica l lin e
one need s a n exponen t pai r (p , q) with p + g as smal l a s possibl e sinc e
(8.67) ((l + it ) < *2(P+9)+ e, i f t ^ 1 .
In 1 95 5 Ranki n [Ral ] compute d th e minimu m o f 0 = | ( p + g ) ove r al l exponen t
pairs generate d b y th e A an d B processes , namel y # m i n = 0.1 6451 067... . I n othe r
applications o f exponent pair s one seeks the minimu m o f a fractional linea r functio n
#(p> a) = ( aP + PQ + M ) / ( ^ P + 7 ^ + v). Havin g thes e application s i n min d i n 1 98 5
Graham gav e a truly elegan t algorith m fo r th e solutio n t o thi s genera l problem ; se e
[GK].
Today ther e ar e know n exponen t pair s whic h canno t b e obtaine d b y th e va n
der Corpu t method . Fo r example , Huxle y an d Wat t [HW ] showe d tha t
(8.68) M=G|<S )
is an exponen t pai r b y elaboratin g th e wor k o f Bombier i an d Iwanie c [BI] . Furthe r
improvements alon g thes e line s ar e give n i n [Hu4 ] an d a ver y nic e expositio n i s
contained i n [GK] .
The Wey l shif t (fro m n t o n + q) wa s introduce d fo r th e purpos e o f reducin g
the amplitud e functio n (fro m f(ri) t o h(n) = f(n + q) — /(n)) . However , thi s shif t
also offer s a n additiona l variabl e o f summation . On e ca n tak e advantag e o f this b y
an appea l t o th e theor y o f two-dimensiona l exponen t pairs , bu t th e improvement s
are no t impressiv e (i n th e contex t o f the Rieman n zet a functio n th e result s ar e stil l
weaker tha n wha t follow s fro m th e Huxley-Wat t exponen t pai r (8.68)) .
8.5. Vinogradov' s method .
In th e lat e thirtie s I . M . Vinogrado v [Vi2] , [Vi3 ] invente d a ne w metho d fo r
estimating exponentia l sum s o f Weyl' s typ e whic h i s ver y stron g fo r sum s o f larg e
amplitude (relativ e t o th e length) . H e spen t a considerabl e tim e o f hi s lif e t o
8. EXPONENTIA L SUM S 217
massage th e results , s o w e hav e toda y severa l elegan t version s du e t o hi m an d hi s
followers.
Let f(x) b e a rea l smoot h functio n i n [iV , 27V]. Ou r ai m i s t o estimat e expo -
nential sum s
(8.69) 5/(0,6) = V J e (/(n))
a<n<b
for an y a, b with N ^ a < b ^ 2N. W e shal l approac h th e proble m i n severa l
distinct steps .
Step I (Smal l Shift) . W e begin, a s i n Weyl's method , b y shiftin g th e summatio n
n
5/(0,6)= J2 <f( + l))-
a—q<n<b—q
We choose q to b e relatively smal l s o we can approximat e f(n - f q) by a polynomia l
in q of reasonably smal l degree. Precisel y w e apply th e Taylo r expansio n f(n + q) =
Fn{q) + Rn(q), wher e F n(q) i s the polynomia l
with coefficient s otj{n) = f^\n)/j\ an d R n{q) satisfie s th e boun d
\Rn(q)\^qk+1f{k+1)/(k+l)l
with / ^ + 1 ) — max|/( fe+1 )(a;)|. Insertin g thi s approximatio n w e ge t
1
k+ 1
(k+ )
5/(o,6)= £ e(F n(q))+0(2q + q f N)
a<n<b
where \9\ ^ 1 . Her e th e first erro r ter m 2q represents th e trivia l boun d fo r expo -
nential sum s ove r tw o non-overlappin g interval s an d th e secon d erro r ter m come s
from th e inequalit y \e(R n(q)) - 1 | < 2n\R n(q)\ < q k+if(k+i)m W e s h a l l r e q u i r e a n
the implie d constant s i n th e forthcomin g estimate s t o b e absolute . Thi s i s impor -
tant, eve n with respec t t o /c , because fo r som e application s (firs t o f all for boundin g
£(1 -|- it)) on e use s k dependin g o n th e lengt h o f summatio n an d th e siz e o f th e
amplitude function .
At thi s poin t Vinogradov' s approac h diverge s fro m tha t o f Weyl . Th e first
new ide a i s tha t no w w e ar e goin g t o averag e ove r q in a specia l subset , sa y Q C
{ 1 , 2 , . . . ,Q} , rather tha n i n th e ful l set . W e obtai n
1
(8.70) 5/(o,6) = J2 i 2 r E e ( F " ^ ) ) + 0 ^ + ^ f e +1 / ( f c +1 )7V
)
a<n<b q£Q
where \Q\ denote s th e numbe r o f point s i n Q. Thoug h F n(q) i s a polynomia l i n q
the innermos t su m
Sn = ^e(F n(q))
q£Q
cannot b e considere d a s Weyl' s su m o f degre e k, becaus e o f restriction s fo r Q (t o
be specifie d soon) . Moreover , w e ar e no t goin g t o appl y th e ide a o f differencin g b y
way o f squarin g th e su m Sf(a,b) ) bu t rathe r w e procee d directl y t o estimatio n o f
218 8. EXPONENTIA L SUM S
each inne r su m S n separatel y fo r a < n < b. I n othe r words , w e no longe r nee d th e
variable n unti l th e las t step , an d eve n there , i t i s no t essential .
Since the variabl e n play s a minor rol e in what follow s w e simplify th e notatio n
by considerin g th e exponentia l su m
S=5>(F(<?))
qeQ
for arbitrar y polynomia l wit h rea l coefficient s
F(q)= Y, a ^-
Obviously S depend s o n th e fractiona l par t o f th e coefficient s aj . Fo r thi s reaso n
we shal l exploi t onl y th e coefficient s wit h \a.j\ < 1 , becaus e thes e ca n b e easil y
controlled. O n th e othe r hand , th e ver y smal l coefficients , specificall y \aj\ < Q~ J,
do no t mak e a variatio n i n th e argumen t o f e(F(q)). Therefor e onl y th e middl e
terms o f ou r polynomia l F(q) pla y a role , tha t i s thes e wit h Q~i < \otj\ < 1 , th e
other term s ar e wasted . I n th e final argument s w e shal l appea l t o th e origina l
coefficients aj — ot.j(n) = f^\n)/j\ whos e particula r shap e i s no t important , w e
only nee d t o estimat e thei r size .
STEP I I (creatio n o f bilinea r forms) . W e choos e
Q = {xy:l^x iy^P}
where th e number s q = xy ar e counte d wit h multiplicit y o f suc h representations ,
thus \Q\ = Q = P 2 an d th e exponentia l su m S become s a bilinea r for m
(8.7i) s= J2 E <n*v))-
It i s importan t tha t th e variable s x , y ru n independentl y ove r tw o set s (o f som e
cardinality), ye t i t i s of n o significanc e wha t th e point s are . Fo r example , w e coul d
restrict x, y t o specia l number s o r conside r a genera l bilinea r for m
with an y comple x coefficient s a(x) , 6(y), wher e X, y ar e th e set s o f point s o f typ e
x = [ 1 , . . . , xk], y = [1 , y , . .. , yk] wit h x,y e Z, 1 ^ x ^ X, 1 ^ y ^ Y an d
(x, y) = F(xy). I f X, y ar e sufficientl y larg e an d well-space d set s the n a non-trivia l
bound fo r B(X,Y) follow s easil y b y Fourie r techniqu e (se e Chapte r 1 0) . However ,
this i s no t th e cas e unde r consideratio n here ; ou r set s ar e quit e sparse .
STEP II I (expandin g th e variables) . Ou r nex t goa l i s t o creat e mor e point s
of summatio n an d mak e man y variable s wit h n o larg e gaps . T o thi s en d w e rais e
the bilinea r for m (8.71 ) t o a hig h power , appl y Holder' s inequalit y an d rearrang e
8. E X P O N E N T I A L SUM S 219
the resultin g multipl e su m a s follows:
|5|'<P'-1^|^e(F(a;i,))|'
x y
e 1
= P ~ J2(* E e(F(x yi) + • • • + F(xy e))
x yi,---,ye
Al
= pi ~X 1 L ^ ' ''' ' X
^ Yl Cxeia^x + • •• +a k\kx
k
).
A i , . . . ,Af c x
Here £ x ar e complex number s wit h \( x\ = 1 and z/(Ai,... , A&) is the number of
solutions t o the system o f equations
( 2/ 1 H + yt = A i
2/i + • • • + 2/1 = A 2
(8.72 )
yf- + 2/ * = A fe
in integer s 1 ^ yj ^ P fo r j = 1 ,2,.. . ,£ . Thu s £ ^ X 3 ^ £Pi, or else ther e are
no solutions . Sinc e £ will be much large r tha n k (approximatel y £ x k 2) w e expec t
that mos t vector s (Ai,.. . , A&) have goo d numbe r o f representations b y the system
(8.72). Thi s numbe r i/(Ai,.. . , Xk) ma y var y onl y slightly . I n order t o smooth the
outer summatio n w e raise \S\ £ again t o power 2£, getting
\S\2i2 < P 2 1
^-^I 2{e )
t,k(P) - JtAP)Ze,k(P)
where
h,k{P)= £ KAi,...,A fc ),
A i , . . . ,Afc
2
JiAP)= Yl ^ (Ai,---,A fc ),
A i , . . . ,A fe
21
k
Ze,k(P)= Yl \52(xe(a 11\ x+ -<- + a k\kx )
A i , . . . ,A & 3 :
For th e average o f v(\i,... , A&) we hav e th e exact valu e
itAP) = P 1 ,
but th e estimatio n o f the averag e of i/2 (Ai,... , A&) is a much harde r problem . Not e
that Ji,k(P) i s the number o f solutions t o the system o f homogeneous equation s
f 2/ i H Yyi- yi+i • yu = 0
2 , , 2 2
- 2/2 , = o
.73) 2/i + * • * + Vi ~ Vi+i
U i + • • • + yt - 2/ m v\t = o
in integer s 1 ^ yj < P for j = 1 , 2 , . .. , 2£. W e hav e
21
(8.74 ) JiAP)= '" \ Yl e (aix + "- +akx )
k
da\ . . . da k.
J J
° ° l<x<P
22 0 8. E X P O N E N T I A L S U M S
More generally , i f the right-hand sid e of the system (8.73 ) i s replaced b y constants
<Ti, . . ., ak then th e number o f solutions J^k{Pj] cri, • • • , crk) to the system o f inho-
mogeneous equation s i s given b y the integral
/" '/ M C e (aix~\ \-a kx
k
)\ e(a 11a -\ h akak)dai . . . dak.
Hence it is clear tha t
(8.75) J ^ ( P , ; e n , . . . ,<r fc) ^ J ^ ( P j ; 0 , . . . ,Q) = J itk(P).
STEP I V (tunin g dow n t o linear polynomials) . Estimatio n o f Jijk(P) lie s
in the heart o f Vinogradov's metho d (se e the next step) , however , th e vital savin g
comes fro m Z^^ k{P) a s it is the only su m left wit h non-trivia l characters . W e have
e
Ze,k(P) ^ ^2 5 Z ( a i ^ i ^ i H ha fcAfccrfc)
a s i , . . . ,x 2£ A i , . . . ,Afe
where a k = a k(xu . . . ,x2i) = S i (x j _ xh
j+t)-> s o
l°"fe l ^ tP k'- Henc e b y (8.75) we
arrive at
Ze,k{P) ^ Ji,k(P) Yl J 5Z e (^i A i^i + Va k\kak)
C T i , . . . ,CTf c A i , . . . ,Af c
Here a i , . .. , 0>, A i , . .. , A^ run independently ove r al l integers with |<J^ | ^ £P h an d
|A^| ^ £P^. Not e tha t th e multiple su m factors s o we write th e result as
2k
1 k{k+ )
Zi,k{P) ^ Jt, k(P)t P ^
where
A= J ] D(a hJPh),
D{a,X)=X~2 Y, I J2e (amn)
\m\^X \n\^X
Now, as in Weyl's method , th e source of cancellation i s in the exponential sum s for
linear polynomials . Thi s is reached ver y differentl y her e by Vinogradov's techniqu e
of bilinear form s tha n befor e b y Weyl's differencin g process . Bu t above all , the new
process is much faster. Indeed , Weyl' s differencing require s applications of Cauchy's
inequality k — 1 times (eac h applicatio n reduce d th e degree by one), whic h amount s
to raisin g th e original su m to power 2 / e _ 1 , wherea s Vinogradov' s metho d get s the
job don e wit h th e power 2£ 2 x k 4. Anothe r ne w feature i s that w e produced linea r
polynomials ou t of every monomia l i n F, not just on e from th e highest degree ;
consequently th e saving factor s appea r abou t k times.
Gathering th e above estimate s w e obtain
2£2 2k
(8.76) \S\ ^ e P"V-V+k(k+Vjlk(P)A.
8. E X P O N E N T I A L SUM S 221
STEP V (estimation o f Ji^(P)). Le t u s give some thought abou t th e inequalit y
(8.76). Her e S is the origina l su m
S{au . . . , a k) = ^2 e ( Q i < ?1
+ " a k<ik)
while J^f c i s the L 2^-norm o f
e
P ( a i , . . . ,a k) = ^2 ( a i ^ H \~OL kx
k
)
with respec t t o the coefficient s a i , . . . , a^ ove r the toru s (R/Z) fc . A t first glance we
may thin k tha t th e proble m o f estimatin g Jt, k(P) i s no t muc h differen t tha n tha t
of th e origina l su m S. Yes , this woul d b e a n eve n harde r proble m i f w e dealt wit h
individual sums. But , du e to complete integration ove r the coefficients, w e are reall y
having a diophantine proble m (countin g solution s t o th e syste m o f homogeneou s
polynomial equation s (8.73)) .
By a heuristic argumen t on e i s lead t o th e boun d
12€ fc+
(8.77) J ^ ( P ) < P -^ >
(the linea r equatio n i n (8.73 ) fixes on e variable , th e quadrati c equatio n fixes two
additional variables , etc.) , bu t w e canno t prov e this . Vinogrado v succeede d in
proving a slightly weake r bound .
THEOREM 8.21 . Let k,m be positive integers and £ ^ k(k + m). We then have
for any P ^ /^(i-V*)"" 1 .
We begi n b y provin g
LEMMA 8.22 . Let p be a prime number, p > k, and A i , . . . ,A & be integers.
Then the number T = Tp (Ai,... , Xk) of solutions to the system of congruences
x\ + • • • - f xk = Ai (mo d p)
3.79 )
[xil h ^ E A f c (mo d pk)
in # i , . . . ,x k (mo d p k) such that Xi ^ Xj (mo d p) is bounded by
1
1hih
(8.80) T^k\p ^ .
PROOF. W e writ e eac h x r uniquel y i n th e for m
k l
xr = xrl + xr2p H h x rkp ~
with 0 ^ x ri < P for 1^ r $ J k and 1 < i ^ k. Th e first coordinates x r\ satisf y
the syste m (8.79 ) (mo d p). Pro m th e theor y o f symmetri c polynomial s i t follow s
that th e j - t h symmetri c function s i n £ i i , £ 2 i , . .. , £/ci, sa y Sj , ar e determine d b y
A i , . . . , Afc and tha t x r\ ar e roots o f xk — sixk~x -\ [-(—l) k
sk. Ther e ar e a t mos t
k roots , s o there ar e at most fc!solutions ( x n , x 2 i , . . • , xk\).
22 2 8. EXPONENTIA L SUM S
Now suppos e ( x n , x 2 i , . . . ,Xki), .. • , (xih,x2h,.-- ,Xkh) wit h 1 < ft < k ar e
given. W e shal l find condition s fo r th e coordinate s o f orde r ft + 1 . Lettin g
Xr\ + ^r2 P H + X rhph~X = y r/l
for 1 < r < fc, we hav e
h+1
^r = 2/r/ i + aj^+iP^mo d p ).
Consider th e system (8.79 ) (mo d p h+l) reduce d b y omitting th e first ft congruences,
i.e.,
k
1
h+
Y,(Vrh + Xrh+lP^™ = A m(mod p )
r=l
for ft < m ^ k. Henc e
k
(8.81) ^Xrh+iy?h = Mm(mo d p), ft < ra < /c ,
r=l
where // m ar e som e number s determine d b y th e coordinate s x r j o f orde r 1 ^ j ^ ft
and b y th e constant s A m wit h ft < m < k. Th e number s y rhA ^ r ^ k, ar e
different modul o p becaus e s o ar e x r. I n particular , a t mos t on e y rh vanishe s
modulo p , sa y y ^ , i f any . I f w e choos e £1 /1 +1 ,.. . ,Xhh+i arbitraril y th e remainin g
ones Xh+ih+i, • • -^/c/i+i ar e determine d modul o p uniquel y fro m th e linea r syste m
(8.81) becaus e it s determinan t (Vandermonde ) i s non-zero . Therefore , th e numbe r
of coordinate s o f orde r ft + 1 does no t excee d p h. W e conclud e tha t
O/ 1 fc(fc-l)
T < klpp l---pk-1 = klp—^—.
a
COROLLARY 8.23 . Let p be a prime number, p > k, and let Uk P(P) be the
number of solutions to the system of congruences
(8.82) £ {xt - x^ +k) E E 0 (mo d p m)
l^h^k
for 1 < m ^ k, in X < Xh ^ X + P, 1 < ft ^ 2k, such that Xh 1 ^ Xh 2 (mo d p) for
1 ^ fti 7 ^ h2 ^ k. We have
(8.83) U kp(P) < fe!(pfc 4- P ) f c
P V ^ .
Our nex t ai m i s t o establis h a recurrenc e inequalit y fo r J ^ ( P ) . W e assum e
that £ > k ^ 2 and P ^ pq, wher e p i s a prim e numbe r > k t o b e chose n late r an d
q i s a n integer . Clearly , w e hav e Ji,k{P) ^ Je,k(pq)- Fo r estimatin g Ji,k(PQ) we
divide the solution s o f the syste m (8.73 ) int o two classes . W e say tha t ( y i , . . . , y 2i)
is o f th e first clas s i f bot h sequence s ( y i , . . . , yi) an d (y^+i),.. . , y2i) hav e a t leas t
k differen t number s modul o p. Th e remainin g solution s ar e o f th e secon d class .
Let u s estimate th e numbe r Ji , say , of the first clas s solutions. Sinc e k number s
can b e pu t i n £ places i n £(£ — 1 ) . . . (£ — k + 1 ) way s w e obtain J\ < £ 2kJn, wher e
8. EXPONENTIA L SUM S 223
J n stand s fo r th e numbe r o f solution s ( y i , . . . , y<ii) suc h tha t al l ( y i , . . . , yu) ar e
different modul o p an d al l (y^+i,.. . , yi+k) ar e differen t modul o p. Lettin g
S(a) = J2 <F a(y)),
where F a(y) = a xy -\ h akyk, w e split
s(<*)=u(mod < F«(v)) = Ed p) 5 ^
E p)El^y^P u(mo
u(mod p) l^y^l
y=u(mod
y=u(moc p)
say. B y Holder' s inequalit y w e obtai n
pi pi. , , 2I 2£-2k
Jn= •• • / Yl S Ul(a) - - S Uk(a)\ ^ S u(a) da
iti,... ,Ufc(mo d p ) u(mo dp)
< p *- a *-i £ Jll(tt) i
0<u<p
say, wher e ^ } * means tha t th e summatio n i s restricte d t o th e residu e classe s ui,
. . . , Uk modulo p tha t ar e al l different , an d
2
Jn(u)= [ •• • I I V*S Ul(a)-'SUk(a)\ \Su(a)\2i-2kda.
This i s equa l t o th e numbe r o f solution s ( y i , . . . ,2/2^ ) o f th e syste m (8.73 ) suc h
that al l y i , . . . , i/fc ar e differen t modul o p , al l y^+i,.. . , yg+i,.. • , y^+/c ar e differen t
modulo p an d al l th e remainin g one s ar e congruen t t o u modul o p. Fo r thes e w e
set yh = u + pvun with 0 ^ Vh < q if k < h < £ or k + £ < ft < 2^ . W e obtai n
E ( ^ - ^ ) + E [(«+ pvh)m - ( « + ? w i = 0,
0<h^k k<h^£
for 1 ^ m ^ /c . No w w e appl y a n obviou s fac t tha t i f ( y i , . . . , y2^) satisfie s th e
homogeneous syste m (8.73) , the n (y i —it,.. . , y2^ — u) als o does . Therefor e w e
obtain
x
E( h-*H+t)+pm E « - v
W = o,
0<h^k k<h^£
for g ^ m ^ fc, where 2 ^ = y ^ — w for 0 < ft ^ k an d £ < h ^ £ + k. I n particular ,
we observ e tha t x ^ satisf y th e syste m o f congruence s (8.79) . Give n a solutio n t o
(8.79) w e se e tha t Vh satisfy th e inhomogeneou s syste m o f equation s
E w - v ™+^ =A-' l < m ^ k>
with som e constan t \ m independen t o f Vh- Thus th e numbe r o f the v^s i s bounde d
by Ji-k,k(q)i se e (8.75) . Combinin g wit h (8.83 ) w e conclud e tha t
J n ( u ) < W(p fc + P) k k1
1
=ii ±
P p S Jt-k,k(Q)
and
(8.84) J x < f kk\V2l-2k-m^1 Pk{pk + P) k
Je-ktk(q).
22 4 8. EXPONENTIA L SUM S
Now w e estimat e J2 , th e numbe r o f solution s o f th e secon d clas s o f (8.73) . Amon g
the number s ( y i , . . . , y^) ther e ar e a t mos t k — 1 different residu e classe s modul o
p, o r amon g th e numbe r (y^+i,.. . ,2/2^ ) ther e ar e a t mos t k — 1 different residu e
classes modul o p. Therefor e
J<2=
" yZ S Ul(a) • - S Ui (a)S Ui+1 (a)... S u^ (a)da
where U i s th e sequenc e o f vector s u = ( i ^ , . . . , i ^) (mo d p) wit h th e relevan t
property, s o \U\ < 2k ipiJrk1
~.
By th e inequalit y (x\ • • • x n)^ ^ -(x\ + • • • 4 - xn) w e obtai n
11
(ui,... ,u 2i)£U h=l
say, wher e
2i 2k
J2(u)= I •• • f \S u(a)\ da ^ J^ k(q) ^ q Je^k(q).
Jo Jo
Hence
t i+k l 2k
(8.85) J 2 < 2k p - q Je.kM{q).
By (8.84 ) an d (8.85 ) w e conclude tha t
2fc
JeAP) < f* f c ! p 2 < - 2 f c - ^ P V + P) fc + 2fcV +fe - 1 9 2fc l J t.ktk(q).
Finally assumin g tha t P ^ k k w e may tak e a prime numbe r p wit h 2 + P 1 / ^ ^ p ^
2p1/k a n ( i th e intege r g = \p~ 1 P] + 1 getting
LEMMA 8.24 . Let k ^ 2 , ,£ > \k(k + 3 ) - 1 and P ^ & fc. We then have
4
(8.86) J,, fc (P) ^ 2 ^+ ^ J ^ - M ^ 1
"*).
Now Theore m 8.2 1 follow s easil y fro m Lemm a 8.1 4 b y inductio n i n m.
Combining (8.76 ) wit h (8.78 ) fo r £ = k(k - f m) w e obtai n
2
(8.87) | 5 | ^ 4P (APfc(H1 )(1
^r)^2(H-)2
where m i s an y positiv e intege r an d P > k k^l~k) m .
STEP V I (estimatio n o f A) . I t remain s t o estimat e A . W e hav e
m i n X
X>D(a,X)^ £ 2||am| 1 ( ' ^ )
, ,< v
1
^^ V '2am-r /
|r|<Cc*A~+i | a m - r | < ^
^ 2 ( a X + l)( X + ^ min(2X , —)).
0<u<l/2a
8. E X P O N E N T I A L SUM S 22 5
The las t su m i s bounde d b y a n integra l whic h equal s a - 1 logeX , therefor e
D(a, X) ^ 2X~ 2(aX + 1 )(X + a' 1 lo g eX) ^ 4( a + a^X' 2) log3 X
for a > 0 an d X ^ 3 . W e als o hav e trivia l boun d D{a,X) ^ 4 . Fro m bot h
estimates w e conclud e tha t fo r an y 7 c { l , 2 , . . . , / c } ,
k
(8.88) A < JJ(|a,-| + la^p-V) (4k\og3£P) .
STEP VI I (estimatio n o f exponentia l sums) .
We no w assum e tha t f(x) i s a rea l smoot h functio n o n [iV , 2iV] suc h tha t fo r
all j ^ 1
j3
(8.89) a~ F< °^\f U)(x)\ ^ a 3"F
on [TV , 2iV], wher e F ^ i V ^ 2 and a ^ 1 . Henc e fo r a ^ n ) = f {j)(n)/j\ w e ge t
a - j 3 ( 2 i V ) - j F < la^n) ! ^ a j3
N~jF
and
A ^ J^(FAT-^ ' + F - 1 ^ ' p - 2 j > f e 4 ( 8 A ; l o g 3 P ) f c .
jei
We assum e tha t P 4 ^ T V ^ F 3 an d fc ^ 21 ogP/logiV . W e choos e
f. log P log P1
V ' logiY <J^log(7V/P)J
getting
A < Y[(FN- J)(2kek)k2(log3N)k < 7V"i jr(J-1 )(2A:e/c)fc2(log3Ar)/c
where
lQ
7 = m > ^ ^ gF 1 > fog*1) fog ^) , 1
11 2+
" log(N/P) lo gTV " 2(log7V)
Finally
(8.90) A ^ exp(-(logF) 2 (logP) 2 (21 og7V)- 3 )a /c4 (^log3AT) fc .
Now settin g P = iVi , k = [ 4 log P/ lo g TV] and m = 8f c w e infe r fro m (8.70) ,
(8.87) an d (8.90 ) th e followin g
T H E O R E M 8.25 . Let fix) be a smooth function on [TV , 27V] which satisfies
(8.89) with F ^ TV 4 and a ^ 1 . We then have
1 8
(8.91) S f(a,b) < aA^exp(-2- (log7V) 3 (logP)- 2 ),
where the constant implied in < C is absolute.
APPLICATIONS: W e conclud e th e presentatio n o f Vinogradov' s metho d b y
drawing fro m (8.91 ) a fe w basi c applications .
226 8. EXPONENTIA L SUM S
COROLLARY 8.26 . For t^N^2we have
3
(8.92) Yl ^<7Vexp(-/3(log7V) (logt)- 2 ),
l^n^N
where (3 and the constant implied in < C are absolute.
THEOREM 8.27 . There exists an absolute constant a > 0 such that for s = a+it
with t ^ 2 and | ^ a ^ 1 ,
a
(8.93) C ( 5 ) < t ^-^f(logt)i.
The implied constant is absolute.
PROOF. B y (8.21 ) usin g partia l summatio n an d (8.92 ) w e hav e
C(s)= Yl ri- s
+ 0(l)<^ ( x~ a
exp(-^(logx) 3(logt)-2)^
pi pi
f
= ( l o g t ) / t^-^-^du^ilogt) t W-Ku-vfdu
Jo Jo
where f(u) = ( 1 - cr) u - / ^ 3 an d v - ^/( l - cr)/3/ 3 s o /(v ) = 2( 1 - a)?/3y/3@-
This give s (8.93 ) wit h a = 2/3>/3/? . •
COROLLARY 8.2 8 (VINOGRADOV-KOROBOV , 1 957) . There exist absolute con-
stants 7 > 0 , 6 > 0 such that
(8.94) |C(< 7 + zt)|^ 7(log*)§
m the region
(8.95) £ ^ 2, a > l-5(\ogt)-i.
There ar e severa l interestin g device s b y mean s o f whic h on e ca n transfor m
an uppe r boun d fo r £(s ) int o a zero-fre e regio n an d deriv e estimate s fo r l/£(s) ,
(f(s)/((s) i n thi s region . Th e origina l argument s o f Hadamar d an d d e l a Valle e
Poussin (presente d i n Chapte r 5 ) produc e onl y a zero-fre e regio n o f th e typ e a ^
1 — c(logt) - 1 , regardles s o f ho w goo d th e estimat e fo r £(s ) tha t on e has . Howeve r
combining thes e argument s wit h a metho d o f Landa u on e ca n establis h deepe r
relationships (se e Theore m 3.1 0 an d Theore m 3.1 1 o f [T2]) .
LEMMA. Let (j)(t), ip(i) be positive increasing functions for t ^ 0 such that
(f)(t)ip(t) = o(exp((/)(t))) as t - + +oo . / /
C ( 5 ) « e x p ( ^ ) ) for a>l-^(t)-\
then
1 1
COO^O for a^l-cxp(2t + 3)- (/){2t^3)- .
Moreover in this region
1/C00, C'(s)/((s) « 0(2 t + 3)^(2 i + 3) .
Hence usin g Corollar y 8.2 8 w e obtai n
8. E X P O N E N T I A L SUM S 22 7
THEOREM 8.29 . We have ((s ) ^ 0 and
— <(log£)i(loglogt)* ,
Cw
^<(log*)i(loglog*)*
C(«)
for c r ^ l — c(logt) / (loglog^) _ 1 / 3 ; £ ^ 3 7 where c > 0 is an absolute constant.
_2 3
Now the standar d contou r integra l argumen t (se e Chapte r 5 ) yield s th e follow -
ing stronges t know n Prim e Numbe r Theorem .
COROLLARY 8.30 . We have
ip(x) — x + 0(xexp(—c(logx) s (loglo g x ) - 5 ) )
for x ^ 3, where c > 0 is some absolute constant.
Let Tk(n) stan d fo r th e numbe r o f way s o f factorin g n int o k positiv e integers ,
so the Dirichle t generatin g serie s fo r 7>(n ) i s ( k(s).
THEOREM 8.31 . For any e > 0 and x ^ 2 we have
r
(8.96) D k(x) = J2 * W = xPk(logx) + 0(x^ +£),
where Pk is a polynomial of degree k — 1 , 5k = 1 — 7A: -2 / 3 , ^ is a positive absolute
constant and the constant implied in O depends on e and k only.
PROOF. B y Perron' s formul a (5.1 1 1 ) w e obtai n
k
Dk(x) = — / C (s)-ds + Oix^T- 1 ),
2
™ Ji+e-iT S
where T i s an y numbe r wit h 2 ^ T < x. Mov e th e integratio n t o R e (s) = cr ,
\ < a < 1 , getting b y (8.93 )
Dfc(x) = re s C fc(s)— + 0((x ( 7 T a ; c ( 1 - < j ) § + x T " 1 ) ^ ) .
3=1 5
The residue i s equal t o xPk(logx) an d th e erro r ter m become s 0(x 6k+2s) b y settin g
T = x k'2/3 an d a = 1 - ( 2 a / ^ ) ~ 2 . •
Recall tha t fo r smal l k on e get s stronge r result s usin g va n de r Corpu t method .
"i-i
It i s conjecture d tha t th e bes t exponen t i n (8.96 ) i s 5k = ^iT2k
f° r a n y ^ -
This page intentionally left blank
http://dx.doi.org/10.1090/coll/053/10
CHAPTER 9
THE DIRICHLE T POLYNOMIAL S
9.1. Introduction .
A Dirichle t polynomia l i s a finite Dirichle t serie s
anTl
(9.1) D(s)= Yl ~S
with comple x coefficient s a n. Thi s i s a specia l cas e o f sum s o f typ e ^ a n e _ A ^ n ^ s
where A(n ) ar e distinc t rea l number s calle d "frequencies. " Ou r specia l cas e X(n) —
log n i s distinguished b y th e followin g features :
- X(n) i s smoot h an d slowl y increasing ,
- X(n) i s additive .
The derivativ e o f D(s) i s als o a Dirichle t polynomia l o f lengt h TV,
a
D'(S)= Yl 'nn~S>
its coefficient s o! n — —a^logn ar e change d onl y slightly . Th e additiv e propert y o f
our frequencie s implie s tha t th e produc t o f two Dirichlet polynomial s i s a Dirichle t
polynomial, namel y w e hav e
( E a «n_s)( ^ b ™m-s) = X > r s
with L = MN an d th e coefficient s o f th e produc t ar e
(9.2) a= ^2 a nbm.
nm=£
The main objectiv e o f the theor y o f Dirichlet polynomial s i s to estimate D(s) a t
special points. Sinc e the set of special points is not constructible i n practice (thin k of
the zero s p = / 3 + i7 o f ((s) wit h (3 > | ) i t i s necessary t o deal with arbitrar y points .
These ca n b e electe d b y combinatoria l argument s int o well-space d sets . Moreover ,
the coefficient s a n ar e quit e complicate d i n applications , therefor e on e ha s t o trea t
them a s arbitrar y comple x numbers . Thoug h i n suc h generalit y i t i s impossibl e t o
give a non-trivia l boun d fo r D(s) a t an y fixed 5 , i t i s tru e tha t |D(s) | canno t tak e
large value s fo r man y well-space d points . Henc e ou r fundamenta l questio n wil l b e
how ofte n \D(s)\ i s large r tha n th e mea n valu e ove r th e relevan t se t o f points .
If s = a + it ha s fixed cr , the n b y changin g th e coefficient s a n int o a nn~<7 w e
can assum e withou t los s o f generalit y tha t al l th e point s i n questio n ar e o n th e
229
230 9. T H E D I R I C H L E T P O L Y N O M I A L S
imaginary lin e s = it. Afte r establishin g result s fo r points on the imaginary lin e we
shall exten d the m fo r sets o f well-spaced point s i n vertical strips . Actuall y w e can
also modify th e Dirichlet polynomia l D(s) by twisting its terms by various function s
fs(n) dependin g o n s which chang e onl y slightl y i n n (hav e smal l derivative s i n n),
that i s to say, the theory extend s t o sums o f type
(9.3) ^cinfs(ri)n~
Such a twis t ca n be handle d usin g an y standard Fourie r analysi s (separatio n o f
variables i n / s ( n ) , se e Proposition 9.1 1 ) .
9.2. Th e integral mean-valu e estimates .
By Cauchy' s inequalit y w e ge t
(9.4)
l<n<N
E ani <GN
where
(9.5) G= > El a J 2
.
Of course , thi s boun d i s best possible , however , on e can do better o n average.
THEOREM 9.1 . For any complex numbers a n we have
rT, | 2
il
(9.6) /J ]T a nn dt = (T + 0(N))G
^ ° l<n<N
where the implied constant is absolute.
PROOF. Le t f(t) b e the following continuou s piecewis e linea r functio n
'0 if t ^ -TV ,
1 + -*- if - N < t ^ 0 ,
1
^ N
/(*) 1 if 0 < ^ T ,
1
N if T < t ^ T + JV,
<0 if t > T + N.
Then ou r integral i s majorized b y
u 2
f f(t)\ E a nn \ dt= Yl a manF{^\
l<n<N l^m,n^AT
with
F(x)= f f(t)x ltdt.
We hav e F ( l ) = T + N an d F(x) < iV - 1 (logx)~ 2 i f x ^ 1 . Henc e fo r m ^ n we
have
mfm\A T i A rn\-1
2
/ r a + n \2 N
F — «CA^ _1 log — < C— < T r^ .
\nJ V nJ N\m — nJ (m — n) z
9. TH E DIRICHLE T POLYNOMIAL S 231
Therefore ou r integra l i s bounde d fro m abov e b y
(T + N)G + O^N Yl Wma n\(m - n)~ 2 ) = ( T + 0(N))G
l^m/n^iV
by using th e inequalit y 2\a rnan\ ^ |a m | 2 + |a n | 2 . Similarl y w e derive the sam e lowe r
bound. Combinin g bot h estimate s w e complet e th e proo f o f (9.6) . •
This resul t shoul d b e compare d wit h th e discussio n o f bilinea r form s an d larg e
sieve i n Chapte r 7 .
Theorem 9. 1 implie s tha t G? i s th e mean-valu e o f \D{ii)\ i n th e followin g
asymptotic sens e
(9.7) li m i / \D(t)\ 2dt = G.
T->oo T J 0
Note tha t th e mean-valu e o f th e produc t o f Dirichle t polynomial s i s essentiall y
bounded b y th e produc t o f mean-values . Precisel y w e deriv e fo r th e coefficient s
(9.2) th e followin g inequalities :
X ^ l 2 ^ ^2Y1 \ ant>ma>nibmi\
£ nm=nimi
a
^Y1^2 \ nbm\2r(nm)
nm
<(£M2TW)(£M2TM).
nm
Hence usin g th e boun d r(n) < C n£ w e obtai n
(9-8) ] > > , | « (MJV) £ (5> n | 2 )(£|& m | 2 ).
For th e produc t o f k polynomial s w e hav e
( E <4V* ) • • • (E ^ fc)»-J) = E <•«» -
n^iVi n^N k n^N
with N = Ni ... Nk an d th e coefficient s ar e
««=E--E<--e
ni---nk=n
The mean-valu e o f thes e coefficient s satisfie s
(fe)
E i ° n i 2 < E " - E K ) - " a ^ ) W n i •••"* ) ^ ^ - - - ^A;
n ni n k
where fo r a give n sequenc e A — (a n) w e pu t
(9.9) G k = £
n
This follow s b y th e inequalit y Tfc(ni---rifc ) ^ ^ ( n i ) • • • Tfc(rifc). Sinc e Tjt(n ) i s
multiplicative, i t suffice s t o chec k thi s inequalit y fo r rij — paj. I n thi s cas e w e
have Tk(p a) — ( 1 + a ) ( l + a/2 ) • • • ( ! + a/(f c - 1 )) , s o i t suffice s t o chec k tha t
232 9. TH E DIRICHLE T POLYNOMIAL S
1+ OiH h ak)/£ ^ ( 1 + OL\jf) • • • (! + a k/tj fo r 1 ^ £ < k, whic h i s obvious .
Note tha t
£
Gk < G7V
by Tk(n) < C ne , wher e th e implie d constan t depend s o n s an d fc; however, w e shal l
often i n practic e hav e a bette r estimat e G k < C G(\og2N)K wit h som e K ^ k.
The approximat e formul a (9.6 ) i s th e bes t possibl e o f it s kin d i n general . I t i s
best use d fo r polynomial s o f lengt hT V whic h i s nea r T i n th e logarithmi c scale . I f
the polynomia l D(s) i s quit e shor t w e ca n bette r appl y (9.6 ) t o a suitabl e powe r
D(s)k gettin g
COROLLARY 9.2 . For any integer k ^ 1 we have
(9.10) /
J0
/ V| Yla a -n \ alt
n
lt
< ( T + N k)Gkk
l<n<N
where G k is given by (9.9) and the implied constant is absolute.
Still w e ma y no t b e abl e t o matc h T an d N k fo r k a n integer , s o (9.1 0 ) i s
not perfec t i n som e ranges . I n thi s connectio n H . Montgomer y [Mo2 ] mad e th e
following
CONJECTURE M k. Suppose \a n\ < 1 . Then for any real k with 1 ^ k ^ 2 we
have
u k
(9.11) / ] Ta nn \ dt<^(T + N )Nk+£
where the implied constant depends only on e.
It i s no t difficul t t o prov e (9.1 1 ) fo r som e specia l polynomials , fo r example ,
when a n — 1 for al l n. However , th e boun d (9.1 0 ) fo r genera l comple x number s a n ,
with N k+£ replace d b y G kN£, wa s show n b y J . Bourgai n [Bou ] t o b e fals e i f k i s
not a n integer. Hi s counter-example i s given by a Dirichlet polynomia l supporte d i n
a shor t interval . Bu t suc h case s ar e no t commo n i n applications , s o the Conjectur e
Mfe, i f true, i s still important . I t yields , amon g othe r things , th e densit y conjectur e
for zero s o f th e Rieman n zet a functio n (se e Chapte r 1 0) .
9.3. Th e discret e mean-valu e estimates .
Next w e establis h discret e version s o f Theore m 9. 1 an d som e o f thei r conse -
quences.
DEFINITION. A set T o f real numbers t r i s said to be well-spaced if \t ri —t r2 \ ^ 1
for an y r\ ^ r<i.
LEMMA 9. 3 (GALLAGHER) . Let T be a set of points t r with \ ^ t r < T - \
which is well-spaced. Let F(t) be a smooth function on [0,T] . Then we have
(9.12) J2 \ F^)\2 <
Jo
[ T(\F(t)\2 + \F(t)F'(t)\)dt.
trer
9. T H E D I R I C H L E T P O L Y N O M I A L S 23 3
P R O O F . Fo r a smoot h functio n f(x) o n [0,1 ] w e get th e identit y
f(x)= J f(t)dt+ C tf{t)dt + [ (t-l)f(t)dt
JO JO Jx
by partia l integration . Henc e
l/(|)K f\\f{i)\ + \\f\t)\)dt.
Jo
Applying thi s fo r f (t) w e infe r
i/(|)i 2 < / (\f(t)\ 2 + \f(t)f(t)\ dt
Jo
This give s
\F(U)\2^ r +2
(\F(t)\2 + \F(t)F'(t)\)dt,
Jtr-i
and summin g ove r t r i n T w e deduc e (9.1 2 ) becaus e th e interval s (t r — \,tT + | )
do no t overlap . D
Notice tha t b y th e Cauchy-Schwar z inequalit y w e hav e
2
(9.13) f \F(t)nt)\dt^{J \F(t)\ dtY(J \F'(t)\ 2dty.
Applying (9.1 2 ) togethe r wit h (9.1 3 ) fo r F(t) = D(it) on e get s
THEOREM 9.4 . Let T be a set of well-spaced points in the segments [0,T ] with
T ^ 1 , and let a n be any complex numbers. Then we have
a
(9.14) ] T | J2 nnltr\ <^{T + N)Glog2N
tr£T l^n^N
where G is given by (9.5) and the implied constant is absolute.
COROLLARY 9.5 . Let the conditions be as in Theorem 9.4- For any integer
k ^ 1 we have
lU
(9.15) ] T j Y^ ann \< C (T + N k)Gkk \og2N k
where Gk is given by (9.9) and the implied constant is absolute.
REMARKS. Gallagher' s lemm a yield s a transitio n fro m a n individua l valu e
D(it) t o a n integral . I t employ s th e derivativ e D'(it) whic h i s no t acceptabl e
in som e applications . However , w e ca n mak e a transitio n withou t differentiatio n
of D(s). T o thi s en d tak e a smoot h functio n f(x) > 0 supporte d o n [|,2iV ] wit h
f(x) = 1 on [1 ,7V ] suc h tha t x af{a\x) < 1 for 0 ^ a ^ 2 . B y Melli n inversio n
f(x) = —J F{t)x«db
where
Fit) = f f(x)x it-1 dx < C (1 + |i|)" 2 log 2N
234 9 . TH E DIRICHLE T POLYNOMIAL S
by partia l integration . W e multipl y a n b y / ( n ) , whic h i s a redundan t facto r i f
1 ^ n ^ TV , and appl y th e abov e integra l representatio n o f f(n) gettin g
1 f°°
D(s) = — D(s + it)F(t)dt.
2?r J_ 00
Hence
oo
2
\D(it)\(l + \t-t r\)- dt
/ -OO
where th e implie d constan t i s absolute . Th e sam e argument s appl y t o D(s) k wit h
k a positiv e intege r givin g
oo
\D(it)\k(l + \t-t r)-
2
dt
/ -oo
where the implied constan t i s absolute. Henc e the estimates (9.1 4 ) an d (9.1 5 ) follo w
from (9.6 ) an d (9.1 0 ) respectively .
The bound (9.1 4 ) has deficiency o f not dependin g on the cardinality of the set T ;
it i s relatively weake r fo r smalle r sets . Montgomer y [Mon ] succeede d i n improvin g
(9.14) i f R = \T\ i s smalle r tha n T* (not e tha t an y well-space d se t T £ [0,T ] ha s
no mor e tha n T + 1 points). Th e ide a i s based o n th e dualit y principl e fo r bilinea r
forms:
Let X — (xmn) b e a comple x matrix . The n th e followin g statement s abou t X
and a numbe r D ar e equivalent :
(A) Fo r an y comple x number s a nj
I, 2
2
^2 \^2 anXrnn\ ^ D^|a n | .
mn n
(B) Fo r an y comple x number s 6 m ,
.| 2
2
^2\^2brnXrnn\ < ^ ^ | 6 m | .
nm
For th e proof , se e Chapte r 7 .
THEOREM 9. 6 (MONTGOMERY) . Let the condition be as in Theorem 94- Then
we have
a
(9.18) ] T | J2 nnitr\2 < G(N + RT*) log TT
treT l^n^N
where the implied constant is absolute.
PROOF. B y th e dualit y principl e th e proble m i s equivalent t o showin g tha t fo r
any comple x number s c r on e ha s
( 9 - 19 ) J2 \J2 c
r ^ f < (Yl \cr\ 2
)(N + RT*)log2T.
l^n^N t reT t r£T
9. TH E DIRICHLE T POLYNOMIAL S 235
Squaring ou t an d changin g th e orde r o f summation o n the lef t sid e o f (9.1 9 ) w e ge t
C
/j / j r\ c r2 ^ \*ri ~ tr 2 )
r\ r 2
where
nlt
z(t)= Yl -
By (8.20) , (8.26 ) an d (8.34 ) w e deriv e tha t
1
(9.20) Z(t) < Nlt]' + \t\ 2 lo g |£|, i f \t\ ^ 1 .
This yield s (9.1 8) . •
By th e Lindelo f Hypothesi s £( ^ + it) < C \t\ £ fo r \t\ > 1 , on e shoul d expec t a
better boun d fo r Z(t) tha n (9.20) , namel y tha t
1
(9.21) Z(t) < N^- + N* \t\ £, fo r |t | ^ 1 .
Hence w e get
lU £
(9.22) J2 Yl ctnn \ <^G(N + RN2)T
in plac e o f (9.1 8) . Bu t a n eve n stronge r boun d i s predicte d b y Montgomer y i n th e
following
CONJECTURE M. Suppose \a n\ ^ 1 . Let T be a set of R well-spaced points t r
in the segment [0,T ] with T ^ 1 . Then
a
(9.23) V | J2 nniu\ <^N(N + R)T £.
EXERCISE. Sho w tha t Conjectur e M implie s Conjectur e Mk fo r an y rea l 1 ^
k < 2 (wit h extr a facto r T £).
9.4. Larg e value s o f Dirichle t polynomials .
The discret e mean-valu e estimate s offe r som e answers to the questio n ho w ofte n
a Dirichle t polynomia l assume s larg e value s a t a give n se t o f well-space d points .
Thus (9.1 4 ) yield s
THEOREM 9.7 . Let D(s) be a Dirichlet polynomial of length N > 1 and let T
be a set of well-spaced points t r in the segment [0 , T] with T ^ 1 such that
(9.24) \D(it r)\ ^ V
for some V > 0 . Then the cardinality ofT, say R= \T\, satisfies
(9.25) R<z:(T + N)GV~ 2 lo g 2N
where the implied constant is absolute.
Notice tha t th e boun d (9.25 ) i s non-trivia l i f V ^> G? \og2N. Fo r V muc h
larger tha n thi s Montgomer y go t a bette r resul t b y applyin g (9.1 8) .
23 6 9. T H E D I R I C H L E T P O L Y N O M I A L S
THEOREM 9.8 . Let the conditions be as in Theorem 9.7 with
(9.26) V^G?T*log2T.
Then
2
(9.27) # < c G W " log2T
where the implied constant is absolute.
P R O O F . W e ca n assum e tha t T i s large r tha n N b y a larg e constan t factor ,
because otherwis e (9.27 ) follow s b y (9.25) . Insertin g (9.24 ) int o (9.1 8 ) w e ge t
RV2 <z:G(N + RT^) log2T .
This implie s (9.27 ) unde r th e restrictio n (9.26) . •
The restrictio n (9.26 ) ca n b e removed a t a cos t o f introducing a n extr a ter m i n
the boun d (9.27) .
COROLLARY 9. 9 ( H U X L E Y ) . Let the conditions be as in Theorem 9.7. Then
(9.28) R < {GNV~ 2
+ G 3 iV7V- 6 }(log2T) 6
where the implied constant is absolute.
P R O O F . Huxley' s ide a (th e subdivisio n method ) i s a simpl e tric k o f dividin g
the se t T t o mee t th e conditio n (9.26) . Firs t w e can assum e tha t
(9.29) V^G*log2T
1 i
because otherwis e th e assertio n (9.28 ) i s trivial . The n w e hav e V ^ G*T 04 lo g 2T0
with T 0 = m i n { T , G - 2 y 4 ( l o g 2 T ) - 4 } . Th e conditio n (9.29 ) ensure s u s tha t 1 <
T0 ^ T . Le t Te b e th e subse t o f point s o f T i n th e interva l [IT, (£ + 1 )T 0] wit h
0 ^ I ^ TT 0 _1 . Fo r eac h subse t Tt> Theore m 9. 8 i s applicabl e givin g R £ = \Te\ <
GNV~2\og2T. Henc e
l
R = S£RI4Z TT^ GNV~2 log2 T
t
which yield s (9.28) . •
Applying th e abov e result s fo r th e polynomia l D{it) k wit h k a positiv e intege r
one get s
k
(9.30) i J < (T + N )(GkV-2)k\og2Nk,
(9.31) R < {{G kNV~2)k +T(G 3
kNV-
6 k
) }{log2Tf
where th e implie d constan t i s absolut e (se e (9.25 ) an d (9.28 ) respectively) .
In 1 975 , M . Jutil a cam e u p wit h severa l innovation s t o improv e th e abov e
estimates i n som e ranges . Hi s bes t resul t (whic h w e stat e her e withou t proof ,
see [Ju3] ) i s
9. TH E DIRICHLE T POLYNOMIAL S 23 7
THEOREM 9.1 0 ( J U T I L A ) . For any positive integer k we have
„3 2 , ^{-.(^-^(f)^},^
for any e > 0, the implied constant depending only on e and k.
REMARKS. Notic e tha t i f
e
(9.33) V ^G*N*(NT)
then Jutila' s boun d (9.32 ) become s essentiall y Huxley' s boun d (9.31 ) b y lettin g k
be sufficientl y larg e i n term s o f e.
The bes t (conditional ) estimate s fo r th e numbe r o f larg e value s o f a Dirichle t
polynomial wit h bounde d coefficient s come s fro m Montgomery' s conjectures . B y
Conjecture M/ ~ one derive s
k
(9.34) R^(T + N )NkV-2k(NT)£
for an y rea l k ^ 1 an d e > 0 , th e implie d constan t dependin g o n e an d k. In
applications (9.34 ) i s best use d fo r k wit h N k = T . B y Conjectur e M on e derive s
(9.35) R < N 2V~2T£, i fV > N^T £
.
Actually th e abov e estimate s fo r R ar e equivalen t t o th e Montgomer y conjectures .
All th e discret e mean-valu e estimate s wit h respec t t o a well-space d se t T —
{ti,... , IR} C [0 , T] ca n b e generalize d slightl y t o sum s o f typ e
(9.36) ] £a nfr(n)n^
where f r(n) i s a nic e functio n whic h doe s no t var y i n n to o much .
PROPOSITION 9.1 1 . Suppose that each f r{x) satisfies
a
(9.37) x \tia\x)\^2
for 1 ^ x ^ N and 0 ^ a ^ 2 . Suppose that
(9.38) I Y,a nfr(n)n^ ^v
l^n<N
for 1 ^ r< R. Then the bounds (9.30) and (9.31 ) hold true with Gk replaced by
Gk(log2N)2. In particular, / r ( n ) = n~ Gr with 0 ^ a r ^ 1 qualifies.
P R O O F . W e ma y assum e tha t f r(x) i s supporte d i n [|,2iV ] wher e i t satisfie s
(9.37). The n w e writ e
lt
fAn) = — / f r(it)n- dt
where f r{s) i s th e Melli n transfor m o f f r(x),
S l
fr{s) = / fr{x)x ~ dx.
JO
238 9. T H E D I R I C H L E T P O L Y N O M I A L S
This satisfie s f r(it) < C (|t | + 1 ) 2 log2A r b y partia l integratio n tw o times . Henc e
we ge t
oo
\D(itr
l<n<N /
« (lo g2AT) ]T \D(it r(m))\(\m\ + 1)"
where t r{rn) = t r — t i s th e poin t a t whic h \D(it r — it)\ i s maxima l wit h m < t ^
m + 1 . Notic e tha t |£ r (ra)| < T + \m\ + 1 . B y th e hypothesi s (9.38 ) i t follow s tha t
1
\D(itr(m))\ > (|ra | + l ^ ^ k ^ A O "
for som e m G Z. Le t 7^ i b e th e se t o f suc h point s t r(m) an d Rm it s cardinality ,
so we hav e R ^ ]T m i? m . Th e se t Tm ca n b e divide d int o tw o well-space d subsets .
Applying (9.30 ) an d (9.31 ) fo r each of these subsets we obtain correspondin g bound s
for R rni an d summin g ove r m w e get th e asserte d bound s fo r R. •
9.5. Dirichle t polynomial s wit h characters .
A larg e par t o f th e theor y o f Dirichle t polynomial s (9.1 ) extend s t o sum s o f
type
a
(9.39) D(s, X)= J2 nX(n)n-s
where % is som e sor t o f arithmeti c harmonic , fo r example , v(n ) ca n b e a Dirichle t
character, o r th e Fourie r coefficien t o f a modula r for m (se e Chapte r 1 4) . However ,
in th e latte r exampl e on e i s face d wit h technica l difficultie s cause d b y th e lac k o f
complete multiplicativit y (th e generatin g L-functio n ha s Eule r produc t o f degre e
two) whil e th e orthogonalit y propert y i s onl y approximate .
In thi s sectio n w e dea l wit h th e Dirichle t character s t o variou s moduli . Le t
k ^ 1 and Q ^ 1 . W e conside r th e se t H(k,Q) o f character s x ( m o d kq) suc h tha t
X =£V > where £ is an y characte r t o modulu s k an d ip is an y primitiv e characte r t o
modulus q with 1 ^ q ^ Q , (q, k) = 1 . Throughou t w e assum e tha t T ^ 3 , iV ^ 1 ,
and w e denot e
(9.40) H = kQ 2T, C = lo g HN.
First w e establis h th e followin g extensio n o f Theore m 9.1 .
THEOREM 9.1 2 . For any complex numbers a n we have
lt 2 3
(9.41) ] Tf \ Yl anX(n)n \ dt^:G(N + H)C
where the implied constant is absolute.
9. TH E DIRICHLE T POLYNOMIAL S 239
REMARKS. Ou r estimat e (9.41 ) i s not a s strong a s it coul d possibl y be , namel y
the logarithmi c facto r C? could b e remove d b y takin g a traditiona l approac h base d
on th e larg e siev e inequalit y fo r additiv e characters . Th e transitio n fro m multi -
plicative t o additiv e character s bein g performe d b y Gaus s sum s (se e Chapte r 3) ,
this approac h ha s the extr a advantag e o f working well for sequence s (a n) supporte d
in shor t segment s M < n < M + N. Nevertheless , w e have chose n a direc t metho d
simply t o sho w ne w ideas . Her e th e quantit y H = kQ 2T i s abou t th e numbe r o f
harmonics x( n)ntt bein g employe d (thin k o f t a s a discret e variabl e rangin g ove r a
well-spaced se t o f point s wit h |£ | < T) .
P R O O F . Fo r th e proof o f (9.41 ) w e firs t assum e tha t (a n ) i s supporte d i n a
dyadic segmen t X ^ n ^ 2X wit h X ^ 1 . B y th e dualit y principl e th e proble m
reduces t o th e estimatio n o f
2
(9.42) E | E / c x(t)X(n)n^dt
for an y comple x number s c x(t). W e enlarg e th e oute r summatio n b y introducin g a
smoothing functio n f(n) ^ 0 with f(n) ^ 1 for X ^ n ^ 2X. The n squarin g ou t
we se e tha t th e dua l for m (9.42 ) i s bounde d b y
(9.43) E E // \c xAh)cxAh)B(t1 -t2,XiX2)\dtidt2
i 0 i 0
Xi X2 ''
where
(9.44) 5 ( t , x ) = £ / ( n ) x ( n ) n".
Here \ = X1 X 2 is a character o f modulus £ = k[qi,q2] < kQ 2 an d \t\ = \t\ — ^1 ^ T.
By Poisson' s summatio n
s(«^E<K)
where G i s the Gaus s su m
x7
a(mo d t)
lt
and F i s the Fourie r transfor m o f f(x)x 1
•>o o
it
F(y) = / f(x)x e(xy)dx.
Jo
Have i n min d tha t F(y) i s als o a functio n o f t. W e choos e f(x) suc h tha t
(9.45) F ( 2 / ) « X e x p ( 1*
L1 l/yX\\-
- ( ^ r ) 5 ) ,i f y > 0.
For this , ther e ar e man y goo d choices , fo r example ,
(9.46) /(I ) = „„(»_£_!)
24 0 9. TH E DIRICHLE T POLYNOMIAL S
satisfies al l th e requirements . Fo r th e proo f o f (9.45) , w e mov e th e pat h o f in -
tegration i n th e Fourie r integra l o f f(x)x lt fro m R + t o e 2^lR+ wit h 0 < 0 < \
getting
f°° / 5\
\F(y)\ ^X exp(- + 0\t\ - (x + x~ l) cos6 - 2irxyX sm0)dx.
Since x~ l co s 0 + 2irxyX si n 0 ^ {nyX si n 20) *, thi s yield s
\F(y)\ ^ X e x p ( ^ + 0\t\ - (TH/ X sin 20) * ) (costf)" 1 .
We choose 0 = T~ l gettin g (9.45) .
By (9.45 ) w e deriv e tha t (b y th e trivia l boun d \G(h/t)\ < £)
4
(9.47) B(t, X) = S X^1 F(0) + 0(fTexp(| - ( * ) ))
where
/•OO
(9.48) F(0 ) = / f(x)x u
dx < C X(|t| + l ) " 2 ,
£x = 1 if x i s principal , an d 5\ — 0 otherwise . Not e tha t % = X1 X 2 i s principa l
if an d onl y i f \i = X 2 (thi s i s th e plac e wher e ou r hypothesi s tha t th e character s
^(mod q) ar e primitiv e i s essential) . Insertin g (9.47 ) an d (9.48 ) int o (9.43 ) w e
estimate (9.42 ) b y
T
{X + H 2eM-(X/H)l>)} £ [ T c ,~dt.
\ x(t)\
0
xen(k,Q)'
Since thi s hold s fo r an y comple x number s c x(t), i t follow s b y dualit y tha t fo r an y
complex number s a n ,
(9.49)
a
E/ | E a nx(n)n
u
\\t<£{X + H 2eM-(X/H)")} E l -| 2 '
xeH(fc,Q) ° X<n^2X X<n^2X
This resul t i s a s goo d a s (9.41 ) provide d X ^ H(logH) 2, bu t i t i s ver y poo r
for smalle r X. Not e tha t w e establishe d (9.49 ) onl y fo r sum s ove r n i n dyadi c
segments. T o deriv e a resul t fo r shor t sum s w e shal l increas e X artificially , whil e
preserving th e dyadi c shap e o f th e rang e o f th e characte r su m (thi s ide a i s du e t o
E. Bombieri) . W e begi n wit h th e inequalit y
] T logp ^ ^2 logp-log/e g ^ \P
P<p^2P P<p^2P
p\kq
which hold s fo r P ^ 9 log kq. Henc e th e lef t sid e o f (9.49 ) i s bounde d b y
2 lo u 2
~P E( SP) E / I E a nX(pn)(pn) dt.
tJU
p<P^2P xen(k,Q) x<n<:2X
9. T H E D I R I C H L E T P O L Y N O M I A L S 241
Here we used th e multiplicativit y o f our harmonic s x( n)n%t t o replace n b y m = pn.
Now m range s ove r th e dyadi c segmen t pX < m ^ 2pX, s o applying (9.49 ) wit h X
replaced b y pX w e get a ne w estimat e
2
{PX + H 2 exp(-(PX/H)i)} ^ |a n|
X<n^2X
where P i s an y numbe r ^ 9logkQ. W e choos e P = ( 9 + HX~ 1 )(logH)2 showin g
that
u] 2
E/ | E *nX{n)n \dt^{X + H){\ogH) 2 £ |a n| .
xeH(k,Q)^° X<n^2X X<n^2X
Finally we replace the dyadic segment X < n ^ 2X b y the whole interval I ^ n ^ N
by subdividin g th e latte r an d applyin g Cauchy-Schwar z inequality . Thi s produce s
an extr a facto r log2iV , an d i t complete s th e proo f o f Theore m 9.1 2 . •
To ever y x G W ft Q) w e associate severa l point s (possibl y none )
(9.50) S r{x) = <Tr(x) + MX) w i t h 0 < <T r(X) ^ 1 , | * r ( * ) | ^ T.
Let S(k,Q,T) b e th e se t o f al l suc h point s counte d wit h relevan t multiplicity .
We sa y tha t <S(fc , Q,T) i s well-space d i f fo r an y s ri(xi), s r2(\2) h i 5(fc , Q,T) wit h
Oi,Xi) ¥= ( r2,X2) w e have either x i 7^X2 , or x i = X 2 = X and \t ri(x)-tr2(x)\ ^ 1 -
Clearly, i f 5(fe , Q,T) i s well-spaced, the n it s cardinalit y satisfie s
(9.51) R = \S{k, Q,T)| < 3A;Q 2r = 3ff .
Nowhere in the proof o f Theorem 9.1 2 did we use essentially th e continuous measur e
in t , therefor e th e sam e estimate s hol d tru e i f on e replace d th e integratio n b y a
summation ove r an y se t o f well-space d point s £ r (x)- The n on e ca n chang e n ltr^
into n~ Sr^ a t th e cos t o f a n extr a facto r (lo g 27V)2 i n (9.41 ) b y th e argument s
from th e proo f o f Propositio n 9.1 1 . Thi s wa y Theore m 9.1 2 become s
THEOREM 9.1 3 . Let S = S(k,Q,T) be a well-spaced set of points (9.50). For
any complex numbers a n we have
(9.52) Yl I E a nX(rc)rc~ ar(x) | 2 <G(7 V + # ) £ 5
sr(x)es I ^ J V
where the implied constant is absolute.
Theorem 9.1 3 i s a n extensio n o f Theore m 9.4 . Nex t w e establis h a n extensio n
of Theore m 9.6 , namel y
THEOREM 9.1 4 . Let S = S(k,Q,T) be a well-spaced set of points (9.50) of
cardinality R. For any complex numbers a n we have
3
(9.53) Yl I E anX(n)n- 'M\2 <&G(N + RHi)£*
where the implied constant is absolute.
242 9. T H E D I R I C H L E T P O L Y N O M I A L S
P R O O F . Thi s goe s b y modification s o f ou r argument s fro m th e proo f o f Theo -
rem 9. 6 an d Theore m 9.1 2 . Firs t w e assum e tha t cr r(x) — 0> i- e-> s r(x) = ^r(x)-
Then b y dualit y w e reduce th e proble m t o th e estimatio n o f
(9.54) V = ]T / ( „ ) |£ ]T cx(tr)X(n)n^\2
r
nX
for an y comple x number s c x (t r ), wher e f(n) is any non-negativ e functio n o n R +
such tha t / ( n ) ^ 1 if 1 ^ n ^T V (in wha t follow s w e assum eT V ^ 2 , otherwise
(9.53) i s trivial). Th e dua l for m satisfie s
(9.55) V < X i E E E h x . ^ i K ^ r J i ?^ -tr 2,XlX2)\
Xi Xi n r 2
where B(t, x ) i s given b y (9.44) . A t thi s poin t w e are goin g to refin e th e estimatio n
(9.47). W e writ e
L
(9.56) B(t, X) = TT: f (s- it , X)f(s)ds
27TZ J ( 2 )
where L(s,x ) i s the Dirichle t L-functio n an d f(s) is the Melli n transfor m o f /
/•OO
f(s) = / f(x)x 1 s
- dx.
Jo
Moving th e integratio n t o th e imaginar y lin e w e deriv e
(9.57) B(t, X) = Sx^f-f(l + zt) + 0(£H\t\ + l)*(logJW(|t| + l)) 2 ).
Here th e leadin g ter m come s fro m th e pol e o f L(s — it, x) a t s = 1 + it if x i s
principal (o f course , i t agrees wit h tha t i n (9.47)) an d th e erro r ter m i s obtaine d
from tw o estimate s
(9.58) L(s,x ) < ^ ( W + 1)* log*(|s| + 2)
(see Exercis e 9 in Sectio n 5.9 ) an d
ls|
(9.59) / ( 5 ) < e " logTV
on R e s = 0. T o se e th e latte r tak e
(9.60) f(x) = h[e-x'N - e~ x)
so f(s) = 5T(s)(TV s - 1 ) . Henc e w e als o ge t / ( l + it) < TVe"'*' . No w (9.57 ) show s
that th e dua l for m satisfie s
Xr
and thi s prove s (9.53 ) wit h th e logarithmi c facto r C 2 i n plac e o f £ 4 , bu t onl y i f al l
the point s s r(x) are imaginary . The n w e change n lt int o n~ a~lt a t th e cos t o f tw o
extra logarithm s completin g th e proof . •
There ar e tw o essentia l ingredient s i n ou r proo f o f Theore m 9.1 4 , namel y th e
reduction t o th e dua l for m (9.54 ) followe d b y the estimatio n (9.57 ) o f the characte r
9. T H E D I R I C H L E T P O L Y N O M I A L S 24 3
sum B(x,t). However , thi s estimatio n i s no t th e bes t possible . Assumin g th e
Lindelof hypothesi s
(9.61) L(s,x)<&Ws\Y
for R e s = \ an d x(nio d £) one deduce s (b y movin g th e integratio n t o th e critica l
line rathe r tha n t o th e imaginar y one ) tha t
<p]f)
(9.62) B(t, x) = tfx^V/(I + it) + 0(N*e°(\t\ + l) £ ).
This yield s
SrM S
(9.63) J2 I Yl anX(n)n- \< C G(iV + RN*H )C4
in plac e o f (9.53) . Th e result s (9.53 ) an d (9.63 ) ar e du e t o H . Montgomer y [Mo2] .
9.6. Th e reflectio n method .
The Lindelo f hypothesi s ma y be an open problem fo r a long time. Nevertheless ,
M. N . Huxle y [Hu2 ] ha s succeede d t o establis h unconditionall y a resul t whic h i s
almost a s goo d a s (9.63 ) i n som e importan t ranges . Firs t h e di d i t fo r polynomial s
D(s) i n Corollar y 9. 9 b y th e subdivisio n trick . However , thi s tric k doe s no t mak e
sense fo r polynomial s D(s,x) twiste d b y characters . Fo r thes e polynomial s Huxle y
invented a completel y differen t brillian t tric k whic h h e call s th e reflectio n metho d
for reason s soo n t o b e clear . Usin g Huxley' s idea s w e ar e goin g t o prov e
THEOREM 9.1 5 . Let the conditions of Theorem 9.1 4 hold true. Then
sAx)
(9.64) ] TI Y, a nX(n)n- \< G(N + iJ§ H*N*)£ 6
where the implied constant is absolute.
For th e proo f w e can assum e tha t th e coefficient s a n ar e supporte d i n a dyadi c
segment N ^ n ^ 2N ', an d a s before , w e ca n als o assum e tha t al l th e point s s r (x)
are o n th e imaginar y line . Th e latte r simplificatio n cost s a los s of factor (lo g 27V)2,
so w e must sho w (9.64 ) wit h C 4 i n plac e o f £ 6 .
The proble m reduce s t o th e estimatio n o f th e dua l for m V. I n th e proo f o f
Theorem 9.1 4 w e have estimate d V fo r an y comple x coefficient s c x (t r ), however , i t
is sufficient t o dea l wit h
a
(9.65) c x(tr) = D(s r(X),x) = ^2 nX(n)n-s^K
n
Precisely, lettin g C be th e lef t sid e o f (9.64) , w e hav e
2
(9.66) C ^ VG
(see the derivatio n o f the dualit y principl e i n Chapte r 7) . No w we are goin g to tak e
advantage o f these specia l coefficient s (9.65) . B y (9.55 ) w e are agai n face d wit h th e
character su m B(x,t). W e shal l no t improv e th e erro r ter m i n (9.57 ) bu t rathe r
replace i t b y anothe r characte r sum ; i n othe r words , w e us e a kin d o f summatio n
24 4 9. T H E D I R I C H L E T P O L Y N O M I A L S
formula (se e Chapter 4 , i n particula r (4.26)) . W e derive suc h a formul a fro m a
functional equatio n fo r the corresponding L-function .
Without los s of generality, w e can assume tha t al l the characters i n C have the
same parit y (b y dividing C into tw o sums i f necessary), s o the character \ — X1 X 2
in B(x,t) i s alway s even . I f x ( m o d £) i s primitive , the n w e have th e functiona l
equation (4.73 )
s
(9.67) L(s, X)=ex"f(s)el- L(l-s,x)
where \e x\ = 1 an d
(9.68) 7(s )= ^ - l r ( ^ ) / r ( | ) .
Suppose x(mod £) is induced by the primitive character x*(mo d £*) with £*\£. Then
L( 5 , X ) = L ( 5 , x * ) n ( 1 - X * ( p ) P ~ s ) .
p\£
Hence th e functiona l equatio n (9.67 ) fo r L(s,x* ) yield s th e followin g functiona l
equation fo r L(s, x),
(9.69) L{s,x) = e x^(s)P(s)L(l-s,x)
where e x = s x* and
X
(9.70) P(s) = {t)^ s I t( - X*(P)P~*)( l ~ X'WP*- 1
1
)- .
p\£
Notice tha t 7(5) , P(s) ar e holomorphic i n Re s < 1 , that |P(s) | = 1 on Re s = \,
whence b y the convexity boun d o r Phragmen-Lindelof principl e
(9.71) \P(s)\ ^£^~ a i f cr = R e (s) ^ \.
Moreover, L(s , x) is holomorphic on C except fo r a simple pole at s = 1 with residu e
(f(£)/£ i f x ( m ° d £) i s principal. Therefor e b y movin g th e integration i n (9.56 ) t o
the lin e R e s = — e an d applying th e functional equatio n (9.69 ) w e obtain
(9.72) B(t, x) = S x^f-f(l + it) + exB*(t, X )
where
(9.73) B*(t, x)
= - L / " l{s~ it)P(s - it)L(l - s + it, x)f(s)ds.
2
™ J(-e)
Expanding L( l — s + zt, x) int o th e Dirichlet serie s and integrating termwis e w e get
00
(9.74) B*(t, X) = ^ 3Hx(m)m-5-1 (
where
s
(9.75) g(y) = -^/ 7 (s - it)P(s - it)f{s)y -^ds.
Notice w e moved th e integration fro m R e s = — e i n (9.73 ) bac k t o R e s = ^ in
(9.75) a s we can because 7(5 ) and P(s) hav e n o poles i n Re s < 1.
9. T H E D I R I C H L E T P O L Y N O M I A L S 245
The formul a (9.72 ) i s the on e w e were talking about , bu t i t i s not ye t read y fo r
applications becaus e th e dua l functio n g(y) i s no t evaluate d explicitl y enough . T o
see it s propertie s w e tak e agai n th e majoran t functio n f(x) give n b y (9.46 ) wit h
X = iV , s o it s Melli n transfor m i s
l 5 2
(9.76) f{s) = J exp(^----y- dx = 2e / Ks(2)Ns
where K s(y) stand s fo r th e Bessel-Macdonal d function . Henc e f(s) ha s exponentia l
decay o n an y vertica l lin e s = a + it. Usin g analyti c propertie s o f / ( s ) , 7(5 ) an d
P(s) on e shows , b y movin g th e integratio n i n (9.75 ) sufficientl y fa r t o th e lef t (a s
in (1 0.64)) , tha t
(9.77 ) ^)«^M-K^T))' )
where th e implie d constan t i s absolut e (compar e thi s estimat e wit h (9.45)) . Henc e
^(ra) i s very smal l i f mN > £(\t\ + l)(logiW(|t | + l)) 3 .
In ou r cas e \t\ ^ T and £ < kQ2, s o th e term s o f B*(t,x ) wit h mN ^ HC 3
contribute ver y little , precisel y w e ge t b y (9.77 )
1
(9.78) B*(t, X)= ^ s H x H r n ^ - '+ Oir )
where
3
(9.79) MN = HC
and th e implie d constan t i s absolute .
an
To mak e (9.78 ) read y fo r application s i t i s desire d t o separat e m fro m \ dt
involved i n g(m). W e accomplis h thi s goa l quickl y b y introducin g (9.75 )
1
(9.80) B*(t, X) = ^-. I1 {s)P{s)f{s + it){ Y. x t m K ^ H O r ) .
Since \K 3(2)\ ^ 8|T(s)| < C e~^ fo r s = \ + iu, w e ge t
oo . .
/ +i(t
J2 x(m)m-* +u)le-^du + H- 1
where th e implie d constan t i s absolute .
This resul t i s mor e precis e tha n (9.57) . Indeed , estimatin g triviall y b y (9.81 )
one get s
(9.82) B * ( i )X)<ifi£§
which recover s (9.57) . Bu t on e ca n d o bette r b y puttin g th e su m
(9.83) Yl Xi&Mm-i+W-i-t-j+t. )
in th e origina l polynomia l
u
(9.84) Yl ^nXi(n)n ^
N<n^2N
246 9 . T H E DIRICHLE T POLYNOMIAL S
which appear s a s a coefficien t i n (9.55 ) (her e an d hereafte r w e use th e abbreviate d
notation t r = t r(x)). B y (9.55) , (9.72 ) an d (9.81 ) w e deriv e
Xr lr 2
oo
T> _ H
/ EE E E \ 1 , Xl
(^r2,X2)V(itr ! »*r 2 , X2 5 « ) | e d«
~°° X i X 2 r i r 2
+tf~1EEEEip^,xi)z>(^2,x2)i
Xi X 2r ir 2
where V(it ri, Xi ? ^r2 > X2; ^) denote s th e produc t o f the tw o sums (9.83 ) an d (9.84) .
For fixed X2,tr 2iu thi s produc t i s a ne w polynomia l o f typ e (9.39) . Th e lengt h o f
P ( i t r i , X i ) = V(it ri,xi'->ttr2iX2',u) bein g 2MN = 2HC S i s perfec t fo r a n applica -
tion o f Theore m 9.1 3 . W e obtai n
E E i m - ^ x i ) ! < ( ^ E E ip ( ^ ^ i ) i 2 ) " «(i?GF)i£ 6.
r
Xi ^ iX i i
After thi s bound , whic h doe s no t depen d o n X2^2,^ 5 w e estimat e
EEi^^'^i^^EEi25^'^)!2)^^)^
X2 r 2 X 2r 2
Integrating i n u, w e find that th e contributio n o f these polynomial s t o D i s at mos t
0(R(CGHN)zC5). Th e polynomial s fro m th e diagona l x i = X 2 = X contribute
r
xn 2
and th e remainin g one s contribut e
jff_1
EEEEi I) ^'^) I3 ^'^)i <<F " liiC<<c -
Xi X 2r ir 2
Adding u p thes e contribution s w e obtai n P < C NC + R(CGHN)z £ 6 . Henc e w e
derive b y (9.66 ) tha t C ^ GN + GRi{HN)^C A. Thi s complete s th e proo f o f
Theorem 9.1 5 .
9.7. Larg e value s o f D(s,x)>
The notatio n fro m previou s sectio n hold s here , i n particular , w e recal l tha t
H = kQ 2T an d C = lo g HN. I n thi s sectio n w e deduc e a fe w bound s fo r th e
number o f larg e value s o f th e polynomial s D(s,x)- First , a s i n Theore m 9.7 , w e
deduce fro m (9.52 ) th e followin g
na
THEOREM 9.1 6 . Suppose for any s r (x) ^ well-spaced set S(k, Q , T) we have
(9.85) \D(s r(X),x)\>V.
Then the cardinality of this set, say R = \S(k,Q,T)\, satisfies
(9.86) i ? « ( i f + iV)Gr 2£5
where the implied constant is absolute.
9. T H E D I R I C H L E T P O L Y N O M I A L S 24 7
If V i s relatively large , the n (9.53 ) yield s a bette r boun d (fo r details se e th e
proof o f Theorem 9.8) , namel y th e following
THEOREM 9.1 7 . Let the conditions be as in Theorem 9.1 6 with
3
(9.87) V^G^H^C .
Then
2 4
(9.88) R < GNV~ £
where the implied constant is absolute.
The estimat e (9.88 ) i s idea l fo r applications , unfortunatel y i t i s establishe d
only for very larg e V relativ e to the "conductor" H. On e can remove the condition
(9.87) a t a cost o f introducing a n extra ter m i n the bound (9.88) . T o this en d w e
apply (9.64 ) an d get the following resul t
THEOREM 9.1 8 . Let the conditions be as in Theorem 9.1 6. Then
2 3
(9.89) R < (GNV~ + G NHV-6)C
1 S
where the implied constant is absolute.
The las t estimat e (9.89 ) i s the celebrated resul t o f Huxley [Hux] . I t is surpris-
ing t o see that i n the special cas e o f the Riemann zet a function , hi s subdivisio n
argument yield s a bound (9.28 ) whic h agree s wit h (9.89 ) obtaine d b y very differen t
arguments (reflectio n method) .
The assumption tha t th e ordinates t r(x) o f the points s r(x) — 0"r(x)+^r(x ) a r e
well-spaced ca n be relaxed, i t suffice s tha t the y d o not cluster to o much. Suppos e
that fo r any sr (x) £ S{k,Q,T) w e have
(9.90) \{n ; sri(X) G S(k,Q,T), \t ri - t r\ < 1 } | ^ L.
Then clearl y al l Theorem s 9.1 3 to 9.1 8 are true fo r th e set S(k,Q,T) wit h th e
relevant uppe r bound s multiplie d b y L (t o see this divid e th e set S(k, (J , T) int o
at mos t 2L subset s o f well-spaced point s an d apply th e results fo r eac h o f thes e
subsets separately) .
In particular , w e know tha t th e number o f zeroes p — ji + ry of L(s, x) f° r a
given characte r x ( m ° d kq) with | 7 — t\ ^ 1 is bounded b y 0(\ogkq(\t\ + 3) ) (see
Chapter 5) . Therefore , b y virtue o f the above remark , al l Theorems 9.1 3 to 9.18
hold fo r any subset <S(fc , Q, T) o f zeros of
L
(9.9i) n n n ^O
q^Q ^ ( r a o d q) £(mo d k)
(q,k) = l i\) primitiv e
in a rectangle a ^ a , \t\ ^ T (counte d wit h multiplicity ) wit h jus t on e extr a facto r
£, an d G replaced by
(9.92) G(a) = J2 K\ 2n~2a
in the relevant uppe r bounds .
In th e next chapte r w e apply thes e theorem s fo r selecte d point s s r(x) whic h
are zero s o f (9.91).
This page intentionally left blank
http://dx.doi.org/10.1090/coll/053/11
CHAPTER 1 0
ZERO-DENSITY ESTIMATE S
10.1. Introduction .
In the absenc e of a proof o f the Gran d Rieman n Hypothesis , i t i s natural t o as k
how many zeros of a given L-function ca n lie off the critical line. Sinc e we know there
are n o zero s o n th e lin e R e (s) = 1 , a fe w suc h zero s woul d no t normall y influenc e
applications, bu t a larg e numbe r o f these , especiall y whe n the y ar e nea r thi s line ,
may distor t substantiall y th e result s (a n exampl e bein g th e distributio n o f prime s
in shor t intervals , se e Sectio n 1 0.5) . Therefore , w e shoul d as k ho w man y zero s
can possibl y b e i n a give n region , an d ou r estimate s shoul d revea l tha t th e furthe r
the regio n i s fro m th e critica l line , th e smalle r th e numbe r o f zero s i t contains . A
quantitative statemen t o f thi s kin d i s called a zero-densit y theorem .
The region s w e need mos t i n practic e ar e th e rectangle s
(10.1) R(a,T) = {s = a + it; a ^ a, \t\^T}
for \ ^ a ^ 1 and T ^ 3 . Le tT V (a, T) denot e th e numbe r o f zero s o f th e Rieman n
zeta function i n R(a, T) (counte d wit h corresponding multiplicity) , tha t i s we count
the zero s p = (3 + ij wit h
(10.2) p > a, | 7| < T.
In thi s notatio n th e Rieman n hypothesi s assert s tha t N(a, T) = 0 fo r an y a > \
and T ^ 3 , while the bes t know n zero-fre e regio n (du e to Vinogradov an d Korobov )
says tha t
2/3
(10.3) N(a,T) - 0 i fa > 1 - c(logT)~ (loglogT)1 /3
where c i s a n absolut e positiv e constan t (se e Chapte r 8) . Recal l tha t th e numbe r
of al l zero s p = ft - h 27 with \j\ ^ T satisfie s (se e Chapte r 5 )
(10.4) N(T) = - lo g -^ + O(logT) .
7r Zire
By analog y t o th e convexit y principl e concernin g th e orde r o f £(s ) o n vertica l line s
one ma y expec t th e followin g boun d fo rA T (a, T) t o b e tru e
DENSITY CONJECTURE . For \ ^ a ^ 1 and T ^ 3 we have
2(1
(10.5) 7V(a , T) < T - a ) lo g T
where the implied constant is absolute.
Of course , th e convexit y principl e fo r zero s i n rectangle s doe s no t hol d fo r gen -
eral analyti c functions ; nevertheless , th e densit y conjectur e fo r £(s ) ha s a reliabl e
249
25 0 10. ZERO-DENSIT Y ESTIMATE S
backup - th e Rieman n Hypothesis . Thi s conjectur e i s particularl y attractiv e be -
cause i t ha s a chance t o be proved b y th e technolog y whic h i s available today , whil e
it replace s th e Rieman n hypothesi s i n variou s application s t o th e distributio n o f
primes.
A grea t dea l o f effor t wa s mad e t o establis h estimate s o f typ e
c
(10.6) 7V(a , T) < T ^ ) (1 " a ) ( l o g T ) A
with c(a) a s smal l a s possibl e (th e logarithmi c facto r i s of secondar y concer n here ,
although i t wil l b e importan t t o eliminat e i t i n Chapte r 1 8) . Th e firs t zero-densit y
results wer e prove d b y Boh r an d Landa u [BL ] i n 1 91 4 . Thei r ide a wa s basi c i n
many work s ove r thre e decades , notabl y i n th e work s b y F . Carlso n [Car ] an d A .
E. Ingha m [Ing] . Tw o ne w device s wer e introduce d b y P . Tura n i n 1 949 , an d i n
collaboration wit h G . Halas z i n 1 958 . Grea t refinement s wer e mad e b y H . Mont -
gomery i n 1 969 . Th e othe r significan t contributor s t o thi s fascinatin g theor y ar e
M. N . Huxley , M . Jutil a an d D . R . Heath-Brown .
In this book we shall try to give a glimpse of the current stat e of knowledge. Ou r
results will no t b e a s stron g a s th e bes t know n ones , thoug h nearl y so . Moreover ,
we treat i n considerabl e detai l th e zero s o f Dirichlet L- functions. Le t N(a,T, \) b e
the numbe r o f zero s p x = f3 x + i^ x o f L(s , x) wit h fi x > a an d |7 X | ^ T (counte d
with multiplicity) . Th e ultimat e goa l woul d b e t o prov e th e followin g
GRAND DENSIT Y CONJECTURE . Let / c ^ l , Q ^ l , T ^ 3 and \ ^ a ^ 1 .
Then
a
(10-7) E E E N(a^^)^H^- \logH)A
q^Q -0(mo d q) £(mo d k)
(q,k) = l -^primitiv e
where H = kQ 2T and A is an absolute constant, the implied constant being also
absolute.
Our mai n result , th e Gran d Densit y Theore m 1 0.4 , will give (1 0.7 ) i n the rang e
| < a ^ 1 . Moreover , i t yield s (1 0.7 ) i n th e whol e segmen t \ < a ^ 1 , bu t wit h
the constan t c = 1 2/ 5 i n plac e o f 2 .
10.2. Zero-detectin g polynomials .
First w e outline th e ol d and ne w strategies. Give n a holomorphic functio n L(s)
in a regio n D C C ther e i s a variet y o f tool s fo r countin g it s zero s insid e D. A
typical on e i s th e integra l formul a o f Jense n
R
,f
/ A n , ,dr I f , \L(s)\ds
10-8 / n(r)— = — / l o g - ) - (-
Jo r 2<KI J lsl=R IL(0) Is
where n(r) denote s th e numbe r o f zero s o f L(s) i n th e dis c |s | ^ r (assumin g
L(s) ^ 0 fo r s = 0 an d fo r s o n |s | = R). Ther e ar e simila r formula s fo r othe r
domains. Not e tha t on e need s bot h uppe r an d lowe r bound s fo r \L(s)\ o n th e
boundary t o deduc e a n estimat e fo r th e numbe r o f zero s insid e th e region . O f th e
two, th e lowe r boun d i s th e hardes t t o find. I n vie w o f thes e requirement s an y
integral formul a i s practical onl y fo r a regio n sufficientl y wid e s o that it s boundar y
is distanced fro m th e area in which the zeros are concentrated (otherwis e one canno t
10. Z E R O - D E N S I T Y E S T I M A T E S 251
claim a reasonable lowe r boun d fo r \L(s)\ o n dD). Bu t thi s i s not the cas e fo r the
narrow rectangle s R(a, T).
To avoi d th e abov e barrier s on e get s th e following idea : multipl y L(s) by an -
other holomorphi c function , sa y M(s), whic h i s suspected t o be large whe n L(s)
is small , an d coun t th e zero s of the produc t N(s) = L(s)M(s) rathe r tha n o f L(s)
alone. O f course , th e extr a zero s of M(s) will occur , bu t hopefully no t s o many as
to chang e th e orde r o f magnitude o f the origina l numbe r ( a positive proportio n o f
the extr a zero s is acceptable).
In realit y th e ide a work s onl y for counting "fictitious " zero s because N(s) doe s
vanish at the true zeros of L(s). Thu s the points at which we want th e mollifier M(s)
to be large ar e onl y hypothetical . Havin g n o possibilit y t o inspect th e distributio n
of thes e hypothetica l point s on e ma y tr y M(s) whic h mimic s 1 /L{s) a t almost all
points, s o N(s) is expected t o be close to one almos t everywher e i n a given region .
It woul d b e tautological t o tak e jus t 1 /L(s) fo r the mollifier M(s) , o r naiv e t o
deform 1 /L{s) smoothl y t o get a useful holomorphi c function . Bu t suppos e 1 /L(s)
is give n b y an absolutely convergen t serie s i n a certain region . The n th e chances
are that it s partial sum s d o well approximate t o \/L(s) uniforml y i n a wider regio n
which doe s no t contain th e true zero s o f L(s). Therefore , a finite bu t sufficientl y
long partia l su m o f such kin d offer s a good candidat e fo r M(s).
The abov e strateg y serve s wel l fo r counting fictitious zero s o f Dirichlet serie s
with multiplicativ e coefficients .
First t o illustrate th e ideas w e giv e a quick treatmen t o f L(s) = C( 5)- I* 1 this
case
1 /i(ra )
c(s) Alt ps) z ^
but onl y if Re s > 1. Nevertheless, on e expect s tha t th e partia l su m
(10..) M W . £ <g >
approximates t o l/((s) i f Re s = a > | . Indeed , th e Riemann hypothesi s ensure s
that
£
(10.10) - ^ = M(s) + 0{(\s\M) Mi-a).
REMARKS. Th e partia l su m M(s) was firs t use d i n th e contex t o f zero-density
estimates b y Carlson [Car ] whil e Boh r an d Landa u [BL ] wer e working earlie r wit h
the partia l produc t
i
(Ian) ^)=n( -i)-
Either devic e produce s non-trivia l results , yet M(s) is more efficien t tha n P(s).
It help s to keep hol d o n th e siz e of the points , s o we restrict ou r analysi s to the
region
(10.12) 5 = c r + zt, c r ^ a , T<\t\^2T,
25 2 10. Z E R O - D E N S I T Y E S T I M A T E S
and w e assume tha t T i s large. I n thi s regio n
n
(10.13) C ( s ) = E ~ S+ °( T
~a)
where th e implie d constan t i s absolute . Henc e
s 1a
(10.14) C(s)M(s)= Y^ a nn~ ^0(T~ M -a\og2M)
n^TM
where th e coefficient s ar e
(10.15) a n= ^ /x(m) , |a n|^r(n),
dm=n
m^M,d^T
and th e erro r ter m i s obtained b y th e trivia l estimatio n
1 _a
(10.16) M(s) < M lo g 2M.
Assuming H M ^ T w e hav e a\ = 1 and a n = 0 i f 1 < n ^ M o r n > MT. W e
cover th e interva l M < n ^ MT b y dyadi c segment s N < n ^ 2N wit hT V = 2 £M,
0 ^ £ < L = [lo g T/ lo g 2]. Fo r eac h o f thes e segment s denot e
a
(10.17) D £(s)= Yl -
N<n^2N
Thus w e hav e
(10.18) C(s)M(s) = l+ Yl De{s) + E(s)
O^KL
where th e erro r ter m satisfie s \E(s)\ < \ fo r s i n th e regio n (1 0.1 2) , provide d
a
Mi-a ^ T (logT)"2.
An importan t featur e o f th e identit y (1 0.1 8 ) i s th e absenc e o f term s wit h 1 <
n ^ M o n th e righ t side . Th e lef t sid e a t a zer o o f £(s) , sa y a t s = p , vanishes s o
we mus t hav e
D
(10.19) | J2 ^P) >k.
Hence
1
(10.20) \D e(P)\ > (2L)-
for som e 0 ^ £ < L. Thi s lowe r boun d i s rathe r larg e b y compariso n t o th e mean -
value o f \Di(s)\ 2 o n th e lin e R e s = a. Indee d w e hav e
2a
(10.21) G £(a) = J2 K\ 2
n~2a « A^- (log2A0 3 .
N<n^2N
For thi s reaso n w e cal l Di{s) zero-detectin g polynomials . Wha t w e showe d i s tha t
every zer o o f £(s ) i n th e regio n (1 0.1 2 ) i s detecte d b y a t leas t on e o f a fe w fixed
polynomials b y wa y o f producin g unusuall y larg e value .
10. Z E R O - D E N S I T Y E S T I M A T E S 253
R E M A R K S . O f course, th e lower boun d (1 0.20 ) ma y not hold a t point s othe r
than th e zeros o f £(s). I n fact, on e should expec t tha t
(10.22) D < N* -ot+e
£(s)
for an y s in the region (1 0.1 2) . Comparin g (1 0.22 ) wit h (1 0.20 ) on e could ge t a
contradiction whic h prove s th e Riemann Hypothesis . Well , thi s argumen t i s noth-
ing bu t wishfu l thinkin g becaus e on e needs th e Riemann Hypothesi s t o establis h
(10.22). Thes e remark s revea l tha t th e concept o f a zero-detecting polynomia l is
somewhat superficial , i t will loos e it s meaning whe n th e GRH i s fully established .
In realit y Dirichle t polynomial s d o not assume larg e value s ver y often .
Now the theory of Dirichlet polynomial s as developed in Chapter 9 tells us that
(10.20) canno t happe n to o often, i n particular, producin g a n interesting estimat e
for th e number o f zeros o f the zeta function . Precisely , le t Ri be the number o f
zeros of ((s) i n (10.12) whic h are detected b y the polynomial Dt(s). The n Theore m
9.7 (o r Theorem 9.1 6 ) togethe r wit h th e closing remark s o f Chapter 9 yield
(10.23) R £ < ( T + A O A ^ - ^ l o g T )11
by (1 0.20 ) an d (10.21). Sinc e M ^N < MT, thi s give s
(10.24) R £ < {TM 1 -2* + ( T M ) 2 - 2 a } ( l o g T ) n
for an y 0 < £ < L. Choosin g M = T2^1 w e ge t
4a(1
(10.25) R e < T -a)(logT)n.
Adding u p the Ri we get an upper boun d fo r all zeros o f ((s) i n (1 0.1 2 ) (som e of
these ma y be detected b y several polynomial s D^(s)) , the n w e extend th e result t o
the zero s i n (10.2), an d conclude
THEOREM 1 0.1 . For \ ^ a ^ 1 and T > 2 we have
(10.26) N(a,T) < T 4 c * ( 1 - a )(logT) 1 3 .
This estimat e (apar t fro m th e logarithmic factor ) wa s obtained b y F. Carlso n
[Car] i n 1920 (se e also E. Landau [La3 ] of 1921). Twent y year s later A . E. Ingham
[Ing] prove d
(10.27) N(a,T) < T 3^(logT)5.
Either (1 0.26 ) o r (10.27) impl y tha t th e Riemann hypothesi s i s "almos t true " in
a statistica l sense , namel y tha t almos t al l zeros o f £(s) lie arbitrarily clos e t o the
critical line . I t follows b y (10.27) tha t th e number o f zeros p — / ? + ry with J7 J ^ T
and
+ 3
(10-28) ^ > i ^
is bounde d b y 0(T(logT) 5-4A) fo r A > 1.
Concerning th e zeros nea r th e critical lin e in 1942 A. Selberg [S4 ] prove d tha t
(10.29) N(a, T)<^{a-\)- lT, i f a > \,
25 4 10. Z E R O - D E N S I T Y E S T I M A T E S
where th e implie d constan t i s absolute . Hence , give n a functio n $(T )— > co a s
T—> • co, i t follow s tha t almos t al l zero s o f th e Rieman n zet a functio n li e i n th e
region
do.30 ) ^-il<^ -
10.3. Breakin g th e zero-densit y conjecture .
It i s amazin g ho w lon g th e estimat e (1 0.27 ) o f Ingha m resiste d improvements .
After sixt y year s i t i s still th e bes t know n resul t (apar t fro m th e logarithms ) i n th e
range | < a < | . Th e ne w methods , whic h emerge d fro m th e wor k o f Halas z an d
Turan [HaTu ] i n th e lat e sixtie s an d flourished i n th e seventies , produce d a ne w
estimate fo r th e numbe r o f larg e value s o f Dirichle t polynomials , namel y (9.28 ) i n
place o f (9.25) . Thi s ne w estimat e turne d ou t t o b e bette r tha n th e ol d on e fo r
treating zero s nea r th e lin e R e s = 1 . Firs t th e estimat e (1 0.27 ) wa s improve d i n
the rang e | < a < l b y H . L . Montgomer y [Mo3 ] i n 1 969 , an d shortl y afte r i n
the rang e | < a < l b y M . N . Huxle y [Hu2 ] i n 1 97 2 (se e a mor e genera l Theore m
10.4).
For th e zero s o f ((s) nea r th e lin e R e s = 1 on e i s als o helpe d b y havin g
shorter partia l sum s whic h approximat e t o £(s) . I n thi s sectio n w e us e a suitabl e
short polynomia l t o establis h a zero-densit y estimat e whic h i s even bette r tha n th e
zero-density conjectur e fo r a > | .
THEOREM 1 0. 2 ( H U X L E Y ) . For any a > | and T ^ 2 we have
(10.31) N(a, T) < T 3-^ (log T) A
where A = 300( a — | ) ~ 2 and the implied constant depends only on a.
REMARKS. Actuall y Huxle y prove d (1 0.31 ) fo r an y a ^ \ wit h A an d th e
implied constan t bein g absolute . W e coul d refin e ou r argument s t o ge t (1 0.31 ) i n
wider rectangle s an d wit h absolut e constants , bu t i t woul d obscur e th e presenta -
tion. Beside s provin g (1 0.31 ) ou r missio n i s also to introduc e neatl y som e technica l
novelties du e t o M . Jutil a [Ju4] . H e realize d tha t i t wa s sufficien t t o tak e th e
mollifier M(s) quit e short . Then , t o achiev e bes t results , h e make s a n adjustmen t
by raisin g th e zero-detectin g polynomial s t o appropriat e hig h powers . Thi s ide a of
Jutila i s somewha t simila r t o th e wor k o f D . R . Heath-Brow n o n hi s identit y fo r
primes (1 3.37) .
By WeyP s boun d (8.1 8 ) (th e va n de r Corpu t boun d (8.53 ) woul d als o d o th e
job) w e deriv e
(10.32)CO O = Yl n
~S + °(x*~aT* l o £ T )
for s = a + it wit h \ < a < 1 , T < \t\ < 2T , wher e 1 ^ X ^ T an d th e implie d
constant i s absolute . Henc e
s
(10.33) ((s)M(s) = 1 + ] Ta nn~ + E(s)
M<n^MX
10. Z E R O - D E N S I T Y E S T I M A T E S 255
where 1 < M ^ X, \a n\ < r(n) (see (1 0.1 5 ) wit h T replaced b y X) an d
2
(10.34) E(s) < M^'Xi-'Ti (logT) .
We requir e
(10.35) |E( S )| < \
for 5 in th e rectangl e (1 0.1 2) , an d thi s hold s i f
(10.36) X a
~i ^ M 1
-^(logT)3
because T is assume d t o b e large .
Now th e situatio n i s th e sam e a s in Section 1 0. 2 excep t tha t w e emplo y fewe r
zero-detecting polynomial s (1 0.1 7) ; thei r lengt h N = 2lM is restricted b y M ^
TV < MX (i.e. , 0 < £ < L = [log Xj log2]) . Recal l tha t R t denote s th e numbe r
of zero s o f ((s) in (10.12) satisfyin g (1 0.20) . W e shal l estimat e Ri by an appeal
to (9.31 ) wit h a suitable k ^ 2 rather tha n (9.30 ) whic h wa s use d wit h k = 1 fo r
deriving (1 0.23) . I n other words , w e appl y (9.28 ) fo r th e polynomia l Di(s) k. W e
choose k ^ 2 depending o nT V suc h tha t P = N h fall s int o th e segmen t
2
(10.37) Z <P^ (MX) + Zi,
where Z is a given numbe r t o be specifie d late r subjec t t o Z ^ MX. T o se e tha t
(10.37) i s possible tak e k ^ 2 such tha t Z 1 ^ < M < Z 1 / ^ " 1 ) , an d i n case fc =2
use als o ] V ^ MX. Th e mean-valu e o f \Di(s)\2k o n th e lin e R e s = a is bounded
by
G 2
(10.38) fc=( Z ) l«n| n - 2 a T , ( n ) ) / C «1
P - 2 - ( l o g P ) 4 f c ^1+ )
where th e implie d constan t depend s onl y o n k. Moreover , th e lowe r boun d V =
(2L)" 1 o f \Dt(s)\ a t the zer o i s raised t o the powe r V k = (2L)~ k > (logT)" fc .
Therefore (9.31 ) yield s
(10.39) R £ < (P 2^-*) + TP4-6a)(log£)^
where Ak = 6(f c + l)(12fc + l) an d th e implie d constan t depend s o n k. Her e it would
be perfec t t o choos e P = T 1 ^2(^2OL~1 \ producin g th e bes t boun d
rp(l-a)/(2a-l) Q rp\ A k
but i t is not alway s possibl e becaus e P i s not precisel y localize d (sinc e k had t o b e
an integer!) . However , w e have reasonabl y goo d contro l o f P give n b y th e interva l
(10.37) wher e Z is at our disposal . Insertin g th e uppe r boun d fo r P t o the firs t
term an d th e lowe r boun d fo r P t o th e secon d ter m i n th e righ t sid e of (10.40) w e
get
4(1
(10.40) Ri < {(MI) " a ) + Z 3 < 1 - a ) +TZ 4 6a
- }(logT)Ak.
Now i t is clear tha t th e bes t choic e i s Z = T 1 ^ 3 * - 1 ) . W e als o tak e MX =
Zi(logT)9 whic h yield s
(10.41) Ri <T^^(logT) Afc+6.
256 10. ZERO-DENSIT Y ESTIMATE S
We stil l hav e som e freedo m t o choos e M an d X subjec t t o th e conditio n (1 0.36) .
For th e valu e MX = A' 3 / 4 ( 3 Q - 1 ) (logT) 9 thi s conditio n i s satisfie d wit h
(10.42) M = j 1 ( 6 a - 5 )/ 1 2 ( 3 o ; - 1 )
(7 3a)/6(3a 1 }
(10.43) X = T - - (logT)9.
Finally, summin g (1 0.41 ) w e get (1 0.31 ) wit h A = A k + 8 = 6( K + 1 )(1 2K + 1 ) + 8,
where K i s th e maxima l k whic h i s neede d t o pu t P = N k int o (1 0.37 ) fo r M ^
N < M X , i.e. , K = 1 + [lo g Z/log M] ^ 1 + 2( a - f ) " 1 . Henc e A < 300( a - | ) " 2 .
Having sufficientl y stron g estimate s fo r exponentia l sum s on e ca n mak e a goo d
approximation t o £(s ) b y partia l sum s whic h ar e ver y shor t fro m whic h on e ca n
derive relatively shar p zero-densit y estimates . Thus , using the Vinogradov estimat e
(8.92) i t wa s show n b y Halas z an d Tura n [HaTu ] tha t fo r an y a nea r on e
(1
(10.44) iV(a,T ) <cT -a)i|los^-a^3.
EXERCISE 1 . Prov e alon g th e abov e line s tha t th e Lindelo f Hypothesi s implie s
12 ( + £
(10.45) % T ) « T ^
for \ ^ a < 1 and an y e > 0, th e implie d constan t dependin g onl y o n s.
EXERCISE 2 . Assumin g th e Lindelo f Hypothesi s an d th e Montgomer y Conjec -
ture M (se e (9.23)) , prov e alon g th e abov e line s tha t
£
(10.46) N{a,T)<^T
if a > \ fo r an y e > 0, th e implie d constan t dependin g o n a an d e.
10.4. Gran d zero-densit y theorem .
Now we are goin g to exploi t th e idea s o f the previou s sectio n an d th e result s of
Chapter 9 for polynomials with characters in their ful l force . W e consider character s
x(mod kq) wit h (k,q) — 1 suc h tha t \ — ^ wit h £ an y characte r t o modulu s k
and I/J any primitiv e characte r o f conducto r q. Her e k i s give n whil e q varies ove r
the segmen t 1 ^ q ^ Q. Moreover , w e kee p th e point s s — a - f it i n th e rectangl e
a ^ a , \t\ < T , wher e T ^ 3 . Le t N(a, k, Q, T) denot e th e tota l numbe r o f zero s of
all I/(s , x) counte d wit h multiplicit y subjec t t o th e abov e restrictions , i.e. ,
N
N(a,k,Q,T)= J2 E m
Y, MW)-
q^Q V( ocl q) £(mo d k)
(q,k) = l ^primitiv e
Put D = kQT, H = kQ 2T an d £ = 2 log D. B y th e classica l boun d
N(a,T,x)<£Tlog(kqT)
(see Theore m 5.24 ) i t follow s tha t 7V(a , fc,Q, T) < C HC, whic h w e shal l refe r t o a s
the trivia l bound .
In thi s sectio n w e shal l establis h variou s non-trivia l bound s fo r N(a, fc, Q, T)
which ar e generalization s rathe r tha n improvement s o f th e result s b y Ingha m an d
Huxley (compar e (1 0.72 ) wit h (1 0.27 ) an d (1 0.31 )) .
10. Z E R O - D E N S I T Y E S T I M A T E S 257
From the previous section it is clear that i f one has shorter Dirichlet polynomial s
approximating L(s,x) , the n a stronge r zero-densit y theore m ca n b e deduced . Th e
best unconditiona l approximatio n i s given b y th e approximat e functiona l equation ,
which, roughly speaking , represents L(s, x) a s a s u m o f two polynomials of length X
and Y respectively , wit h XY bein g th e conductor . Classica l formula s o f suc h kin d
(see e.g . [Lav] ) ar e quit e messy , s o w e rathe r develo p her e a differen t expressio n
for L(s , x) o f the sam e nature , bu t mor e friendl y fo r application s i n mind . Se e als o
Theorem 5. 3 fo r a genera l version .
The qualit y o f a n approximat e functiona l equatio n depend s o n th e choic e o f a
particular cut-of f function . W e hav e alread y experimente d wit h suc h approximat e
equations whe n studyin g th e reflectio n metho d i n Sectio n 9.6 . Ou r presen t settin g
is clos e t o tha t particula r one . On e ma y star t fro m an y smoot h functio n / o n R +
such tha t
(10.47) /(0 )- 1 and li m f(x) = 0
with f'(x) havin g rapi d deca y a s x tend s t o 0 o r oo. The n it s Melli n transfor m
/»0 0
1s
(10.48) f(s) = / f(x)x ~ dx
Jo
has a simpl e pol e a t s — 0 wit h residu e 1 and sf(s) i s a n entir e function . Indee d
we hav e
rOC
s
(10.49) sf(s) = - / f{x)x dx
Jo
which show s wha t w e claimed . Normall y w e woul d no t recommen d workin g wit h
any specific cut-of f function , bu t w e do so here for clarit y t o keep track of uniformit y
in severa l variable s sid e b y side . A nic e choic e i s
(10.50) /(* )= «/ exp(-y-- -\-H
^
where K is th e normalizin g constan t suc h tha t /(0 ) = 1 . W e shall se e fro m compu -
tations belo w tha t n~ l = 2K 0(2) = 2^2T F f(k)r-3(k) = 0.2277.. . . This functio n
satisfies
x
(10.51) 0 < f(x) < ne~
because e~ l/y < y, an d
1 /x
(10.52) 0 < 1 - f(x) < Ke~
because
(10.53) /(:r )+ / ( i ) = l .
By (1 0.49 ) fo r / give n b y (1 0.50 ) w e ge t
1s
(10.54) sf'(s) = Kf exp(-y--)y - dy = 2KK s(2)
25 8 10. Z E R O - D E N S I T Y E S T I M A T E S
where K s(z) i s th e Bessel-Macdonal d function . I n particular , b y th e powe r serie s
expansion o f K s(z) a t z = 2 we ge t
(10.55) sf(s) = - ^£ — \T(k=7) ~ f(kT7)J •
Hence w e see tha t f(s) i s od d
(10.56) f(s) = -f(-s).
Moreover, w e ge t th e unifor m boun d fo r s — a + it G C,
1|ff|
(10.57) / ( 5 ) < | s | - e-*|t|
where th e implie d constan t i s absolute .
Now w e ar e read y t o deriv e th e approximat e formul a fo r L(s,x) m question .
We assum e tha t s = a + it wit h \ < a < 1 and x i s a characte r o f modulu s £ ^ 1 .
To refres h you r memory , w e recommen d readin g agai n th e argument s fro m (9.67 )
to (9.75 ) i n th e previou s chapter . W e begi n b y evaluatin g th e su m
oo
(10-58) B(«,x ) = E x ( n ) » " ' / g ).
1
By contou r integratio n an d th e functiona l equatio n (9.69 ) w e ge t
u
B(s,x) = ^~ f L(s + u, X)X f(u)du
= S^Xi-ftl - s) + L(s, x ) + ^ / _ L(s + u, X )Xuf(u)du.
Then b y th e functiona l equatio n (9.69 ) an d b y (1 0.56 ) w e find tha t th e integra l o n
the lin e R e u — — 1 is equa l t o — exB*(s, x) wher e
u
£*(*, X) = ^ / l(s - u)P{s - u)L{\ -s + u, x)X- f(u)du.
2iri J {1 )
Here we expand L( l — s + u, x) m ^ ° Dirichle t serie s an d integrat e eac h ter m gettin g
oo
(10.59) B*(s, X) = Y1 xMm^gimX)
l
where
u
(10.60) g(y) = ^- [ 7 ( s - u)P(s - u)y- f(u)du.
Gathering th e abov e result s w e obtai n th e desire d expressio n
1
(10.61) L(s,x) = B(s,x)+e xB*(s,X) - 6 x^f(l - s)X ~s
where B(s, x) a n d -B*(s , x) a r e th e reduce d serie s given by (1 0.58 ) an d (1 0.59 ) wit h
arbitrary X > 0.
10. Z E R O - D E N S I T Y E S T I M A T E S 259
It i s clear tha t B(s, \) r u n s essentiall y ove r n < C X, an d we show tha t B*(s, x)
runs essentiall y ove r m < 7 , wher e Y i s somewha t large r tha n -£|s|X _1 . T o thi s
end recal l th e estimate s
R e s
(10.62) | P ( s ) | ^ - ,
Res
(10.63) 7 ( s ) < N * -
which hol d uniforml y i n th e half-plan e R e s ^ | , an d th e estimat e (1 0.57 ) whic h
holds fo r al l s. Hence , movin g th e integratio n (1 0.60 ) sufficientl y fa r t o th e right ,
say t o th e vertical lin e
5
(10.64) R e W = max(l,i(^) )
one derive s
(10.65) 9 ( s ) «a e x p (_l(JL)*)
where th e implied constan t i s absolute. Henc e g(mX) i s very smal l i f mX i s some-
what large r tha n £\s\. Precisely , w e get by (1 0.65 )
1
(10.66) B*(s, X) = £ x(m)m'- g(mX) + 0 ( ^ )
provided
(10.67) XY ^£\s\(log£\s\f.
Next w e write (1 0.60 ) a s
(10.68) g(y) = — ( l(s - 2a + u)P{s - 2a + u)y u-2af(2a - u)du
by changin g u int o 2a — u an d movin g th e integration t o th e line R e u = 77 , where
1 < 7 7 < 2a. The n w e insert (1 0.68 ) int o (1 0.66 ) gettin g
(10.69) B*(8, X) =^/ ( E x W ^ S " + U _ 1 ) ^ W ^ + 0 ( l y)
where
W(u) = 7(5 - 2c r + u)P(s -2a + u)X u~2(Tf(2a - u).
By (1 0.63) , (1 0.62) , (1 0.57 ) w e get fo r u = rj + iv,
W{u) < ((|s | + |^|K)* +CT - r? X r? - 2<7 (2a - r y ) - 1 e - * H
«*-„)-(^"V-.-".
We assum e X 2 ^ £\s\ and7 7 = a + | , gettin g fo r a ^ a ,
(10.70) W(u )< ( a- iJ-^i^e-H .
Hence (1 0.69 ) yield s
oo
m s +<*~i+™ e d v +
/ 00 ' -
- OO m < -y\s
~ xy "
26 0 10. Z E R O - D E N S I T Y E S T I M A T E S
Before goin g furthe r le t u s stat e wha t w e hav e proved .
LEMMA 1 0.3 . Let x be a character modulo £ and s a complex number with
Re s ^ a > | . Choose X ^ (^H) ^ andY satisfying (1 0.67). Then we have (1 0.61 )
where B(s, x) i> s given by (1 0.58) with f as in (1 0.50) and B*(s, x) satisfies (1 0.71 )
with the implied constant being absolute.
Now w e proceed t o th e proo f o f ou r mai n resul t whic h i s
GRAND DENSIT Y THEORE M 1 0.4 . Let\ < a ^ 1 and e > 0 . Then we have
(10.72) N(a, fe, Q , T ) < ( £ > ( 2 + 0 ( l - a ) + H c(a)(l-a)^CA
where
(10.73) c(a )= n ^_i__J_).
Here D = kQT\ H = kQ 2T and C = 2\ogD. The exponent A in the power of
logarithm and the implied constant depend only on a and e.
REMARK. On e could remove e and asser t tha t bot h A an d th e implie d constan t
are absolute , however , suc h a refinemen t i s technicall y ver y complicate d t o mak e
along ou r line s (tr y th e origina l arrangement s b y H . L . Montgomer y [Mo2]) .
Note tha t (1 0.72 ) i s essentially a s stron g a s (1 0.7 ) i n th e rang e | < a < 1 , an d
considerably stronge r wit h respec t t o Q. I n particular , comparin g wit h th e numbe r
of terms i n th e sum , i t show s tha t fo r T fixe d an d an y a > | , ver y fe w L- functions
of primitiv e character s modul o q ^ Q hav e a zer o i n th e rectangl e (1 0.1 ) .
In wha t follow s % r u n s o v e r character s o f modulu s £ = kq < Q an d s = a +1 i s
a comple x variabl e restricte d t o th e rectangl e a ^ < r ^ l , | £ | ^ T . Furthermore , i f
X is principa l w e assum e tha t \t\ ^ £ 2 , becaus e otherwis e th e resul t i s trivial . Fo r
the proo f o f Theore m 1 0. 4 w e use (1 0.61 ) wit h
(10.74) X = D*C, Y = D X2C2.
The residua l ter m i n (1 0.61 ) i s smal l
5x^ff(l-s)X^«D-'
by virtu e o f (1 0.57) , an d th e serie s (1 0.58 ) ca n b e reduce d t o n ^ Y u p t o a
correction ter m 0(D~ 1 ) b y virtu e o f (1 0.51 ) . W e obtai n
(10.75) L ( S,X)= YJ X (n)n-s/(-|)
+ 0(x^- a / \Y^ x(n)n- s+a
-*+lv\e-^dv + D" 1 ).
Multiply thi s b y th e polynomia l
(10.76) M(s, X)= Y, M(™ M
10. Z E R O - D E N S I T Y E S T I M A T E S 26 1
where 1 < M ^ D* gettin g
(10.77) L(s, X)M(slX)= X )a nX(n)n-s
n^MY
/•OO
/ f°° I I \
s
+ 0(C Y, a n(v)X(n)n- \e~Mdv + D^Mi)
where th e coefficient s ar e give n b y
(10.78) a n= Y. ^H/(^) >
dm=n
d<Y,m<M
d \<*-%+iv
m
(10.79) a n(v)= X ^ )(y)
dm=n
d^Y,m^M
so \a n\ ^ T(TI) an d |a n (^)| ^ r ( n )« Fo r n ^ M w e hav e mor e precis e estimate s
an= J2 M(m){ l + 0 ( e - x / d ) } = ^ M ( m ) + 0( J D- 2 ),
dm=n
v ot—±+iv
The first yield s a i = 1 + 0(£>~ 2 ) an d a n < D " 2
i f 1 < n < M , whil e th e latte r
gives
s r
IXa n(^)x(n)n- | < Y*" * ^ (n)n~* < Y ^ M * log2M .
We wan t thi s t o b e bounde d b y C~ 2 whic h i s th e cas e b y assumin g
a
(10.80) M ^D ~i£-6.
By th e abov e estimate s w e reduce (1 0.77 ) t o
s
(10.81) L(s,x)M(*, X) = l+X *nX(n)n-
M<n^MY
+0 C 1
( I E ^Wxtnln-'je-Wdnr ).
«'—OO -K/r^^s-Tk/r v
M<n^MY
We wan t t o trea t th e first su m an d th e integra l o f th e secon d su m i n on e way . T o
this en d w e combine th e su m an d th e integra l int o on e integra l wit h respec t t o th e
measure
(10.82) dp = \e-Wdv + \6(y)
where dv i s the Lebesqu e measur e o n R an d S(v) i s the poin t measur e a t v = 0 , th e
factor | bein g introduce d fo r th e normalizatio n
(10.83) / d/j, = 1 .
26 2 10. Z E R O - D E N S I T Y E S T I M A T E S
In thi s notatio n w e write (1 0.81 ) a s on e inequalit y
oo . .
/ s
I Y, a n(v)x(n)n- \dfi(v) + C-1
~°° M<n^MY
where fo r v = 0 w e redefin e a n (0) t o b e a n. Pro m no w o n w e onl y nee d t o kno w
that a n(v) doe s no t depen d o n s, x an d tha t |o n (v)| ^ T~{n). For convenienc e w e
also se t a n(v) = 0 i f n ^ M o r n> MY.
Recall that (1 0.84 ) holds for s in the rectangle R(a, T) an d a character x modul o
kq wit h q ^ Q, an d i n cas e \ i s principa l w e requir e \t\ > £ 2 , wher e C — 21ogD
and Z ) = kQT. I n particular , i f s = p is a zero of L(s, x) m thi s region, w e get fro m
(10.84)
roo .
p
J ] Ta n(v)x(n)n- \dn(v) > C' 1
J —c
M<n^MY
provided D is sufficientl y larg e (whic h conditio n ca n b e assumed , o r els e th e goa l
is trivial) . A s i n Sectio n 1 0. 3 w e brea k th e summatio n int o dyadi c segment s N <
n^2N wit hT V - 2 £M, 0^£<L= [lo g Y/ lo g 2]. Denot e
oo . .
/ s
IY a n{v)X{n)n- \dii{v).
"°° N<n^2N
For ever y zer o p bein g counte d ther e exist s £ such tha t
(10.86) Dt(p, X)>L-\
Let Ri denote th e numbe r o f zero s satisfyin g (1 0.86) . The n th e tota l numbe r o f
zeros i s
R^YRe^ LmaxR £.
Raising Dz{s,x) t o a suitable powe r 2k wit h k ^ 2 (dependin g o n N) w e obtai n
2
°° i i
/ s
Yb n(v)x(n)n- \ dp(v)
_0
° P<n<2* P
k
with P = N whic h fall s int o th e segmen t
(10.87) Z < P < ( M F ) 2 + Z*
where Z is an y fixed numbe r t o b e chose n later , subjec t t o MY ^ Z ^ H. A t thi s
point w e requir e M ^ D £/4, s o / c is bounde d i n term s o f e. Th e coefficient s b n(v)
are bounde d b y a power o f th e diviso r function , s o we hav e
2 2a
5 3 \b n(v)\ n- < pi-2"£^
fc
P<n^2 P
where A depends onl y o n k. Addin g u p thes e inequalitie s fo r th e point s (p,x)
satisfying (1 0.86 ) w e obtai n
oo . 2
dp(v).
/ -OO / _ \ , - or > ^ / n
(p,x)eSt P<n^2 k
tn
P
10. Z E R O - D E N S I T Y E S T I M A T E S 263
Now we are ready t o us e two results fro m Chapte r 9 , namely Theore m 9.1 3 an d
Theorem 9.1 5 (se e als o the closin g remark s i n Chapte r 9 concerning th e spacin g o f
points p). Applyin g Theore m 9.1 3 w e get
(10.88) Ri < C P1~2a(P + H)C A
,
and b y Theore m 9.1 5
R£ ^ P 1 '2"^ + R]H^P*)£ A
,
the latte r givin g
(10.89) R £ < P 1 ~2a(P + H 3 4a
- )CA.
(here H — kQ 2T, an d A i s a constan t dependin g o n k whic h ma y b e differen t i n
each appearance) . A direc t compariso n o f (1 0.88 ) an d (1 0.89 ) show s tha t on e i s
stronger tha n th e othe r i f \ < a ^ | an d | ^ a < 1 respectively.
Suppose \ +s ^ a < | . The n introducin g P fro m (1 0.87 ) int o (1 0.88 ) on e get s
Rt < ((My) 4 *1 -"* + Z^ 1 -^ + HZ l 2a
~ )CA.
Clearly th e bes t choic e i s Z = #1 /(2-00 . W e als o assum e MY ^ Z± gettin g
A
(10.90) R t^iH^=^C .
It remain s t o verif y whethe r th e restrictio n MY ^ Z* doe s no t contradic t th e
conditions previousl y imposed . Fo r Y = D^C 2, an d sinc e D ^ H , thi s restrictio n
is verified wit h
2a-lo
2
M = H^^^C~ .
£/4
Note tha t M satisfie s (1 0.80) , M ^ £> an d obviousl y MY ^ Z . Th e exponen t A
and th e implied constan t i n (1 0.90 ) depen d onl y on e. Sinc e e is arbitrary, Theore m
10.4 is established i n the rang e \ < a ^ | (actuall y i n this rang e (1 0.72 ) hold s tru e
without th e first ter m becaus e i t i s absorbe d b y th e secon d one) .
Suppose | ^ a < 1 . The n introducin g P fro m (1 0.87 ) int o (1 0.89 ) on e get s
4 (1
Rt < ((M7) -Q) + Z 3(i-a) + HZ A Q(X
~ )CA.
Now th e bes t choic e i s Z — H1^3a~1^ givin g
2
Re « ( ( M ) 2 (1 _a)
+ H^^)C A
.
For M — D£^ thi s establishe s Theore m 1 0. 4 i n th e rang e | ^ a < 1 .
EXERCISE 3 . Sho w tha t fo r \ < a < 1 and e > 0,
(10.91) JV(a , jfe, Q,T) < (jD(3+e)(i-a ) + # 6 ( a ) ( i - a ) )£^
where
(10.92) 6(a )= n ^ _ ^ , _ i _ ) .
[Hint: Rais e th e polynomia l D^(s,x ) t o a powe r 2f c with k ^ 3 so tha t Z ^ P ^
(MY)3 + Z 4/3.}
26 4 10. Z E R O - D E N S I T Y E S T I M A T E S
EXERCISE 4 . Sho w tha t fo r \ < a ^ 1 and e > 0,
£){1
(10.93) N(a, k, Q, T) < (D^ ~a) + H *(*)^-*))cA
where
(10.94) a ( a ) = min (_l_ > _L_).
[Hint: Rais e th e polynomia l Di(s,x) t o a power 2/ c with k ^ 4 so tha t Z ^ P ^
( M 7 ) 4 + Z 5 / 4 .]
REMARKS. Fo r a nea r 1 , th e estimate s (1 0.91 ) an d (1 0.93 ) ar e stronge r tha n
(10.72) i n terms o f Q2 , bu t not in terms o f kT. I n particular, i f kT ^ Q £, the n
(10.93) yield s
Ar(a,fe,Q,T)<Q4(1-^+£
in th e rang e ~ ^ a ^ 1 , which i s essentially th e densit y conjecture . B y usin g hi s
Theorem 9.1 0 , M . Jutil a [Jut ] establishe d th e densit y conjectur e fo r zeros o f ((s)
in th e sam e range . Ever y exponen t a(a),fe(a),c(a ) take s it s maximu m a t a = | ,
being o ( | ) = f ,6(f ) = f, c(f ) = f . Fo r a = 1 we ge t o(l ) = f , 6(a ) = | ,
c(l) = |, an d fo r a = | al l thre e exponent s tak e valu e 2.
10.5. Th e gap s betwee n primes .
This sectio n i s written fo r the purpos e o f giving a n example o f how th e zero -
density theorem s ca n b e applied. W e hav e chose n a few question s abou t prime s
in shor t intervals , becaus e the y wer e inspirationa l fo r the developmen t o f density
theorems i n th e firs t place , an d fo r othe r statistica l studie s of zeros of £(s). B y n o
means d o w e attemp t t o show th e bes t result s fro m th e vas t literature .
By th e Prim e Numbe r Theorem ,
^{x) = ^2 A ( n ) = x + ^(x )'
where E(x) is a suitable erro r term , i t follows directl y tha t
(10.95) ^{x + y) - I/J(X) ~y
as x —> oo , provided y = y(x) i s somewhat large r tha n E(x). Henc e there i s a prime
number i n the shor t interva l (x , x + y] for al l sufficientl y larg e x.
Without improvin g th e existin g erro r ter m i n the Prim e Numbe r Theorem , G.
Hoheisel [Ho ] succeeded , nevertheless , i n showing i n 1930 tha t (1 0.95 ) hold s for
y = x6 with som e absolut e constan t 9 < 1. This i s an impressiv e achievemen t give n
the fac t tha t th e erro r ter m a s good a s E(x) = 0(x e) seem s t o be fa r beyond th e
current technology . Recal l that th e latter boun d translate s int o the non-vanishing of
£(s) i n R e s > 0. However , Hoheise l require d onl y tw o result s whic h wer e availabl e
in hi s time . Firs t i s a zero-free regio n of £(cr + it) o f th e typ e
l l0g T
(10.96) \t\^ T , G>\- °f > B
logT
for som e constan t B > 0 and al l T sufficientl y large . Th e secon d ingredien t i s the
zero-density estimat e o f type
c(1
(10.97) N(a,T) < T -a)(logT)A
10. Z E R O - D E N S I T Y E S T I M A T E S 265
for al l \ ^ a ^ 1 with som e constant s c ^ 2 and A ^ 1 . Pro m thes e result s one
derives
THEOREM 1 0.5 . Put 0 = 1 - ( c + (A + I)/BY1. Then
(10.98) i/>(x + y) - i/>{x) = y + O ( ^ - )
V log x /
/or all y with x ^ l o g x ) 3 ^ y ^ x.
PROOF. On e use s the approximate "explici t formula " (se e Sectio n 5.9)
#*) = *- Y, f+og(logx) 2)
w i t h T = x1~e. Henc e
2/ ^ T w Vlogx /
Here the sum over th e zeros is bounded by
1 1
a
] T x^" ^2 / x ~ dN{a,T)
1
OL
<2aT*W(±,T) + 2(loga: ) f x - N(a,T)da
i
A c
< ^ - i T l o g T + ( l o g x ) ( l o g T ) f\T /xYda
J 7)
< x - i T l o g T + (TVx^OogT) ^
where7 7 = i?(lo g log T)/ lo g T. Sinc e
(c
(TV*)" - ( l o g T ) - ^ ) B = (\ogT)- A
-\
the su m over zero s i s 0(1/log x) completin g th e proof o f Theorem 1 0.5 . •
Hoheisel use d th e zero-free regio n (1 0.96 ) wit h a positive bu t very smal l con -
stant B du e to J. E . Littlewood [Lit] , an d the zero-density estimat e (1 0.97 ) wit h
exponent c = 4 due to F. Carlso n [Car ] (see (10.26)). Afte r Vinogradov' s widen -
ing th e regio n (1 0.96 ) (se e Corollary 8.28 ) on e can tak e B arbitraril y larg e s o
0 = 1 — c - 1 + e satisfie s th e conditions o f Theore m 1 0.5 . Pro m no w on the Ho -
heisel exponen t 0 depends onl y o n c in the zero-density estimate . Not e tha t on e
needs (1 0.97 ) t o hold throughou t th e segment \ ^ a ^ 1 with th e same c , so an
improvement i n a subrange doe s no t help. I n other words , al l one gets fro m (1 0.6 )
is c = maxc(a) . B y the Gran d Densit y Theore m 1 0. 4 for £(s ) one gets (1 0.97 )
with c = 1 2/ 5 (the maximum o f c(a) i s attained a t a = | ) , which yield s (1 0.98 )
for x e ^ y ^ x wit h 6 = ^ + e. Thi s i s the best resul t o f its kind obtaine d s o far
(in 1 97 2 by M. N. Huxley [Hu3]) . Th e density conjectur e c = 2 yields 0 = \ + e.
Various combination s o f the abov e analyti c argument s wit h siev e methods pro -
duced estimate s
(10.99) y < ip(x + y) - i/>(x) < y
26 6 10. Z E R O - D E N S I T Y E S T I M A T E S
in plac e o f th e asymptoti c formul a (1 0.95) , however , fo r shorte r intervals . Fo r
example, R . Bake r an d G . Harma n [BH ] go t (1 0.99 ) fo r y = x e wit h 0 = 0.534 .
From (1 0.99 ) wit h y = x 6 i t follow s tha t th e differenc e betwee n consecutiv e
primes satisfie s
(10.100) d n = P n + l - P n < P n -
Therefore w e hav e (1 0.1 00 ) wit h 6 = 0.53 4 a s a consequenc e o f th e Baker-Harma n
i
work. Recal l tha t th e Rieman n hypothesi s yield s d n < C Pn logpn , bu t eve n o n th e
Pair Correlatio n Conjectur e (se e Chapte r 25) , th e bes t tha t ha s bee n achieve d i s
(due t o D . Goldsto n an d D . R . Heath-Brow n [GHB] )
(10.101) d n < (p nlogpn)i.
Creating a probabilisti c mode l fo r prime s i n 1 93 7 H . Crame r [Cra ] wa s le d t o a
conjecture tha t
2
(10.102) d n < (logpj .
In th e othe r directio n R . Ranki n [Ra2 ] showe d b y constructin g specia l com -
posite number s tha t
(10.103) d n ^ (e 7 - £)(logp n )(loglogp n )(loglogloglogp n )(logloglogp n )~ 2
infinitely often , wher e 7 is the Eule r constant . Pau l Erdo s offered a price of $1 0,00 0
to anyon e wh o ca n replac e e 7 i n (1 0.1 03 ) b y a functio n increasin g t o infinit y (th e
largest pric e eve r offere d b y Erdo s fo r a solutio n t o a mathematica l problem) .
It i s conjecture d tha t th e normalize d gap s betwee n consecutiv e prime s d n* —
{pn+i — Pn)(logPn)- 1 ha s a Poisso n distribution , tha t i s fo r an y t > 0,
(10.104) li m -\{n ^ x; d n* ^ t}\ = 1 - e~K
x—>oo X
Many estimate s fo r ip(x + y) — ip(x) wer e establishe d o n averag e wit h respec t
to x.
EXERCISE 5 . Assumin g (1 0.97 ) wit h c ^ 2 prove tha t
2
(10.105) / (0( x + y) - ^J(X) - yfdx « y X{\ogX)~A
Jx
for X e ^ y < X wit h 0 — 1 — 2c _ 1 + e, fo r an y e > 0 an d an y A > 0 , th e implie d
constant dependin g onl y o n e and A. I n particular , b y taking c = 1 2/ 5 deduce tha t
(10.95) hold s tru e wit h y = x e fo r almos t al l x , wher e 0 > | i s a fixed number .
We close thi s sectio n b y pointin g ou t a forma l similarit y betwee n th e problem s
of prime s i n shor t interval s an d prime s i n arithmeti c progression s t o larg e moduli .
10. Z E R O - D E N S I T Y E S T I M A T E S 26 7
EXERCISE 6 . Assum e th e followin g estimate s fo r th e zero s o f Dirichle t L-
functions wit h character s x ( m ° d q):
(1) L(s , x) 7 ^ 0 in th e regio n s = a + it wit h \t\ ^ T an d
l
(10.106) a * 1 - B ^ f
log qT
where B i s a positiv e constant , provide d qT i s sufficiently large ,
(2) Th e numbe r N(a,q,T) o f zero s o f al l L(s,x ) w ^ h x ( m ° d q) i n th e rec -
tangle \t\ < T , a ^ a satisfie s
(10.107) N(a,q,T) < (gT^-^log ^
for ^ ^ a < 1 , T ^ 3 , where c > 2 and A ^ 1 are suitabl e constants .
Prove tha t
(10.108) ^ ; , , 0 )= ^ ( 1+ 0 (_i_))
uniformly fo r x ^ ^ ( l o g g ) 3 wit h 6 — c -f ( A + l ) / £ . Se e Chapte r 1 7 for uncondi -
tional results .
This page intentionally left blank
http://dx.doi.org/10.1090/coll/053/12
CHAPTER 1 1
SUMS OVE R FINIT E FIELD S
11.1. Introduction .
In thi s chapte r w e conside r a specia l typ e o f exponentia l an d characte r sums ,
called sometime s "complet e sums" , whic h ca n b e see n a s sums ove r th e element s of
a finite field. Althoug h th e method s o f Chapte r 8 can stil l b e applie d t o th e stud y
of suc h sums , disregardin g thi s specia l feature , th e deepes t understandin g an d th e
strongest result s ar e obtaine d whe n th e finite field aspec t i s taken int o accoun t an d
the powerfu l technique s o f algebrai c geometr y ar e brough t t o bear .
We have alread y encountere d i n th e previou s chapter s som e example s o f expo -
nential sum s whic h ca n b e interprete d a s sum s ove r finite fields, fo r example , th e
quadratic Gaus s sum s
<™= E (fMf)
x mo d p
or th e Kloosterma n sum s (1 .56 )
S(„,M= r . ^ ) .
x mo d p
In thi s chapte r w e wil l stud y thes e sum s i n particular . Th e culminatin g poin t o f
our presentatio n i s the elementar y metho d o f Stepano v whic h w e apply fo r provin g
Weil's boun d fo r Kloosterma n sum s
\S(a,b]P)\^2y/p
and Hasse' s boun d fo r th e numbe r o f point s o f a n ellipti c curv e ove r a finite field.
Then we survey briefly, withou t proofs , the powerful formalis m o f £-adic cohomolog y
developed b y Grothendieck , Deligne , Katz , Laumo n an d others , hopin g t o conve y
a flavor o f th e tool s involve d an d t o giv e th e reade r enoug h knowledg e t o mak e
at leas t a preliminar y analysi s o f an y exponentia l su m h e o r sh e ma y encounte r i n
analytic numbe r theory .
11.2. Finit e fields.
We first recal l briefl y som e fact s abou t finite fields, an d establis h th e notation s
used i n thi s chapter . Fo r ever y prim e p , th e finite rin g Z/pZ o f residu e classe s
modulo p i s a field, whic h w e denot e F p . Th e Galoi s theor y o f ¥ p i s ver y eas y t o
describe: fo r an y n ^ 1 , ther e exist s a uniqu e (u p t o isomorphism ) field extensio n
of F p o f degre e n , writte n ¥ pn. Conversely , an y finite field F wit h q element s i s
isomorphic (bu t no t canonically ) t o a unique field F pd, s o q = p d, an d F admit s als o
a uniqu e finite extensio n o f degre e n fo r an y n ^ 1 , namely ¥ pdn.
Let no w F = ¥ q b e a finite field wit h q—p d elements . I n mos t o f th e chapter ,
p is fixed an d w e chang e notatio n slightly , denotin g b y F a n algebrai c closur e o f F
269
270 11. SUM S OVE R FINIT E FIELD S
and b y F n C F th e uniqu e extensio n o f degre e n o f F fo r n ^ 1 . Th e contex t wil l
always indicat e clearl y tha t th e cardinalit y o f F n i s q n an d no t n .
The extensio n F n / F i s a Galoi s extension , wit h Galoi s grou p G n canonicall y
isomorphic to Z/nZ, th e isomorphism bein g the map Z/nZ—> • G n define d b y 1 H> a,
where a i s th e Frobeniu s automorphis m o f F n give n b y a(x) = x q.
Let F b e a give n algebrai c closur e o f F , s o by th e above ,
n^l
By Galoi s theory , fo r an y x G F, w e hav e X E F <<=4> <J(X) = x i f an d onl y i f
xq — x an d mor e generall y
(11.1) x £ ¥ n <=^> a n{x) = x ^ > x qn
= x.
Prom thi s w e can deduc e tha t F n i s the splittin g fiel d o f the polynomia l X q — XG
¥[X]. Mor e precisely , on e ca n stat e th e followin g resul t o f Gauss :
LEMMA 1 1 .1 . For any integer n ^ 1 , we have
qn
(11.2) Y[ Y[ P = X -X
d\n deg(P)= d
where the product ranges over all irreducible monic polynomials P of degree d di-
viding n.
P R O O F . Thi s i s a n immediat e consequenc e o f th e descriptio n o f finit e fields :
the roots of the polynomial o n the right sid e (i n an algebraic closure) ar e exactly th e
elements x G F n wit h multiplicit y on e and , conversely , ever y suc h x ha s a minima l
polynomial whic h mus t occur , exactl y once , amon g th e polynomial s P o n th e lef t
side. •
Associated t o the extension F n / F ar e the trace map and the norm map. Becaus e
of the abov e descriptio n o f the Galoi s grou p o f F n / F , th e trac e ma p T r = TvY n/w :
F n— » F i s give n b y
(11.3) Tt(a: )= V J <T \x)= £ x**
while th e nor m ma pT V = iV Fn / F : F *— > F * i s similarl y
a
(11.4) N(x) = Yl ^x) = Yl x q
= x 3
^ .
The equation s T r (x) = y an d N(x) — y, fo r a fixe d y G F ar e ver y important ,
Because th e extensio n F n / F i s separable , th e equatio n T r (x) = y alway s ha s a
solution. I f #o is a given solution, then all solutions are in one-to-one correspondenc e
with solution s o f T r (a) = 0 , b y x = Xo + a . Moreover , an y solutio n o f T r (a) = 0
is o f th e for m a = a(b) — b = b q — b for som e b £ F n , uniqu e u p t o additio n o f a n
element i n F .
Similarly, fo r an y y G F*, th e equatio n N(x) = y ha s a solution , an d i f xo i s a
given solution , th e se t o f solution s i s i n one-to-on e correspondenc e wit h solution s
of N(a) = 1 , whic h b y Hilbert' s Theore m 9 0 (o r b y direc t proof ) ar e al l give n b y
11. SUM S O V E R F I N I T E F I E L D S 271
a = a(b)b l — b q x fo r some 6GF*, uniqu e u p to multiplication b y an element i n
F*.
As the additive grou p o f F is finite, the general theor y o f characters o f a finit e
abelian grou p (se e Chapter 3 ) can be applied. Character s o f F are called additiv e
characters, an d the y ar e all of the for m x—
i > > ip(ax) fo r some a G F, where tp is some
fixed non-trivia l additiv e character . Fo r instance, le t Tr : F —> TLjpZ be the trace
map t o the base-field, the n
(11.5) ip(x) = e(Tr(x)/p)
is a non-trivial additiv e characte r o f F. Fo r a given additiv e characte r ty and a G F,
we denote b y ipa the character x *-> I/J(CLX).
Applying th e general theor y o f characters o f finite abelia n groups , w e get the
orthogonality relation s
5>(*) = {q0 iotherwis
fx— 1,
e
(which i s used t o "solve " th e equation x = 0 in F) an d
f q i f ^ — 1 is the trivial character ,
E I/J(X) — <
1 0 otherwise .
The descriptio n o f characters o f the multiplicativ e grou p F * (als o calle d multi -
plicative character s of F) is not so explicit. Th e grou p structur e o f F* is well-known
(dating bac k t o Gauss) : i t i s a cycli c grou p o f order q — 1 . Generator s o f F* are
called primitiv e roots , an d there ar e <p(q — 1 ) of them, bu t no useful formul a fo r a
primitive roo t exists . Fixin g one , sa y z G F *, on e has an isomorphism
f F * ~ Z/(q - 1 ) Z
log : <
i >• n suc h tha t z n = x
I x—
and al l multiplicative character s o f F are expressed as
'alog(x)s
/alog(x)\
Q
for som e a £ Z/( g — 1 )Z , bu t suc h a descriptio n i s usually o f no use in analyti c
number theory .
As example s o f multiplicative characters , suppos e F = Z/p Z an d p^ 2 . The n
the Legendr e symbo l
'2N
X \->
\p, • )
is a non-trivial quadrati c character . I n general, i f 5 | (q—1), ther e i s a cyclic grou p
of orde r S consisting o f characters \ ° f ^* ° f order 5.
The orthogonalit y relation s becom e
^—^ y 0 otherwise ,
272 11. SUM S OVE R FINIT E FIELD S
(the su m ove r al l multiplicativ e characters) , an d
q—1 i f x = 1 is the trivia l character ,
£ *(*) = {0 otherwise .
xeF* ^
It i s usual to extend multiplicative character s to F by defining x(0 ) = 0 if x ^ 1 ,
and x(0 ) = 1 if x = 1 . Notic e the n tha t fo r an y b\q—\ th e formul a
(n.6) Y. x(* ) = \{ye F I ^ = x >i
(also a particula r cas e o f th e orthogonalit y relation s fo r th e grou p F*/(F*) d , a s
described i n Chapte r 3 ) i s tru e fo r al l x G F .
11.3. Exponentia l sums .
Let F = ¥ q b e a finite field wit h q = p m elements , p a prime . Exponentia l
sums ove r F ca n b e o f variou s kinds . Fo r th e simples t case , conside r a polynomia l
P G ¥[X] an d a n additiv e characte r -0 , and defin e th e su m
S(P) = £ > ( P ( * ) ) .
x£F
Slightly mor e generally , tak e a non-zer o rationa l functio n / = P/Q G ¥(X) an d
consider
S(f)= £ 1 >(f(x));
xeF
Q(x)^0
for instance , takin g q = p an d f(x) = ax + bx~ l, w e hav e S(f) = 5(a , 6;p) .
Multiplicative character s ca n als o b e used , gettin g sum s o f th e typ e
s*(/) = £**(/(*))
(where th e sta r i n J^ * mean s her e an d hencefort h tha t th e summatio n extend s
to al l x whic h ar e no t pole s o f / ) . Fo r g = p , X = ( " ) ( t n e Legendr e symbol )
and f(x) G Z[X] a cubi c polynomia l withou t multipl e root s modul o p , w e see tha t
—Sx(f) i s the p-th coefficien t a p o f the Hasse-Weil zet a function o f the ellipti c curv e
with equatio n y 2 — f(x) (se e Sectio n 1 4.4) .
Still mor e generally , on e ca n mi x additiv e an d multiplicativ e characters , an d
define sum s suc h a s
(11-7) S x(f,g) = ^2\(f(x))^(g(x)),
XEF
an exampl e o f whic h i s the Sali e su m T(a, b;p) define d b y
x N
mt , x v^ *( \ (ax ' ax+- f bx
bx\
r e
^p/ \ p
<".M= E Q x mo d p (^-)
which occurs in the Fourier expansio n of half-integral weigh t modula r forms ; se e [16]
for instance . I n contras t wit h th e seemingl y simple r Kloosterma n sum s 5(a , 6;p) ,
the Sali e sums T(a, b\p) ca n be explicitly compute d (se e Lemma 1 2.4 , and Corollar y
21.9 fo r th e unifor m distributio n o f th e "angles " o f th e Sali e sums) .
11. SUM S OVE R FINIT E FIELD S 27 3
To end this list, we mention that al l these definitions ca n again be generalized t o
sums in more than on e variable, an d tha t th e summatio n variable s ca n be restricte d
to th e rationa l point s o f a n algebrai c variet y define d ove r F : som e example s wil l
appear i n th e surve y section s o f thi s chapter .
The exponentia l sum s whic h directl y aris e i n analyti c numbe r theor y ar e sum s
over th e prim e field Z/pZ . However , th e deepe r understandin g naturall y require s
considering sum s ove r th e extensio n fields F p n. Indeed , th e ver y reaso n fo r th e
success of algebraic method s lie s in the fact tha t a n exponential su m over F p doesn' t
really com e alone , bu t ha s natura l "companions " ove r al l th e extensio n fields ¥ pn,
and i t i s reall y th e whol e famil y whic h i s investigate d an d whic h i s th e natura l
object o f study . Thos e companio n sum s ar e easil y defined : tak e th e mos t genera l
sum S = S x(f,g) w e have introduced , the n fo r n ^ 1 let
(11.8) S n = Y? x(tfF n /F(/(aO)MltF n /F(s(z)))
x£Fn
where we use the multiplicativ e characte r x o iV and th e additiv e characte r i\> o Tr o f
F n . Al l th e sum s S n ar e incorporate d int o a singl e object , th e zet a functio n o f th e
exponential sum , whic h i s the forma l powe r serie s Z = Z x(f, g) G C[[T]] define d b y
the formul a
*—p(EM
n
n^l
Justification fo r th e introductio n o f th e zet a functio n come s fro m th e followin g
rationality theorem , conjecture d b y Weil , an d prove d b y Dwork .
T H E O R E M 1 1 . 2 ( D W O R K ) . The zeta function Z is the power series expansion
of a rational function; more precisely, there exist coprime polynomials P and Q in
C[T], with P(0) = Q(0 ) = 1 , such that Z = g .
As a corollary , denot e b y (c^ ) (resp . (/3j) ) th e invers e o f th e root s (wit h mul -
tiplicity) o f P (resp . Q) , s o
P = JJ(l-aiT), Q = JJ(l- i 8 i T).
Then usin g th e power-serie s expansio n
n^l
we find tha t th e formul a Z = P/Q i s equivalent t o th e formul a
for an y n ^ 1 , whic h show s ho w th e variou s sum s S n ar e related . I n particular ,
note tha t the y satisf y a linea r recurrenc e relatio n o f orde r d equa l t o th e numbe r
deg P + de g Q o f root s a^ , fij .
27 4 11. SUM S OVE R FINIT E FIELD S
COROLLARY 1 1 .3 . We have for any n ^ 1 the upper bound
3i
In particular,
(11.9) |S|<I>; l + I>i| .
3i
A commo n abus e o f language i s to spea k o f the c ^ an d (3j a s the root s o f the
exponential su m S. W e will describ e i n Section 1 1 .1 1 a number o f general facts
about thes e roots . I n the intervening sections , w e will prove Dwork' s Theore m and
estimate th e modulu s o f th e root s i n the importan t specia l case s o f Gauss sums ,
Kloosterman sum s an d for the local zet a functio n o f elliptic curves .
REMARKS. W e have calle d th e sums S n,77 - ^ 1 , "companions " o f the origina l
exponential su m S. However , on e can consider othe r companion s o f S as well. I f S
involves an additive character , it can also be very useful sometime s to consider 5 as
just on e element o f the family o f sums obtaine d b y varying th e additive character ,
specifically i f
s = £*x(/(*Ms(s))>
we als o introduc e fo r a G F,
Sa = £ * x ( / ( s ) W a ( s ( * )) = £ * x ( / ( * ) M a c , ( z ) ) .
x£F xe¥
Estimates o n average ove r a for the first fe w power moment s o f Sa are often easil y
derived b y elementar y means , an d they ca n be of great us e in estimatin g 5 , eve n
in additio n t o the method s o f algebraic geometry . Se e the proo f o f Weil' s boun d
for Kloosterma n sum s i n Section 1 1 . 7 and th e example s i n Section 1 1 .1 1 . Mor e
general type s o f families hav e bee n (an d still are ) extensively studie d b y Katz; see
for instanc e [Kl].
11.4. Th e Hasse-Davenpor t relation .
We consider genera l Gaus s sum s ove r a finite field. Le t F = ¥q be a finite field
with q — p m elements , an d le t ip be a n additiv e characte r an d \ a multiplicativ e
character o f F. Th e Gauss su m G(x, V J) ls
(11.10) G( Xi)=ExWW'
xeF
ls
(recall tha t \ extende d t o F by x(0) = 1 , 0 accordin g t o whethe r \ i s trivial o r
not). Whe n \ i s the Legendre symbol , on e recovers quadrati c Gaus s sums .
The associate d sum s ove r th e extensions fields are
G „ ( x , ^ ) = Y. x(N ¥n/¥(x))iP(Tr¥n/¥(x))
xe¥n
and th e zeta functio n is
qA Tn
(11.11) Z( X^)=e^(J2 ^ )-
n^l
11. SUM S O V E R F I N I T E F I E L D S 275
In thi s case , Dwork' s Theore m wa s prove d b y Hass e an d Davenpor t an d i s
known a s th e Hasse-Davenpor t Relation .
T H E O R E M 1 1 . 4 (HASSE-DAVENPORT) . Assume x and ip are non-trivial. Then
we have for any n ^ 1 ,
-Gn(X,rl>) = {-G( X1, >))n
or equivalently the zeta function is a linear polynomial
Z(X,1>) = l + G{x,1 >)T.
Hence the only "root " fo r the Gauss sum is G{\, ip) itself. Thi s can be estimate d
elementarily, a s wa s don e fo r th e Gaus s sum s considere d i n Chapte r 3 .
PROPOSITION 1 1 .5 . We have
\G(x,1>)\ = y/Q
nor 7 s
if neither \ V ^ trivial, while
,/ N ,( 0 if tb non-trivial,
[q ifip = l
,, x ,f 0 if X non-trivial,
\G(XA)\=\ r ,
{ q ifx = 1.
nor ls
PROOF. Th e las t tw o statement s ar e immediate , s o assum e neithe r % ^
trivial. W e hav e
|G(x,^)l2= £ x(x)x(yMxM-y)
x,y£W*
= n writin z = x
^2^( z) Yl ^(( z ~ ^ (° § v~l)
— q (b y orthogonality , applie d twice. )
•
We no w tur n t o th e proo f o f th e Hasse-Davenpor t Relation . W e conside r th e
field F = ¥(X) o f rationa l function s o n F an d th e rin g R = ¥[X] o f polynomials .
Recall tha t R i s a principa l idea l domain . Fo r h £ R o f degre e d ^ 0 , w e define th e
norm
N(h) = q d.
The zet a functio n o f F i s th e Dirichle t serie s (analogou s t o th e Rieman n zet a
function)
CF(S)= £ N{h)->.
heR
h moni c
REMARK. Thi s coul d als o be writte n a s a su m ove r th e non-zer o ideal s a in R,
Ms) = X>(°)~s
a
whereA T (a) = \R/a\ = N(h) fo r an y polynomia l h suc h tha t a = (h). Bu t w e wil l
work wit h polynomial s t o emphasiz e th e elementar y spiri t here .
27 6 11. SUM S OVE R FINIT E FIELD S
The Dirichle t serie s CF(S) converges absolutel y fo r R e (s) > 1 . Indeed , puttin g
n(d) = {h G R | deg(h) — d, h i s monic} — q d
we obtai n immediatel y
cF(s) = J2n(d)i'ds = E<?(1~ s)d = c1 - ^y 1
-
On th e othe r hand , uniqu e factorizatio n int o irreducibl e polynomial s yield s a n
expression o f £ p a s a n Eule r produc t
1
<F(S)= n (i-A^rT
PeR
P moni c irreducibl e
which i s convergen t fo r R e (s) > 1 .
The firs t ste p i n th e proo f o f th e Hasse-Davenpor t Relatio n consist s o f writin g
the zet a functio n o f Gaus s sum s a s a n L- function fo r th e field F. Le t H C F * b e
the subgrou p o f rationa l function s whic h ar e quotient s o f moni c polynomials , an d
G C H a subgrou p wit h th e propert y
hih2 e G => hi, h 2 e G.
Then i f a : G - > C * i s a characte r o f th e grou p G, i t ca n b e extende d t o a totall y
multiplicative functio n o f th e se t o f moni c polynomial s h G R b y puttin g a(h) = 0
if h $L G. Th e correspondin g L- function i s define d analogousl y t o th e classica l
L-functions b y th e Dirichle t serie s
s 1s
L{s,a)= J2 a(h)N(h)- = l[(l-a(P)N(P)- )-
heR P
h moni c
for R e (s) > 1 .
For dealin g wit h Gaus s sum s w e conside r th e subgrou p G C H o f rationa l
functions / define d an d non-vanishin g a t 0 . Defin e a characte r A on G b y
X(h) = x ( a d ) ^ ( a i )
for h = X d — a\X d~l + • • • + (— l)ddd G R. Clearl y A is multiplicativ e o n moni c
polynomials, an d extend s t o a characte r o f G. I n thi s cas e w e ge t th e following :
LEMMA 1 1 .6 . We have L(s, A ) = 1 + G(x, *P)q~ s-
PROOF. W e arrang e th e Dirichle t serie s fo r L(s,A ) accordin g t o th e degre e o f
h:
£(*,A) = £ ( E A00)<r *
d^O deg(h)=d
and evaluat e eac h ter m i n turn . Fo r d — 0 , th e onl y moni c polynomia l occurrin g i s
h — 1, and A(l ) = 1 . Fo r d — 1, we hav e h — X — a so tha t
£ \(h) = Y,MX-a)=Y,x(aMa) = G(x,i>).
deg(fr) = l aE¥ a£F
11. SUM S OVE R FINIT E FIELD S 27 7
For an y d ^ 2 we hav e
d
Y^ X(h)= Yl A(X -a
1 X1
d
" + -. . + ( - l ) d a d )
deg(h)=d ai,...,a d €F
d 2
=q ~ £ xMV-(ai)= 0
ai,a d GF
by orthogonality , becaus e a t leas t on e o f th e character s x > ^ i s non-trivial . •
On th e othe r hand , appealin g t o th e Eule r produc t w e will prove :
LEMMA 1 1 .7 . We have L(s , A) = Z(q~ s), where Z = Z(xii>) i> s
^e zeta func-
tion (1 1 .1 1 ) associated with Gauss sums.
Theorem 1 1 . 4 follow s fro m Lemma s 1 1 . 6 an d 1 1 .7 .
P R O O F O F LEMMA 1 1 .7 . Takin g th e logarithmi c derivativ e o f the Eule r prod -
uct, w e ge t
n ^ l rd=n p
deg(P)=d
while, o n th e othe r hand ,
1 Z'(q- S)
= ^G„(xi)f
logq Z(q-°) n>i
It therefor e suffice s t o prov e th e formul a
n d
(11.12) J2 d\(P) / = G n(X,iP)
P
d=deg(P)\n
for n > 1 , th e equalit y o f th e logarithmi c derivative s bein g sufficien t t o impl y
Lemma 1 1 . 7 since bot h side s ar e Dirichle t serie s wit h leadin g coefficien t 1 .
To prove (1 1 .1 2) , le t P b e on e o f the irreducibl e polynomial s appearin g o n th e
left side , o f degre e d \ n. It s roots , sa y a?i,... , x^ ar e i n F n . Fi x on e roo t x — Xj
and writ e
1
P = X d- a i X ^ " + • • • + (-l) d
ad.
We ge t
N(x) = (N Wd/F(x)r/d = a n /d
d ,
Tr(x) = - T r F d / F ( x ) = - a x
hence
HP)n/d = ( x ( a ^ ( a i ) ) " / d = xW*)*^) = x (N(x))^{Tr(x)).
27 8 11. SUM S OVE R FINIT E FIELD S
Summing ove r al l root s of P we deriv e
d
i=l
and summin g ove r al l P with deg(P ) | n, we get (1 1 .1 2 ) b y Lemma 1 1 . 1 since ever y
element i n ¥n wil l appea r exactl y onc e as one of the root s xi for some P. •
11.5. Th e zet a functio n fo r Kloosterman sums .
Next w e consider Kloosterma n sums . Le t F b e a finite field wit h q = p m
elements an d thi s time conside r additiv e character s ip and (p. We define th e Kloost -
erman su m associate d t o ip and <p by
1
(11.13) S^,ip) = -^2tl;(xMx- ),
xGF*
(the minu s facto r i s only fo r cosmetic reasons) . Whe n q — p is prime, an d I/J(X) =
e(ax/p), <p(x) — e(bx/p), w e hav e therefor e S(ip,ip) = —S(a,6;p) .
The companio n sum s ove r th e extensio n fields F n ar e
r1
S n f t M = - E ^(Tr(x)M rr(x- ))
and th e Kloosterma n zet a functio n is
We wil l prov e Dwork' s Theore m i n this case , whic h i s due to Carlitz.
THEOREM 1 1 .8 . Assume that ip and ip are both non-trivial Then
m<p) =
i-s(n,,l)T +^-
The proof i s very similar t o that o f Theorem 1 1 .4 . W e put R = F[X], F = ¥(X)
as before , an d conside r th e sam e grou p G C F* o f quotients o f monic polynomial s
defined an d non-vanishin g a t 0. We defin e a character7 7 : G -» C* by putting
77(h) = ip{ai)(p(a d-i/ad)
for a monic polynomia l h G G, where w e write (compar e th e previou s section )
ft = X d + a i X ^ " 1 + • • - + a d _xX + a d
(with ad 7^ 0 since h e G). Th e followin g computatio n verifie s tha t7 7 i s indeed a
character o f G: le t ft' = Xe + b\X&~x + • • • + be-\X + o e wit h 6 e 7^ 0, the n
ftft' = X d+e + (a x + ^ X ^ 6 " 1 + • • • + {a d-ibe + a A - i )* + a dbe
and
/,,A , ,. , N /a d-ibe + a dbe-i\
77(ftft ) = ^(a i + bi)(p{ — J
= ^{ai)ip{ad-ila d)^{bl)ip{be-i/be)
= n(h)r 1 (hf).
11. SUM S O V E R F I N I T E F I E L D S 27 9
Recall tha t w e extend7 7 t o all h G R by putting 77 (h) = 0 for h $_ G.
LEMMA 1 1 .9 . For ip and ip non-trivial, the L-function associated to 77 is given
by
L(s,rj) = ^riWNih)-3 = 1 - S^^)q~ s + q 1 '28.
h
PROOF. B y arranging term s accordin g t o the degree o f h, we writ e
-ds
£ M = £( E »K>0
l\ ) )H y IL
d^O deg(h)=d
and evaluat e th e inner sums . Fo r d — 0, we have onl y h = 1 and 77(1) = 1 . For
d = 1, we have / i = X + a with a ^ 0 , hence
£ v(h) = E rt* + a) = E V-faM O = -s(V, *>)•
deg(/i) = l aGF * aGF *
For d = 2 , w e ge t
aX
E*? W = E ^ + + 6) = E ^Mafe" 1 )
d e g ( / i ) = 2 aG F aG F
6GF* 6GF *
= 9 - l + ( £ > ( « ) ) ( $ > ( & ) ) =9
aGF* &GF *
by applyin g twic e th e orthogonality o f characters, sinc e neithe r i\) no r ip is trivial.
Finally, fo r d ^ 3 , we get
d e g ( h ) = d aGF * ai,.. . ,a d_iGF
= / ~3 Y, V^iM«d-i O = 0
ai,ad_i€F
aGF*
since ther e i s free summatio n ove r 0 1 G F. •
LEMMA 1 1 .1 0 . For ip and p non-trivial, we have the identity
This lemm a complete s th e proof o f Theorem 1 1 .8 .
PROOF. Th e L-functio n ha s a n Euler produc t
1
L(sJr,) = l[(l-T,(P)N(P)-'r .
P
Taking th e logarithmic derivativ e w e get
1 L'{s,ri)
\ogq L(s,rj)
= Edeg(P)E 7 ?( P )V r d e s ( P ) s
r>l
d
E ( E E nipyy
n ^ l rd—n deg(P)= r
28 0 11. SUM S OVE R FINIT E FIELD S
and a s befor e i t suffices t o prove th e formul a
n d
(11.14) Yl dn{P) l = -S ntyt<p)
d=deg(P)\n
for n ^ 1 . Le t
P = Xd + aiXd~l + • • • + ad^X + a d
be on e of the irreducibl e polynomial s o n th e lef t sid e of (11.14), of degree d | n, and
#i,..., Xd its roots , whic h li e i n F^ . W e hav e fo r eac h z,
_ n _n
T r ( ^ i ) = ^ T r F d / F ( Z i ) = - ^ ai
(since a ^ X ^ X " 1 ) = X d + ^ ^ d _ 1
+ • • • + a'1) an d
•&(*, ) = -Tr ¥d/¥(Xl ) = - 5 — •
Hence
,(P)»« = v ,(= 01 ) v (^)^(- I ,w-,-. )
and summin g ove r th e root s a^ , then ove r th e polynomial s P of degree d | n, w e
obtain (1 1 .1 4 ) b y Gauss' s Lemm a again . •
Theorem 1 1 . 8 allow s u s to factor th e Kloosterma n zet a functio n
Z(V,^) = ( l - S ( ^ ^ ) T + g r 2 ) - 1 = ( l - a T ) - 1 ( l - / 8 T ) - 1
where a an d / ? are comple x numbers , an d o f course a -f- /? = 5(^> , ip), a/3 = q. In
sharp contras t t o the cas e o f Gauss sums , however , th e root s a an d / ? cannot be
explicitly computed .
T H E O R E M 1 1 .1 1 ( W E I L ) . Assume that ip and <p are non-trivial and p ^ 2 .
Then the roots a and f3 for the Kloosterman sum 5( /0,^>) satisfy \a\ = |/3| = y/q,
and therefore we have
1
(11.15) 5 ( ^ ) ^ 2 ^ .
We will prov e Theore m 1 1 .1 1 in the nex t tw o sections .
COROLLARY 1 1 .1 2 . Let a, b, c be integers, c positive. We have
2
(11.16) \S(a,b]c)\ < r(c)(a,b,c)^ c^2.
P R O O F . B y the twisted multiplicativit y (1 .59 ) for Kloosterman sums , it suffices
to conside r c = pv wit h p prime an d v ^ 1 . If p j nra, w e hav e Ramanuja n sum s
for whic h th e resul t i s easy (se e (3.2) , (3.3)) . Otherwise , th e case v = 1 follows
from Theore m 1 1 .1 1 fo r p ^ 3 , and fo r p = 2 one check s immediatel y tha t the
Kloosterman sum s modul o 2 satisfy Theore m 1 1 .1 1 : w e hav e 5(1 ,1 ; 2) = 1 , an d
the associated zet a function i s therefore Z(T) = 1 +T+2T 2 , wit h roots (—l±iy/7)/4
of modulu s l / \ / 2 .
The cas e p \ nm and / ? > 2 can b e dealt wit h elementarily ; se e Exercis e 1 o f
Chapter 1 2 . •
11. SUM S O V E R F I N I T E F I E L D S 281
EXERCISE 1 . Conside r a general Kloosterman-Sali e su m
and it s associate d companion s S n an d zet a functio n Z , wher e ip and <p ar e additiv e
characters o f F and \ ^ s multiplicative (s o \ = 1 is the cas e of Kloosterman sums) .
Show tha t
Z = (1 - 5( X ; ^ ^)<T S + x(-a) l 2s l
X{b)q - r
where
«,)-.(5M), *)-.(») .
11.6. Stepanov' s metho d fo r hyperelliptic curves .
We will prove Theore m 1 1 .1 1 by deducin g it from th e Rieman n Hypothesi s fo r
certain algebrai c curve s ove r finite fields. However , w e us e Stepanov' s elementar y
method (se e [Ste] , [Sch] , [Bo3] ) instea d o f Weil's arguments .
Let F be a finite field with q element, o f characteristic p. W e will only conside r
algebraic curve s Cf ove r F given b y equation s o f the typ e
(11.17) C f : y2 = f{x)
for som e polynomia l / G ¥[X] o f degree m ^ 3 . W e assum e moreove r th e followin g
condition
(11.18) Th e polynomia l Y 2 - f(X) G F[X, Y] i s absolutely irreducibl e
(i.e. i t is irreducible ove r th e algebrai c closur e o f F). Thi s i s a minimal regularit y
assumption o n th e curv e Cf. I t is easily see n to be equivalen t t o the conditio n tha t
/ i s not a square i n F[X], an d w e will us e it in this form .
REMARK. Stepanov' s metho d ha s been refined b y Schmidt [Sch ] an d Bombier i
[Bo3] an d i s capable o f handling th e genera l cas e o f the Rieman n Hypothesi s for
curves; th e cas e of curves wit h equatio n o f the typ e y d = f(x) i s not muc h harde r
than th e on e treate d here . W e limi t ourselve s t o the curve s Cf fo r simplicity, an d
because i t suffice s fo r the application t o Kloosterma n sum s an d elliptic curves .
Note tha t curve s o f the type y 2 = f(x) ar e instances o f so-called hyperellipti c
curves, which ar e quit e naturall y distinguishe d amon g algebrai c curve s (bu t no t all
hyperelliptic curves are of this form; se e for instanc e elliptic curves in characteristic s
2 an d 3) .
The proble m w e consider i s that o f estimating th e numbe r |C/(F) | o f F-rationa l
points of C/, i.e. , th e numbe r N of solutions (x , y) G F 2 t o the equatio n y 2 = / ( # ).
We ar e especially intereste d i n this questio n whe n q is large (typically , a s with
exponential sums, the polynomial / G ¥[X] i s fixed, and we consider th e F n -rational
points fo r all n ^ 1 ) , althoug h w e will obtai n completel y explici t inequalities .
THEOREM 1 1 .1 3 . Assume that f G ¥[X] satisfies (1 1 .1 8), and m = deg(/) ^
3. Ifq > 4m2, then N = \C f(¥)\ satisfies
\N - q\ < 8rriy/q.
282 11. SUM S OVE R FINIT E FIELD S
Clearly w e ca n assum e tha t p > 2 , a s otherwis e th e ma p y \-> y 2 i s a n auto -
morphism o f F an d N — q.
Stepanov's idea , whic h wa s inspire d b y result s o f Thu e [Thu ] i n diophantin e
approximation, i s to construct a n auxiliary polynomial o f degree r, say , having zero s
of hig h multiplicit y (a t leas t £, say) a t th e x-coordinate s o f points o f Cf (F) . Henc e
one get s easil y th e inequalit y
N ^2r£~\
the facto r tw o being the highes t possibl e multiplicit y o f a given x-coordinate amon g
points i n C/(F) . Thi s inequalit y turn s ou t t o b e s o stron g tha t i t give s th e upper -
bound o f th e theore m (certainl y a surprisin g fact!) . A tric k the n deduce s th e
lower-bound fro m this .
We firs t distinguis h amon g th e point s (x , y) thes e wit h y = 0 . Le t iV o be th e
number o f distinct zero s of / i n F, which is also the number o f points (x , 0) G C/(F).
If (x , y) i s a poin t o f Cf wit h y =fc 0 , i t follow s tha t f(x) i s a squar e i n F , whic h i s
true i f an d onl y i f g(x) = 1 where
g = f c wit h c = |(«7-1 ) .
Conversely, give n x G F wit h g(x) = 1 , ther e ar e exactl y tw o element s y G F*
with y 2 — f(x). Hence , writin g
(11.19) 7V ! = | { x G F | g(x) = l}\
it follow s tha t
(11.20) N = No + 2Nx.
We will estimate N\ b y following the strategy sketche d above , but i n order to handl e
the lowe r boun d later , w e generalize slightl y an d conside r fo r an y a G F th e se t
(11.21) S a = {x G F | f(x) = 0 or g(x) = a}.
To produc e polynomial s vanishin g t o a larg e order , w e wis h t o us e derivative s
to characteriz e whe n thi s occurs . I n characteristi c 0 , a polynomia l P ha s a zer o of
order £ at x o i f an d onl y i f al l th e derivative s P^ wit h 0 < i < £ vanish a t XQ. I n
characteristic p > 0 , however , thi s i s n o longe r tru e i f £ > p, a s th e exampl e o f th e
polynomial P = X p shows , sinc e p( k) = 0 for al l k ^ 1 , in particular , P ^ ( 0 ) = 0 .
A satisfactor y solutio n follow s b y considerin g othe r differentia l operators .
DEFINITION. Le t K b e an y field . Fo r an y k ^ 0 , th e fc-th Hass e derivativ e i s
the linea r operato r E k : K[X] -> K[X] define d b y
n
Ekxn = f \Xn-k
1
for al l n ^ 0 , an d extende d t o K[X] b y linearity . W e als o writ e E = E (bu t
beware tha t E k ^ E o E o • • • o E).
REMARK. Fro m th e binomia l expansio n
Xn = (X - a + a)n - Y^ (l) aU~k(X - ^) fc5
and b y linearity , w e se e tha t th e valu e o f E kP a t a poin t a G K, fo r P G if[X] ,
is simpl y th e coefficien t o f (X — a) k i n th e Taylo r expansio n o f P aroun d a. Thi s
11. SUM S O V E R F I N I T E F I E L D S 283
explains th e properties o f the Hasse derivatives , bu t we cannot tak e thi s a s a defi -
nition, becaus e th e values of a polynomial ove r a finite field do not characterize the
polynomial.
Note tha t fo r K o f characteristi c p > 0 , w e ge t EX P = E 2XP = • • • =
E ~ Xp = 0 , bu t E PXP = 1 ^ 0 an d w e see that th e Hass e derivative s detec t
p l
the zer o o f X p o f order exactl y p at 0 . Thi s i s a general fact , a s Lemma 1 1 .1 6 will
show.
LEMMA 1 1 .1 4 . The Hasse derivatives satisfy
k
k EJ Ek 3
E {fg) = Y,^ ^ ~ d)
j=o
for all f,g€ K[X], and more generally,
k
(11.22) E {h---U)= Y. {E^h)---{E^f r)
/ o r A , ...,f r£K[X].
I t suffice s t o consider / = X m, g = X n, an d the first formul a follow s
PROOF.
from th e identity
m n
n + m\ ^ ( \(
k =
) f^\j]\k-j
which i s obvious fro m th e combinatorial interpretatio n o f the binomial coefficients .
Then th e second formul a follow s b y induction. •
COROLLARY 1 1 .1 5 . (1 ) For all k, r ^ 0, and all a e K, we have
Ek(X-a)r = ( r
\(X-a)r-k.
(2) For all k, r ^ 0 with k ^ r, and all f,gE K[X], we have
k r k
E (f9l = hg ~
for some polynomial h such that
deg(h)^deg(f) + kdeg(g)-k.
PROOF. Fo r (1), we apply (1 1 .22 ) t o /i = • • • = f r = X - a , gettin g
Ek(X - a) r = J2 EJl
(X-a)-- E Jr
(X - a)
j i H \-j r=k
and onl y term s wit h al l ji G {0,1}, 1 < i ^ r , giv e non-zer o contribution s sinc e
Ei(X — a) = 0 for j > 2 , from th e definition. Henc e (1 ) follows .
For (2) , we observe tha t i f k ^ r , w e have ji = 0 fo r a t leas t r — k indice s
in (1 1 .22) , whic h give s (2) . •
1 1 .1 6 . Let f e K[X] and a e K. Suppose that (E kf)(a) = 0 for all
LEMMA
k < L Then f has a zero of order ^ £ at a, i.e., is divisible by (X — a)*.
28 4 11. SUM S OVE R FINIT E FIELD S
P R O O F . Le t
be th e Taylo r expansio n o f / aroun d a. B y (1 ) o f Corollar y 1 1 .1 5 , we obtai n
Ek
f= E ^ V - " )4
and evaluatin g a t a we get a ^ = 0 for al l k < £, hence / i s divisible b y (X — a) e a s
claimed. •
We need anothe r technica l lemma .
LEMMA 1 1 .1 7 . Let K = ¥ be a finite field of characteristic p with q elements,
and let r = h(X, X q) G ¥[X], where h G F[X , Y]. Then
Ehr = (E k
xh)(X,X
q
)
for all k < q, where on the right side E xh denotes the Hasse derivative of h
performed with respect to X.
I t suffice s t o conside r h = X nYm, s
PROOF. o we must prov e that E kXn+mq —
k n rnq
(E X )X . Pro m Lemm a 1 1 .1 4 , we ge t
3=0
so i t suffice s t o sho w tha t E jXmq = 0 for 0 < j < q to prov e th e lemma . Bu t
^mq\ mq (mq — 1 N
v •1 1
3• /J •J \ J - • °
in characteristi c p , an d th e resul t follows . •
We com e t o th e hear t o f Stepanov' s method , th e constructio n o f th e auxiliar y
polynomial.
PROPOSITION 1 1 .1 8 . Assume that q > 8m, and let £ be an integer satisfying
m < £ ^ q/8. Then there exists a polynomial r G F[X] of degree
deg(r) < c£ + 2m£(£ - 1 ) + mq
which has a zero of order at least £ at all points x G S a (recall c= \{q— 1 )^ .
We wil l look , b y th e metho d o f indeterminat e coefficients , fo r a polynomia l r
of th e specia l for m
q
(11.23) r = f Yl (rj+s j9)yJ
for som e polynomial s r^ , Sj G F[X], t o b e constructed , eac h o f whic h ha s degre e
bounded b y c - m. Henc e suc h a polynomia l r ha s degre e bounde d b y
(11.24) deg(r ) ^ £m + c - m + cm + Jq ^ (J + m)q.
The nex t lemm a i s crucia l t o ensur e tha t r ^ 0 . Thi s i s wher e w e nee d th e
assumption (1 1 .1 8) .
11. SUM S OVE R FINIT E FIELD S 285
LEMMA 1 1 .1 9 . We have r = 0 e ¥[X] if and only if Tj = Sj = 0 £ ¥[X] for
all j.
P R O O F . W e ca n assum e (b y a shif t X \-> X + a i f necessary ) tha t /(0 ) ^ 0 .
Suppose tha t r = 0 but no t al l r^, Sj ar e zero ; let k b e th e smalles t inde x fo r whic h
one o f r*fc , Sk is non-zero. Dividin g b y f £Xkq, w e get fro m (1 1 .22 ) th e identit y
k^j<J
which w e write i n th e for m ho + h\g = 0 , wher e
s
h0 = ] T rjXU-V*, hi = J2 3xU~k)q-
We squar e thi s equation , the n multipl y bot h side s b y / , gettin g
hlf = h\f«.
Since / 6 F[X] , we hav e
f{Xy = f(X") = /(0 ) (mo d X« )
hence
rlf = s 2kf(0) ( m o d i ' ) .
However, th e degre e s o f th e polynomial s i n thi s congruenc e ar e bounde d b y
2deg(r/e) + m ^ 2( c — m ) + m < q, an d 2deg(s/c ) < 2( c — m ) < g , respectively .
So there mus t b e equalit y r\f — s\f(f$), whic h contradict s th e assumptio n (1 1 .1 8 )
that / i s no t a squar e i n ¥[X]. •
We no w evaluat e th e Hass e derivative s o f r .
LEMMA 1 1 .20 . Let k ^ £. Then there exist polynomials r K- J , s^ each one of
degree ^ c — m + k(m — 1 ) such that
Ekr = f~k J2 {rW + sfg)**.
0^3 < J
PROOF. W e ca n writ e r = h(X, X q) wher e h G F[X, Y] i s the polynomia l
Hence b y Lemm a 1 1 .1 7 , we hav e
Ekr = (E kxh)(X,Xq) = Y\ (E k
(fr3) + E k i+c
(f sj))Xjq.
o^j<J
By (2 ) of Corollary 1 1 .1 5 there exist polynomial s r^ an d s : ' satisfyin g E k(f£rj) =
fi-kr(k) an d E k(f+cSj) = f-k+c s(k) wit h deg ( r W ) < ; deg(rj) + kdeg(/ ) - if e <
c — m + fc(m — 1 ) an d deg(s ^ ^ ) ^ c — ra+fc(m — 1 ) . Thi s i s th e desire d result . •
Recall tha t w e wis h r t o hav e zero s o f orde r ^ £ at point s i n <S a (se e (1 1 .20)) .
If f(x) = 0 , clearly thi s i s the case . S o let x E <Sa, with f(x) 7 ^ 0. Applyin g Lemm a
286 . SUM S O V E R F I N I T E F I E L D S
11.21, w e evaluat e E kr a t a poin t x G <Sa, using g(x) = a, an d mos t importantl y
xq = x:
Ekr{x) = f{xY~ k ] T (rf\x)+asf\x))xi
l k
= f(x) - *W(x)
where a^ G ¥[X] i s th e polynomia l
0<j<J
We ca n no w prov e Propositio n 1 1 .1 8 : i f <T^ ) = o fo r al l k < £, Lemm a 1 1 .1 6
shows tha t r ha s a zer o o f orde r ^ £ at al l point s i n S a- Th e syste m o f equation s
{k)
(11.25) a = 0 , fo r al l k < £
is a homogeneous syste m o f linear equations , th e unknown s bein g the coefficient s o f
the polynomial s r*j , Sj, the equation s correspondin g t o th e coefficient s o f th e cr^) .
We observ e tha t
deg(a (/c) ) <c-m + k(m - 1 ) + J ,
so th e numbe r o f equation s doe s no t excee d B = £(c — m - f J ) + \£{£ — l)(r a — 1 )
while, o n th e othe r hand , th e numbe r o f coefficient s o f th e rj an d Sj i s a t leas t
A = 2( c — TO)J.B y choosin g J larg e enough , w e ca n mak e A > B. The n th e
system (1 1 .25 ) ha s a non-trivial solution , an d b y Lemm a 1 1 .1 9 this produce s r ^ O
such tha t r ha s zero s o f orde r > £ at al l point s x G <Sa. Takin g
J = - ( c + 2m(*-l) )
one ca n chec k tha t A > B (recal l tha t 2c — q — 1 and S£ ^ q). Th e degre e o f r i s
bounded b y (1 1 .24) , whic h give s Propositio n 1 1 .1 8 .
We no w prov e Stepanov' s Theore m 1 1 .1 3 . First , le t a b e arbitrary , an d appl y
Proposition 1 1 .1 8 . Sinc e th e auxiliar y polynomia l r i s non-zer o an d vanishe s t o
order ^ £ a t point s i n <S a, w e hav e ^|<S a| ^ deg(r ) ^ c£ + 2m£(£ — 1 ) -f - mq s o
|<Sa| ^ c + 2m( ^ — 1 ) 4- mqt~x. W e choos e £ = 1 + [^/g/2] , whic h give s th e boun d
(11.26) \S a\ < c + 4mVg .
To prov e Theore m 1 1 .1 3 , tak e firs t a = 1 getting
7V0 + A ri = |<S a|< f + 4 m V g
hence th e uppe r boun d
(11.27) N = N 0 + 2/V i < 2(iV 0 + iVi ) < q + S T O ^ .
To ge t a lowe r bound , b y th e factorizatio n X q — X = X(X C — l)(X c + 1 ) w e hav e
f(x)(g(x) - l)( 5 (z) + 1 ) = 0
for al l x € F , henc e N 0 + Ni + N 2 = q wher e iV 2 = |{ x G F | g(x) = - 1 } | .
By (1 1 .26 ) applie d t o <S_i , we hav e
iV0 + iV 2 = |<S_i | < | + 4 m v ^
11. SUM S O V E R F I N I T E F I E L D S 287
hence
Nx = q - N 0 - A/ i > | - 4mv^ ,
and finally ,
(11.28) N = N 0 + 2N 1 ^2N1 >q- Sm^/q.
Clearly (1 1 .27 ) an d (1 1 .28 ) prov e Theore m 1 1 .1 3 .
11.7. Proo f o f Weil' s boun d fo r Kloosterma n sums .
Let F b e a finit e field wit h q elements , o f characteristi c p ^ 2 . Le t ip be an y
fixed non-trivia l additiv e characte r o f F . Fo r an y additiv e characte r <p ther e exist s
a uniqu e a G F suc h tha t (p = ip a, henc e an y Kloosterma n su m S(ip,tp) i s o f th e
form
x£F*
for som e a , b G F. W e conside r a an d b as fixed an d writ e g = aX + bX -1 . W e
will prov e Weil' s boun d (1 1 .1 5 ) b y relatin g th e averag e o f th e Kloosterma n sum s
S(i/ja,^b) over ip t o th e numbe r o f point s o n a n hyperellipti c curve , wher e th e
contribution o f th e trivia l characte r ipo — 1 will b e th e mai n term .
LEMMA 1 1 .21 . For any n ^ 1 and any x G F n ; we have
(11.29) | { * G F n | y * - 2/ = * } | = ] [ > ( ! * ( * ) )
where the sum ranges over all additive characters of¥ and Tr is the trace F n— > F .
PROOF. I f T r (x) = 0 , the n th e equatio n y q — y = x ha s q solutions exactly , a s
recalled i n Sectio n 1 1 .2 , and i n thi s cas e w e have ^(T r (x)) = 1 for al l 0 , henc e th e
right sid e of (1 1 .29 ) i s also equal to q. O n the other hand , i f Tr (x) ^ 0 , the equatio n
yQ — y — x ha s n o solution , an d th e characte r su m i s zer o b y orthogonality . •
Prom thi s lemm a w e deduce tha t
ip ip xGF *
(11.30) =\{(x,y)eF* nx¥n \ y* - y = g(x)}\ = N n, say ,
for n ^ 1 . I f ip = ^o 5 th e trivia l character , w e hav e
For i/; ^ yyo , let a^ , fy b e th e "roots " o f th e Kloosterma n su m S ( ^ a , ^ ), s o b y
Theorem 1 1 . 8 we hav e a^f3^ — q an d
Sn(^a,^)=Q^+/^,
for al l n ^ 1 .
We ca n therefor e writ e
288 11. SUM S OVE R FINIT E FIELD S
The equatio n y q — y = g(x) doe s no t obviousl y describ e a curve , sinc e g i s no t a
polynomial, bu t multiplyin g b y x i t i s equivalent wit h
Ca}b : ax 2 -(y q-y)x + b=0
(note that x = 0 is not possibl e since b ^ 0) . Becaus e p ^ 2, th e number o f solutions
is equa l t o th e numbe r o f solution s o f th e discriminan t equatio n o f thi s quadrati c
equation
Da,b : (y q-y)2-4ab = v 2,
i.e., N n = \D a^{¥n)\. Thi s i s o f th e for m (1 1 .1 7 ) wit h deg(/ ) = 2q, an d becaus e
Aab ^ 0 it satisfie s (1 1 .1 8) . Henc e b y Theore m 1 1 .1 3 we hav e
1 +n/2
\Nn-qn\ 1
< 6g
if n i s larg e enough , s o that q n > 1 6q.
By (1 1 .30 ) w e ge t a shar p estimat e fo r th e root s ct^ , /J^ , o n averag e
n/2
(ii.3i) -\J2 q
K + 05) <16g"
</^V> o
for n larg e enough . Th e followin g simpl e lemm a show s tha t th e individua l root s
must b e o f modulu s ^ y/q:
LEMMA 1 1 .22 . Letuj\, ..., u r be complex numbers, A, B positive real numbers
and assume that
r
|Vw"| < AB n
holds for all integers n large enough. Then \UOJ\ ^ B for all j.
P R O O F . On e ca n d o thi s b y han d (usin g Dirichlet' s bo x principle) , bu t a nic e
trick give s th e resul t immediately : conside r th e comple x powe r serie s
/w-E (!>;)«"-Ei^; -
n^l j 3
The hypothesi s implie s tha t / converge s absolutel y i n th e dis c \z\ < B~ l, henc e /
is analyti c i n thi s region . I n particular , i t ha s n o pole s there , whic h mean s tha t w e
must hav e \WJ\~ 1 ^ B~ x fo r al l j . •
From thi s lemm a applie d wit h A — 16g, B = y/q, w e deduc e th e uppe r bound s
\ai>\ ^ y/Q^ I A/; | ^ \/q fo r al l ip = £ ipQ. Since a^/3^ — g, w e hav e i n fac t \a^\ =
\Pi/>\ — \/# > anc ^ s o Theore m 1 1 .1 1 is proved .
REMARKS. (1 ) We see here twice how crucial the introduction o f the companio n
sums K n is : first becaus e th e curv e D a^ ha s very hig h degree , s o Stepanov's boun d
\N — q\ < Srriy/q is trivial when applie d t o F itself, an d secondl y because onl y by th e
consideration o f all extension fields ca n w e determine th e exac t orde r o f magnitud e
of th e roots , an d obtai n Weil' s boun d S(ip,(p) ^ 2y/q wit h th e shar p constan t 2 .
(2) Th e constan t 2 is optima l i n Weil' s boun d fo r fixed a , b and q. Indee d w e
have
1
• a^z -fyz'
n^l
11. SUM S O V E R F I N I T E F I E L D S 289
This i s a non-zer o rationa l functio n wit h pole s o n th e circl e \z\ = 1 /y/q, henc e thi s
is it s radiu s o f convergence . Therefor e
limsup|Snty0)V6)r1/n = ^ = .
This mean s tha t fo r an y e > 0 there exis t infinitel y man y n suc h tha t
\Sn(il>aM>(2-£)qn/2.
It is conjectured (thi s follows from th e Sato-Tate Conjectur e fo r the angles of Kloost-
erman sum s describe d i n Chapte r 21 ) tha t th e Wei l boun d i s als o optima l whe n
a, b are fixed, n = 1 , an d q = p - » +oo . However , thi s remain s ver y muc h open .
See the remar k a t th e en d o f the introductio n t o Sectio n 1 1 . 8 for th e cas e o f ellipti c
curves.
(3) Usin g th e extensio n o f Stepanov' s metho d t o curve s o f th e typ e y d = f(x)
and a n analysi s o f th e correspondin g zet a function , on e ca n prov e th e followin g
estimate fo r complet e characte r sums :
THEOREM 1 1 .23 . Let ¥ be a finite field with q elements and let \ be a non-
trivial multiplicative character of F* of order d > 1 . Suppose f G ¥[X] has m
distinct roots and f is not a d-th power. Then for n ^ 1 we have
\^2x(N(f(x)))\^(m-l)qn/2.
x£Fn
This i s Theore m 2 C , p . 43 , o f [Sch] . I n particular , w e ge t th e followin g
corollary whic h wil l be use d i n provin g th e Burges s boun d fo r shor t characte r sum s
(Theorem 1 2.6) .
COROLLARY 1 1 .24 . Let x (mo d p) be a non-principal multiplicative character.
If one of the classes b v (mo d p), v = 1 , . . . , 2r is different from the remaining ones
then
| ] Tx ((x + 6i ) • • • (x + b r))x((x + 6 r + 1 ) . . . (x + bar) )^ 2rp 2
x ( m o d p)
PROOF. Observ e tha t
X((x + &!).. . (x + b r))x{(x + 6 r + 1 ) . . . (x + b 2r) = x(f(x))
with
Hx)= n (x + bj) Yl (x + b.y- 2.
Prom th e assumption , on e o f th e bi is a roo t o f / o f orde r eithe r 1 or p — 2 , whic h
is coprim e wit h th e orde r d \ (p — 1 ) o f x, s o w e c a n a PPly Theore m 1 1 .23 . •
290 11. SUM S OVE R FINIT E FIELD S
11.8. Th e Rieman n Hypothesi s fo r ellipti c curve s ove r finite fields.
A particularl y importan t case o f th e Rieman n Hypothesi s i s tha t o f ellipti c
curves. Historicall y thi s wa s first establishe d b y Hass e usin g globa l methods . I n
the notatio n o f Section 1 1 .6 , this mean s that w e consider curve s Cf wit h degf = 3 ,
so the equatio n i s of th e for m
(11.32) C : y 2 = x 3 + a 2x2 + a 4x + a 6
(the numbering reflect s th e traditional notatio n fo r elliptic curves, see Section 1 4.4) .
In contrast wit h that section , w e emphasize that w e are considering the affin e curve ,
without th e poin t a t infinity . Th e cubi c polynomia l f(x) — x3 + CL2X 2 + a 4x + a$
cannot b e a square , s o thi s curv e satisfie s th e assumptio n (1 1 .1 8) . Moreover , w e
assume tha t / doe s not hav e a doubl e root ; thi s mean s tha t th e curv e C i s smoot h
(see Sectio n 1 1 .9) , an d i t i s a necessar y conditio n fo r wha t follows .
In thi s case , Theore m 1 1 .1 3 implie s tha t fo r q > 3 6 th e numbe r N = |C(F) |
satisfies
\N-q\< 2 4 ^ .
In Sectio n 1 1 .1 0 w e wil l prove , a s befor e fo r Kloosterma n sums , th e rationalit y
and th e functiona l equatio n o f the correspondin g zet a function , fro m whic h w e will
deduce:
THEOREM 1 1 .25 . Let C be an elliptic curve overF given by
C : y 2 = x 3 + a<ix 2 + a 4x + a§
with cti E F. Then for all n ^ 1 we have
(11.33) | | C ( F n )|-gn|^2^/2.
REMARKS. Theore m 1 1 .2 5 i s optimal. Indee d lettin g n — > +00 , thi s follow s a s
for Kloosterma n sum s fro m Lemm a 1 1 .22 . However , i t i s also true i n the horizonta l
sense a s th e followin g exampl e shows : le t E/Q b e th e ellipti c curv e wit h equatio n
E : y 2 = x3 - x
which ha s comple x multiplicatio n b y Z[i}. A s befor e w e conside r th e affin e points ,
not th e projectiv e ones . Th e discriminan t o f E i s 6 4 s o E ca n b e reduce d modul o
p to a n ellipti c curv e ove r Z/p Z fo r an y od d prim e p. On e show s (fo r instanc e b y
relating E t o th e curv e y 2 — x4 + 4 b y changin g (x,y ) i- > (yx~ 1 ,2x — y 2x~2) fo r
(x,y) ^ (0,0) , se e e.g . [1 4 ] o r [IR] ) tha t \E(Z/pZ)\ = p i f p = 3 (mo d 4 ) an d
\E(Z/pZ)\ =p-2a pifp = l (mo d 4) , wher e
2 2
n= n + h
with ix = a p-\ribp = 1 (mo d 2( 1 +i)) (thi s congruenc e determine s n u p t o conjuga -
tion). Fo r an y e > 0, Theorem 5.3 6 (generalize d slightl y t o ad d th e congruenc e con -
dition) show s that ther e exist infinitel y man y Gaussia n prime s TT = 1 (mo d 2( 1 + 2))
such tha t |arg7r | < e. Henc e |Im(7r) | ^ £|7r | an d
2
\p-\E{Z/pZ)\\ = <2\a v\>2{l-e )^
for infinitel y man y p.
11. SUM S O V E R F I N I T E F I E L D S 291
Theorem 1 1 .2 5 will b e prove d i n Sectio n 1 1 .1 0 afte r som e geometri c an d alge -
braic preliminaries . Thi s goe s a bit furthe r awa y from th e hear t o f analytic numbe r
theory, ye t w e includ e ful l detail s becaus e th e Hass e boun d i s als o importan t a s
being the simples t cas e of the very important Delign e bound fo r Fourie r coefficient s
of modular forms . Th e reader will also certainly appreciat e th e elegance and beaut y
of th e geometr y involved .
11.9. Geometr y o f ellipti c curves .
In explainin g th e specia l geometri c feature s o f ellipti c curves , w e ma y a s wel l
consider a mor e genera l case . S o let k b e a n arbitrar y field , fc an algebrai c closure ,
and le t C b e th e curv e give n b y th e equatio n
C : y 2 = f{x), wit h / = X 3 + a 2X2 + a 4X + a 6 e k[X],
identified wit h th e se t o f solution s (x,y) G fc 2. W e assum e a s befor e tha t / doe s
not hav e a double root . I f k' jk i s any extension , w e let C(k f) b e the se t o f solution s
in (A/) 2.
The geometr y o f th e ellipti c curv e become s muc h cleare r i f w e wor k wit h th e
projective versio n o f the curv e C, namel y th e curv e E i n the projective plan e given ,
in homogeneou s coordinate s (x : y : z), b y th e equatio n
E : y z — x + a 2x z + a^xz + a§z .
Putting z — 1 gives bac k C ; o n th e othe r hand , "a t infinity" , w e ar e onl y addin g
one point : takin g z — 0 yield s x — 0, an d al l element s ( 0 : y : 0 ) (wit h y ^ 0 )
correspond t o a singl e poin t o o = ( 0 : 1 : 0) i n th e projectiv e plane . Notic e tha t
this point o o is rational ove r the bas e field fc, so that fo r al l extensions fc'jkw e have
£(fc') = C(fc')U{oo} .
The mai n propert y o f th e curv e E tha t w e will us e i s th e beautifu l fac t tha t it s
points for m a n abelia n group , wit h identit y elemen t oo. Throughout , p denote s
points o n E, no t th e characteristi c o f the field k. Th e grou p la w (denote d b y + ) i s
described b y th e geometri c conditio n tha t fo r an y thre e (distinct ) point s p\, p 2 an d
p 3 i n E, w e hav e p\ -j - p2 + Pz = 0 i f an d onl y i f th e thre e point s ar e collinea r (i n
the projectiv e plane) , an d th e opposit e o f a point (x : y : z) i s the poin t (x : — y : z)
(symmetry wit h respec t t o th e x-axis) . Thi s wa y on e ca n construc t th e su m o f an y
two distinct points , b y computing th e equatio n o f the lin e joining them , an d takin g
the opposit e (i n th e sens e above ) o f th e thir d intersectio n poin t wit h th e curve .
That ther e ar e exactl y thre e intersectio n point s follow s immediatel y fro m th e fac t
that th e polynomia l / i s o f degre e 3 . I n addition , t o comput e th e doubl e p + p o f
a poin t p , th e sam e constructio n i s don e wit h th e tangen t lin e a t p ; th e conditio n
that / ha s n o doubl e roo t ensure s tha t thi s tangen t lin e alway s exists .
Also, becaus e / G k[X], i t follow s easil y tha t th e ^'-rationa l point s E(k f), fo r
any extensio n k' /fc, for m a subgrou p o f E(k).
We d o no t prov e thos e fact s here ; completel y elementar y proofs , b y computin g
explicitly th e coordinate s o f th e su m p\ + P 2 o f tw o point s accordin g t o th e recip e
above an d checkin g th e abelia n grou p axiom s (associativit y i s the onl y difficulty) ,
are fairl y straightforwar d (se e fo r instanc e [IR] , ch. 1 8 , 1 9) .
292 11. SUM S OVE R FINIT E FIELD S
We no w introduc e som e furthe r geometri c object s relate d t o E or , mor e gen -
erally, t o an y smooth , projective , algebrai c curve 1 . Thu s conside r agai n a mor e
general case : le t k b e a n algebraicall y close d field , an d le t E b e a plan e algebrai c
curve ove r A; , i.e. give n b y a n equatio n
f(x,y,z) = 0
for som e homogeneou s / G k[X, Y , Z\. W e identif y E wit h th e se t o f point s i n th e
projective plane . W e assume that E i s smooth, whic h means here that fo r an y poin t
p = (x : y : z) o f E, no t al l partia l derivative s df/dX(p), df/dY(p), df/dZ(p),
are zero . I n thi s cas e th e lin e wit h equatio n
?L(p)(X-x) + ZL(p)(Y-y) + ^{p)(Z-z)=0
is well-defined an d i s the tangen t lin e t o E a t p. Fo r ellipti c curve s y 2 = / ( # ) , thi s
smoothness conditio n i s equivalent t o th e fac t tha t th e polynomia l / ha s n o doubl e
roots, b y a simpl e calculation .
Let C b e th e affin e curv e correspondin g t o E give n b y
C : f(x,y,l) = 0
in k\ Le t g(X, Y) = f(X, Y, 1 ) G k[X, Y]. W e define k[C] = k[X, Y]/(g). Element s
of k[C] can b e interprete d a s function s o n C. W e assum e tha t (g) i s a prim e idea l
(this i s easil y checke d i n th e cas e o f ellipti c curves ) s o tha t k[C] i s a n integra l
domain, an d w e le t k(C) o r k(E) b e it s quotien t field , calle d th e functio n field o f
C o r o f E. I t i s a finit e extensio n o f th e fiel d k(X) o f rationa l function s ove r k
(for ellipti c curve s y 2 = / ( # ) , i t i s a quadrati c extensio n k(X)(y/J)). W e interpre t
elements o f th e functio n fiel d a s rationa l function s o n E 1 , s o give n a poin t p G E
and a n elemen t p> G k(E), eithe r ip has a pol e a t p o r (p(p) G k i s defined .
Now th e importan t poin t i s tha t becaus e E i s smoot h i t i s possibl e t o defin e
the orde r o f ip at p for ever y p in E an d ever y non-zero rationa l functio n p G k(E)*.
As expected , thi s orde r behave s lik e it s analogu e fo r holomorphi c function s theor y
or rationa l functions . Precisely , fo r ever y p G R, ther e i s a discret e valuatio n
ord p : k(E) x - > Z ,
which give s th e orde r o f th e zer o (i f ^ 0 ) o r pol e (i f < 0 ) o f a rationa l functio n a t
p. A s a discret e valuation , i t satisfie s
ordp(c) = 0 , fo r c £k*,
ordp(<pV0 = ordp M + ord p(ip),
ordp((p + *p) ^ min(ord p ((/?),ordp(^)).
We sketc h a proo f (se e als o [Sil] , Prop . II . 1.1): conside r th e rin g O p = {p G
k(E) \ p i s define d a t p}. Thi s i s a noetheria n loca l domai n wit h maxima l idea l
vtip = {p E O p | p(p) = 0 } an d residu e fiel d O p/mp ~ k (b y evaluatio n a t p).
Because E i s smoot h a t p (ther e i s a tangen t line) , th e /c-vecto r spac e trip/m ^ i s of
dimension 1 (because th e curv e i s in th e plane , i t i s of dimension ^ 2 ; the equatio n
of th e tangen t lin e give s a relation , an d i t i s eas y t o se e tha t trip/m ^ ^ 0) . B y
Nakayama's Lemm a (se e for instanc e [AM] , p. 21 ) , it follow s tha t m p i s a principa l
x
The reade r ca n withou t damag e assum e tha t w e ar e jus t dealin g wit h th e ellipti c curve s
described before , wit h k = ¥. I n thi s cas e ever y incomplet e assertio n ca n b e checke d b y hand .
11. SUM S O V E R F I N I T E F I E L D S 29 3
ideal. Le t IT be a generator ; the n m ^ is generated b y 7rd for any d ^ 1 . Th e order
ordp ca n be defined fo r p i n Op, ip ^ 0 , b y
ordp((p) = max{d ^ 0 | (p G m^ }
and extende d t o a homomorphism k(E)* — > Z. Th e propertie s abov e ar e then quit e
easy t o check.
For ellipti c curve s y 2 = /(#) , on e can easily se e that i f p =fi oc , and p = (x,y)
with y 7^ 0, it is possible t o take n = X — x. Fo r p = oo , one can take 7 r = X/Y,
and on e finds tha t ordoo(:r ) = —2 , ordoo(y) = —3 .
Every non-zer o elemen t p G k(E) ha s finitely man y zero s an d poles, an d to
package the m convenientl y w e define a diviso r o n E t o b e a forma l finit e linea r
combination wit h coefficient s i n Z of symbols [p] , one for each poin t p G E. Divi -
sors for m a fre e abelia n grou p Div(E) . Tw o homomorphism s ar e important. On e
associates t o a non-zer o rationa l functio n p> th e divisor (denote d (cp)) of its zeros
and poles :
( k(E) x ->Div(E )
] ¥>•->(¥> ) = ] T o r d p ( y > )
I p£E
and th e second give s the degree o f a divisor :
,/ Div(£ ) - + Z,
I \p] »- > 1 .
As suggeste d b y the notation, divisor s o f the typ e (ip) are called principa l divisors .
Ordinary rationa l function s hav e a s man y zero s a s poles , wit h multiplicity ,
and th e same hold s fo r (p G k(E)*: thi s mean s tha t fo r all p> G k(E)x, w e have
deg((y>)) = 0 (se e for instanc e [Sil] , II-3) . Fo r elliptic curve s y 2 = / ( # ) , an eas y
proof ca n b e derived b y observing tha t k(E) i s a quadratic extensio n o f k(X). Th e
non-trivial elemen t i n the Galois grou p i s p i- > (p defined b y (p(p) = <p(—p). I t is
clear tha t ord p((p) = ord_ p ((^), henc e deg(cp) = deg(</?) . No w cptp i s in k(X). On e
can chec k th e following compatibility : i f ip G k(X), wit h diviso r (ip)i = Yl nixi
(as a n ordinary rationa l function) , the n it s divisor a s an element o f k(E) i s (V>) =
^2^i(\pi] + [ _Pi])> where pf is any point o f E wit h ^-coordinat e Xi. In particular ,
0 = deg(('0)i ) = 2deg((^)) . Applyin g thi s t o p(p gives
0 = deg((p<p) = deg(<p ) + deg((p) = 2deg(p).
D E F I N I T I O N . 1 . Tw o divisor s D i an d D2 suc h tha t D\ — D2 = (p) fo r som e
(/? G k(E)x ar e called linearl y equivalent . Thi s i s denote d D\ ~ D 2l an d is an
equivalence relatio n o n Div(f7).
2. Th e group Div(Z? ) carrie s a partia l ordering , compatibl e wit h th e grou p
structure, define d b y D ^ 0 if and only i f al l the coefficients i n the formal sum
giving D ar e > 0. Suc h divisor s ar e called effectiv e divisors .
3. Fo r a divisor D G Div(£7), let
L(D) = {0 } U {p G k(E) | foO+ D ^ O } ;
this i s a ^-vecto r space . Le t ^(-D) = dimL(D) , a n integer o r +oo .
If D = ni\pi} + ...+nk[pk}-mi[qi]-...-mj[qj] withn^ , mi ^ 0 , then p G L(D),
</? 7^ 0, means simpl y tha t p has
29 4 11. SUM S OVE R FINIT E FIELD S
(1) Pole s of order a t most n ^ at pi, 1 ^ i ^ k;
(2) Zero s of order a t least rrti at g^, 1 ^ i ^ j.
It follow s immediately tha t i f D2 ^ £>i , then L(D 2 ) C L{D\). Also , if Di ~ D2 ,
then writin g D\ = D2 + (VO? the map<£> H > ^><£ induces a n isomorphism L{D\) — >
L(£) 2 ), i n particular, £(Di ) = ^(^2 ) only depend s o n the linear equivalenc e clas s
of th e divisor .
The followin g interpretatio n i s also clear : ther e i s a bijectio n
( P(L(D)) —> {Effectiv e divisor s linearl y equivalen t t o D},
between th e projective spac e P(L(D)) o f L(D) an d the effective divisor s linearl y
equivalent t o D (b y definition o f L(D), th e map has image i n the set of effectiv e
divisors). Thi s als o require s th e important fac t tha t
(11.35) L(0 ) - k, £(0) = 1.
In othe r words , a n everywhere define d rationa l functio n o n E i s constant: thi s is
obvious for elliptic curves , sinc e regularity o n C forces suc h a <p to be a polynomia l
g{X) + Yh(X), an d regularity a t 00 then force s h = 0 , g G k.
We first notic e th e following simpl e lemma :
LEMMA 1 1 .26 . Let D be a divisor on E such that £{D) > 0 . Then either
deg(D) > 0 or D ~ 0.
PROOF. lt£{D) > 0, there is a non-zero element tp G L(D), so that (ip)+D ^ 0.
Taking th e degree w e find tha t deg(D) ^ 0 . S o we need onl y sho w no w that i f
deg(D) = 0 , the n D ~ 0 . Bu t ((p) + D i s then a n effectiv e diviso r o f degre e 0;
clearly i t must b e = 0 , so D = — (p) = {p~ x) ~ 0 . D
To prov e th e rationality o f the loca l zet a functio n o f ellipti c curves , w e will
need t o kno w th e valu e o f £{D) fo r effectiv e divisors . Thi s i s compute d b y the
Riemann-Roch Theorem .
T H E O R E M 1 1 .27 . Let E be an elliptic curve over an algebraically closed field
k. For any divisor D on E, £{D) is finite and we have the formula
(11.36) £(D)-£(-D)=degD.
Equivalently, by Lemma 1 1 .26, we can compute £(D) for any D by:
1. Ifdeg(D) ^ 0 , and D / 0, then £(D) = deg(D) .
2. IfD~0, then£(D) = 1.
3. Ifdeg{D) < 0, then £(D) = 0.
This i s the Riemann-Roch theore m specialize d fo r elliptic curves . Se e the re -
mark below fo r the general case .
The simpl e proo f fo r th e Riemann-Roc h theore m hinge s o n th e remarkabl e
interaction o f the group structur e o n E wit h th e divisor group . Indeed , conside r
the map
a : Div(E ) - • E
defined b y <j(ni[pi ] + .. . + rik[pk}) — n\Pi + •• • + ^kPk, the + o n the right sid e
corresponding t o the group la w on E.
11. SUM S O V E R F I N I T E F I E L D S 295
PROPOSITION 1 1 .28 . Let D be a divisor on E. Then we have
D~[a(D)] + (te%{D)-\)[oo].
P R O O F . I n essenc e thi s "is " th e grou p la w itself : b y induction , w e nee d onl y
consider D = [p] + [q] an d D — [p] — [q] fo r som e point s p an d q. I f p o r q i s th e
origin oo , the resul t i s obvious .
So conside r first th e cas e o f D = [p] + [q] wit h p , q ^ o o an d p ^ q. The n
the equatio n a X + 6 Y + c = 0 o f th e lin e joinin g p an d q define s a n elemen t
ip = aX + bY + c G &(£), whic h b y definitio n o f th e grou p la w satisfie s
(y>) - [p ] + [<? ] + [r ] - 3[oo ]
where —r — p + q = a(D). Sinc e (y? ) ~ 0 we ge t
D = [p ] + fo] ~ (y, ) - [ a(D)] + 3[oc ] - -[(7(D) ] + 3[oo] .
But similarl y fo r an y poin t p i n E, th e equatio n o f th e lin e joining p an d — p give s
a functio n wit h diviso r [p] + [—p ] — 2[oc] s o tha t fo r an y p
(11.37) -\p] + [oo ] - [-p ] - [oo ]
and henc e D ~ [c(^) ] + [oo] , as desired .
If p = g , th e equatio n o f th e tangen t lin e give s 2[p] - f [2p ] — 2[oo ] ~ 0 , an d th e
result agai n follows . Finall y i f D = [p ] — [g] , use (1 1 .37 ) t o reduc e t o th e previou s
case:
\P] ~ [q] = [P\ + {-[q] + [oo] ) - [oo ] - [p ] + [-q] - 2[oo ] - [ p + q] - [oo] .
D
P R O O F O F TH E RIEMANN-ROC H THEOREM . Notic e tha t (1 1 .36 ) fo r D o r -D
are equivalent . B y Propositio n 1 1 .28 , we hav e
e(D)=e{[v(D)] + (degD-l)[oo]),
£(-D) = £([-a(D)} + ( 1 - degZ?)[oo]) .
One of the two divisors on the right i s effective, s o we can assum e tha t D i s effectiv e
and o f th e for m D = [p ] -f n[oo ] wit h P G E an d n > 0 .
If p = oo , then D = ra[oo] wit h m > 1 . W e must prov e £(D) = m . An y elemen t
p of L(m[oo]) ha s n o poles on C , henc e i s a polynomial (p G fc[X,Y"], which satisfie s
3rdoo((^) ^ —m . Sinc e ordoo(X ) = — 2 and ordoo(F ) = —3 , w e hav e
ordoo(^) = max(-2deg(#) , - 3 - 2deg(ft) ) fo r ^ = g(X) + Yh(X).
Let V = {2deg(#), 3 + 2deg(h)} wher e g an d h ar e polynomial s i n X. On e
sees immediatel y tha t V i s th e se t o f al l positiv e integers , excep t 1 . No w usin g
monomials X a o r YX b a s basi s elements , w e chec k tha t
£(m[oo]) = \{n G V \ n ^ m} | = m .
Now we consider D = [p ] +n[oo] with n ^ 0 and p ^ oo . I f n = 0 , we can use the
automorphism g i-> g—p which sends p to o o to get a n isomorphism L([p\) ~ L([oo] )
which implie s £{[p}) — 1 .
29 6 11. SUM S OVE R FINIT E FIELD S
If n ^ 1 , we have D ^ n[oo ] hence L(n[oo]) C L(D). Thu s n < ^(£>) . Moreover ,
because onl y a simple pol e i s allowed a t p , we have £{D) ^ n + 1: i f 7r p is a functio n
with a simpl e zer o a t p , w e have a fc-linear ma p
L(D)/L(n[oo}) - > fc
ip*-± (7T p(f)(p)
which i s tautologicall y injective . Thu s w e mus t sho w tha t ther e i s i n L{D) on e
more fc-linear ly independen t elemen t no t i n L(n[oo]) .
Write p = (x,y ) i n affin e coordinates . W e hav e th e followin g divisors :
( X - * ) = [p ] + [-p]-2[oc] ,
(Y + y) = [-p] + \p'] + \p"]-3[oo]
for some p' and p". Le t< p = ( r + y ) / ( X - x ) , the n ((£ ) - -[p ] + [p'] + [p"]-[°°] ^ - # •
We claim tha t p ^ p' , p" . Indee d p ' an d p" , b y definition , ar e o f the for m (#i , — y),
(^2, —y). Thi s can be equal to p only if y — 0 , but fo r y = 0 by assumption ther e ar e
three distinc t root s o f f(x) = 0 . Henc e <p ha s a simpl e pol e a t p , s o (p $• L(n[oo]) ,
and w e obtain th e require d formul a £{D) — n + 1 = deg(D) . •
We mus t no w conside r som e rationalit y questions . W e assum e tha t w e hav e
an ellipti c curv e E give n b y y 2 — f(x) wit h / G k[X, Y]. Th e precedin g analysi s
applies t o a n algebrai c closur e fc of fc. There i s a natural actio n o f the Galoi s grou p
Gk o f k o n E (o n th e coordinates) , an d o n th e divisors . A poin t o r a diviso r i s
called fc-rational i f it i s G^-fixed. Notic e that fo r a divisor D — n\p\ +.. . -\-rtjPj thi s
does no t mea n tha t th e pi ar e i n E(k). Als o Gk act s o n k(E) an d th e fiel d fixe d b y
Gk i s k(E), th e fractio n fiel d o f k[X, Y]/(f). Th e diviso r o f a functio n (p G k(E) i s
obviously fc-rational.
If D i s define d ove r fc, we defin e
Lk(D) = {0 } U {ip G fc(£) | (y> ) + £ > > 0 }
and w e le t £ k{D) = dim fc L fe (D). I t i s clea r tha t £ k(D) ^ £(D) .
THEOREM 1 1 .29 . For any k-rational divisor D, we have
ek(D) = e(D).
In particular the Riemann-Roch formula holds with £k(D) instead of £(D).
P R O O F . Thi s i s a specia l cas e o f th e followin g theorem , whic h i s a formulatio n
of Hilbert' s Theore m 9 0 fo r GL(n): le t fc be a field , fc an algebrai c closure , V a fc-
vector space with an action of Gk • Then there is a basis of V mad e of elements which
are G^-fixe d (equivalently , le t Vk — VGk, the n V = Vk <S> fc,or dim/ - Vk = dim ^ V).
For a proof , se e fo r instanc e [Sil] , Lemm a II.5.8.1 . •
REMARK. Thi s theor y adapt s i n th e followin g wa y t o mor e genera l algebrai c
curves (se e fo r instanc e [Ha] , IV) : i f E i s smoot h an d projectiv e ove r fc, on e ca n
define a certai n diviso r clas s K (calle d th e canonica l class , an d relate d t o differen -
tials o n E). I t ha s degre e deg(K) = 2g — 2 for som e intege r g ^ 0 , calle d th e genu s
of E, an d th e Riemann-Roc h Theore m take s th e for m
(11.38) £(D) - £{K -D)= deg(D) + 1 - g.
11. SUM S OVE R FINIT E FIELD S 297
Elliptic curve s correspon d t o g = 1; i n thi s cas e th e canonica l clas s i s trivial , an d
this reduce s t o Theore m 1 1 .27 . Th e cas e g — 0 corresponds t o th e projectiv e line ,
and i s als o ver y easy . Th e proo f o f (1 1 .38 ) i s muc h mor e involve d tha n th e on e fo r
elliptic curves , sinc e ther e i s n o grou p la w o n th e curv e whic h woul d help .
11.10. Th e loca l zet a functio n o f ellipti c curves .
Let C b e a n ellipti c curv e ove r F give n b y (1 1 .32) . I t i s more convenien t t o us e
here th e correspondin g projectiv e curv e E, a s describe d i n Sectio n 1 1 .9 . Th e zet a
function o f E i s define d a s th e forma l powe r serie s
(11.39) Z (E)=exp(^2l-^^Tn).
n^l
We firs t relat e Z(E) t o point s o n th e curve , b y givin g it s Eule r produc t (com -
pare Lemm a 1 1 .7) . T o d o thi s w e introduc e som e terminology , whic h come s fro m
the languag e o f schemes .
DEFINITION. Le t E be a n ellipti c curv e ove r a finite field F wit h q elements .
A close d poin t o f E i s the Galoi s orbi t o f a poin t x$ G E(¥). Th e degre e deg(x ) o f
a close d poin t x is th e cardinalit y (necessaril y finite) o f th e orbi t an d it s nor m i s
Nx = q de^x\ Th e se t o f close d point s o f E i s denote d \E\.
This notio n i s analogu e t o tha t o f a n irreducibl e polynomia l i n ¥[X] use d fo r
the zet a function s o f Gaus s sum s an d Kloosterma n sums . T o ever y close d poin t
x G \E\ i s associated a n F-rationa l diviso r whic h i s simply th e forma l su m o f all th e
elements i n th e orbit . Th e degre e o f thi s diviso r i s the degre e o f x. Moreover , i t i s
easy t o se e that th e grou p o f F-rational divisor s i s the fre e abelia n grou p generate d
by th e divisor s associate d t o close d points .
LEMMA 1 1 .30 . We have the Euler product expansion
d e g
(11.40) Z(E)= Yl ( l - T ^)-\
xe\E\
where the product is over all closed points of E.
P R O O F . Thi s i s very clos e to Lemma s 1 1 . 7 and 1 1 .9 . Firs t b y decomposin g th e
points i n E(¥ n) i n Galoi s orbit s w e obtai n
\E(¥n)\ = J2d E !'
d\n xe\E\
deg(cc)=d
which i s the analogu e o f Lemm a 1 1 .1 . The n w e hav e
^' n^ l
and, o n th e othe r hand , thi s operato r applie d t o th e righ t sid e o f (1 1 .40 ) yield s
^ d e g ^ T - ^ ^ r ^ d£ l )
x£\E\ n ^ l n^ l d\n x€\E\
deg{x)=d
hence th e result . •
298 11. SUM S OVE R FINIT E FIELD S
Using the Riemann-Roch Theorem , w e now prove the rationality an d functiona l
equation o f th e zet a function .
T H E O R E M 1 1 .31 . The zeta function Z(E) of an elliptic curve is a rational
function. More precisely, it is of the form
where a G Z is defined by the relations \E(¥)\ = q + l — a or, in terms of C, |C(F) | =
q — a. The zeta function satisfies the functional equation Z{E, (qT)~ 1 ) = Z(E, T).
LEMMA 1 1 .32 . Let d ^ 0 and let hd(C) be the set of linear equivalence classes
ofF-rational divisors of degree d. Then hd{C) is finite and \hd(C)\ = \ho(C)\ ^
\E(¥)\.
PROOF. Fo r an y rationa l diviso r D , w e hav e b y Propositio n 1 1 .2 8 th e equiv -
alence D ~ [a(D)\ + (degD — l)[oc] , thu s th e linea r equivalenc e clas s o f D only
depends o n o~(D). I f D is F-rational , i t follows tha t cr(D) G E(W), an d therefor e
the inequalit y \h d(C)\ < \E(¥)\ holds .
Moreover, i t is clear tha t th e ma p D i->> D-f - d[oo] wit h invers e D—
i > > D — d[oo]
induces a bijection betwee n hd(C) an d ho{C). D
P R O O F O F THEORE M 1 1 .31 . Sinc e F-rationa l divisor s o n E are simpl y com -
binations wit h intege r coefficient s o f divisor s associate d t o close d points , th e Eule r
product (1 1 .40 ) give s th e forma l powe r serie s expressio n
Z(E) = Y^ Tdeg(D)
where th e su m i s over al l effectiv e F-rationa l divisor s o n E.
Split th e su m accordin g t o the degre e d of D; fo r d = 0, the onl y effectiv e
divisor i s D = 0 so
Z(E) = l + J2Td £ 1 .
deg(D)=d
For eac h d, spli t furthe r th e su m ove r divisor s o f degree d in linear equivalenc e
classes. B y Lemm a 1 1 .32 , ther e ar e ho(C) equivalenc e classe s fo r eac h d. Fo r a
given clas s (tha t o f D say) , th e contributio n i s the numbe r o f effective (F-rational )
divisors linearl y equivalen t t o D. B y (1 1 .34 ) an d Theore m 1 1 .29 , thi s i s equa l t o
Since d > 1 , the Riemann-Roc h theore m implie s £(D) — deg(D ) = d , so the
computation o f Z(E) i s now straightforwar d
z{E) = \ + Y,Td E i =i + ^ ^ x > d - i ) r d
deg(D)=d
= h 0(C)/ T dT N = h 0(C)T
q-l\l-qT l-TJ (l-T)(l-qT)
l b T
(11 42) - - + «T2
1}
(l-T)(l-qT)
11. SUM S OVE R FINIT E FIELD S 299
where b is define d b y ho(C) = q -f 1 — b.
This prove s th e rationality , an d give s th e precis e form , excep t tha t w e need t o
prove that a — 6 , where |i£(F) | = q + 1 — a, o r equivalently ho(C) = |£(F) | (actuall y
in Lemma 1 1 .32 , we have already shown \ho(C)\ ^ |-£(F)| , but w e do not nee d it an y
more). T o obtai n thi s equality , star t fro m th e origina l definitio n (1 1 .39 ) o f Z(E),
and compare with (1 1 .42) : th e latter is seen to imply that |i£(F) | = q+1 —b = HQ{C).
Finally th e functiona l equatio n o f Z(E) i s a forma l consequenc e o f (1 1 .42) . •
It i s worth recordin g separatel y on e o f th e las t step s o f th e proof .
PROPOSITION 1 1 .33 . Let E be an elliptic curve over a finite field F, let D be
a divisor on E. Then D is principal if and only if deg(D) = 0 and cr(D) = 0 G E.
More precisely, the map j : D i- > cr(D) is an isomorphism between the group of
divisor classes of degree 0 and E(¥).
PROOF. A diviso r D i s F n -rational fo r som e n ^ 1 ; lookin g a t E ove r F n , i t
suffices t o prov e th e isomorphis m betwee n classe s o f F-rational divisor s o f degre e 0
and F-rationa l points . Bu t j i s a surjectiv e (j([x\ — [oo] ) = x) ma p betwee n finit e
sets wit h th e sam e cardinalit y (\ho(C)\ = \E(¥)\). •
This i s th e specia l cas e o f th e so-calle d Abel-Jacob i Theorem , fo r a n ellipti c
curve ove r a finite field. I t actuall y hold s ove r an y field, an d a generalizatio n t o
all (smoot h projective ) curve s i s th e conten t o f th e theor y o f jacobia n varietie s
associated t o curves .
To conclud e th e proo f o f Theore m 1 1 .25 , we procee d a s i n th e cas e o f Kloost -
erman sums : fro m (1 1 .41 ) , w e deriv e
| E ( F n ) | - ( g n + l ) = a n + /3n
where 1 - aT + qT 2 = ( 1 - a T ) ( l - f3T). The n Lemm a 1 1 .22 , applie d wit h th e
input fro m Stepanov' s Theore m 1 1 .1 3 , show s tha t \a\ ^ y/q, |/? | ^ y /q1 an d sinc e
a/3 — q, this conclude s th e proof .
EXERCISE 2 . Assumin g th e genera l Riemann-Roc h formul a (1 1 .38) , prove tha t
for a smoot h projectiv e algebrai c curv e E o f genu s g ove r a finite field F wit h q
elements, th e zet a functio n
n^l
is a rationa l functio n o f th e for m
Z[E)
- (l-D(l-tfT )
for som e polynomia l P wit h integra l coefficient s an d degre e 2g.
[Hint: Th e questio n wil l arise whethe r ther e exis t F-rationa l diviso r classe s o n
E o f degre e 1 (which i s obvious fo r ellipti c curve s sinc e th e poin t o o i s F-rational) .
The imag e of the degre e map i s 5Z for som e 6 \ (2g — 2) (th e degre e o f the canonica l
class). Usin g thi s fact , find a preliminary for m o f the zet a functio n an d analyz e th e
poles t o sho w tha t actuall y 5 = 1 (see [Mor2] , 3.3). ]
30 0 11. SUM S OVE R FINIT E FIELD S
11.11. Surve y o f furthe r results : a cohomologica l primer .
The method s o f Stepano v ar e ver y usefu l and , i n certai n circumstance s the y
provide th e bes t tool s availabl e today , especiall y whe n th e genu s o f th e curv e i s
large compare d t o th e cardinalit y o f th e finite field (se e fo r instanc e th e proo f b y
Heath-Brown o f non-trivia l estimate s fo r Heilbron n sum s [HB2]) .
However, th e deepes t understandin g o f exponentia l sum s ove r finite fields an d
the greates t impac t o n classical problems o f analytic numbe r theor y come s from th e
sophisticated concept s o f algebrai c numbe r theory , especiall y th e £-adi c cohomol -
ogy theor y a s develope d b y Grothendiec k an d hi s collaborators , whic h giv e a ver y
powerful an d flexible framewor k fo r workin g wit h ver y genera l exponentia l sums .
The proo f o f the Rieman n Hypothesi s fo r varietie s b y Delign e [Del] , an d eve n
more hi s far-reachin g generalizatio n [De2] , ar e th e basi s fo r th e extensiv e wor k o f
Katz, Laumo n an d others . I t i s beyond th e scop e of this book to discuss this theor y
in great detail . Le t u s direct th e interested reade r t o the survey articles [Lau] , [K2].
Study o f th e foundationa l basi s o f th e £-adi c theor y ca n b e starte d i n [De3 ] an d
continued togethe r wit h application s i n the book s o f Katz, fo r instanc e [K3] , [K4].
We will limit thi s section to a short introductio n o f the basic vocabulary an d w e
will stat e a fe w o f th e mos t fundamenta l result s i n thi s language . W e the n includ e
examples t o sho w tha t suc h knowledg e ca n alread y b e ver y usefu l eve n whe n on e i s
not familia r wit h th e detail s an d backgroun d o f algebrai c geometry .
In Section s 1 1 . 4 an d 1 1 .5 , w e hav e show n tha t Gaus s sum s an d Kloosterma n
sums ca n b e relate d t o analogue s o f Dirichle t character s ove r finite fields. Th e
£-adic cohomologica l formalis m whic h w e no w discus s ca n b e though t a s relatin g
exponential sums , dually , t o object s whic h ar e Galois-theoreti c i n nature .
The exponentia l sum s S n define d b y (1 1 .8 ) ca n b e interpreted a s sums ove r th e
algebraic curv e Uf }9 consistin g o f ¥ p minu s the pole s of the rational function s / an d
g. Mor e generally , on e wishe s t o conside r exponentia l sum s no t onl y ove r curve s
but ove r mor e genera l varieties . W e wil l us e som e basi c vocabular y o f algebrai c
geometry t o describ e suc h situations , bu t wil l illustrate the m i n th e simple r cas e of
curves. Alread y th e cas e o f (1 1 .8 ) an d Uf, g ar e quit e interesting .
Let F b e a finite field an d U/¥ b e a smoot h algebrai c variet y o f dimensio n
d ^ 0 (technically , w e assum e a s par t o f th e smoothnes s assumptio n tha t U i s
geometrically connected , an d a s part o f being a variet y tha t U i s quasi-projective) .
The simples t example s i n dimensio n 1 ar e £//,$ , o r smoot h projectiv e curves . I n
dimension d > 1 , th e mos t importan t example s ar e th e affin e d-spac e A d , wit h se t
of point s A d(¥) = F d , an d th e projectiv e d-space . Th e exponentia l sum s ove r U
will b e o f th e typ e
(11.43) S n= J2 x(N{f(x)))4,(Tr(g(x)))
xeu(¥n)
where / an d g ar e F-rationa l function s define d o n U.
To U/¥ i s associate d th e so-calle d arithmeti c etal e fundamenta l grou p 7Ti(C7 )
which "classifies " etal e covering s V — > U o f U, an d i s th e analogu e bot h o f th e
Galois grou p o f a field, o r o f th e "ordinary " topologica l fundamenta l group . A
morphism o f algebrai c varietie s i s etal e i f i t i s flat an d unramified ; i f U i s a curve ,
this means V i s a curve, / i s non-constant an d unramified . Fo r the simpler purpose s
of exponential sums , the fundamenta l grou p ca n b e considered somewha t a s a black
11. SUM S O V E R F I N I T E F I E L D S 301
box i n wha t follows , bu t on e shoul d kee p i n min d tha t th e element s o f V i n K\{U)
act a s automorphism s o f an y etal e coverin g TT : V —> U (i.e . TT^X) = n(x) fo r
any 7 G TTI(U) an d x G V), an d tha t i t i s a functor : an y ma p E /— » V betwee n
varieties induce s a continuou s grou p homomorphis m IT\(U) —> TTI(V). (On e shoul d
fix a base-poin t i n definin g 7Ti(£/) , bu t a mor e o r les s canonica l choic e exists , th e
so-called "generi c point " o f the schem e U.)
EXAMPLES. (1 ) Le t U be a singl e point {x} define d ove r F . The n 7ri(U) i s th e
Galois grou p Gal(F/F) .
(2) Le t U/¥ b e a smoot h curve , no t necessaril y projective . Ther e i s a a n
associated smoot h projectiv e curv e C/¥ suc h tha t U C C wit h complemen t a finite
set T o f points . I f U — F fo r instance , the n C = F 1 i s th e projectiv e line , an d
T = {0,oo} .
The fundamental grou p can be described concretel y a s follows: le t K = ¥(U) —
¥{C) b e th e functio n field o f £/ , i.e . th e field o f rationa l function s o n U o r C (i f
C = P 1 , the n K = ¥{t) i s th e usua l field o f rationa l fractions) . W e hav e th e
Galois grou p GK — G&\(K / K) o f K. Fo r ever y close d poin t x o f C , ther e i s
the correspondin g discret e valuatio n ord x o f K. Thi s extend s t o th e separabl e
closure K o f K, an d give s ris e t o a decompositio n grou p D x < GK an d a n inerti a
group I x < D x a s i n classica l algebrai c numbe r theory , wit h th e propert y tha t
Dx/Ix ~ Gal(F g /F 9 ), wher e ¥ q i s th e residu e field o f x , a finite field wit h q = Nx
elements. The n TTI(U) "is " th e quotien t o f GK b y th e smalles t close d norma l
containing al l inerti a group s I x fo r x a close d poin t o f U.
Fix a prim e numbe r £ ^ p. Th e object s use d t o interpre t exponentia l sum s
over U ar e th e so-calle d £-adi c sheave s o n [/ . I n th e simple r cases , thos e wil l b e
"lisse", i n which cas e there i s a simpler alternat e Galois-theoreti c descriptio n whic h
we take a s definition .
DEFINITION. Le t U/¥ b e a smoot h variet y ove r a finite field. A liss e £-adi c
sheaf o n U is a continuou s representatio n p : TT\(U) — > GL(V) wher e V i s a finite
dimensional Q^-vecto r space . Continuit y refer s t o th e profinit e topolog y o n TTI(U)
and th e f-adi c topolog y o n V.
Note th e similarit y wit h th e definitio n o f Galoi s representation s o f numbe r
fields (se e Sectio n 5.1 3) . Becaus e o f th e origina l definitio n o f a sheaf , on e usuall y
denotes ^-adi c sheave s b y curl y letter s J 7, <? , etc. Notic e tha t on e ca n obviousl y
speak o f direct sums , tensor product , symmetri c powers , etc., of lisse £-adic sheave s
by performin g th e correspondin g operation s o n th e representations . Als o on e ca n
speak o f irreducibl e sheaves , etc .
An importan t f-adi c sheaf , denote d Q^(l) , i s obtained b y considering th e natu -
ral actio n o f 7Ti (U) o n £-powe r root s o f unity , whic h arise s fro m th e etal e covering s
where on e simpl y extend s th e bas e field fro m F t o it s extensio n b y root s o f unity .
This actio n i s given b y a certai n characte r xe : ^i(U) — > Q}. Usin g thi s sheaf , on e
defines Tat e twists : i f J 7 i s a liss e £-adi c shea f an d i e Z , the n on e denote s T(i) (J 7
twisted i times ) th e shea f whic h correspond s t o th e actio n p' o f 7Ti (U) o n th e sam e
vector spac e bu t wit h
P/(7) = XK7)P(7) ;
in othe r words , .F(l ) = J(8)Q^(1 ) fo r instance .
302 11. SUM S OVE R FINIT E FIELD S
Exponential sum s aris e b y lookin g a t th e actio n o f th e Frobeniu s element s a t
points o f U. Le t x b e a close d poin t x o f 1 7 , which ca n b e see n a s a Galoi s orbi t
of point s i n U(¥). Th e fundamenta l grou p o f th e "point " x i s th e Galoi s grou p
Dx o f th e residu e field o f U a t x , isomorphi c t o F n wher e n i s th e degre e o f x. B y
functoriality ther e is a map D x— > TTI(U). W e have D x ~ Gal(F n /F n ) an d th e latte r
is generate d (topologically ) b y th e Frobeniu s morphis m ex , s o takin g th e imag e
we ge t i n TTI(U) a well-define d conjugac y class , calle d th e arithmeti c Frobeniu s
conjugacy clas s a t x. I n particular , fo r an y ^-adi c shea f on e ca n spea k o f th e trac e
Trp(a x ) withou t ambiguity . However , i t turn s ou t tha t i t i s th e invers e F o f a
(the so-calle d geometri c Frobenius ) whic h appear s naturall y i n th e cohomologica l
description o f exponentia l sums . W e denot e b y F x th e correspondin g conjugac y
class; i t i s called simpl y th e Frobeniu s conjugac y clas s a t x (omittin g th e adjectiv e
geometric).
THEOREM 1 1 .34 . Let U /¥ be a smooth variety, let f ^ 0 and g be ¥-rational
functions on U, let ip be an additive character and \ & multiplicative character of
¥. Let S n — Sn(U, / , #,X>V0 be the associated exponential sums over U(¥ n) as in
(11.43). Then there exists a lisse l-adic sheaf T on U of degree 1 with the property
that for all n ^ 1 we have
(11.44) S n= Y. Tr
(F*\F)
xeu(¥n)
where we denote Tr(g \ J7) = Tr(p(g) \ V), T corresponding to the representation
p : 7Ti(i7)->GX(V) .
To compar e wit h th e character s use d t o describ e Gaus s sum s an d Klooster -
man sums , on e shoul d thin k o f th e latte r a s analogue s o f Dirichle t character s o r
Hecke characters , wherea s th e ^-adi c sheave s give n b y thi s theore m ar e analogue s
of Galoi s characters . Th e correspondenc e betwee n th e tw o concept s i s a n instanc e
of reciprocit y o r class-fiel d theory .
We sketc h th e constructio n o f T i n th e cas e wher e X = 1 and g i s a non-zer o
rational functio n o n U = A 1 — {poles o f #} , over F , whic h make s i t clea r tha t thi s
is very closel y relate d t o th e argumen t i n Sectio n 1 1 .7 . Conside r th e curv e
(11.45) C : y q-y = g(x)
and notic e tha t ther e i s a surjectiv e ma p TT : (x,y) \-> x fro m C t o U. Fo r an y
a e F , th e equatio n y q — y — a = 0 i s separable , henc e i t ha s q distinc t root s i n
F. I n fac t th e additiv e grou p o f F act s o n th e root s b y translation : i f y i s a roo t
and z G F, the n (y + z) q — {y + z) = y q — y = a. Moreover , TT : C — > U i s a n
etale coverin g (we'v e just see n i t i s everywhere unramifie d an d surjective) . I n othe r
words, 7 r is a n etal e Galoi s coverin g wit h Galoi s grou p isomorphi c t o th e additiv e
group F (covering s give n b y suc h equation s ar e calle d Artin-Shreie r coverings) .
The fundamenta l grou p TTI(U) act s o n C b y automorphism s o f th e covering ,
which mean s a s translation s b y element s o f F a s above . Thi s define s a surjectiv e
map (p : TTI(U) - > F suc h tha t ^(7 ) = 77 / — y fo r an y y G C. (Thi s doesn' t depen d
on th e choic e o f y becaus e th e actio n o f 7 o n C mus t b e a morphis m o f curves. )
Consider th e trivia l £-adi c shea f Qi o n C , o r equivalentl y th e trivia l represen -
tation o f 7Ti(C) . B y th e abov e w e ca n construc t th e induce d representatio n p fro m
11. SUM S O V E R F I N I T E F I E L D S 303
7Ti(C) t o 7Ti(L r), which ca n b e describe d a s th e spac e
V= { /: TTI(£ 0 -) • Q * | /(r 7) = / ( 7 ) for an y r e m(C)}
(where r G ^\{C) i s see n throug h th e ma p 7Ti(C )—> • ?ri(?> 0 comin g fro m 7r) , o n
which 7Ti(L r) act s b y translatio n o n th e righ t
p{l)fiT ) = /( T 7) -
The element s f £ V depen d onl y o n 7ri(C)\jri(U) ~ F (i.e . o n th e automorphism s
of the covering C —> U) , whic h implies that V ~ Q ^ is an ^-adic sheaf on C/ of degree
g. Th e representatio n spac e V ca n b e decompose d ove r th e additiv e character s ^
ofF,
where £ ^ i s th e ^-eigencomponen t o f V , namel y
£ ^ = { / € V | p(7) / = ^{f{l))f for al l 7 € *n([/)} .
It i s eas y t o se e tha t eac h £ ^ i s a n £-adi c shea f o n [7 , an d becaus e p i s induce d
from th e trivia l representation , eac h C^ i s o f degre e 1 .
Then fo r ever y additiv e characte r ^ , th e £-adi c shea f o n U corresponding t o C^
is the shea f satisfyin g (1 1 .44 ) fo r th e exponentia l sum s S n(U,g,ip).
Indeed, i f x G U(¥ n)1 an d y satisfie s y q — y = g(x), the n th e Frobeniu s o f x
acts o n y b y y qU = y + Tr ¥ri/¥(g(x)) sinc e
^ n - 1 / -y qn
- y ^ " 1 + y qn~l + yq - y
= (y q ~ y)^' 1 +'" + y q-y = Tr{y q-y)= Tvg(x).
Hence (f(cr x) — Trg(x ) an d b y definitio n o f C^ i t follow s tha t cr x act s o n C^
by multiplicatio n b y ^(Trp(x)) , henc e F x — a~l act s b y ij;(Trg(x)), whic h give s
(11.44).
In particular , not e tha t takin g th e trac e fo r Qi o n C w e deriv e
|C(F n )| = ^ 5 n ( f / , / , ^ ) ,
as i n (1 1 .30) .
EXERCISE 3 . (1 ) Le t S n b e th e characte r su m (1 1 .8 ) wit h g = 0 fo r som e
multiplicative characte r x oi F * an d som e non-zer o rationa l functio n / G F(x), o n
the variet y U = A 1 — {zeros an d pole s o f / } . Describ e a s abov e th e constructio n o f
the shea f C satisfyin g (1 1 .44 ) i n thi s case . [Hint : Us e th e cove r y d = / ( # ) , wher e
d i s the orde r o f th e multiplicativ e characte r x]
(2) Le t S n b e a s i n (1 1 .8) , U C A 1 th e complemen t o f th e zero s an d pole s o f
/ an d th e pole s o f g. I f C^ i s th e shea f satisfyin g (1 1 .44 ) fo r / = 1 and C x i s th e
sheaf satisfyin g (1 1 .44 ) fo r g — 0 , sho w tha t C = C^ ® Cx satisfie s (1 1 .44 ) fo r S n.
EXAMPLES. (1 ) Eve n th e case p — 1 is interesting whe n dealin g wit h a genera l
variety U. Thi s "trivial " ^-adi c shea f i s denote d Q^ , an d on e ha s S n = |L r (F n )|.
(2) Fo r th e shea f Q^(l) , notic e tha t a x act s b y £ \-> £ q fo r an y roo t o f unit y i f
Nx = q. Therefor e F x act s b y £ i—)• £ x / 9 an d i n particula r th e onl y eigenvalu e o f F x
is q~ l.
30 4 11. SUM S OVE R FINIT E FIELD S
Now i n additio n t o U /¥w e conside r it s "extensio n o f scalars " U/¥ ove r th e
algebraic closur e o f F . Ther e i s a correspondin g geometri c fundamenta l grou p
7Ti([7), which sit s i n a n exac t sequenc e
1
(11.46) -> m(U) - > m(U) - > Gal(F/F ) - > 1 .
To ever y ^-adi c shea f T o n U ar e associate d th e £-adi c cohomolog y group s wit h
compact suppor t o f U wit h coefficient s i n T. Thos e ar e finite-dimensiona l Q r
vector spaces , denoted , H lc(U,F) fo r i > 0 . Th e ke y poin t i s that th e Galoi s grou p
of F act s naturall y o n H lc(U, J7 ), an d i n particular , s o d o th e Probeniu s a an d it s
inverse F , th e geometri c Probenius . Th e ke y to th e cohomologica l interpretatio n o f
exponential sum s i s th e
GROTHENDIECK-LEFSCHETZ T R A C E FORMULA . Let U/¥ be a smooth variety
of dimension d ^ 0 , T an i-adic sheaf on U. We have H %C{U\F) = 0 if i > 2d and
for any n ^ 1
Tr
(11.47) J2 ^F* I F) = Tr(F n | H°C(U,F)) - Tr(F n \ Hlc(U\F)) + .. .
xeu(¥n)
n
- Tr(F | H 2cd~\U,F)) + Tr(F n \ H 2d
(U,F)).
Therefore t o evaluat e th e exponentia l sum s (1 1 .43 ) usin g th e associate d sheaf ,
we nee d t o kno w th e traces , o r equivalentl y th e eigenvalues , o f F (equivalently , o f
a — F~ l) actin g o n H lc for 0 ^ i < 2d . I t turn s ou t tha t i n mos t case s H® and H 2d
are eas y t o compute :
PROPOSITION 1 1 .35 . Let T be a lisse £-adic sheaf on a smooth variety U/F,
corresponding to the representation p of n\(U) on the Q^-vector space V. We have
y*i(u) if U is projective,
(11.48) HZ(U,?)~
0 if U is not projective,
and
2 d
(11.49) H c (U,T)^VVl{0)(-2d)
where V G denotes the space of vectors invariant under the action of a group G on
an abelian group, and VQ denotes the space of co-invariants, the largest quotient
of V on which G acts trivially. In both cases, the isomorphisms are canonical
isomorphisms of vector spaces with an action of the Galois group of¥.
Since V i s a representatio n o f 7Ti(£/) , th e exac t sequenc e (1 1 .46 ) show s tha t
y*i(u) a n ( j V 7ri(jj^ ar e acte d o n b y Gal(F/F) , "through " th e give n representatio n
P-
This propositio n show s tha t fo r a curv e C//F , th e onl y "difficult " cohomolog y
group i s Hl(U,F).
EXAMPLE. Le t U — E/¥p b e a n ellipti c curve , T — Q^ th e trivia l sheaf . B y
the propositio n on e ha s
(1) H®{E, Qt) = Q£ , with trivia l actio n o f F (sinc e Qt i s the trivia l sheaf) .
(2) Hl(E,Qt) = Q * ( - l ) , s o b y definitio n o f th e twist , F act s b y multi -
plication b y p (o n root s o f unity , i.e . o n Q^(l) , o~ act s b y £ i- > £ p , henc e F b y
multiplication b y p _ 1 ) .
11. SUM S O V E R F I N I T E F I E L D S 305
The Lefschet z trac e formul a (1 1 .47 ) give s
\E(Wn)\ =p n + l-Tr(F n\ H l
c{E,Q,))
(compare Theore m 1 1 .31 ) .
More generally , on e derive s th e rationalit y o f th e zet a functio n directl y fro m
the Trac e Formula .
COROLLARY 1 1 .36 . Let U, S n and T be as in Theorem 1 1 .34. For 0 < i < 2d,
let bi = di m if* (£7, T) and
bi
Pi{T) = det( l - FT | Hlc(U,F)) = JJ ( 1 - a^T).
i=i
We have
and for n^ 1 ,
(11.50) S n= J2 (-!)Xi -
Theorems 1 1 .4 , 1 1 . 8 an d th e resul t o f Exercis e 1 ar e al l specia l case s o f thi s
corollary, togethe r wit h suitabl e computation s o f cohomology groups . Th e number s
bi are calle d th e ^-adi c Bett i number s fo r T.
Of muc h greate r importance , however , i s Deligne' s vas t generalizatio n o f th e
Riemann Hypothesi s [De2] . On e start s wit h th e followin g "local " definition :
DEFINITION. Le t w e Z b e a n integer . A liss e ^-adi c shea f T o n U/¥ i s sai d
to b e pur e o f weigh t w i f fo r an y close d poin t x o f [/ , al l eigenvalue s o f F x actin g
on th e Q ^ vecto r spac e V associate d t o J- ar e algebrai c integer s al l conjugate s o f
which hav e th e sam e absolut e valu e equa l t o q w/2 wher e q = Nx i s the cardinalit y
of th e residu e field.
For instance , th e trivia l shea f Q ^ is pure o f weight 0 (all eigenvalues 1 ) . Fo r an y
i £ Z , Qi(i) i s pure o f weight —2z , and i f T i s pure o f weight w, the n F(i) i s pure of
weight w — 2i. Fo r an y exponentia l su m (1 1 .43) , th e associate d shea f T i s pur e o f
weight 0 because th e onl y eigenvalu e a t x i s the roo t o f unit y x(Nf(x))ip(Trg(x)).
THEOREM 1 1 .3 7 ( D E L I G N E ) . Let U/¥ be a smooth variety and J 7 a lisse l-adic
sheaf on U, pure of weight w. Let i ^ 0 and let £ be any eigenvalue of the geometric
Frobenius F acting on if*(£/ , J7 ). Then £ is an algebraic integer, and if a G C is a
conjugate of '£, we have
(11.51) |a | < </™+*)/ 2.
The conclusio n i s als o phrase d a s sayin g tha t H lc{U^!F) i s mixe d o f weight s
^i + w. I f ther e i s equality i n (1 1 .51 ) , the n H lc{U,T) i s said t o b e pur e (o f weigh t
w + i). I n certai n cases , on e ca n appl y dualit y theorem s (fo r instanc e Poincar e
duality) t o deduc e furthe r tha t (1 1 .51 ) i s a n equality .
30 6 11. SUM S OVE R FINIT E FIELD S
R E M A R K . Althoug h Deligne' s proof i s a monumental achievemen t o f very dee p
algebraic geometry, it is an interesting fact tha t a crucial use is made of a generaliza-
tion of the metho d o f Hadamard an d de la Vallee Poussin for proving non-vanishin g
of L-function s o n the line R e (s) = 1 (see Section 5.4) . Similarly , i n Deligne's firs t
proof [Del] , th e ideas o f the classical Rankin-Selber g metho d fo r modular form s
are essentia l (specifically , Delign e acknowledge s th e influence o f [Ra3]).
EXAMPLE. Le t C/¥ b e a smooth connecte d projectiv e curv e (fo r instance, an
elliptic curve) . B y Proposition 1 1 .3 5 as in the previous example , w e have easily :
(1) H®(C,Qt) = Qe, with F actin g trivially .
(2) Hf(C,Qi) = < M - 1), wit h F actin g b y multiplication b y p.
It i s more difficul t t o show tha t
(3) Hl(C,Qi) ~ Q ^ , as Q# vector space s (no t as Galois-modules!), wher e
g ^ 0 is the genus of C (for an elliptic curv e g = 1).
Moreover, ther e i s a Galois-invariant perfec t pairin g
It follow s tha t i f a is one of (the complex conjugate s of ) the eigenvalues o f F o n
iJi((7, Qg), the n p/a i s one also. Henc e fro m Theore m 1 1 .37 , sinc e Q# is pure of
weight 0 , one deduces tha t \a\ = yjp. Thu s
|C(Fn)|=p" + l - £ "?
2=1
where the c^ G Q are the eigenvalues of F on H\. Estimatin g triviall y now , w e get
|C(F„)|-(p n + l) ^2gp n/2
recovering the Riemann Hypothesis , an d in particular, Theore m 1 1 .2 5 for the case
<? = !•
In th e case of exponential sum s (1 1 .43) , the sheaf F i s pure o f weight 0 , hence
denoting
l
d(J-) = max{i | H c(U,F)^0},
we derive directl y fro m (1 1 .50 ) an d (11.51) th e bound
(11.52) \S n\^ Y, ^ ^
for n ^ 1 and, in particular,
nd 2
(11.53) \S n\^q ^ (j2bi)-
i
As in the case of Kioosterman sums , th e exponent d(F)/2 i s best possibl e i n this
inequality. Th e bound d{F) ^ 2d gives a trivia l estimat e (becaus e U is smooth
of dimensio n d , it has about q nd points , a s proved b y the Riemann Hypothesi s for
the trivia l shea f Qi). An y improvement o f this trivia l boun d i s equivalent wit h
Hcd(U,F) = 0 , and the square roo t cancellatio n ofte n expecte d fro m heuristi c
reasonings i s equivalent wit h H lc(U,F) = 0 for i > d. Althoug h no t always true ,
this turn s ou t to hol d "generically" , a s the analytic intuitio n suggest s (se e fo r
instance Theore m 1 1 .4 3 below).
11. SUM S O V E R F I N I T E F I E L D S 30 7
For th e exponential sum s (1 1 .43 ) w e have d(.F ) < 2d, unless T i s the trivia l
sheaf Q^ , s o there i s always a non-trivia l bound . Thi s follow s fro m (1 1 .49) , sinc e
T i s of degree 1 so the space o f co-invariants i s either th e whole spac e (meanin g
the representatio n i s trivial) o r 0. However , thi s smal l gai n is usually insufficien t i n
applications.
Another surprisin g consequenc e o f Deligne's resul t an d the discreteness o f in-
tegers i s the following "self-improving " statement :
COROLLARY 1 1 .38 . Let S n be an exponential sum as in (1 1 .43) and T the
associated sheaf. Suppose w ^ 0 is an integer such that
\Sn\^qw'2+5
for some S G [0, \[ and n ^ 1 . Then we have d{F) ^ w, hence | S n | <C qw^2.
A secon d issu e i n applyin g th e estimates (1 1 .52 ) o r (1 1 .53 ) i n the context of
applications t o analytic numbe r theor y i s that w e usually hav e F = Z/pZ , wit h the
prime numbe r p varying. I n this case , wherea s th e variety U can be defined ove r
Q (o r Z) so that th e sum is, for all p, ove r th e F p -points o f the reduction U p of
U modul o p , the sheaves T v genuinel y depen d o n p (se e the equation (1 1 .45)) , i.e.
there i s no theory o f sheaves ove r U/Z givin g eac h T v b y "reductio n modul o p".
(Katz ha s asked a number o f times for such a theory o f "exponential sum s ove r Z";
see e.g. [K2], but it remain s elusive. ) Thus , th e Betti number s
bz(p)= dim W C(UP,TP)
of th e cohomolog y group s ca n depend o n p, an d the applicabilit y o f the result s
above woul d b e ruined , eve n wit h th e Riemann Hypothesis , i f thes e dimension s
were not bounded i n a reasonable wa y in terms o f p.
This i s in fac t th e case . Th e first genera l resul t i n thi s directio n i s due to
Bombieri [Bo4 ] fo r additive characte r sum s (1 1 .43 ) wher e / = 1 , and was general -
ized b y Adolphson an d Sperber [AS1 ] , [AS2 ] fo r general sum s (thei r method s are
p-adic, base d o n Dwork's origina l ideas) . I n general, thos e result s boun d th e Euler
characteristic
2d
Xc(-F) = E ( -1) i d i m f f c ( ^ ^ ) = £ (- 1
)'6*'
i=0 ( K ^ 2 d
of a shea f T o n (7/F , but furthe r argument s o f Kat z [K5 ] show ho w to deduc e
bounds for
2d
b
tTc(JO = 53dimff*(C7,J0= E *>
i=0 0^i^2d
(hence fo r bi < cr^T)) fro m thos e fo r XcP7 )-
T H E O R E M 1 1 .39 . Let U/Q be a smooth variety over Q, f and g functions
on U with f invertible. Let £ be a prime number and for all p ^ £ such that the
reduction U p ofU modulo p is smooth, let \ andip be any multiplicative and additive
characters of¥ p. Let Tv be an £-adic sheaf on Up such that
E x(Nf(x))i,(Trg(x))= £ Tr{F x\7p)
xEU(¥pn) x<EU{F pn)
30 8 11. SUM S O V E R F I N I T E FIELD S
for n ^ 1 . We have cr^Pp) ^ C where C is a constant depending only on U, f and
9-
A simpl e explici t boun d i s give n i n [AS3 ] i f f(x) = 1 , s o onl y th e additiv e
characters occur , an d g is a Laurent polynomia l o n [ / = ( Q — {0}) d . Th e sum s ove r
Z/_pZ i n questio n ar e therefor e sum s i n d variable s o f th e typ e
(11.54) S ftP= J2 W(^i,...,^) )
xi,... ,x de(i/pzy
where / G Q[#i, x± 1 , . . . , Xd, x^1} i s a non-zer o Lauren t polynomial . Writin g
/ = Y2 a3xJ
for som e (finite ) se t J C Z d , th e Newto n polyhedro n W(f) o f / i s defined t o b e th e
convex hul l i n R d o f J U {0}.
PROPOSITION 1 1 .40 . With the above assumptions, denoting by Tf, v the asso-
ciated sheaf for the sums Sf iP, we have
IXc(^/ l P )|^d!Vol(W(/)),
d
vdFf*)
1 < 0 dWol(W(f))
for any p not dividing the denominator of any coefficient of f, where Vol (W(f)) is
the volume of the Newton polyhedron in the subspace spanned by W(f) in R d, with
respect to Lebesgue measure.
Note tha t b y usin g exclusion-inclusio n an d detectin g polynomial s equation s b y
means o f multiplicativ e characters , on e ca n us e combination s o f sum s o f th e typ e
(11.54) t o describe muc h more general ones. Also , in many cases , one can sho w tha t
only all the odd (o r even) cohomology groups vanish, i n which case |xc(^ r )| — o~ c(F).
See als o Theorem s 1 1 and 1 2 of [K5 ] fo r explici t estimate s i n quit e genera l cases .
We no w giv e example s o f computation s usin g thes e fundamenta l results . Fo r
exponential sums arising in analytic number theory, one often need s nothing more, if
one uses skillfull y som e othe r simpl e trick s suc h a s averaging ove r extr a parameter s
to analyz e th e weigh t o f th e roots .
EXAMPLE 1 . Th e Kloosterma n sum s 5(a , b;p) fo r ab ^ 0 can b e treate d usin g
Proposition 1 1 .4 0 wit h d = 1 an d f(x) = ax + bx~ x. The n W(f) i s th e interva l
[—1,1]. B y Propositio n 1 1 .35 , w e hav e H® — H% = 0 i n thi s cas e sinc e U =
P 1 — {0,oo } i s no t projective , s o a c = ~Xc B y Theore m 1 1 .37 , H^ i s mixe d o f
weight ^ 1 . Henc e w e recove r th e Wei l bound :
\S(a,b;p)\^acp1/2^2p1/2.
(Of course , i n fac t w e hav e b\ = 2 and th e las t inequalit y i s a n equality) .
EXAMPLE 2 . Th e previou s exampl e generalize s t o th e multipl e Kloosterma n
sums define d b y
(11.55) K r(a,q)= E e ( l K * i + ;••+*-) )
11. SUM S O V E R F I N I T E F I E L D S 309
for r ^ 2 and a j^ 0, so K 2(a,p) = S(a, l;p ) (se e [Bo4] , [Del]) . Withou t appealin g
to th e L- function, on e ca n nevertheles s ge t som e informatio n b y averagin g ove r a.
We ge t
2
(11.56) Yl \ K ^ «)l = I' ~ I1"' « -L
Hence \K r(a,q)\ < q r/2. T o improv e thi s elementar y boun d w e appea l t o the
following Lemm a
LEMMA 1 1 .41 . Given a finite set of distinct angles 6i modulo 2TT and complex
numbers a.j we have
] T \Y jale{nei)\ = N\\a\\ 2 + O(l)
where the implied constant does not depend on N. Hence
(11.57) limsu p \y^aie(n0j) > all .
n—>-+oo
PROOF. W e hav e
2
£ |^a,e(n^)| = iVEl^|2 + E Ea ^ E< n ^-^))-
The inner su m i s bounded b y a constant independen t o f TV, s o the first resul t follow s
and (1 1 .57 ) i s a n obviou s consequence . •
From (1 1 .56 ) an d (1 1 .57 ) it follows tha t amon g th e K r(a,p), ther e i s a t mos t
one roo t o f weigh t r, say fo r if r (ao,p), an d al l othe r root s ar e o f weigh t ^ r — 1 .
Notice that K r(ao,p) £ Q(/ip ), the cyclotomic field o f p-th root s of unity. Usin g
the Galoi s actio n o n Q(/x p), th e conjugate s o f K r(a0)p) ar e K r(aovr,p) fo r v e F*.
By th e Rieman n Hypothesis , thi s mean s tha t th e conjugat e o f th e roo t £ of weigh t
r i s stil l a root o f weigh t r for K r(aov2,p). Henc e v r = 1 for al l v £ Fp , whic h i s
only possibl e i f p — 1 | r. In particula r al l root s ar e o f weigh t < r — 1 if p > r + 1.
One therefor e get s b y Propositio n 1 1 .4 0
1
(11.58) K r(a,q)^q^ ^2
where th e implie d constan t depend s onl y o n r.
In th e cas e o f Kloosterma n su m th e Newto n polyhedro n i s th e simple x wit h
vertices ( 1 , 0 , . . . ,0) , . . . , ( 0 , . .. , 0,1), (—1 ,.. . , — 1) whos e volum e i s 1 /r! . More -
over, i t is know n tha t th e zet a functio n i s a polynomial s o \c — ~o~ c and w e ge t
the precis e estimat e
\Kr{a,q)\^rqW2.
This wa s firs t prove d b y Delign e [De3] , without an y assumptio n o n p an d r.
310 11. SUM S OVE R FINIT E FIELD S
EXAMPLE 3 . Her e i s anothe r higher-dimensiona l example . I n [CI1 ] , th e fol -
lowing exponentia l su m ove r finite fields appears :
(11.59) W( X,ip;p)= J2 X(xy(x + l)(y + l))i>(xy-l)
x,y mo d p
where p i s prime , x * s a non-trivia l quadrati c characte r modul o p an d ip i s an y
multiplicative characte r modul o p.
T H E O R E M 1 1 .4 2 (CONREY-IWANIEC) . There exists an absolute positive con-
stant C such that
(11.60) \W( X,t,p)\^Cp
for all p and all ip as above.
The first ste p o f th e proo f i s t o appl y Theore m 1 1 .3 4 t o sa y ther e exist s a n
£-adic shea f T o n th e algebrai c surfac e
U = {(x,y) | xy(x + l)(y + l)?0},
pure o f weigh t 0 for whic h w e have, fo r q — p n , n ^ 1 , the formul a
W(X,1>\q)= E x(N(xy(x + l)(y + l)))i>(N(xy-l))
x,y€U(¥q)
=£ T±(F {Xty)\f).
x,y£U{Fq)
The Lefschet z trac e formul a (1 1 .47 ) take s th e for m
4
W(X^;q) = Y.i-lYTriF" \ W C(U,T)).
2=0
By Theore m 1 1 .37 , eac h H lc(U ,F) i s mixe d o f weight s < i. Le t (0^,2^,1 ^ ) b e th e
family o f eigenvalue s o f F actin g o n th e whol e cohomolog y (wit h multiplicities) ,
together wit h thei r inde x an d weight . W e hav e \a u\ = p w»l2 an d
1/
By Theore m 1 1 .3 9 i n thi s case , th e tota l numbe r o f root s a v i s bounde d b y a
constant independen t o f p. Thus , w e gai n o n th e trivia l boun d W < C p2 i f w v ^ 3
(instead o f 4) , an d th e statemen t o f th e theore m i s that w v ^ 2 .
The second step is to show that ther e is at mos t on e root o f weight ^ 3 (actually ,
it mus t b e = 3 ) and , i f i t exists , the n ip = x i s th e non-trivia l quadrati c character .
This wil l b e derive d fro m th e followin g averag e formula :
(11.61) A = -±j ] T \W( X^q)\2 = q 2
-2q-2.
11. SUM S O V E R F I N I T E F I E L D S 311
To prove thi s formula , ope n th e square an d sum over ip first gettin g b y the orthog-
onality o f character s
A= Y1 Y, X(U V (U
11 1 + 1)(V 1 + 1))X(U2V 2(U2 + 1)(V2 + 1))
UiVi=U2V2
= Yl J2^ Ul + 1
)(t'l + 1 ))X((W2 + l)(«i«i + U2))X{U2)
U±,Vi,U2
(where w e shorten th e notation fro m x ° N t o %) . Nex t th e summation ove r u\ i s
performed givin g
if u 2 = vi,
S(vi,ix 2 ) = > X(^ i + l ) x ( ^ i ^ i + ^ 2 j = < , . _, .
if u 2 y^ vi.
^o { -xM ~ x(u2)
Then th e sum over u 2 i s performed givin g
^T x(u2 + i)x(^2)s(vi, w 2) = gx(^i +1 ) + xOi)
and finall y th e sum over v\ give s
A
=Y1 x(vi + X ) («(Vl + X ) + xM + X
) = ^ - 2 ) -2.
From (1 1 .61 ) , usin g Lemm a 1 1 .41 , i t i s clea r tha t fo r al l ip and v w e hav e
wy ^ 3 and moreover, w v ^ 2 except fo r at mos t on e root, fo r one character if;. I f
this cas e occur s fo r ip, it happen s fo r ^ too , so the only possibilit y i s ip being a real
character. Sinc e
W(x,i;q) = (^2x(u(u + i)))2 = l,
u
we must hav e ip = X-
The las t ste p i s to trea t th e cas e ip = x separately . Precisely , on e can sho w
that
\w(x,x;q)\^4q
n
for an y p and q = p . Thi s is done in a purely elementary manne r withou t appealin g
to the Riemann Hypothesis , an d we refer t o [CI1 ] for the details. I n fact, W . Duke
showed tha t W(x^XiP) = 2Re(J 2 (x,£))> wher e J(x>£ ) i s th e Jacob i su m an d £
is a quarti c characte r modul o p = 1 (mo d 4). Als o W(XIX'->P) i s th e P_ ^ n Fourier
coefficient o f the modular for m 77(4z) 6 o f weight 3 and level 12.
When U is not a curve, numerous geometric subtleties can be involved in dealing
with th e non-trivia l cohomolog y group s H lc with i ^ 0 , 2d. Her e ar e two genera l
bounds, amon g many : th e first on e is due to Deligne [Del] , an d the second i s the
recent versio n i n [FK ] of a general "stratification " theore m o f Katz an d Laumon .
T H E O R E M 1 1 .43 . (1 ) Let f e Z[A"i,.. . , X m ] be a non-zero polynomial of
degree d such that the hyper surface Hf in P m _ 1 defined by the equation
Hf : f d(xu... , x m) = 0,
312 11. SUM S OVE R FINIT E FIELD S
where fd is the homogeneous component of degree d of f', is non-singular. For any
p \ d such that the reduction of Hf modulo p is smooth, any non-trivial additive
character i\> modulo p and any n ^ 1 we have
m nm 2
(11.62) | £ - " 1E >(Tr(f( Xl,...,xm)))\^(d-l) q ' .
Xi,... ^XmeWpTi
(2) Let d ^ 1 , n ^ 1 be integers, V a locally closed subscheme of A^ of dimen-
sion di m V(C) < d and f G Z [ X i , . .. , Xn] a polynomial
Then there exists C = C(n , d, V, f) and closed subschemes Xj C Ag of relative
dimension ^ n — j such that
xnc...ax2cx1ahl
and for any rational function g non-zero on V, any h G (Z/pZ)n — Xj(Z/pZ), any
prime p, any non-trivial additive character ip and multiplicative character \ rnodulo
p, we have
^ X{9(x))^(f(x) + ftixi H h nxn) < Cp*~
xe
Note that i n (1), subject t o a geometric conditio n o n Hf, w e obtain squar e roo t
cancellation i n the exponential sum . I n (2) the assumptions ar e much less stringent ,
but th e conclusion i s weaker: w e have a family o f exponential sum s parameterize d
by h e A n , an d roughl y speakin g w e hav e squar e roo t cancellatio n fo r "generic "
sums (fo r h outside a n exceptional subvariet y X\ o f codimension ^ 1 in A n ), while
worse an d wors e bound s ca n occu r onl y o n smalle r an d smalle r subvarieties . W e
will giv e a n applicatio n o f (2 ) in Chapte r 21.
The ^-adi c theor y an d formalism ar e much mor e develope d tha n wha t w e have
surveyed here . I t ca n also deal wit h sum s ove r singula r varieties , bu t the necessar y
algebraic notion s become rather formidable , an d we concede being unable to discuss
the pervers e sheave s tha t aris e i n more advance d situations .
We wish to emphasize, however , tha t thi s theor y i s also particularly well-suite d
to the study of families o f exponential sums . Th e parameters definin g those , say S x,
are mos t naturall y themselve s point s o n som e algebrai c variet y X/F. I n favorabl e
circumstances ther e exist s a lisse £-adi c shea f T o n X (correspondin g t o a n actio n
of 7Ti(X) o n V) suc h tha t
(11-63) S x,n = Tr(FZ\V)
for ever y valu e o f th e paramete r x G X an d an y n ^ 1 . Fo r exampl e a non-zer o
polynomial / o f degre e ^ d over F ca n be describe d a s an F-rationa l poin t o f the
affme paramete r spac e X = A^ + 1 — {0 } by
d
f = ^ciiX 1 h- > ( a0 , . . . , a d ).
2= 0
There i s an ^-adic shea f T o n X suc h tha t
£>(/(*)) = T*(i? i n
xe¥n
11. SUM S O V E R F I N I T E F I E L D S 313
Purity o f a shea f satisfyin g (1 1 .63 ) depend s o n a firs t applicatio n o f th e Rieman n
Hypothesis. I f i t holds , th e applicatio n o f th e £-adi c theor y typicall y result s i n
equidistribution statement s (followin g fro m [De2] ) fo r th e argument s o f th e expo -
nential sums . Thi s equidistributio n i s i n som e spac e o f conjugac y classe s o f th e
"monodromy group " o f the situation . W e refer fo r instanc e t o [KS ] fo r a very luci d
introduction t o thes e profoun d aspects .
11.12. Comments .
In thi s closin g sectio n w e giv e som e impression s abou t ho w th e exponentia l
and characte r sum s ove r finite fields interac t wit h analyti c numbe r theory . Ther e
are mor e subtl e issue s betwee n th e tw o subject s tha n jus t application s o f result s
concerning the first on e for solvin g problems of the latter. W e could be quite specifi c
by covering completely a few representative examples , bu t i t woul d b e long and no t
transparent enough . Rathe r w e decide d t o discus s principles , idea s an d trick s i n
general term s an d guid e th e reade r t o particula r publications .
First o f al l som e exponentia l sum s appea r whe n on e use s Fourie r analysi s t o
get a hol d o n th e sequenc e unde r investigation . Ther e ar e n o finite fields i n th e
background, s o th e resultin g sum s ar e no t immediatel y relate d t o object s o f alge -
braic geometry . However , on e ca n complet e thes e sum s (b y anothe r us e o f Fourie r
analysis) an d the n facto r the m int o sum s o f prim e powe r moduli . Usuall y on e ca n
evaluate thes e loca l sum s explicitly , o r giv e stron g estimate s b y elementar y o r a d
hoc methods , excep t whe n th e modulu s i s prime . Bu t i n th e prim e cas e on e ma y
naturally conside r th e su m a s bein g ove r a finite field. Thi s schem e allow s u s t o
appeal t o th e powerfu l result s fro m algebrai c geometry . However , th e drawbac k o f
finishing b y estimates fo r every complete sum individually i s that on e cannot exploi t
a possibl e cancellatio n fro m extr a variable s offere d b y th e varyin g modul i (finit e
fields of different characteristic s d o no t interac t i n algebrai c geometry) . Sometime s
a kin d o f reciprocit y formul a ca n hel p tur n th e modulu s int o a variabl e (se e fo r
example [1 1 1 ] o r [M3]) . Anothe r scenari o i s tha t th e sum s ove r modulu s appea r
in th e spectra l resolutio n o f a differentia l operator , i n whic h cas e th e spectra l the -
ory produce s estimate s fa r stronge r tha n thos e derive d b y algebrai c geometry . Fo r
example, thi s i s th e cas e o f sum s o f Kloosterma n sums ; se e Chapte r 1 6 . On e ca n
also thin k thi s wa y abou t th e rea l characte r sum s wit h cubi c polynomials ; the y
are coefficient s o f a cus p for m associate d wit h a n ellipti c curves , s o th e modularit y
gives extr a cancellatio n i n summatio n ove r th e modulus .
As a rule the exponential/character su m o f a given modulus whic h comes out of
analytic numbe r theor y i s incomplete . Thi s itsel f i s not a proble m becaus e variou s
completing technique s ar e availabl e a s mentione d above . Completin g i s a natura l
step t o take , bu t i s i t usefu l o r wasteful ? A t thi s poin t on e shoul d realiz e tha t a
bound fo r a complet e su m hold s essentiall y th e sam e fo r th e origina l incomplet e
sum. Thi s mean s tha t th e resul t i s relativel y weake r fo r a shorte r sum . Stil l i t i s
non-trivial whe n th e lengt h o f the origina l su m i s larger tha n th e squar e roo t o f th e
modulus. Ver y shor t sum s canno t b e treated thi s way . W e do no t hav e a n absolut e
recommendation whe n t o complet e o r no t a give n incomplet e sum . Ou r experienc e
suggests executin g th e Fourie r metho d a s lon g a s th e resultin g summatio n ove r
the frequencie s i s shorte r tha n th e rang e o f th e origina l sum : a t leas t on e ca n fee l
that on e i s progressing . Bu t sometime s i t i s wort h actin g otherwise , acceptin g a
step backward s i n this respec t whil e opening a position fo r a stronger secon d move .
314 11. SUM S OVE R FINIT E FIELD S
For example , imagin e tha t th e amplitud e i n th e exponentia l su m i s no t a rationa l
function, bu t nevertheles s become s on e afte r a n applicatio n o f the stationar y phas e
method t o th e relevan t Fourie r transform . I n thi s cas e th e losse s fro m th e rang e
of summatio n ca n b e recovere d wit h extr a saving s b y application s o f algebrai c
arguments (se e Sectio n 8 o f [CI1 ] , wher e thi s gam e i s playe d i n severa l variable s
simultaneously).
Whatever th e argument s whic h lea d t o complet e sum s ma y be , i n the final ste p
one canno t bea t th e squar e roo t savin g factor . Therefor e t o receiv e a non-trivia l
result on e mus t first produc e someho w a su m wit h a numbe r o f term s large r tha n
the squar e roo t o f th e modulus . Ther e ar e severa l way s t o ge t started , dependin g
on th e shap e o f th e exponential/characte r sum . First , on e ca n tr y t o appl y a Wey l
shift wit h the effec t o f squaring th e numbe r o f terms. Similarl y on e may just squar e
the whol e sum , o r rais e i t t o a highe r powe r t o produc e eve n mor e points . Not e
that shiftin g th e variabl e an d squarin g th e su m ar e no t th e sam e things ; th e first
requires some additiv e feature s o f the variable while the latte r nothin g a t all . Thes e
operations see m t o b e quit e superficia l a t first glanc e (w e ar e takin g essentiall y
replicas o f the origina l sum) , ye t wit h ingenuit y on e can rearrang e th e point s s o th e
summation goe s i n a skewe d direction , th e consecutiv e term s repe l violentl y an d
randomly producin g a considerabl e cancellation . Thi s i s eas y t o say , muc h harde r
to execute . I n fac t on e need s man y othe r devices , suc h a s gluin g severa l variable s
with smal l multiplicit y i n orde r t o arriv e a t a singl e variabl e ove r a rang e large r
than th e squar e roo t o f th e modulus . On e als o mus t smoot h ou t thi s composit e
variable befor e applyin g algebrai c arguments . Usuall y a n applicatio n o f Cauchy' s
inequality an d enlargin g th e oute r summatio n (du e t o positivity ) doe s th e job . A
powerful exampl e ho w this work s i s given b y Burges s [Burl] . I n thi s pape r a shor t
character su m i s estimate d b y a n appea l t o th e Rieman n Hypothesi s fo r algebrai c
curves. A n interestin g poin t i s tha t afte r al l th e trick s on e comes t o a complet e
character su m fo r a curv e o f a larg e genus , althoug h th e origina l su m i s over a lin e
segment.
Different argument s fo r buildin g on e extr a larg e variabl e ar e applie d i n [FI4] ,
consequently th e final complete sum comes in three variables, or equivalently i n fou r
variables ove r a hypersurface . Her e th e Delign e theor y applie s (se e th e Appendi x
by Birc h an d Bombieri) , althoug h th e relate d variet y i s singular . On e shoul d no t
be surprised an d afrai d o f that singularity , because , afte r all , the process of creatin g
more point s o f summatio n a t th e star t i s quit e superficial . I n thi s gam e on e mus t
be experience d whe n mixin g th e point s t o b e sur e tha t i t i s quite random . Anothe r
interesting cas e o f creatin g an d estimatin g exponentia l sum s i n thre e variable s i s
given b y N . Pit t [Pi] .
Applications o f th e Rieman n Hypothesi s fo r curve s ove r finite fields ar e b y
now customary . Muc h les s successfu l ar e th e us e o f genuinel y higher-dimensiona l
varieties. Ther e ar e reason s fo r th e difficultie s involved . Firs t o f al l whe n mor e
variables appear , stronge r restriction s ar e impose d o n the m whic h ar e harde r t o
resolve ( a kin d o f uncertaint y principle) . Jus t imagin e havin g a n abundanc e o f
points t o wor k with , bu t whic h ar e no t fre e becaus e o f som e sid e conditions . Fo r
example ho w would yo u cop e with a requirement tha t th e determinant s o f a famil y
of ellipti c curve s matc h th e conductors ?
11. SUM S O V E R F I N I T E FIELD S 315
Of course , ther e ar e als o direc t application s o f th e Rieman n Hypothesi s fo r
varieties t o traditiona l problem s o f solvabilit y o f diophantin e equation s b y mean s
of th e circl e metho d (se e example s i n Chapte r 20) . I f th e numbe r o f variable s i s
sufficiently large , on e need s nothin g t o manipulate , excep t fo r completin g th e su m
by th e standar d Fourie r method . Som e ingenuity , however , i s require d t o appl y
the circl e metho d ( a variant o f Kloosterman) t o treat diophantin e equation s wit h a
relatively smal l number o f variables, a n excellen t exampl e bein g th e wor k o f Heath -
Brown o n cubi c form s [HB6] .
It i s possibl e i n som e circumstance s t o bea t th e boun d fo r exponentia l sum s
which i s derive d b y th e Rieman n Hypothesis . Thi s i s becaus e th e angle s o f th e
roots o f th e //-functio n themselve s var y s o tha t additiona l cancellatio n ma y occur .
Deligne an d Kat z hav e establishe d suc h occurrence s fo r familie s parameterize d b y
points o n curve s o r varieties . I n othe r words , i n thei r case s on e i s actuall y con -
sidering exponentia l sum s i n mor e variables . However , th e cancellatio n o f root s
can als o occu r fo r familie s parameterize d b y point s ove r smal l irregula r sets . Mor e
important fo r analyti c numbe r theor y i s tha t thes e set s ca n b e quit e general , n o
structure o f a subvariet y i s needed , bu t instea d a kin d o f a bilinea r for m struc -
ture woul d suffice . I n practic e i t i s no t clea r ho w t o wor k wit h th e roots , s o on e
returns t o th e correspondin g exponentia l sum s wher e manipulation s wit h th e pa -
rameters (grouping , gluing , etc. ) ca n b e performed properl y accordin g t o th e shap e
of th e involve d rationa l functio n whic h i s see n wit h th e nake d eye . I n thi s proces s
one mus t no t destro y th e complet e variable s sinc e i n ou r min d th e correspondin g
summations ar e alread y execute d i n term s o f th e roots . Therefor e whe n applyin g
Cauchy's inequalit y t o smoot h th e on e variabl e compose d ou t o f th e parameters ,
we put al l th e complet e variable s t o th e inne r summation , sa y n o f them , togethe r
with som e remainin g parameter s whic h wher e no t use d i n th e composition . Thes e
inner parameter s ar e critica l fo r enlargin g th e diagonal . After squarin g ou t w e ge t
a complet e exponentia l su m i n 2 n + 1 variables whic h depend s o n th e inne r pa -
rameters. Excep t fo r a fe w configuration s o f those , th e complet e exponentia l su m
satisfies th e bes t possibl e boun d derive d fro m th e Riemann Hypothesis , thu s savin g
the facto r o f squar e roo t o f th e modulu s pe r eac h variable . Sinc e th e numbe r o f
variables i s large r tha n twic e th e original , w e wi n th e game . Th e abov e recip e i s
somewhat oversimplified, ye t i t reveal s th e sourc e o f extra saving . On e ca n se e how
it works in the particular cas e of [CI1 ] . Speakin g of [CI1 ] we would like to ad d tha t
here th e exponential/characte r sum s i n severa l variable s emerg e afte r application s
of harmoni c analysi s wit h respec t t o th e hyperboli c Laplac e operato r rathe r tha n
by th e traditiona l Fourie r analysis .
The profoun d theor y o f Delign e an d othe r geometer s i s bein g use d i n analyti c
number theor y wit h spectacula r effects , ye t mor e idea s nee d t o b e invente d t o
fully exploi t it s potential . Perhap s on e shoul d g o beyond borrowin g estimate s an d
penetrate deepl y insid e th e theory . Thi s i s a grea t subjec t fo r futur e research . P .
Michel [M3 ] mad e th e firs t significan t step s (se e als o [FK ] an d [KS1 ]) .
This page intentionally left blank
http://dx.doi.org/10.1090/coll/053/13
CHAPTER 1 2
C H A R A C T E R SUM S
12.1. Introduction .
In analyti c numbe r theor y on e ofte n encounter s sum s o f typ e
(12.1) S="£F(x)
xev
n
where V C Z i s a finite se t an d F : V —> C i s a periodi c functio n o f perio d q.
Because V doe s no t matc h th e periodicity , S i s called a n incomplet e sum . Suppos e
(12.2) F(x) = X (f(x))iP(g(x))
where x , if; ar e multiplicativ e an d additiv e character s t o modulu s g , an d / , g ar e
rational function s wit h intege r coefficients . Precisely , w e assum e tha t
(12.3) / = /i//o , 9 = 9i/9o
where fo,fi,9o,9i€ %[x] an d
(12.4) (fo(x)go(x),q) = l if xeV.
Having fixed thes e polynomial s (whic h ar e no t unique ) w e defin e
(12.5) X (f(x)) = X (h(x)fo(x)),
(12.6) il>{g(x))= >(gi(x)g
1 0(x))
where, a s usual , a denote s th e multiplicativ e invers e o f a modul o q. The n th e
resulting su m
(12-7) S = Y?x(f(x))iP(g(x))
xeV
is calle d a n incomplet e characte r sum . Here , an d hereafter , th e sta r restrict s th e
summation t o the point s o f V satisfyin g (1 2.4) . Th e residue classes x(mod q) whic h
satisfy (1 2.4 ) wil l b e calle d admissible . A n importan t case , bu t b y n o mean s th e
only one , i s V bein g a box . Whe n th e bo x ha s siz e exactl y q we ge t
(12-8) 5 ( x , V ) = E * X(f(x))rl>(g(x))
x(mod q)
which i s calle d a complet e characte r sum .
In thi s chapte r w e giv e basi c technique s o f estimatin g incomplet e characte r
sums i n on e variabl e ove r a n interval . A s a n exercis e fo r th e reader , w e sugges t
generalizing som e o f th e forthcomin g result s t o severa l variables .
317
318 12. CHARACTE R SUM S
12.2. Completin g methods .
The mos t commo n treatmen t o f a n incomplet e su m
(12.9) S(M;N)= ]T * F(n )
M<n^M+N
goes b y expandin g i t int o complet e sum s (thi s i s calle d th e completin g technique) ,
(12 ,o) «(!) . r n*H-~)
x(mod q)
and treatin g th e latte r b y variou s arithmeti c means . T o d o s o we split th e summa -
tion int o th e *-admissibl e residu e classe s n = x(mo d q), an d detec t thes e classe s b y
the orthogonalit y o f additiv e character s gettin g
(12.11)
WW)-i £ *(§)5(! )
a(mod q)
where
S '(T)
(12.12) / an\
<$
M<n^M+N
Usually th e mai n contributio n t o S(M;N) come s fro m a — 0 i n (1 2.1 1 ) fo r whic h
A(0) = T V (we assum e tha t M an d N ^ 1 are integers) . Fo r 0 < \a\ ^ § , w e hav e
W f J K g l o l " 1 . Henc e
LEMMA 1 2.1 . Let F{x) be a complex-valued function defined on the residue
classes x (mo d q) which are •-admissible. Then the corresponding sums satisfy
1
(12.13) S(M;JV)--S(0)|< Yl N" ^-)
0<|a|^
Suppose tha t th e complet e sum s satisf y
(12.14) s(-)|<c(0)(a + M ) V
for som e b G Z an d 6 > | . Thi s boun d i s ofte n tru e wit h 0 = | + £ for an y £ > 0 .
Then (1 2.1 3 ) become s
e
(12.15) S(M]N) - —5(0) 1 < c(0)£(b,q)q
where
1
(12.16) e(b,q)= Yl H- (a + 6,^)i
0<|a|<§
Since £(b,q) i s usuall y smal l (fo r example , w e hav e £(0,q) ^ 2r(q)logq), th e in -
equality (1 2.1 5 ) show s tha t th e incomplet e su m S(M;N) satisfie s essentiall y th e
same boun d a s doe s th e correspondin g complet e sum s 5 ( - ) , u p t o th e mai n ter m
fS(0) .
12. C H A R A C T E R SUM S 319
Clearly th e abov e metho d o f completin g th e su m o f a periodi c functio n F(x)
works fo r mor e genera l sum s o f typ e
(12.17) S = Y^F(n)G(n)
n
where G{x) i s a nic e functio n o f analyti c characte r whic h decay s t o zer o rapidl y a s
\x\— > oo . No w w e presen t anothe r method . Suppos e G(x) i s o f Schwart z clas s o n
M. The n on e ca n appl y th e Poisso n summatio n formula , givin g
where G(y) i s the Fourie r transfor m o f F(x). I n principle bot h method s ar e equiva -
lent, howeve r th e latte r ma y be preferable i n case G(x) propagate s som e vibrations .
If the vibration s ar e regular , G(-) ca n b e evaluate d asymptoticall y b y th e station -
ary phas e metho d (se e for exampl e Corollar y 8.1 5 ) givin g a n asymptoti c expansio n
for S i n term s o f S(^). O n th e othe r hand , i f G(x) i s wild , on e shoul d hav e som e
reservation fo r completin g S i n term s o f additiv e characters . On e shoul d tr y t o
use "harmonics " whic h ar e mor e adequat e fo r th e particula r case . Th e Fourie r
coefficients o f cus p forms , holomorphi c o r non-holomorphic , ma y serv e wel l a s th e
relevant harmonic s i n man y cases .
12.3. Complet e characte r sums .
Let Xq a n d i>q b e multiplicativ e an d additiv e character s respectivel y t o th e
modulus q. Th e complet e characte r su m
S{xq,1>q)= E * X qU(x))%{g(x))
a?(mod q)
is multiplicativ e wit h respec t t o q. Indeed , le t q = rs wit h (r , s) = 1 . The n Xq
factors uniquel y int o XrXs wher e ;\> > Xs a r e multiplicativ e character s t o modul i r , s
respectively. Th e additiv e characte r ip q is give n b y
'ax\
(12.19) ^ ( x ) = e(— )
Q>
for som e a G Z. B y th e "reciprocity " formul a
(12.20) - + - = -(mo d 1 )
rs q
where ss = l(mo d r ) an d rf = l(mo d 5) , th e additiv e characte r factor s int o i/j q =
ip*ips' Hence i t follow s tha t
(12.21) S( Xq,A) = S(Xr,r r)S(Xs,rs)-
Therefore th e problem o f evaluating a complete character su m o f modulus q reduces
to tha t o f prim e powe r moduli .
The complet e su m S(Xi V O °f modulu s q = p@ shouldn' t b e confuse d wit h th e
character su m over the finite field ¥ q whic h was considered i n Chapter 1 1 , except fo r
q = p i n which cas e 5(x ,VO i s indeed on e o f such sums . Th e cas e o f prime modulu s
belongs t o th e theor y o f L- functions fo r curve s ove r finite fields, a s describe d i n
Chapter 1 1 . Th e rationalit y o f the relevan t L-functio n togethe r wit h th e Rieman n
320 2 . CHARACTER SUM S
Hypothesis (bot h prove d b y A . Wei l [Wei ] i n this case ) yiel d algebrai c number s
9\ > • • • , 9r wit h \g v \ — p, p 2,1 suc h tha t
(12.22) S(x,il>)=9i + '-+9r,
The number r is bounded independentl y o f the characteristic p. Moreover , assumin g
some non-singularit y condition s fo r th e rationa l function s / , 9 wit h respec t t o the
characters x^-> ther e ar e n o root s g v with \g v\ = p, s o (1 2.22 ) yield s
(12.23) \S( X,n^rpi.
See Chapter 1 1 and i n particular Sectio n 1 1 .1 1 for mor e complet e discussio n o f thi s
situation.
There is no need to algebrai c geometry fo r th e complet e characte r sum s S(x,ip)
to modulu s q = p$ with (3 ^ 2 , because i n this cas e elementar y argument s are
available.
LEMMA 1 2.2 . Let q = p2a with a ^ 1 . Then we have
(12.24) S( x,ip) = pa £ * x(f(y))Hg(y))
y(mod p Q )
%)E0(modpQ)
where h(y) is the rational function given by
(12.25) h(y) = ag'(y) + b^(y)
with the integers a , b depending on the characters ip, \ which are determined by
(12.19) and (1 2.27) below.
REMARK. Sinc e th e character s \,ty hav e modulu s p 2a i t is required fo r cor -
rectness o f the summatio n (1 2.24 ) t o know tha t x(/(2/))V ;(5f(y)) doe s no t depen d
on th e choic e o f the representativ e o f y(mod p a) o n th e curv e h(y) = 0(mod p a).
This propert y follow s i n th e cours e o f th e proof .
PROOF. Writ e x = y + zpa, where y and z run independentl y ove r an y fixed
systems o f residue classe s modul o p a , an d y is restricted b y the condition p \
fo{y)9o{y)- W e hav e
(12.26) f{x) = f(y) + f'(y)zp° (mo d p 2«)
Indeed, th e congruence (1 2.26 ) i s easily see n fo r monomials x n by the binomia l
formula
(y + zpa)n = yn + ny n-1 zp°l + • • • .
Then i t extends t o arbitrary polynomial s fi(x),fo(x) wit h integra l coefficient s b y
linearity. Moreover , i f /o(y) = 0(mo d p), the n (1 2.26 ) i s verified fo r f(x)/fo(x) a s
follows:
A(*)//„(*) = (My) + Mv)zPa)fo(y)(i - fo(y)My)zPa)
= Mv)Hv) + (My)fo(y) - R{y)f'0{v)Mv))*Pa-
By (1 2.26 ) w e ge t
a
x(f(x)) = x(f(y))x(i + j(y)z P ).
12. CHARACTE R SUM S 32 1
Clearly %( 1 - f zpa) i s an additiv e characte r t o modulu s p a , s o there exist s an intege r
b (uniquel y determine d modul o p a) suc h tha t
'bz\
(12-27) X X^
( l ++ zp-)
z ^ )== eey( ^ ) .
Hence
a
(12.28) *(/(*) ) = x(f(y))e(b^(y)zp- ).
Similarly w e get (1 2.26 ) fo r th e rationa l functio n g(x), whenc e
f
(12.29) ^(g(x)) = 4>(g(y))e(ag (y)zp-a).
a
Multiplying (1 2.28) , (1 2.29 ) an d summin g ove r th e residu e classe s y,z modul o p
we ge t
a
s(x,V0= E * x(f(y)M9(y)) £ e(h(y)z P- ).
y(mod p a) ^(mo d p a)
The inne r su m vanishe s unles s h(y) = 0(mo d p a) i n whic h cas e i t equal s p a. Thi s
completes th e proo f o f (1 2.24) . •
LEMMA 1 2.3 . Let q = p 2ot+l with a ^ 1 . Then we have
(12.30) 5(x , </0 = Pa E * x(f(y)Mg(y))G p(y)
y(mod p a)
h(y)=0(mod p a)
where G p(y) is the Gauss sum
2 a
(12.31) G p(y)= Y. e p(d(y)z + h(y)p- z).
z(mod p)
Here h(y) is the rational function (1 2.25) but with b given by (1 2.35) below, and
b f h
(12.32) d(y) = y\y) + - j{y) + (p - l) -(L{y)f.
REMARK. A S Z run s mo d p , s o does z 2/2p, therefor e th e Gaus s su m (1 2.31 ) i s
correctly defined .
PROOF. W e writ e x — y - f zp a wit h y runnin g modul o p a subjec t t o p \
fo(y)go(y), an d z runnin g modul o p a+1 . A s befor e w e argu e tha t
(12.33) f(x) = f(y) + f(y)zp a + ^f"(y)z 2p2a (mod p 2a+l
).
Note tha t th e rationa l functio n \f"{y) ha s integra l coefficients . Thi s i s clea r fo r
the monomia l y n becaus e n(n — 1) = 0(mo d 2) , the n fo r an y integra l polynomia l
by linearity , an d finally fo r a rationa l functio n f(y) = fi(y)/fo(y) b y th e identit y
1 £ti _ 1 f'f f-l f f f-2 I f £" f-2 | f( £f\2 £-3
— 1
2/ 2^ 1 1
^ 0 J J0J0 2l loJo T / U O J JO •
By (1 2.33 ) we get
2 a
(12.34) X (f(x)) = x(/(y))x(l + (£(y)z + \fr (y)z P°)P ).
32 2 12. CHARACTE R SUM S
Consider th e functio n
v2s
1 e 1
« + ^) = \p(^r + (P- 2pj
4)- a+1 v J
2a+1
This i s a characte r o f th e subgrou p o f residu e classe s x ( m o d p ) wit h x =
l ( m o d p a ) . Indee d
f ((1 + zp a){l + wp a)) = £(l + (z + w + zwp a
)pa)
2
(Z + W Z +W2\
= £( l + 2 p a ) £ ( l + w p a ) .
Since th e subgrou p ha s orde r p a + 1 an d £ 6 ar e al l differen t fo r distinc t b modul o
p a + 1 , i t follow s tha t ther e exist s a n intege r b (uniquely determine d modul o p a + 1 )
such tha t
bz bz \
(
Using (1 2.34) , (12.35) an d
(12.36) mx))=my))e (?mz+-m^
we deriv e tha t
2 a
S(X,1>)= E X{f(ym9{y)) £ e p(d(2/)z + h(y)p- z).
a 1a
y(mod p ) z ( m o d p + )
Here th e innermos t su m vanishe s unles s h(y) = 0 (mo d p a ) i n whic h cas e i t equal s
paGp(y). Thi s complete s th e proof o f (1 2.30) . •
The Gaus s sum s G p(y) wer e compute d i n Chapte r 3 . I f p\ 2d(y) , w e hav e
(12.37) G p ( ! ,)=£p!,s(^))e,(_Se(^))2).
The formulas (1 2.24 ) an d (1 2.30 ) represent trul y th e final stage of computation .
The onl y term s whic h ar e no t determine d o n th e righ t sid e ar e th e root s o f th e
congruence h(y) = 0(mo d p a). However , i n practic e ther e ar e no t man y root s
(essentially a bounde d number ) s o on e get s th e estimat e |5(x , VOI ^ cq 1 /2 wit h c
depending mildl y o n th e coefficient s o f th e rationa l function s / , g.
EXERCISE 1 . Usin g Lemm a 1 2. 2 an d Lemm a 1 2. 3 together wit h (1 2.37 ) eval -
uate th e Kloosterma n su m
(12.38) S(m,n;g) =^ e (J
x(mod q)
for q = pP wit h (3 ^ 2 . Suppos e p \ 2mn. Sho w tha t S(m,n\q) vanishes , unles s
( f ) = (^ ) i n whic h cas e
/ I \ i /2£\
(12.39) S(m,n;q) = 2 ( -)q*Re e qe( — J
12. C H A R A C T E R SUM S 32 3
where £ 2 = ran (mod q).
The Kloosterma n sum s t o modulu s q = p@ wit h / 3 ^ 2 were compute d firs t b y
H. Sali e [Sal] . H e als o compute d th e so-calle d Sali e sum s
J
/ ^ ^ x r^ x v
(12.40) T(m,n;q)= t
£ ^(-Je
f%\ fmx
(J rnx\
x ( m o d q)
where (- ) i s th e Jacobi-Legendr e symbol , includin g th e cas e o f prim e modulus .
One ca n d o i t fo r composit e modul i a s well .
LEMMA 1 2.4 . Suppose (q,2n) = 1 . Then
(12.41) T(m,n;q) = eqq$(^) ] T e(^) .
v2=mn(mod q)
PROOF. Conside r th e function F{u) = T(ra , nu 2; q) defined fo r u (mo d q). Th e
Fourier transfor m o f F(u) i s
u ( m o d q)
nxu2 — m? N
= E (fMv ) E « (
x(mod g ) u(mo dq )
I /n\ v-^ * fmx — 4nxv 2\
£ q
o \-a) Sc(mod<q ) ; )
by th e formul a (3.21 ) fo r quadrati c Gaus s sums . Notic e tha t th e Jacobi-Legendr e
symbol cancele d out , therefor e th e las t su m i s th e Ramanuja n su m
5(0, m - 4nv 2 ; q) = ^ dfi(q/d).
d\(4mn — v 2,q)
Hence b y Fourie r inversio n
m-\ E ^KfW(=)S> © E
f (mo d q ) d| q i>(mo
«(=) •
dq)
•L> 2 =4ran(mod d)
If (u , #) = 1 , this simplifie s t o
v2=?nn(mod g )
In particular , fo r u — 1 we ge t (1 2.41 ) . •
Suppose (q,2mn) = 1 . The n T(m,n;q) vanishe s unles s ther e exist s £ with
2
(12.42) £ = mn(mod q).
324 2 . CHARACTE R SUM S
Given £, al l th e solution s t o v 2 = run (mod q) ca n b e writte n explicitl y a s v =
(rf — ss)£, wher e r , s ru n ove r th e factorization s rs = q with (r , s) = 1. Henc e th e
formula (1 2.41 ) ca n b e writte n mor e explicitl y a s follows :
(12.43) T(m,n ]q)=eqqiQ £ c (^(j-j)).
(r,s) = l
The Kloosterma n sum s t o prim e modulu s canno t b e compute d i n elementar y
terms. Thi s shouldn' t b e surprisin g i n view of the result s (an d conjecture ) concern -
ing th e distributio n o f angle s o f Kloosterma n sum s (se e Sectio n 21 .2 , i n particula r
Theorem 21 . 7 an d th e Sato-Tat e Conjecture) .
12.4. Shor t characte r sums .
In thi s sectio n w e ar e dealin g wit h characte r sum s ove r a shor t interva l
(12-44) S X(N)= Yl *(n)>
M<n^M+N
where \ i s a non-principa l Dirichle t characte r o f modulu s q. B y Lemm a 1 2. 1 we
obtain
a
(12.45) \S X(N)\^2 Y. ~l\9x(")\
where
(12.46) g x(a)= £ x(*)e(y )
;r(mod q)
are th e classica l Gaus s sums . Le t x*(mo d q*) b e th e primitiv e characte r whic h
induces x- W e hav e
(12.47) g x(a)=gx.{l) £ d?(ald)n{q/dq*)
d\(a,q/q*)
by Lemm a 3.2 . Henc e
(12.48) \g x(a)\<cr((a,q/q*))V¥-
Inserting thi s boun d int o (1 2.45 ) w e deriv e
(12.49) \S x(N)\^3r(q/q*)V¥logq.
Since r(m) ^ 2y/m, thi s yield s
THEOREM 1 2.5 . For any non-principal character \ (mo d q) we have
(12.50) | J^ x(n)\<6y/Qlogq.
M<n^M+N
This inequalit y wa s prove d i n 1 91 8 independently b y G . Poly a [Pol ] an d I . M .
Vinogradov [V4 ] excep t fo r th e constan t 6 . B y th e Gran d Rieman n Hypothesi s fo r
Dirichlet //-function s on e ca n deriv e a slightl y stronge r estimat e
SX(N) <^ y/qloglogSq,
12. CHARACTE R SUM S 325
however, nothin g bette r tha n th e Polya-Vinogrado v inequalit y i s know n i n gen -
eral excep t fo r th e constan t factor . Fo r recen t improvement s o f thi s constan t se e
A. Hildebran d [Hi2] .
Next w e giv e anothe r derivatio n o f th e Polya-Vinogrado v inequalit y withou t
completing S X(N). W e hav e
where f(x) = min( x - M , 1 , M + N + 1 - x) i f M < x ^ M + N + 1 , and /(a: ) = 0 ,
otherwise. Thi s redundan t cut-of f functio n will hel p t o separat e th e variable s i n
the forthcomin g expressions . Fo r an y intege r a we hav e
Sx(N) = J2x(n + a)f(n + a).
n
Hence
ASX(N)= J2 E X(n + a)f(n + a).
0<a^A M-A<n^M+N
Now separat e th e variable s i n f(n + a) b y th e Fourie r transfor m
oo
f(t)e((n + a)t)dt
/ -oo
getting
OO
|/(t)|B(t)d*
/ -oo
where B(t) i s th e su m i n tw o independen t variable s
B
W= I E I E X( n + a)e(ni )
0<a^A M-A<n^M+i V
By Cauchy' s inequalit y
1
1
cc(mod q) n
^G + J)EE | E X(ni+x) x(n2+x)
n\ n 2 x(mo d g )
Suppose x(mo d q) i s primitive . The n
(12.51) ^2 X(ni + ^)x(^ 2 + a ) = 5(0 , ni - n 2 ; g)
x(mod qr )
is th e Ramanuja n sum . Henc e w e derive tha t
m2^(j + 1 . 1 \/A + N
] ) (^ + l)(A + N)R(q)
q)\ q
where
S
(12.52) R(q)= Y, \ (0>V><1 )
y(mod q)
32 6 12. C H A R A C T E R SUM S
Assuming tha t N ^ | (a s w e ca n b y virtu e o f periodicity) , an d choosin g A = N
the abov e boun d simplifie s t o 6R(q). Henc e
oo
\f(t)\dt.
/ -oo
By |/(0 I — ( T T 0 - 2 | s m ( 7 r 0 s m ( 7 r ^ ) l w e m
^ e r tha t th e integra l i s bounde d b y
3 log 2iV, an d conclud e tha t
|5x(JV)K8fl(g)hogg.
Note tha t i?(g ) = 2 W(«V(<7) ^ T
(?)9- therefor e
(12.53) |5 x (JV)|<8(T((7) 9 )ilogg.
For q prime this estimate give s the Polya-Vinogradov inequalit y whil e for composit e
q it i s only slightl y weaker .
REMARK. Th e facto r log g emerge d i n th e proces s o f separatio n o f variables .
Since thi s facto r canno t b e entirel y ignored , w e mak e a poin t tha t i n genera l th e
losses from separatio n o f variables ar e not exclusivel y o f technical nature , bu t occu r
inevitably fo r arithmeti c reason s a s well. Thi s comment sound s eve n stronger i n th e
context o f bilinea r forms , wher e factorizatio n b y superficia l separatio n o f variable s
at n o cos t woul d b e fata l (thin k o f Corollar y 7.3) .
The Polya-Vinogrado v inequalit y (1 2.50 ) i s trivial fo r characte r sum s S X(N) o f
length N < C #2, whil e th e tru e boun d i s expecte d t o b e
e
(12.54) S x(N)<&N*q ,
which i s non-trivia l i f N ^ > q 3e. I n a serie s o f paper s D . Burges s [Burl] , [Bur2 ]
established severa l result s fo r relativel y shor t characte r sums , on e o f the m bein g
THEOREM 1 2. 6 (D . BURGESS) . Let x be a primitive character of conductor
q > 1 . Then
1
(12.55) 5 x(A^)<7V -^S*+£
for r = 2 , 3, and for any r ^ 1 if q is cubefree, the implied constant depending only
on e and r.
The hypothesi s tha t q i s cubefre e ca n b e remove d a t th e cos t o f weakenin g
the result . Le t q = k£, wher e £ is th e maxima l cubefre e factor . B y factorin g th e
character x(mo d q) accordingl y an d splittin g th e summatio n int o classe s modul o k
one derive s fro m Theore m 1 2.6 ,
(12.56) S X(N) < Ni-rkh^+e
(if £ = 1 , then fc = q and (1 2.56 ) follow s b y th e periodicit y o f x) .
Burgess's boun d (1 2.55 ) fo r q cubefree i s non-trivia l i f i V ^> g 4 + 4?+e? an d b y
taking r sufficientl y large , on e derive s
(12.57) S X(N) < N 1 -6, ifN^ q + +vrs
for an y 0 < 5 ^ | , th e implie d constan t dependin g onl y o n 6.
12. C H A R A C T E R SUM S 32 7
We shal l giv e a proo f o f a slightl y stronge r estimat e
l
(12.58) \S X(N)\ ^ cN -rpri$(\ogp)r
but onl y fo r character s x ( m ° d p) t o prim e modulus . Her e c is an absolut e constan t
(c = 3 0 i s fine). Ou r argument s g o b y inductio n wit h respec t t o N. Firs t notic e
that th e assertio n (1 2.58 ) i s eithe r trivia l o r i t follow s fro m (1 2.50) , unles s
r
(12.59) c v\ + h log p ^ T V ^ p i + i i Q gp
which conditio n w e hencefort h assume . Applyin g a shif t n \-> n + h wit h 1 ^ h ^
H < N w e obtai n
(12.60) S X{N)= Yl x(n + h) + 26E(H)
M<m^.M+N
where |0 | < 1 and
(12.61) E(H) = ctf-rp&ilogp)*
by th e inductio n hypothesi s (1 2.58 ) fo r th e tw o characte r sum s o f lengt h h whic h
do no t overla p wit h th e origina l segment . Le t H = AB, wher e A,B ar e positiv e
integers. W e us e th e shift s o f typ e
(12.62) h = ab wit h 1 ^ a < A , 1 O< B.
Averaging (1 2.60 ) ove r a , b we get
s
* w = jj E* E *( n afc
+ )+ 20E w
l^a^A M<n^M+N
where \0\ ^ 1 . B y x( n + a fy = x( a )x(& n + &) > where a i s the multiplicativ e invers e
of a modulo p (her e we employ tw o properties o f characters, th e periodicity an d th e
multiplicativity) w e deduc e tha t
(12.63) \S X(N)\ < H^V + 2£(# )
where
^ = E " H E ^+ 6 )|
z(modp)
1 ^6^ £
and v(x) i s th e numbe r o f representation s o f x a s an ( mo d p ) wit h 1 ^ a ^ A an d
M < n ^ M -\- N. W e shal l estimat e V withou t usin g th e inductio n hypothesis , s o
the implie d constan t wil l b e independen t o f c .
As a and n vary over short segment s relative to q, many residue classes x(mod q)
are no t represente d b y an; i n othe r words , v(x) i s ofte n zero . Moreover , i/(x) i s
essentially bounded , bu t i t i s impossibl e t o analyz e it s rando m change s o f size .
Therefore w e relax i/(x) i n V b y applyin g th e Holde r inequalit y
328 12. C H A R A C T E R SUM S
where
v1= Y, "(*) • ^ = E ^
x ( m o d p) a;(mo d p)
w
= E | E x( * + &)f.
x ( m o d p) l^b^B
We could hav e restricted th e outer summatio n i n W t o the residu e classe s x(mod p)
for whic h u(x) ^ 0 , bu t w e are no t abl e t o tak e advantag e o f such a condition . Th e
extension t o th e complet e su m i s massive, nevertheles s i t i s nominal relativ e t o th e
length o f the characte r su m raise d t o power 2r. Thi s explain s ou r choic e of Holder' s
exponents.
Clearly, w e hav e V\ — AN. W e shal l sho w tha t V 2 i s als o o f thi s magnitud e
(essentially).
LEMMA 1 2.7 . We have
(12.64) V 2 < SANiANp- 1 + log3A) .
P R O O F . V 2 i s th e numbe r o f quadruple s {ai,a 2,ni,n2) wit h 1 ^ a\,a 2 < A
and M < ni,n 2 ^ M + N suc h tha t a\n 2 = a2ni(modp) . Fi x a\,a 2 an d pu t
a\n2 — a 2ri\ — kp. W e hav e
AN
K - (fl l — ^ 2 )
P V
and (ai , a2)|/c. Give n ai ,G&2 an d k as above we find that th e number o f pairs (ni,n 2)
satisfying th e equatio n a\n 2 — a 2ri\ = kp i s bounde d b y 2N(ai,a 2)/max(ai,a2).
Therefore
(01,02) / 2AN
iax(ai,a ) \(ai,a );
i ^ 2 a m a x ( f l l ' G 2 )2l ( a i ' f l 22) p 7
Hence (1 2.64 ) follow s afte r simpl e estimates . •
LEMMA 1 2.8 . We have
(12.65) W ^ (2rB) rp + 2rB 2r
p^.
This estimat e fo r W lie s a t th e hear t o f Burgess ' method . Assumin g (1 2.65 )
we complet e th e proo f o f (1 2.58 ) a s follows . W e choos e B = [rp^] givin g
2r
W < (2r) p*.
Then w e choos e A = [N/9rp^\. Not e tha t A ^ 1 by th e lef t sid e o f (1 2.59) , an d
AiV ^ N 2/9rp^ ^ p(logp) 2 b y th e righ t sid e o f (1 2.59) . Therefor e Lemm a 1 2. 7
gives
V2^AN(4\ogp)2.
Recalling tha t V\ — AN w e deriv e
V ^ 2r(AN) 1 ~^(4p* logp) ^ ^ N 2'^^2^ logp)- .
12. C H A R A C T E R SUM S 329
Next not e tha t H = AB < N/9 an d H > JV/1 0 . Therefor e w e deriv e b y (1 2.61 )
and (1 2.63 ) tha t
\SX(N)\ < (lO+^cJ^-ip^aogpji .
Taking c = 3 0 we obtai n (1 2.58) .
It remain s t o prov e Lemm a 1 2.8 , fo r whic h w e ca n assum e B < p. A s i n
Burgess' origina l proo f w e appeal t o th e Rieman n Hypothesi s fo r curve s ove r finite
fields. B y Corollar y 1 1 .24 , fo r an y \ modul o p whic h i s non-trivial , w e hav e
(12.66) | ] T X((x + h)--(x + b r))x{{x + b r+1 ).--(x + b 2r)) ^ 2rp 2
cc(mod p)
if one o f th e classe s b v (mod _p) , v — 1, . .. , 2r i s differen t fro m an y othe r one .
We hav e
W= ^2---Yl Yl X({x + b 1 )--(x + b r))x{{x + b r+1 )".{x + b 2r)).
l^bi,... for^B x(mod p)
The complete sum satisfies (1 2.66 ) except possibly for the (&i,.. . , b2r) whic h can b e
arranged i n r equa l pairs. Thu s the number o f exceptions i s bounded b y r ( 2 r )B r ^
(2rB)r. Henc e (1 2.65 ) follows .
REMARK. Choosin g th e involve d parameter s i n th e abov e proo f mor e econom -
ically on e ca n ge t (1 2.58 ) wit h th e facto r c(logp) ^ replace d b y c'(\ogp)^ fo r som e
absolute constan t d ^ 1 (fo r othe r comment s se e [Fri]) .
EXERCISE 2 . Prov e alon g th e abov e line s th e estimat e (1 2.55 ) wit h r = 2 fo r
any modulu s q > 1 .
The Burges s bound (1 2.55 ) with r = 2 implies a subconvexity boun d fo r Dirich -
let L- functions i n conducto r aspec t (compar e Sectio n 5.9) .
T H E O R E M 1 2.9 . Let x be a primitive Dirichlet character modulo q ^ 2, and
s = | + it. We have
+£
(12.67) L(s,x)<£\s\q^
for any e > 0, the implied constant depending only on e.
PROOF. B y Theore m 5. 3 w e hav e
where V s(y) i s a rapidl y decayin g functio n give n b y (5.1 3) . Precisel y i t satisfie s
Hence
33 0 12. C H A R A C T E R SUM S
By (1 2.55 ) wit h r = 2 we hav e
13
s
S(x) = 22x(n) <min(<7,x2gi6+ ).
Then b y partia l summatio n w e ge t
\L(s,X)\ ^ J" \s(x)(x-V a(x/y/q)) dx
and th e resul t follow s b y applyin g th e relevan t estimates .
•
12.5. Ver y shor t characte r sum s t o trul y composit e modulus .
It i s natura l tha t Burgess' s metho d fail s t o produc e a non-trivia l estimat e fo r
the characte r su m S X(N) o f length N < C q*, wher e q is the conducto r o f \. O n th e
other hand , w e do have effective treatment s o f exponential sum s of Weyl's type (se e
Chapter 8 ) which ar e very short relativ e t o the siz e of the amplitud e function . Her e
we appl y th e differencin g proces s t o reduc e th e amplitud e functio n severa l time s
until i t i s small enoug h (howeve r unbounded ) s o that a non-trivia l estimat e fo r th e
resulting su m i n th e fina l ste p i s availabl e fro m specia l sources . I n thi s respec t th e
case o f th e characte r su m S X(N) i s quit e differen t becaus e on e normall y canno t
reduce th e conducto r b y shifting th e argument ; absolutel y not , i f the conducto r i s a
prime number . Ther e is , however, a possibility t o imitat e th e Weyl-Va n de r Corpu t
differencing idea s fo r characte r sum s o f modulu s whic h facto r int o relativel y smal l
numbers; w e sa y tha t suc h a modulu s i s trul y composite . A n importan t estimat e
for characte r sum s o f specia l composit e modul i wa s give n b y S . W. Graha m an d J .
Ringrose [GRi ] (se e als o [H-B ] an d [FI3]) . I n thi s sectio n w e presen t ou r versio n
of th e Graham-Ringros e magnificen t result .
We shal l procee d b y inductio n wit h respec t t o th e numbe r o f factor s (no t nec -
essarily primes ) i n th e modulu s o f th e character . Fo r thi s reason , i n orde r t o mee t
the inductio n hypothesis , w e trea t mor e genera l sum s tha n necessar y fo r th e fina l
result.
Let £(mo d k) b e a multiplicativ e character . W e conside r
(12.68) St(N)= ] T £(/(n) )
M<m^M+N
where f(x) i s a rationa l functio n wit h al l zero s an d pole s bein g integers . W e writ e
771
(12.69) f(x) = Y[(x-a 1 /)d"
z/=l
where d v ar e non-zer o integer s an d a v ar e an y integers . W e do no t assum e tha t ai ,
. . . , CLfYi a r e distinct , s o th e representatio n (1 2.69 ) doe s no t defin e th e exponent s
d i , . . . , dm uniquely . Recal l that fo r a given representation / = fi/fo wit h / i , // Q G
Z[x] w e define d £(/(#) ) — £,(fi(x))£(fo(x))i therefor e th e su m (1 2.68 ) i s actuall y
restricted b y
(12.70) ( ( n - a i ) - . - ( n - a m),fc) = l.
12. C H A R A C T E R SUM S 331
THEOREM 1 2.1 0 . Let £ = xi'"Xr-iX> where \t ( m ° d qi) for 1 ^ £ < r
are any characters and x (mo d q) is a primitive character of conductor q > 1 , q
squarefree. Let f be given by (1 2.69) with
(12.71) ( d i , . . . ,dm,h ) = l
J
where h is the order of x- Put NQ — max{gi,... , qr-i, q }q*, and
(12.72) A=niI(S-S) -
Then for any N ^ NQ we have
1
(12.73) IS^N^ ^4N((Aq 1 ---qr.l,q)q- r(qy2Tm(q)2n2~r•
The powe r o f proo f stem s fro m estimatio n o f th e complet e characte r su m
(12.74) S( X)= £ x(/(*)) -
:r(mod q)
If q = p i s prime w e are dealin g wit h a sum ove r th e field ¥ p. I n thi s cas e w e deriv e
from [Sch ] th e followin g
PROPOSITION 1 2.1 1 . Let x (mo d p) be a non-principal character of order h
which satisfies (1 2.71 ). Then
(12.75) | S ( X)|<(m-l)(A,p)M-
P R O O F . Firs t not e tha t p ^ 2 becaus e ther e i s onl y th e trivia l characte r o f
modulus two . Fo r m = 1 we hav e f(x) = (x — a\) dl wit h (di , ft)=0 an d
dl
S(x)= E x 0*) = o,
a:(mod p)
so (1 2.75 ) hold s (i n thi s cas e A - 1 ) . Le t m ^ 2. I f A = 0(mod p) , the n (1 2.75 )
is trivial . No w suppos e al l th e number s a i , . . . , am ar e distinc t modul o p . I f / i s
a polynomia l (i.e. , al l th e exponent s c?i,.. . , dm ar e positive) , the n th e polynomia l
F(X, Y) — Yh — f(X) i s absolutel y irreducibl e (se e Lemm a 2C o f [Sch] ) an d ou r
assertion follow s fro m Theore m 1 1 .23 . I f f{x) — fi(x)/fo(x) wit h fi(x),fo(x) £
Z[x] having all distinct zeros , then we consider the polynomial g(x) = fi{x)fo(x) p~2
in place of the rationa l functio n f(x). Not e that x(f( x)) — x(d( x)) a n d (p~2 , ft)=
1, becaus e h\(p — 1). Therefor e th e conditio n (1 2.71 ) hold s fo r g(x), s o (1 2.75 ) i s
established. •
Proposition 1 2.1 1 extends b y multiplicativit y a s follow s
COROLLARY 1 2.1 2 . Let x (mo d q) be a primitive character of squarefree con-
ductor q and order ftwhich satisfies (1 2.71 ). Then
(12.76) |S( X )K(m-ir^(A,g)M
where uo{q) is the number of prime divisors of q.
332 12. CHARACTE R SUM S
PROOF. I f q = p\ • • -p t, the n x — Xi ' * * Xt> wher e Xi > • • • >Xt ar e primitiv e
characters o f orde r / i 1 ? . . . , ^ respectivel y whic h ar e divisor s o f h. Therefor e th e
condition (1 2.71 ) i s satisfied fo r eac h character Xs(mo d p s). Multiplyin g (1 2.75 ) fo r
X i , . . . ,X t w e ge t (1 2.76) . •
Now w e ar e read y t o prov e tha t fo r N ^ TVQ ,
1
(12.77) \S Xl...Xr.lX(N)\ < c r(m)((Aqi • • • 9 r_ 1} q)q~ )2'rN
by inductio n wit h respec t t o r . Th e factor s c r(m) ^ 1 will b e determine d i n th e
induction process ; the y depen d o n g , but i t i s not necessar y t o displa y thi s becaus e
q wil l no t chang e throughou t th e process . Moreover , w e d o no t chang e th e se t
of exponent s { d i , . . . ,<i m} i n th e representatio n (1 2.69) , althoug h ever y inductio n
step wil l doubl e th e numbe r o f zero s an d pole s o f th e resultin g rationa l functions .
an
For r = 1 we hav e £ = x d
W =E x(f(n)) = ^s(x) + eq
where 5(x ) i s th e complet e su m (1 2.74 ) an d |0 | ^ 1 . Henc e w e obtai n b y (1 2.76 )
that (becaus e q? ^ N)
w (,9/) (A,g)^
(12.78) | S x(A0|<m (^^)T./V.
Now le t r ^ 2 . W e appl y th e shif t n »-> • n + /M7 1 and averag e ove r 1 ^ h ^ H
getting
n
"%W = £ E *i(/( )) E l(/( n + ^i)) + 2 ^9 l
M<n^M+N l^h^H
where £ = X 2 ' " X r - i X a n d |0 | ^ 1 - Her e th e erro r ter m i s obtaine d b y trivia l
estimation o f the shor t characte r sum s whic h d o not overla p wit h th e origina l sum .
Hence \S^{N)\ ^ T + 2Hq 1 , wher e
iE | E «/( n + ^i))
M<n^M+N l^h^H
By Cauchy' s inequalit y
T 2
<^ E *| E £(/( " + Mi)//(n + Mi))
l^InM^H M<n^M+N
Here th e inne r su m i s o f typ e (1 2.68 ) fo r th e reduce d characte r £ and th e quotien t
function /(# ) = / ( # + h\qi)/f(x + / ^ l ) - Thi s ha s 2 m zero s an d poles . Fo r f(x)
written a s (1 2.69 ) w e introduc e th e polynomia l
A
0 ) = 1 1 I I ^+ a
^ ~ a ^ ) > degA(x ) =m(m-1 ) ,
so A = A(0) . The n th e correspondin g polynomia l fo r f(x) i s
A(x) = A 2(x)A2(x + y)(x + y) 2rn
12. C H A R A C T E R SUM S 33 3
where y = (hi - h 2)qi- Henc e A = A(0 ) = A 2A2(y)y2m an d
(Aq2 - - • q r-i,q) < (A«?i • • • q r.1 ,q)(yA(y),q/(q, qi)).
By th e induction hypothesi s w e ge t
| £ l ( / ( " ) ) | < c r _ 1 (2m) 5 (ft 1 - h 2)((Aqi • • • qr_uq)q-1f~' >
n
where g(h) = (hqiA(hqi)^q/(q Jqi))2 . W e choose H = q and estimate a s follow s
ft
jp X ) * 9(hi-h 2) = - J2 ( 9iA(ftgi),g/( 9 ,gi)) 2 l " r
l ^ / u , / i 2 ^ # ft (mo d <? )
2/(mod g )
5
i^ (J/A(J/), 9))
2/(mod 9 )
For th e inner su m we have
- J2 (j/A(ft),g)<^;|{y(modd);yA(j/)=0(modd)} |
y(mod q) d\q
< ] T ( 1 + d e g A )w ( d ) = ( 2 + d e g A ) " ^ .
We hav e deg A = m(m - 1 ) , so 2 + degA ^ 2m 2 and (2 + deg A )^ ^ r(g)r^(g) .
Gathering th e above estimate s w e obtain
\St(N)\ s C cl1(2m)(r(q)Tl(q)(Aq1 • • • qr_1,q)/q)2'"N + 2q Ql.
Since qiq% < A/" , this yield s (1 2.77 ) fo r r ^ 2 , provided
C-(m) > ci 1 (2m)(r( g )T^(g)) 2 "" + 2 .
By (1 2.78 ) w e get (1 2.77 ) fo r r = 1 , provided ci(ra ) ^ r m (g). On e can check tha t
the number s
cr(m) = A(T(qY 2Tm{q)2r)2~r
satisfy th e above (recurrence ) inequalit y (us e r2rn(q) = r(q)r rn(q)) givin g (1 2.73) .
Our mai n interes t i n Theorem 1 2.1 0 is for / ( x) = x (s o m = 1 , A = 1 ) in which
case i t yield s
THEOREM 1 2.1 3 . Let \i (mo d qi) for 1 ^ £ < r be any characters and
let x (mo d q) be a primitive character of conductor q > 1 , q squarefree with
(q^qi ' * *^r-i) = 1 - Then for N ^ No = max{qi,... ,q r-i,q±}q± we have
T((?r 2
(12.79) Yl Xi'--Xr-ix(rc ) <4iV| ~
M<m<^M+N
33 4 12. C H A R A C T E R SUM S
COROLLARY 1 2.1 4 . Let x be a primitive character of conductor q > 1 , q
squarefree. Suppose all prime factors of q are $J NQ . Then
r/2
1 /r2r
(12.80) £ x(n)k4JM*) V
M<n^M+N
where r is any integer with N r ^ q 3.
PROOF. Le t p b e th e larges t prim e facto r o f q. W e ca n writ e q = qi - — q r wit h
qi < pq* fo r 1 ^ £ < r (som e o f th e factor s ca n b e one) . W e hav e qi < TV's .
Moreover, on e o f th e factors , sa y q r, satisfie s q r ^ qr. W e ca n als o requir e tha t
uj{qr) ^ cj(q)/r. T o mee t thes e condition s defin e q r a s th e smalles t produc t o f
the larges t prim e factor s o f q suc h tha t q r ^ qr. The n defin e qg successively fo r
1 ^ £ < r a s th e larges t produc t o f th e larges t prim e factor s o f q/q r • • • qi+\ suc h
that qi ^ pqr (i f q = q v • • • #g+i, the n pu t ^ = • • • = q\ = 1 ) . Clearly , ever y prim e
factor o f q is used onc e i n thes e r steps , s o q = q\ • • • q r. Accordingl y x — X i ' '' Xr,
where Xi a r e character s modul o qt an d Xr i s primitive . Therefor e (1 2.80 ) follow s
from (1 2.79) . •
The diviso r functio n i n th e estimate s (1 2.79) , (1 2.80 ) i s a nuisance . I n genera l
we hav e r(q) = (a x + 1 ) • • • (a n + 1 ) ^ 2 a i + - " + a - = 2 QM i f q = p? 1 • • - p ^, wher e
H(g) i s th e numbe r o f prim e factor s o f q counte d wit h multiplicity . Thi s ca n b e
quite large . W e estimat e Q(q) a s follows ,
CZ
^-—' lnc
logdr r\ o g 2l n a :7.f^Vlog
^ l ^-^ ^-—' V lno *d/7lo l nga 2/ /
7. log lnczr log
? 2
z
d\q d\q d<z °
where z > 1 and c is a n absolut e constant . Choosin g z = logq fo r q ^ 3 we ge t
(1181 ) ( . ( . K ^ f ! . <
log log q V lo g log q
Hence
1 +C//loglog(? log(? loglog(?
(12.82) T(Q ) ^ 2( ) / .
EXERCISE 3 . Prov e tha t r(q) ^ g V log log 3? for al l q ^ 1
Using thi s crud e estimat e w e deriv e fro m (1 2.80 )
COROLLARY 1 2.1 5 . Let x be a primitive character of conductor q ^ 3,q square-
free. Let p be the largest prime factor of q. Then, for
(12.83)T V > q £{q) + p9 with e(q) = 4(loglogg)" 2
we have
(12.84) Yl X(n) < iVexp(-V /lo^)
M<n<^M+N
where the implied constant is absolute.
12. CHARACTE R SUM S 335
EXERCISE 4 . Le t th e condition s b e a s i n Corollar y 1 2.1 5 . Sho w tha t
(12.85) L(l , x) < e(q) log q + 9 logp + b
where b is a n absolut e constant .
12.6. Character s t o powerfu l modulus .
The trul y composit e modul i (not e tha t th e number s havin g n o larg e prim e
divisors, th e so-calle d "smoot h numbers" , o r "entier s friables " i n French , ar e trul y
composite) appea r i n practic e no t a s ofte n a s th e prim e moduli . Anothe r extrem e
are th e modul i whic h ar e hig h power s o f a fixed prime . Thes e ar e example s o f
powerful numbers . W e sa y tha t q is powerfu l i f it s kerne l
(12.86) * = n ^
p\q
is smal l relativ e t o q in th e logarithmi c scale .
Let x b e a primitiv e characte r o f conducto r q which i s a powerfu l number . I n
this cas e a larg e par t o f characte r value s ca n b e describe d a s a n exponentia l o f a
polynomial, s o the characte r su m S X(N) ca n b e reduced t o Wey l sums. T o this en d
consider th e followin g polynomial :
(12.87) L(X) = X~Y + -..±^- .
One ca n sho w tha t (se e [Ga2] , p . 1 92 )
a b
(12.88) L(x + y + xy) = L(x) + L(y) + ] T * c(a,b)x y
d<a+b^2d
with som e rationa l coefficient s e(a , b) such tha t c(a, b) G Z[a, 6].
Suppose 4 { q (fo r a technica l reason) . Le t d be sufficientl y larg e s o tha t q 2\kd.
Let D b e th e produc t o f al l m ^ d , (m,q) = 1 . Clearl y DL(kx) G Z[x], an d i t
follows fro m (1 2.88 ) tha t
DL(kx + ky + k 2xy) = DL(kx) + DL(ky) (mod q).
Hence th e functio n £( 1 + kx) = e(DL(kx)/q) i s a characte r o f th e multiplicativ e
group o f residu e classe s mo d q congruen t t o 1 mod k. I t i s eas y t o se e tha t th e
order o f £ is q/k, whic h i s the orde r o f th e group . Therefor e ther e exist s a n intege r
B suc h tha t
(12.89) x ( l + kx) = e(BDL(kx)/q).
Since q is the conducto r o f x i t follow s tha t B i s prime t o q/k. O n th e othe r hand ,
b is determine d onl y mo d q/k s o we ca n assum e tha t B i s prim e t o q.
By (1 2.89 ) w e arrang e th e characte r su m (1 2.44 ) a s follows :
SX(N)= J2 X(a)j2x(l + kan)= £ X («) £ > ( F ( a n ) / 9 )
0<a</c n 0<a<k n
where n run s ove r th e interva l (M — a)k_1 < n ^ (M- f N — a)/c_1, ad = l(mo d g) ,
and F(x) = BDL(kx) i s a polynomia l o f degre e d wit h intege r coefficients . Th e
33 6 12. C H A R A C T E R SUM S
last su m i s o f Weyl' s typ e whic h ca n b e estimate d b y Vinogradov' s method . Thi s
has bee n don e i n detai l i n [1 2] , gettin g
THEOREM 1 2.1 6 . Let x be a primitive character of conductor q, 2 \ q. Let k be
the product of prime factors of q. Then, for k 1 00 < N < N ; ^ 2N we have
1 6
(12-90) | Yl * ( ^ jN '
N<n^N'
where
7 = exp(200rlog 2 (1 200r)), S = (1 800r)- 2 (log3600r)~ 1 , r = | ^ | .
REMARKS. Th e first resul t o f thi s typ e fo r q = p £ wa s establishe d b y A . G .
Postnikov [Pos] . Ou r generalizatio n o f th e Postniko v formul a (1 2.89 ) fo r th e pow -
erful modulu s i s base d o n th e formul a (1 2.88 ) whic h i s du e t o P . X . Gallaghe r
[Ga2]. Ther e ar e application s o f th e result s t o boundin g th e L(s,x) , t o widenin g
the zero-fre e region , an d t o estimatin g th e leas t prim e p = a (mod q) (se e [Ga2 ]
and [1 2]) .
http://dx.doi.org/10.1090/coll/053/14
CHAPTER 1 3
SUMS OVE R P R I M E S
13.1. Genera l principles .
Given / : N—> C w e ar e intereste d i n estimatio n o f th e su m
(13.1) V = J2fiP)
v
where p run s ove r prim e numbers , o r o f th e closel y relate d su m
(13.2) S = $>(n)/(n )
n
where A(n ) i s the vo n Mangold t function . Not e tha t S i s obtained fro m V fo r / ( n )
replaced b y f(n) logn u p t o a smal l contributio n o f term s n — p v wit h v ^ 2 . O f
course, th e problem s o f estimatin g S an d V ar e equivalen t b y partia l summation ,
but i t i s ofte n simple r i n practic e t o dea l wit h S.
If / i s a nice , smoot h functio n wit h compac t suppor t o n R + on e ca n expres s S
by the zero s of ((s) b y integrating —f(s)( ,(s)/((s)1 wher e / i s the Melli n transfor m
of / . On e get s
(13-3) S = /(l) - J2 /(~2n) - J2 fiP)-
n>0 p
However, eve n assumin g th e Rieman n Hypothesis , thi s expressio n (th e so-calle d
explicit formula , se e Exercis e 4 in Chapte r 5 ) i s no t alway s useful . Fo r example , i f
/ : R+— > C i s a n oscillator y function , suc h a s
(13.4,) f{x) = e{ax)g(x)
(13.5) f[x) = e{a^c)g{x)
with a e R* an d g(x) a bum p function , the n th e Melli n transfor m f(s) i s sprea d
over a large range of zeros so that th e expansion (1 3.3 ) fail s to produce goo d results .
As wit h an y summatio n typ e formul a (se e Chapte r 4 ) th e decisio n t o appl y i t o r
not ca n b e reasonabl y guide d accordin g t o whethe r th e origina l su m ha s mor e
terms ove r prime s tha n th e resultin g su m ove r th e zeros . Recently , equippe d wit h
powerful computers , researcher s d o no t hesitat e t o downloa d fro m th e internet 1
a lis t o f numerica l value s o f th e zeros , an d ar e no t afrai d o f usin g the m t o tes t
sums ove r prime s indirectl y vi a the explici t formula . Prime s an d zer o ar e n o longe r
different beasts , i t i s a matte r o f choic e t o wor k wit h on e o r th e othe r sets .
See Odlyzko's site : http://www.dtc.umn.edu/~odlyzko/zeta_tables /
337
33 8 13. SUM S OVE R PRIME S
Many interestin g arithmeti c function s / : N —> C ar e no t eve n define d o n th e
real numbers , i n whic h cas e th e explici t formul a (1 3.3 ) i s no t applicabl e (true , on e
can tak e smoot h extensio n o f / t o R + , bu t i t doe s no t wor k efficientl y i f th e se t o f
values f(n) fo r n G N is erratic). I n such a case we prefer t o us e a sequence notatio n
A = (a n) i n plac e o f th e functio n f(n) = a n. Sinc e A i s infinite , w e conside r th e
finite sum s
(13.6) S(A,x) = ^A(n)a n
n^.x
in plac e o f (1 3.2 ) wit h th e ai m o f estimatin g thes e fo r al l sufficientl y larg e x.
It i s not difficul t t o predic t th e asymptoti c behavio r o f S(A, x) b y appealin g t o
a randomnes s o f th e Mobiu s function :
MOBIUS RANDOMNES S LAW . The Mobius function n(m) changes sign ran-
domly so that for any "reasonable" sequence of complex numbers A — (a m ) the
twisted sum
(13.7) M(A,x) = ] P / i(m)am
m^x
is relatively small due to the cancellation of its terms.
Of cours e a reasonabl e sequenc e mean s chose n wit h n o bias . I n practic e i t i s
often th e case , excep t whe n playin g wit h siev e weight s whic h d o conspir e wit h th e
Mobius function .
To appl y thi s la w t o prim e number s w e write A = \i * L, o r
(13.8) A(n ) = Y, ^ d) lo S \ = ~ E M< 0 log d,
d\n d\n
and inser t th e latte r int o (1 3.6 ) gettin g
(13.9) S(A,x) = -^2fi(d)(logd)A d(x),
d
where
a
(13.10) A d(x)= Yl -
n=0(mod d)
The quantitie s Ad(x), s o also A d(x) logd , ar e "reasonable " i f the origina l arithmeti c
terms a n ar e themselve s "reasonable" . Therefore , accordin g t o th e principle , w e
believe tha t th e contributio n o f larg e d' s i n (1 3.9 ) i s negligible , s o 5(^4 , x) i s wel l
approximated b y
(13.11) S(A,D,x) = - J ] ) fjL(d)(logd)A d(x)
d^D
for som e D whic h i s relativel y small . No w fo r d ^ D th e quantitie s A d(x) ma y
satisfy (du e t o larg e averaging ) a simple , ye t stron g approximat e formul a
(13.12) Ad(x)=g(d)A(x) + r d(x)
13. SUM S O V E R P R I M E S 339
where g(d) i s a nic e multiplicativ e function , A(x) = A\{x), an d rd(x) i s a smal l
error term . Ignorin g th e erro r ter m w e arriv e a t S(A, J9 , x) ~ H(D)A(x), wher e
ff(D) = - ] T > ( d ) ( l o g d ) 5 ( d ) .
Usually g is regularly distribute d ove r prime s s o there i s a limit o f H(D) a s D tend s
to oo , sa y H(D) ~ H(oo) — H wit h
(13.13) H = ->T>(d)(logd)0(d ) = I I ( 1 ~5(P))( 1 " ^ ) _ 1 -
Therefore, th e abov e transformation s lea d u s t o th e followin g asymptoti c formul a
(13.14) S(A, x) ~ HA(x), a s x - • oo .
Of course , provin g thi s rigorousl y i s a differen t matter . Wha t need s t o b e said ,
however, i s tha t th e heuristi c formul a (1 3.1 4 ) ha s neve r faile d t o hol d i n ever y
natural cas e i n whic h S(A, x) i s evaluated rigorousl y b y specia l means .
In 1 934 , I . M.Vinogrado v (se e [V5] , [V6] ) gav e a ver y goo d estimat e fo r th e
exponential su m
(13.15) V(a\x) = ^2e(ap)
p^x
which was needed fo r solvin g the Goldbac h proble m wit h three primes (se e Chapte r
19). I n th e absenc e o f th e Rieman n Hypothesis , thi s wa s a surprisin g accomplish -
ment. Hi s metho d applie s t o th e su m S(A,x) fo r a grea t variet y o f oscillatin g se -
quences A = (a n) whic h ar e no t multiplicative , suc h a s a n = e(an), a n = e(ay / n),
an — \( n + 1 ) ) wher e x i s a multiplicativ e non-principa l character .
Here i s how on e ca n explai n th e principle s o f Vinogradov' s metho d i n abstrac t
settings. On e start s b y repeate d application s o f th e exclusion-inclusio n procedur e
to arrang e S(A,x) int o sum s o f two type s
(13.16) Si = 2 J z J a dadn,
d^.D dn^x
(13.17) S 2
mn^x
M<m^N
where a^ , /3 m ,7 n ar e independen t rea l number s (essentiall y bounded ) an d D, M , N
are suitabl e parameter s (neithe r smal l no r larg e relativ e t o x). Th e su m o f type S\
can b e writte n a s
(13.18) S! = J2 a dAd(x)
where Ad(x) i s given b y (1 3.1 0) . Her e th e term s a n o f Ad{x) ar e cleare d o f compli -
cated coefficient s excep t fo r th e conditio n n = 0 (mo d d) whic h i s eas y t o handl e
for smal l d. Assumin g som e regularit y i n th e variatio n o f th e argumen t o f a n on e
shows tha t Ad(x) i s smal l fo r ever y d ^ D du e t o th e cancellatio n o f terms .
340 13. SUM S OVE R PRIME S
The su m o f type S 2 is considered a s a bilinear for m i n the coefficient s /3 m , 7n
(see Chapte r 7 for mor e genera l considerations) . Fo r estimatio n i t is convenient to
split S 2 int o bilinea r form s ove r dyadi c segment s
(13.19) B{M,N)= Y, P™ J2 W™-
M<m^2M N<n^2N
Applying Cauchy' s inequalit y on e remove s th e coefficient s /3 m ,7 n a s follows :
2 2
(13.20) \B(M,N)\ ^ (Y,\Pm\ )Y,J2\^ln2\C(nun2)
n\ ri2
where
(13.21) C(n un2) = V a ,mn\ Uj mri2 •
If ni = n 2 w e ca n d o nothing bette r tha n th e trivia l bound . However , sinc e N
is quit e larg e th e number o f such case s (th e diagona l terms ) i s relatively small .
On th e othe r hand , fo r a majority o f n\ / n 2 th e su m C{rii,n 2) i s small becaus e
of cancellatio n o f terms (th e variatio n o f the argumen t o f amrll i s not completel y
annihilated b y tha t o f am n 2 ) . I n this wa y on e arrive s a t a non-trivial estimat e for
S(M, N) an d als o fo r th e sum s o f typ e S 2.
Combining th e results fo r sums o f type Si an d £2 one deduce s a boun d for
S(A, x) whic h i s often remarkabl y strong . I n the forthcomin g section s w e provid e
details of the Vinogradov metho d i n particular contexts . W e shall als o present som e
other methods .
13.2. A variant o f Vinogradov's method .
In thi s sectio n w e develo p a formula fo r sums ove r prime s usin g som e modifi -
cations o f the origina l idea s o f Vinogradov. Th e argument s ar e of combinatorial
nature, the y hav e muc h i n common wit h siev e methods .
For 1 < n ^ x we put f3 n(x) = ( 1 +a;(n, y/x)) -1 , wher e cj(n, y/x) i s the numbe r
of prime s p\n wit h p ^ y/x. If n is squarefree, w e hav e
Y^ n , x j -Li f n has a prime facto r ^ y/x,
Pn/P(x) = J Q0 i f n is prime wit h y/x < n ^ x.
This nice formula i s due to O . Ramare. Henc e for an y sequenc e o f complex number s
A = (a n) we hav e
Yl a P = ^2 an
~ ^2^2 P™(x)amp
^/x<p^x Kn^x p^v ^ mp^x
(rn-,p) = l
where ^ restrict s th e summatio n t o squarefree numbers . Le t P(z) be the prod -
uct o f all prime s p ^ z wit h 2 ^ z ^ y/x. W e appl y th e above identit y t o th e
subsequence o f numbers a n wit h (n , P(z)) = 1 gettin g
Yl a P= ^2 an
~ ^2^2 Pm(x)am P.
^<p^x Kn^x Z<P^T/X mp^x
(n,P(z)) = l p\m,(m,P(z)) = l
13. SUM S OVE R PRIME S 341
Now w e remov e th e restrictio n t o squarefre e number s n , an d th e conditio n p \ m.
The correctio n quantit y doe s no t excee d
E W < ( Ep>zk l f f E ^ 'f < \\A(x)\\xh-$
n^x n^x
Vp>z,p \n
where ||^4(x)| | i s define d b y
2
(13.22) \\A(x)\\=(j2\a n\ y.
n^.x
To thi s w e ad d th e term s a\ and a p wit h p ^ ^fx an d estimat e thei r contributio n
by
E \a n\ ^ \\A(x)\\x l<.
n^y/x
Next w e remov e th e conditio n (n,P(z)) = 1 by the exac t siev e (th e Legendr e
formula)
an=
^2 Yl ^( d)Ad(x).
n^x d\P{z)
(n,P(z)) = l
We kee p th e term s Ad(x) wit h d ^ yfx an d estimat e th e remainin g ones . Firs t b y
Cauchy's inequalit y w e get
T( dn
E \M*)\<
d\P(z) dn^x
E Edn^x
K«KM(*)II( E E - )f
d>^/x~ d\P( Z),d>y^C d\P( Z),d>y^
1
^\\A(x)\\(xlog3x)±( ] T ^(djd- )*.
d\P{z),d>V^
To th e las t su m w e appl y Rankin' s tric k a s follow s
1
J2 r(d)d~ ^ x~ £ Y, r{d)d 2e 1
~ ^ x~ £ Y[(l + 2p 2£ 1
-)
d\P(z) d\P(z) p^z
d>y/x
< x~ £ J ] ( l + p - 1 " 5 ) 2 2 ' 6 ^ x" e C(l + e) 2z3e
^+ if"=eXp(-g£)(1 + 1 o g ^
by choosin g e = 1/logz. Collectin g th e abov e estimate s w e get
PROPOSITION 1 3.1 . Let 2 ^ z ^ yfx. For any sequence of complex numbers
A — (an) we have
(13.23)
Y a P =Y K d A
) d(x) ~ Y Prn(x) ^ a
mp + s(x, z)y/x\\A(x)\\
p^x d\P(z) (m,P(z)) = l mp^x
d^y/x z<p^y/x
342 13. SUM S O V E R P R I M E S
where P(z) is the product of all primes p ^ z, ^2 restricts the summation to square-
free numbers, A^{x) is given by (1 3.1 0), \\A(x)\\ is given by (1 3.22), \fi m(x)\ ^ 1
and e(x,z) (which also depends on A) satisfies
21
(13.24) \e(x,z)\ < -^+exp(-i^)(log3x) .
y/z V 2 log z J
REMARKS. Mor e precisel y w e hav e (1 3.23 ) wit h f3 m(x) = ( 1 + uj(m, y/x)) 1- ,
where cj(ra , y/x) i s th e numbe r o f prime s p\m,p ^ y/x, bu t w e didn' t specif y thi s
to emphasiz e tha t on e doesn' t nee d anythin g explici t abou t thes e coefficients . Th e
first su m o n th e righ t sid e o f (1 3.23 ) i s o f typ e Si wit h D = y/x (se e (1 3.1 6 ) an d
(13.18)). On e ca n als o spli t thi s int o tw o sum s Si -f - S 2 accordin g t o d ^ z an d
z < d ^ y/x. The n Si i s o f typ e (1 3.1 6 ) wit h D = z whil e S 2 i s o f typ e (1 3.1 7 )
with M = z an dT V = y/x. Th e doubl e su m o n th e righ t sid e o f (1 3.23 ) i s als o o f
type (1 3.1 7 ) wit h M = z an d N = y/x. Th e estimatio n (1 3.24 ) i s fine fo r
(13.25) z = exp(V'logx) .
For thi s choic e o f z w e hav e
/
(13.26) Hx,z)\ ^ 3exp(-| v i o i ^ ) ( l o g 3 a : ) 21 .
If A — (a n) i s an oscillatory sequenc e (s o the mai n ter m i n (1 3.1 2 ) i s expected t o b e
small), the n withou t grea t los s on e ca n simplif y (1 3.23 ) b y writin g th e inequalit y
a
(13.27) \J2 p\^ E \M*)\ + Y1\ E
z)y/x\\A(x)\\.
Z<p^.y/x
We shal l retur n t o th e abov e construction s i n Sectio n 1 3.6 .
13.3. Linnik' s identity .
In 1 96 0 Yu.V . Linni k [Li2 ] gav e a marvelou s expressio n fo r
fn
A(n) _ f a _ 1 i = P aia > °»
(13.28) A'(n )
lognn (~ \ 00 otherwis e
in term s o f the stric t diviso r function s
(13.29) T' k(n) = \{n 1 ,...ink>2; m...n k = n}\
To this en d h e compute s th e logarith m o f
c(*) = E n " f l = 1 ft 1 -*>"')"
1P
in tw o way s usin g th e powe r serie s expansio n
00 / j
Iog(l-x) = - X ; y -
fe=l
First b y th e Eule r produc t fo r ((s) on e get s
(13.30) logC(s ) = 5]A ,(n)n-fi
n^2
13. SUM S O V E R P R I M E S 34 3
On th e othe r hand , b y th e Dirichle t serie s fo r C( 5)>
(13.31)
logC(s) = l o g ( l - ( l - C ( s ) ) )
fc=l n^2 k=l
Comparing th e coefficient s i n bot h expansion s Linni k get s th e identit y
(13.32) A '(n) = - £ L l ^ ( n ) .
Note tha t r k(n) = 0 i f 2 k > n , s o (1 3.32 ) i s a finit e su m ove r 1 < k ^ log njlog2.
In application s Linni k use s (1 3.32 ) fo r number s n havin g n o smal l prim e factors ,
say fo r (n,P(z)) = 1 with z ^ 2 . The n (1 3.32 ) run s ove r k ^ lo g nj lo g z. Thi s
yields th e followin g
PROPOSITION 1 3.2 . Let 2 ^ z ^ y/x. For any sequence of complex numbers
A = (a n) we have
a
(13.33) E < V = - £ ^X ) «r^n)
p ^ r r k^K n^x
P>z (n,P(z) )=l
where K = lo g #/ lo g z and r fk(n) is the strict divisor function 1( 3.29).
If on e prefer s th e exac t diviso r function s r^{n) on e ca n replac e th e stric t one s
rk{n) i n th e Linni k formul a b y
(13.34) r' k{n)= T ( -1 ) * " ' ( * W ) -
O^i^k
Moreover, th e conditio n (n , P(z)) = 1 can b e relaxe d a t smal l cos t b y applyin g th e
exact siev e i n th e sam e wa y a s i n th e Vinogrado v formula .
On th e right-han d sid e o f (1 3.33 ) th e firs t ter m (k = 1 ) i s a su m o f typ e Si ,
precisely
(13.35) 5 i = ] Ta n,
(n,P(*)) = l
while an y othe r su m
(13.36) S k= J2
n^x
(n,P(z)) = l
for k ^ 2 can b e regarde d a s a bilinea r for m ( a su m o f typ e S2) . However , Linni k
gave a specia l treatmen t fo r Sk fo r a fe w smal l k (i n th e solutio n o f th e Hardy -
Littlewood equatio n p + x 2 + y 2 — iV) , an d onl y fo r large r k h e arrange d Sk int o a
suitable bilinea r form . B y extendin g th e clas s o f special form s involvin g th e diviso r
functions o f degre e k > 1 one gain s mor e flexibilit y i n th e remainin g part s o f th e
identity (1 3.33 ) fro m whic h t o arrang e bilinea r forms . O f course, no t ever y for m Sk
can be treated b y special means. Usuall y on e treats S i b y classical Fourier analysis ,
34 4 13. SUM S OVE R PRIME S
and on e ca n ofte n appl y t o # 2 th e spectra l theor y o f automorphi c forms . On e ma y
hope tha t S k wit h k ^ 3 can b e treate d b y Fourie r analysi s i n GL^ , bu t s o fa r thi s
has no t bee n successful .
In 1 98 1 D . R . Heath-Brow n [HB3 ] develope d a formula fo r A(n ) whic h i s
reminiscent o f tha t o f Linnik . Le t
M(s) = Y, M Mm
Then s
as)M(s) = l + J2an(z)n- .
n>z
Consider th e identit y
k1
£ ( S ) ( 1 _ C ( S ) M ( S ) ) * = £ ( * )+ J2 (-l) (Kk)C(s)Ck- (s)Mk(s).
Comparing th e Dirichle t coefficient s o n bot h side s w e obtai n
PROPOSITION 1 3.3 . Let K ^ 1 , z ^ 1 . Then for any n < 2z K we have
(13.37) A(n ) = - ] T (-^ k(Kk) J2'"H l i{ml)...^mk)\ognk.
k
v 7
l^k^K rn 1 ...m
1 kn ...nk=n
mi,... ,m k^z
This identity o f Heath-Brown ha s a few advantages ove r that o f Linnik. Mos t of
all, the numbe r o f terms K ca n b e chose n smalle r b y regulating wit h th e paramete r
z mor e efficiently .
EXERCISE 1 . Sho w tha t fo r n ^ zK we hav e
(13.38) M W = -E ( " W * ) Y.---Y. Kmi)...Kmk).
l^k^K ^ ' m ...mkn1
11 ...nk- =n
mi,... ,m k^.z
13.4. Vaughan' s identity .
The mos t popula r formul a fo r A(n ) i s du e t o R . C . Vaugha n [Va] . W e deriv e
it fro m
A(n) = £ > ( & ) lo g £ .
b\n
Here w e keep th e term s wit h b ^ y an d transfor m th e remainin g su m a s follow s
£M(fr)log^ = £ & ( & ) A ( c ) .
b\n bc\n
b>y b>y
Next w e keep th e term s wit h c > z and transfor m th e remainin g su m a s follow s
A
£ £A*(6)A(c ) =Y,H M&)A(c) - J]E^ (°)-
6c|n 6c| n 6c| n
b>y,c^iz c^.z b^.y,c^.z
Here th e complet e su m ove r al l b dividing n/c vanishe s unles s c = n , whic h i s no t
possible i f n > z. Addin g u p th e abov e expression s on e obtain s
13. SUM S OVE R PRIME S 345
PROPOSITION 1 3.4 . Let y, z ^ 1 . Then for any n > z we have
lo A
(13.39) A(n ) = J2 »(*>) §\ " £ £ ^) (C)+ £ £ ^ 6 A C
) ( )'
6|n 6c| n 6c| n
b^y b^y,c^z b>y,c>z
Similarly w e deriv e a formul a fo r th e Mobiu s function .
PROPOSITION 1 3.5 . Let y, z ^ 1 . Then for any m > max(y, z) we have
(13.40) n(m) = - £ E M*>)M< 0 + E £ M%(c) .
6c|m frc|ra
b^.y,c^.z b>y,c>z
PROOF. W e star t fro m
fi(m) = ^^/i(6)/x(c )
bc\m
and spli t th e summatio n int o fou r range s accordin g t o b ^ y, b > y, c ^ z, c > z.
Then w e transfor m th e tw o middl e sum s int o
£ X>(% M = - £ £ M%(c),
bc\m bc\m
b^y,c>z b^.y,c^z
££/i(6)/x(c) = -££/x(%(c).
bc\m bc\m
b>y,c^z b^.y,c^z
Adding u p th e abov e expression s on e obtain s (1 3.40) . •
Vaughan's identit y (1 3.39 ) is not a s flexible fo r arrangin g bilinear form s a s those
of Linni k o r Heath-Brown , bu t i t prove d t o b e sufficien t fo r basi c application s an d
it i s the simples t o f al l (read y t o us e withou t re-groupin g it s terms) .
EXERCISE. Deriv e a formul a a l a Vaugha n fo r Ak(n).
13.5. Exponentia l sum s ove r primes .
As a n exampl e o f numerou s application s o f th e method s describe d i n th e pre -
vious section s w e prov e Vinogradov' s estimat e fo r th e exponentia l su m (1 3.1 5) .
Assuming
(13.41) | a - - | ^ — with(a,g )= l ,
z
I q\ q
Vinogradov [V6 ] showe d tha t
(13.42) V(a;x) < qi x% + q~^x + xexp(-^^/loga;) .
Using Vaughan' s identit y (1 3.39 ) w e deriv e practicall y th e sam e boun d fo r
(13.43) 5(a ; x) = ] T e(cm)A(n) .
346 3 . SUM S OVE R PRIME S
THEOREM 1 3.6 . Suppose a satisfies (1 3.41 )- Then for x ^ 2 we have
(13.44) S(a,x) < (qix* + q~^x + x^)(logx) 3
where the implied constant is absolute.
We begin th e proo f o f (1 3.44 ) b y estimating specia l exponential sums . W e have
e
| 52 ( an )|< min ( 7V -2M)-
Hence fo r an y number s x(m) > 0 we ge t
min iV a;(TO)
52 | 52 e(«™«)| < 52 ( ' '^u^)-
n^x(m)
As m varie s ove r a segmen t o f lengt h q/2, th e point s 1 1 am| | ar e al l distinc t an d
spaced b y l/2q a t least . B y thi s observatio n w e deriv e
£ min ( iV '^)^(1 +4M
^ ( i V + E «< _1 )
<€i(M + N + MNq- 1 + q) log 2g.
Similarly fo r x(m) = x/ m w e derive (conside r separatel y th e rang e 1 ^ m ^ q/2)
Y\ m i n f - ' ^ i\) <:(M + xq- 1
+q)log2qx.
^—' \m 2 \\am /
From thes e estimate s w e ge t
LEMMA 1 3.7 = For any numbers x{m) > 0 we have
e
(13.45) Y^ I ]L ( a m n ) | < ( M + N + M A ^ - 1 + g)log2g ,
n^rr(m)
(13.46) ^ I ] P e(amn)\ < ( M + x g - 1 +g)log2gx .
Next w e deriv e a boun d fo r genera l bilinea r form s
(13.47) B(x;N)= ^ | £7 m e(amn)
N<n^2N mn^x
where 7 m ar e comple x number s wit h | 7 m | ^ 1 . Applyin g Cauchy' s inequalit y an d
(13.45) w e ge t
B2(x;N)^2N J^Jl | E <a(™i - m 2)n)
< ( ^ + A ^ - h - + g+
)xlog2g .
9
From thi s boun d w e deduc e
13. SUM S OVE R PRIME S 34 7
LEMMA 1 3.8 . For any complex numbers a m,/3n with \a m\ ^ l,|/3 n | ^ 1 we
have
(13.48)
1 1 Y Y amPne(amn) <z: ( ^ + ^ + - ^ q) 2 x^ (lo g x) 2.
mn^x
m>M,n>N
P R O O F O F THEORE M 1 3.6 . B y th e identit y (1 3.39 ) w e hav e
S(a;x) = / J /] n(m)(log£)e(alm) — V V V /i(m ) A(n)e(afmn)
£m^.x Imn^x
+
H Y1 Y1 M(^)A(n)e(a^mn ) + O(iV) .
imn^x
m^M,n^N
We choose M = N = x$. T o the first an d th e secon d sum s we apply (1 3.46 ) gettin g
0((xs + xq~ l + q)(\ogx) 2. I n th e las t su m w e consider in = fc as one variable wit h
coefficient
c(k) = ^ A(n ) ^ lo g A;.
n\k,n^N
Then w e appl y (1 3.48 ) gettin g 0((xs + x g - 1 - f <?)^ 2 (logo:) 3. Addin g u p thes e
estimates w e get (1 3.44) . •
By th e sam e arguments , bu t usin g th e identit y (1 3.40 ) i n plac e o f (1 3.39 ) (als o
apply Cauchy' s inequalit y alon g th e wa y t o remov e th e correspondin g coefficient s
c(k) whic h ar e bounde d b y r(k) J no t b y lo g k a s above) , on e show s
THEOREM 1 3.9 . Suppose a satisfies (1 3.41 )- Then for x ^ 2 we have
m
(13.49) ^^ ^( )e( a m
) < ( < p £ 2 _| _ q-\ x + x5)^2(logx) 4
.
m^.x
If a i s a rationa l numbe r wit h smal l denominator , the n th e su m (1 3.49 ) ca n b e
well estimate d b y a n appea l t o th e zero-fre e regio n o f L-functions . Indeed , fo r an y
Dirichlet characte r %(mo d k) w e have (se e (5.80) )
(13.50) ] P /i(m)x(m ) < k^x(logx)"
for an y x ^ 2 an d A ^ 0 , th e implie d constan t dependin g onl y o n A. Henc e w e
derive b y mean s o f Gaus s sum s tha t
A
(13.51) Yl A*Me ( —) < qx(logx)~ .
m^.x
Now le t a b e an y rea l number . Give n Q ^ 1 there exist s a rationa l numbe r a/q
with (a , g) = 1 , 1 ^ q ^ Q suc h tha t
a 1
(13.52) a -- — < —7Z
<7 qQ
34 8 13. SUM S OVE R PRIME S
Hence w e deriv e b y partia l summatio n tha t
/\ / J ( 27T£\| V^ /am\
E /i(m)e(am)| ^ (^ 1 + —J| ^ /^Weh r)
for som e 1 ^ y ^ x. Usin g (1 3.51 ) on e get s
5A
(13.53) Yl ^{rn)e{am) < C (q + ^ ) x ( l o g x ) ~
m^x
for an y x ^ 2 an d A ^ 0 , th e implie d constan t dependin g onl y o n A W e appl y
(13.53) i f q < x Q - 1 an d (1 3.49 ) i f x Q " 1 <q^Q gettin g
_1
y ^ fjb(m)e(am) < Q ^2(logx)-5A + (5^4(logx) 4 + x^(logx) 4.
Finally, choosin g ( J = x(logx) _ 4 A , w e conclud e th e followin g
THEOREM 1 3.1 0 . Fo r any real number a and x ^ 2 we have
(13.54) ^ fjb(m)e(am) < C x(logx)"A
ra^x
/or am / A ^ 0 ; where the implied constant depends only on A.
This estimat e wa s firs t prove d i n 1 93 7 b y H . Davenpor t [Dal ] b y applyin g
Vinogradov's method . Th e complet e uniformit y i n a o f thi s boun d i s it s pleasan t
feature. W e shal l exploi t (1 3.54 ) i n ou r treatmen t o f th e Goldbac h problem s i n
Chapter 1 9 .
EXERCISE 2 . Usin g th e identit y (1 3.39 ) prov e tha t
(13.55) ^ e(oiy/n)A{n) < x§ (log x)4
n^x
for an y rea l numbe r a ^ 0 , wher e th e implie d constan t depend s o n a .
13.6. Bac k t o th e sieve .
We elaborat e furthe r th e idea s o f Sectio n 1 3. 2 t o dra w result s whic h ca n b e
useful i n mor e delicat e situation s tha n tha t i n th e las t section . Th e goa l i s t o
produce th e boun d
a £7r x
(13.56) \52 p\ < ()
p^x
for an y e > 0 an d x ^ XQ(S) subject t o hypothese s o n sum s o f typ e (1 3.1 8 ) an d
(13.19) whic h ca n b e verifie d b y existin g technology . On e o f suc h technolog y wil l
be th e spectra l theor y o f automorphi c form s whic h w e presen t i n Chapte r 2 1 fo r
the proble m o f equidistributio n o f quadrati c root s t o prim e moduli .
Recall tha t P(z) denote s th e produc t o f al l prime s p < z. Give n x w e conside r
the sum s
(13.57) Q(A,z)= Y,
n^x
(n,P(*)) = l
13. SUM S OVE R PRIME S 349
as a function o f z for th e sequenc e A = (an) an d it s subsequence s Ad = {cbmd)-
(See als o Chapte r 6 for a n elementar y introductio n t o siev e methods) . Suppos e
that \a n\ < r(n).
Let x s < z ^ x*. The n ou r su m over prime s i s approximate d b y Q(A,z) a s
follows:
Y a P = Q(^ z ) -^2^2apq + Ylap + ai
p^x pq^% p<z
z^p^q
= Q(A,z) + Ol log 1 2
v logx1
\2 ogz y (logx) y
One can reduce th e "sievin g level " z by applying th e Buchstab identity , namel y for
any sequenc e A and any w < z we have
A
(13.58) Q(A,z) = Q(A,w)- Y Q( P^P)-
w^.p<z
Applying thi s identit y twic e w e get
(13.59) Q(A,z) = Q(A,w)- Y ^p^)+EE^^) -
w^.p<z w^~q<p<z
Opening Q(A, w) an d Q(A P, w) b y the exact siev e o f Legendre w e get
Q(A,w)- Y Q(Ap,w) = Y^ KtfMx)
w^p<z d
where th e das h restrict s th e summatio n t o th e divisor s d of P(z) whic h hav e at
most on e prime facto r p^ w. W e fixy^ z and estimate th e partial su m over d > y
by applyin g Rankin' s tric k a s follows :
\Y fJ>{d)A d(x)\ ^Y z L T( m c 0 <• x(logx)Y^ r 1
d d
()~
d>y d>y md^.x d>y
and
£'r(d)d- 1 <!/- e £ , ' r ( d ) (f " 1
d>y d
<y~s(1+ E ^br 1 )Il( 1 + r ^" 1 )-
w^p<z p<w
Choosing e = (logw)" 1 w e get
E^>^(j)'n(-rM;P'w
d>y y p<z F
^ y/
Hence
8
Y M M ^ W « (- ) x(logx) .
yJ
d>y ^
The remainin g su m over d ^ y is estimated b y
(13.60) R{x) = Y\Mx)\-
d^y
35 0 13. SUM S OVE R PRIME S
Now w e procee d t o th e doubl e su m o f Q(A pq, q) ove r prime s p, q with w ^ q <
p < z. Le t x± < v ^ xz. W e estimat e th e partia l su m ove r q ^ v trivially as
follows:
/~ \2
A ( 1 +
E Q( P^ I)«T^ZlogEx ^-^ i'\lo gz
x , / log x \ x
< - logo;
lo 1\3 gogi
—^ — +
y (logx) 2
It remain s t o evaluate th e su m o f Q(Apq,q) ove r prime s p , q with w ^ q < v
and q < p < z. W e writ e thi s i n th e for m
(i3.6i) EE«A>„9 )= E E mp
w^q<v mq^x 7m a.
q<p<z w^q<v
q<Pm<z
where p m is th e leas t prim e facto r o f m and 7 m i s th e numbe r o f prim e divisor s o f
m whic h ar e < z, so 7 m ^ uj(m). W e want t o mak e it a bilinear form . Thi s require s
us to relax th e relatio n q < pm. T o thi s en d w e us e th e followin g lemm a whic h is
convenient fo r separatio n o f variables :
LEMMA 1 3.1 1 . There exists a function h(t) depending only on z such that
oo
\h{t)\dt <\ogQz
/ -oo
and for all integers 1 ^ a , b ^ z,
f°° f ,(^\ lt f 1 ifa^b,
(1363) (,)
L" ( i ) * = ( , ,/„> ,
PROOF. Pu t g{u) = mm{uz, 1 , 1 + (1 - u)z} fo r 0 < u < 1 + z_ 1 an d g{u) = 0
otherwise. The n fo r positiv e integer s a , b ^ z we hav e g(a/b) = 1 if a ^ 6 , and
g(a/b) = 0 if a > 6. W e als o hav e
^-el/Kir* /(0 )
where f(s) i s th e Melli n transfor m o f g{u), i.e. ,
f(8)=Jo 9 (U)U° du= - (-^? .
For s = it with t > 0 w e hav e three estimates \f(s)\ ^ min{log62; , 2t_ 1, Azt~ 2} fro m
which it follows tha t th e functio n h(t) = ^f(-it) ha s th e asserte d properties . •
Applying (1 3.63 ) t o (13.61 ) an d changin g th e notatio n q to p we obtai n th e
integral o f bilinea r form s
oo
lt
] T ] P lmCLmp{Pmlp) h{t)dt
/ -oo _mp^x
„^ „
w^.p<v
13. SUM S OVE R PRIME S 351
which w e estimat e b y B(x)(logx) 2 wit h
a
(13.64) B(x)= ] T| Yl mPPlt
(m,P(w)) = l mp^.x
w^~p<v
for som e rea l numbe r t (us e u(m) < log 2m ^ logx , (1 3.62 ) an d 6z ^ x).
Collecting th e abov e estimate s w e obtai n
a
(13.65) \Yl p\ ^ R(x) + B(x)(\ogx) 2 + E(x)
p^x
where
1/ lo g w
(13.66) E(x) « ^ - lo g ( ( l o g * } ) + — ^ + (-) x(logxr
2
logx \6(\ogv)(logz)J (logx) \yj
and th e implie d constan t i s absolute .
Let 2 ^ A ^ #1 2 . W e choos e z — A~ 2x?, y = A _ 1 £ 2 5 v = A _ 1 £ 3 7 an d w =
^l/ioiogioga: gettin g E(x ) < C x(logx)- 2 lo g A. Henc e w e conclud e (tak e A = x £)
THEOREM 1 3.1 2 . Let A = (a n) be a sequence of complex numbers with \a n\ ^
r(n). Let x ^ e 1 2 an d ( l o g x ) - 1 < e ^ ^ . Suppose that
(13.67) V lV a d m k-^-,
2
^ ^ I ^- ^ I (logx)
X
(13.68) ] T IE? 'i4a.pra ^ (logx) 4
(m,P(iu)) = l pm^x
w^.p<v
/or an? / t G l, where the parameters y, w, v are given in terms of x and e by
(13.69) y = x \ '\ v = x\~ £, w =x ^/^^^x
and P(w) is the product of primes < w. Then
(13-70) E«,«i ^
p^x
where the implied constant is absolute.
REMARKS. Th e conditio n (1 3.68 ) i s regarde d a s a n estimat e fo r genera l bi -
linear forms . Th e facto r p lt (whic h comes o n ou r wa y o f separatio n o f variables )
contaminates th e coefficients , bu t i t ha s n o effec t i n practice . On e coul d remov e
this factor , however , a precis e statemen t woul d involv e anothe r parameter , whic h
compromises th e clarity . Som e o f th e restriction s i n th e bilinea r for m (1 3.68 ) ca n
also b e modified . Fo r example , on e ca n remov e th e coprimalit y conditio n i n th e
outer sum , bu t ou r experienc e show s tha t i t i s convenien t t o hav e i t (not e tha t
(m, P(w)) = 1 implies (m,p ) = 1 ) . A concept o f "prim e producin g exponents " wa s
discussed i n [DFI5] ; Theore m 1 3.1 2 assert s tha t th e pai r (^ , ^ ) i s prim e produc -
ing. Th e presenc e o f e i n th e parameter s (1 3.69 ) i s ver y important ; withou t i t th e
requirement (1 3.68 ) woul d b e impossibl e t o verif y fo r mos t interestin g sequence s
A = (a n ). A well balanced choic e i s e — e(x) — (log log x )- 1 givin g a bound fo r th e
352 13. SUM S O V E R P R I M E S
sum ove r prime s whic h show s onl y littl e cancellation , nevertheles s i t i s a satisfac -
tory result , particularl y i n application s t o th e equidistributio n problem s o f degre e
two object s relate d t o th e GL2 theor y (se e Chapte r 21 ) .
http://dx.doi.org/10.1090/coll/053/15
CHAPTER 1 4
HOLOMORPHIC MODULA R FORM S
14.1. Quotient s o f th e uppe r half-plan e an d modula r forms .
Holomorphic modula r form s wer e historicall y discovere d an d studie d fo r pur -
poses o f complex analysi s an d algebrai c geometry , i n particular, i n connectio n wit h
elliptic functions . The n the y wer e introduce d i n algebrai c numbe r theory , particu -
larly for th e developmen t o f class field theory, Galoi s representations an d arithmeti c
of ellipti c curves . Mor e recentl y modula r form s hav e entere d th e territor y o f ana -
lytic numbe r theory . Beside s providin g ne w tools , the y ar e als o a sourc e o f dee p
problems wit h numerou s connection s t o arithmeti c geometry , notabl y throug h th e
theory o f //-functions .
See als o Chapte r 1 5 fo r a surve y o f th e theor y o f non-holomorphi c forms , a s
well a s Chapte r 1 6 for application s t o Kloosterma n sum s an d Chapte r 2 6 fo r som e
results abou t L- functions. Ther e ar e man y fine book s o n variou s aspect s o f thi s
vast subject , fo r instanc e [Bu] , [BG] , [Mi] , [Sh3] , [1 4] , [Sel] .
We star t wit h H , th e Poincar e uppe r half-plan e
(14.1) M = {zeC | y = I m (z) > 0}
r
w hich i s a n ope n subse t o f C (i n th e nex t chapter , i t wil l b e see n a s a riemannia n
manifold o f constan t negativ e curvatur e wit h th e Poincar e metric) .
There i s a n actio n o f th e grou p G = SX(2,R ) o n H I by linea r fractiona l trans -
formations
riAn\ az + b fa b
14.2) gz = - , fo r g = ,
^J * cz + d' u \c d
because
Im(z)
(14.3) lm(gz)
tcz + d\ 2
which i s positive i f I m (z) > 0.
The cente r o f G , namel y {±1 } , act s trivially , an d on e show s tha t th e grou p
of holomorphi c automorphism s o f H i s th e quotien t PSX(2,R ) = SL(2,R)/{±1 } .
Note that thi s actio n i s also isometric fo r th e hyperboli c Poincar e metric ; se e Chap -
ter 1 5 .
The actio n o f thi s larg e grou p give s E I a larg e amoun t o f symmetry . A usefu l
manifestation o f thi s i s th e followin g property : defin e first th e actio n o f G o n th e
extended uppe r half-plan e M * = l U l U o o , b y th e sam e formul a (1 4.2 ) i n additio n
to th e rule s go o = a/c i f c ^ 0 an d goo = o o i f c = 0 , the n fo r an y thre e distinc t
points v, w an d z i n R U o o , ther e exist s g G G suc h tha t
gv = 0 , gw = 1 , gz = oo.
353
35 4 14. H O L O M O R P H I C MODULA R FORM S
Associated t o a Riemannia n metri c i s a natura l invarian t measur e <i/i , which i n
the cas e o f H I i s th e hyperboli c measur e
(14.4) dfi(z) = y~ 2dxdy i f z = x + iy
which i s invarian t b y G , i.e . fo r an y / integrabl e o n H I we hav e
/ f{z)d^z) = f f{gz)dn(z).
Arithmetic come s whe n w e conside r discret e subgroup s o f G actin g o n H I and
the correspondin g quotien t spaces . On e shoul d kee p i n min d th e analog y o f Z c l
(or Z[i] C C) . Thi s i s a ver y ric h theory ; ther e ar e man y "different " discret e
subgroups o f G , an d t o classif y the m i s almos t th e sam e a s classifyin g al l Rie -
mann surfaces . Fo r arithmeti c applications , th e mos t importan t example s ar e th e
congruence subgroups , particularl y thos e whic h contai n on e o f th e so-calle d Heck e
congruence subgroup s To(q) define d b y
The group s To(q) hav e finite inde x i n SX(2,Z) , precisel y
[To(l):r 0 ( g )]= g n( 1 +P" 1 )-
p\q
For q varying on e obtain s ver y interestin g quotient s r 0 (g)\H[ (thei r genu s in -
creases) .
One prefer s t o dea l wit h compac t Rieman n surfaces , however , quotient s o f HI
are no t alway s compact . I t i s eve n possibl e t o hav e Vo l (F\M) = +oo , wher e th e
volume refer s t o th e natura l measur e induce d fro m (1 4.4) . Fo r example , th e grou p
of translation s b y integer s
r„-{(J \) ]k.z}
has th e vertica l stri p 0 < R e (z) ^ 1 "representing " th e quotien t r ^ H I , an d i t ha s
infinite area .
More generally , a fundamenta l domai n fo r th e actio n o f a discret e subgrou p
T C G o n H I is a nic e subse t F C H I which "almost " contain s on e poin t pe r orbit .
Precisely, F mus t satisf y th e followin g conditions :
(1) F i s a n ope n subse t i n HI.
(2) Th e closur e F o f F i n C intersect s ever y orbi t o f T , i.e . fo r al l z G H I
there exist s 7 G T wit h 72 : G F.
(3) N o tw o point s i n F ar e T-equivalent .
A fundamenta l domai n alway s exists , bu t i t i s certainl y no t unique . On e ca n
even choos e F t o b e a polygo n wit h (hyperbolic ) geodesi c sides .
For th e congruenc e subgroup s To(^) , on e ca n first conside r th e usua l funda -
mental domai n fo r SX(2,Z) ,
F 1 = {^GH I I | R e ( z ) | < | , \z\>\]
14. H O L O M O R P H I C M O D U L A R F O R M S 355
and the n tak e th e unio n o f it s image s unde r cose t representative s i n 5L(2,Z) :
Fq= | J 7* 1
7er 0 (q)\r 0 (i)
is a fundamenta l domai n fo r T 0(q). (I t i s no t necessaril y connected ; fo r othe r
choices, se e [1 5] , Section 2.2. )
On a picture of F\, th e non-compactness appear s because Fi touche s the bound -
ary MUo o at infinity , s o (z , 2i, 3z,...) i s a sequence without convergin g subsequence .
However, b y the Gauss-Bonne t formul a o r a simple direc t computation , th e volum e
of SX(2,Z)\H I (whic h i s th e volum e o f an y fundamenta l domain ) remain s finite,
namely Vo l (SX(2, Z)\Bf) = | , an d th e sam e i s tru e fo r To(q) wit h
Vol(r 0 (, ? )\H) = | [ r o ( l ) : r 0 ( g ) ] .
For r suc h tha t Vo l ( r \ H) < +oo , th e point s a t whic h F touche s th e boundar y
are calle d cusp s of T (thi s intuitive descriptio n wil l be made mor e precise i n Sectio n
15.7 whe n w e discus s th e classificatio n o f element s i n G) , an d the y ar e i n finite
number (becaus e t o each cus p on e can attac h non-overlappin g smal l neighborhood s
in M , whic h hav e fixed positiv e volume) . Fro m th e above , o o i s th e onl y cus p
for SX(2,Z) . S o fo r To(q), the y ar e easil y classifie d b y considerin g th e actio n o f
r 0 ( g ) \ r o ( l ) o n oo. On e finds tha t ever y cus p ha s a uniqu e representativ e ^ wit h
(a, c) = 1 , c ^ 1 , c | g, an d a is modulo (c , | ). Thei r numbe r i s therefor e
h=Y,<P((c,d)).
cd—q
This i s equal t o r(q) i f g is squarefree .
For an y cus p a , it s stabilit y grou p T a = { 7 G T \ 7 a = a } i s infinit e cycli c (u p
to ±1 ) , generate d b y 7 a say , an d ther e i s a a a £ G , calle d a scalin g matrix , suc h
that cr a 00 = a an d
1
(14.5) ^ 7a^ a =fJ \Y
Using a a, mos t computation s an d propertie s relate d t o th e variou s cusp s o f
T ca n b e studie d fo r th e cus p a t infinity , wit h stabilit y grou p generate d b y th e
translation z y-> z - f 1 , fo r th e conjugat e grou p G~ lTcra (i t i s a n advantag e o f th e
Poincare mode l ove r othe r model s o f th e hyperboli c plane , fo r instanc e th e uni t
disc, tha t i t ha s suc h a distinguishe d cusp) .
We can no w defin e modula r form s a s holomorphic function s o n HI transforming
in a simpl e wa y unde r th e actio n o f a discret e subgroup . Le t k ^ 1 be a n integer .
Then w e hav e a n actio n (th e "weigh t k action" ) o f G = SX(2,IR ) o n function s
/ : H -> C by
ab
(14.6) ( /| fc g)(z) = j(g, z)~ kf(gz), wher e j(g, z) - cz + d for g =
cd
a
Let q ^ 1 be a n integer , an d \ Dirichle t characte r modul o q (no t necessaril y
primitive). Clearl y \ induce s a character o f r 0 (g) b y x(d) = x(d) f° r 9 —
356 14. HOLOMORPHI C MODULA R FORM S
A modula r for m o f weight k, level q, and nebentypu s (o r character ) \ i s a holomor-
phic functio n / o n H I which satisfie s
(14.7) / I * 7 = X(7)/, for all 7 G r 0 (g)
and i s holomorphi c a t all cusp s o f To(^) . Th e holomorph y a t cusps i s explained as
follows. First , fo r a cusp a of To(g), the functio n f a = f \k cra i s seen b y modularit y
to b e periodic o f period one , f a(z + 1 ) = f a(z). Thi s implie s no w tha t f a i s a
function o f th e paramete r q — e(z), namel y f a(z) = g a(q), wher e g a i s holomorphi c
in a punctured dis c { z G C | 0 < | 2 : | < r } . W e the n sa y tha t / i s meromorphic at
a if ga i s meromorphic a t 0. Therefor e f a ha s a Laurent serie s expansio n
AW = Yl fa(n)e(nz)
n^na
for som e intege r n a , and f a(na) ^ 0 . W e sa y tha t / i s holomorphic a t a if it i s
meromorphic an d n a ^ 0 . If , moreover, w e hav e n a > 0 , w e sa y tha t / vanishe s at
a. I f / vanishe s a t all cusps , it is called a cusp form .
EXERCISE 1 . Sho w fro m th e definitio n tha t a modular for m / i s a cusp for m
if an d onl y if the T-invarian t functio n
k/2
(14.8) 9(z) = y \m\
is bounde d o n H I (this criterio n i s very convenien t i n practice).
EXERCISE 2 . Sho w tha t i f we tak e th e sam e definitio n bu t k = 0 , th e corre -
sponding spac e o f modula r form s i s reduce d t o 0.
Clearly modula r form s of fixed level, weigh t an d characte r for m a vector space ,
which is denoted Mk(q, x) ? a n d th e cus p form s a subspace denote d S&(# , x)- Takin g
7 = — 1, w e see tha t / i s identicall y zer o unles s x ( —1) — (~l)fc? s o we assume tha t
X satisfies thi s consistenc y condition .
The firs t fundamenta l resul t i s that M^(g,x ) i s a finite dimensiona l vecto r
space. Thi s ca n b e proved i n general usin g th e Riemann-Roc h theore m fo r compac t
Riemann surfaces . I n the case o f To(g), a simpler proo f i s possible. Fo r k ^ 2,
the dimensio n o f Mk(q,x) c a n ^e compute d exactl y (se e e.g . [Sh3]) , an d the n
estimated ver y precisel y (th e cas e o f forms o f weight on e i s a tantalizing proble m
of deep arithmetica l significanc e becaus e o f links with Arti n L- functions, se e [Se2] ,
[Dul]). On e show s fo r instanc e tha t
dim M k(q, x ) = " i 2 " ^ + 0(y/qk)
where v q = [T0 (l) : T 0(q)].
For an y modula r for m / w e denot e b y df(n) its Fourier coefficient s a t oo, so
that
(14.9) f(z) = Y,af(n)e(nz).
Those coefficient s ar e ver y importan t object s (the y ar e arithmeti c harmonics) ,
and thei r orde r o f magnitude, i n particular, i s the subjec t o f many inquiries . For
14. H O L O M O R P H I C M O D U L A R F O R M S 35 7
cusp forms , th e criterio n tha t y k^2\f(z)\ i s bounded o n HI implies quickl y th e trivia l
bound
k/2
(14.10) a f(n)<^n
which i s goo d enoug h t o star t wit h (se e below (1 4.54 ) fo r th e muc h deepe r Delign e
bound).
One als o define s th e Petersso n inne r produc t o n Mk{q, x) b y
(14.11) {f,g) = J f{z)W)y kd^(z).
This i s well-define d becaus e th e integran d i s ro(g)-invariant , a s a simpl e compu -
tation reveal s (compar e (1 4.8)) , an d th e integra l i s finite a s soo n a s on e o f / o r g
is a cus p form . I n particular , Sk{q,x) 1 S a (finit e dimensional ) Hilber t spac e wit h
this inne r product , an d thi s analyti c fac t i s very importan t i n what follow s (se e th e
definition o f "newforms " i n Sectio n 1 4.7) .
Notice als o tha t i t i s possibl e t o multipl y modula r form s o f th e sam e level : i f
/ e Mk(q,x), 9 £ M k>{q,Xf), w e n a v e fg € ^4+/c'(<2SXX') - I n particular , th e direc t
sum o f al l Mk(q), k ^ 0 , i s a grade d C-algebra . Thi s importan t algebrai c fac t wil l
not b e o f muc h us e i n thi s book , however .
It shoul d b e emphasize d tha t i n provin g tha t a functio n i s modular , th e mod -
ularity formul a (1 4.7 ) nee d onl y b e prove d fo r 7 i n a famil y o f generator s o f ro(g) .
Such group s ar e alway s finitely generate d s o thi s i s a finite se t o f condition s o n / .
For instanc e 5L(2 , Z) i s generate d b y th e tw o element s
(01 ) an d
( 1 "o 1 )
acting a s th e translatio n z \-> z + 1 and a s inversio n z M > — z~l respectively , an d
the tw o condition s fo r / t o b e modula r becom e
k
f(z) = f(z + l) an d f(^.)=z f(z).
We no w giv e example s o f modula r form s an d i n particula r o f cus p forms .
14.2. Eisenstei n an d Poincar e series .
The basic examples of modular form s ar e the Eisenstein an d the Poincare series .
We onl y defin e thos e serie s associate d t o th e cus p 0 0 fo r simplicity , bu t simila r
constructions ca n be done at ever y cusp. Le t k > 2 (to ensure absolute convergence )
and pu t
fc
Ek(z)= J2 X(7)j(7,*)- .
Clearly, Ek i s i n Mk(q,x)i a s ^ i s constructe d b y th e averagin g technique ; holo -
morphy a t th e cusp s i s checke d b y explici t computatio n o f th e Fourie r expansion ,
which als o show s tha t Ek i s no t a cus p form . I t i s calle d a n Eisenstei n series .
More generally let m ^ 0 . Th e ra-th Poincar e series is obtained b y the averagin g
technique applie d t o e(mz):
p k
m{z) = Yl X(7)j(7 , z)- e(mjz)
ler^xroiq)
358 14. HOLOMORPHI C MODULA R FORM S
for k ^ 2 (the convergenc e fo r k — 2 is not absolut e bu t ca n b e see n fo r m ^ 1 from
the Fourie r expansio n (1 4.1 2 ) below , usin g th e Wei l boun d fo r Kloosterma n sum s
(1.60)). W e d o no t conside r m < 0 as th e serie s diverge s then .
The importan t fac t i s that fo r m ^ 1 , we get cus p forms .
PROPOSITION 1 4.1 . (1 ) J/r a = 0 , then P 0 = E k.
(2) If m ^ 1 ; then P m is a cusp form in Sk(q, x)-
We sketc h par t o f th e proof , namel y th e followin g lemma .
LEMMA 1 4.2 . The Poincare series P rn has Fourier expansion
mn r
(14.12) P m(z) = 6(m,0) + J2p( ' Wnz
with coefficients
/?ir\k n 1
k
~ * .^
P ( 0 , n ) = ( - ) Ytkj E c-*5 x (0,n;c)
c>0
c=0 (mod q)
p ( m 5 n ) = f — j 2 f(5(m,n) + 27r i fc ^ c 1
S' x (m, n; c) Jfc _i f J J
c>0
c=0 (mod q)
if m ^ 1 . ifer e S x (ra, n; c) is the Kloosterman sum with character x,
(14.13) S x (m,n;c) = ^ *(d)e (
d (mod c)
and J/c- i « 5 the Bessel function of order k — 1 .
PROOF. W e firs t mak e a convenien t parametrizatio n o f th e coset s r ^ r o ^ ) ,
namely usin g th e actio n o f a n elemen t o f To o o n th e righ t an d o n th e left , w e se e
that To(q) i s the disjoin t unio n
(14.14) r 0(g) = ro o | j ( | J( J roof " ^ W )
0 0 d(modc )
c=0(modq) ( c,d) = l
(where fo r give n c an d d wit h (c , d) = 1 , a an d 6 are an y tw o integer s suc h tha t
ad — be = 1 ) .
Thus w e hav e correspondingl y
d J c
P m (z) = e(raz ) + 5 Z X T ^( ) ( '* *)
c>0 d (mod c)
c=0 (mo d g)
where
I(c,d;z) = J2(c(z + n) + dy ke(m(«c-c{c{z+1 n) + d) )).
14. H O L O M O R P H I C M O D U L A R F O R M S 359
Next, b y th e Poisso n summatio n formul a
i z
^ ^]2jm(^+v)+dyH~-c{c{zZ)+d)-nv)dv
/am nd\
ii /
-oo+iy
Here w e observ e tha t th e inne r integra l vanishe s fo r n < 0 (mov e th e lin e
of integratio n upwards ; th e cas e n = 0 i s th e mos t important , s o th e reade r i s
encouraged t o chec k carefully) . The n puttin g togethe r th e differen t terms , an d
recognizing th e Besse l functio n i n th e integra l (se e [GR , 8.31 5.1 , 8.41 2.2]) , w e
obtain th e lemma . •
EXERCISE 3 . Prov e th e decompositio n (1 4.1 4) .
Notice tha t ther e ar e infinitel y man y Poincar e serie s P m , bu t the y al l liv e i n
the finite dimensiona l spac e Sk(q,x)- Thi s mean s tha t ther e mus t b e man y linea r
relations betwee n th e P m 's. O n th e othe r hand , suc h abundanc e tend s t o sugges t
that th e Poincar e serie s shoul d spa n th e whol e space . Thi s i s indeed th e case .
LEMMA 1 4.3 . Let f e Mk{q,x) be a modular form with expansion 1( \.9).
Then for any m > 1 ;
COROLLARY 1 4.4 . The Poincare series with ra ^ 0 span Sk(q,x)-
Indeed, an y cus p for m / orthogona l t o ever y P m mus t hav e al l it s Fourie r
coefficients a t o o equal to zero , an d henc e mus t b e zero . Becaus e th e spac e spanne d
by Poincare series is closed (thi s uses crucially the finite-dimensionality o f the space )
the resul t follows .
P R O O F O F LEMM A 1 4.3 . B y definitio n o f P m w e hav e
k k
(f,Pm) = [ /(*) { J2 x{l)Wi7z)~ ^m^z)}y d^z)
k
= Yl Im(<yz) f(-yz)e(m~fz)dfi{z)
jF
ieroo\r0(q)
(by modularit y o f / an d I m (jz) = \j(g, z ) | - 2 I m (z))
pi p + OO
k 2
=/ / y ~ f(z)e(-mz)dxdy
Jo Jo
r+oc pi
k 2
Taf{n) / y - e-2^n+m^dy e(( (n
r — m)x)dx
„ ^ n JO JO
r(fc-i) a m
(A.ixm)xkf e _ T / ( )
•
360 14. HOLOMORPHI C MODULA R FORM S
Taken together , th e tw o previou s lemma s impl y a formul a whic h show s ho w
the sequence s o f Fourie r coefficient s o f a basi s o f Sk(q,x) a r e almos t orthogonal ,
thereby givin g anothe r decompositio n i n arithmeti c harmonic s o f the diagona l sym -
bol (5(m , n). Le t T b e an y orthonorma l basi s o f Sk(q,x) (distinguishe d basi s wil l
be describe d i n Sectio n 1 4.7) . Sinc e P m i s in Sk(q,x)i w e c a n expan d i t a s a linea r
combination o f th e / G J , namely w e hav e
Pm=Yt(f,Pm)f
so that b y taking the n-th Fourie r coefficien t o f both side s (o n the right wit h Lemm a
14.3, o n th e lef t wit h Lemm a 1 4.2 ) w e obtain th e Petersso n formul a
PROPOSITION 1 4. 5 ( P E T E R S S O N ) . For any n ^ 1 and ra ^ 1 ,
(1415) a n
7 I ^ f e ^ E / ( ) ^ M= 5(m,n )
1
+ 2m * 2 ^c 5x(m,n;c)J/c_if— J .
c>0
c=0 (mod q)
EXAMPLE. Th e "first " cus p for m occur s fo r leve l 1 an d weigh t 1 2 ; i t i s th e
famous Ramanuja n A function , whic h ha s th e elegan t produc t expansio n
A(*) = QY[0--024> w i t h Q = < z)
(see e.g . [Sel ] fo r on e o f man y direc t proof s o f th e modularity) .
The existenc e o f a non-zer o / G £1 2(1 ) ca n b e predicte d b y considerin g th e
function / = E% — E%, where E± an d EQ are th e Eisenstei n serie s o f leve l 1 an d
weight 4 and 6 . Thes e hav e th e Fourie r expansion s (a t th e onl y cus p oo )
EA(z) = 1 + 24 0 Y^ °z(n)e(nz)
n^l
E6(z) = 1 - 50 4 ^2 °s{n)e(nz)
so tha t a direc t computatio n o f th e smal l orde r term s give s
El-El = 1 72Sq-{-0{q 2)
whence / i s non-zer o an d i s a cus p form . Sinc e di m 5i2(l ) = 1 , th e equalit y
/ = 1 728 A mus t follo w onc e we know that A i s indeed modular . Th e coefficient s i n
the expansio n o f A ar e denoted r(n) (no t t o be mistaken wit h th e diviso r function) .
These ar e integer s havin g man y fascinatin g properties , fo r example ,
m o d 61
9
r(n) = Y^ dU -
d\n
This kin d o f congruence s belong s properl y t o th e theor y o f Galoi s representation s
(see e.g . [Se3]) .
Evidently, A must als o be proportional to the non-zero Poincare series in 612(1),
but i t doe s no t aris e naturall y i n thi s way . Also , th e constructio n o f / exploit s th e
14. H O L O M O R P H I C M O D U L A R F O R M S 361
fact tha t holomorphi c modula r form s ca n be multiplied. Actually , i t i s not ver y
difficult t o show tha t th e graded rin g 0fc^o-^fc(l ) i s isomorphic t o the polynomial
ring C[E4,EQ], an d E± and EQ ar e algebraically independent .
14.3. T h e t a functions .
The Eisenstei n an d the Poincare serie s ar e constructed b y averaging ove r the
group cosets . Thet a function s provid e anothe r importan t clas s of examples of mod-
ular form s whic h com e ou t in quite a different way . Th e most basi c thet a serie s is
Jacobi's 0 function an d equation (1 .53 ) i s indeed a modularit y propert y o f weight
\. However , w e hav e not defined half-integra l weigh t form s (se e for instance [Shi] ,
[14] for this theory ) s o we square 6 and obtain a weight 1 form
02(z) = J2r(n)e(nz).
More generall y w e consider firs t thet a serie s attache d t o binar y quadrati c forms .
Definite form s ar e related t o holomorphic thet a series , whil e indefinit e one s woul d
give non-holomorphi c forms , whic h wer e firs t constructe d b y Maass [Ma] .
Recall the notatio n o f Section 3.8 : K = Q(\/~D) is an imaginary quadrati c field,
D < 0 being a fundamental discriminant , 0 = Z © LOZ is the ring o f integers o f K,
w
~ 2
\{D + V~D), w is the number o f units o f O and H i s the ideal clas s group .
Let A E % be an ideal class . Th e theta serie s attache d t o A i s defined t o be
(14.16) 0 A(z) = - + y e{zNa)
aeA
where a runs ove r the non-zero integral ideals belonging to the clas s A. A s shown in
Section 22.2 , there i s a one-to-one correspondenc e betwee n idea l classe s and classes
of definit e binar y quadrati c form s o f discriminan t D. Le t $A — [a,b,c] b e the
quadratic for m associate d t o A. I f b E A, w e have b = (a) a for som e a E a - 1 ,
a = m + riz^. Then Nb = ^ ^ ( m , n) so taking th e units int o accoun t w e obtain the
formula
(14.17) 0A{Z) =-T e($ A(m,nj).
772,Tl G Z
Poisson summatio n (i n two variables) show s again tha t Qj± is modular, precisel y
9A £ MI(\D\,XD)I wher e XD i s th e Kronecke r symbo l (th e Dirichlet characte r
associated t o the field K). I t als o follow s tha t
OA(Z) =
From (1 4.1 6 ) o r (1 4.1 7 ) w e see that 6A is not a cus p form . Fo r a class grou p
character x E ?^ we define th e theta serie s
/x(Z) = ^ X (o)e(*)
a
where th e sum is over al l integral ideal s o f O thi s time . Splittin g int o classe s we
derive
/*(*) = E x(A)d A(z)
A<EH
362 14. H O L O M O R P H I C MODULA R FORM S
for x 7 ^ 1 , s o / x E MI(\D\,XD) a l s o - Computin g th e Fourie r expansio n a t othe r
cusps, one proves that f x i s a cusp form i f y2 ^ 1 . Recal l that th e characters of order
2 o f H ar e calle d genu s characters . The y wer e determine d (essentially ) b y Gauss ,
who showe d tha t th e subgrou p o f genu s character s i s isomorphi c t o (Z/2Z) t _ 1 ,
where t = u(\D\) i s th e numbe r o f distinc t prim e divisor s o f \D\.
For x n ° t a genu s character , w e hav e f x G SI(\D\,XD), a n d i t als o satisfie s
MIDI J
All clas s grou p character s ar e particula r case s o f Heck e character s o f K (se e
Section 3.8) , s o the y hav e a n L-functio n (ove r K) attache d t o the m
£*(*,*) = J>(a)(Wa)-*
a
and b y Melli n inversio n th e complete d L-functio n
AK(s,x)=(^)Sr(s)LK(s,X)
is expressed b y th e formul a
1
A*(5,x) = J^ (sT +iT 8)/x(-^L=)dy.
A fe w othe r representation s o f A^(s , x) a r e give n i n Sectio n 22.3 . Fro m th e abov e
integral representatio n w e derive, a s fo r th e Rieman n zet a functio n o r th e Dirichle t
L-function, tha t LK(S,X) n a s meromorphi c continuatio n t o th e whol e comple x
plane an d i s entire i f x ¥" 1 - Moreover , w e have th e functiona l equatio n
A*(s,x) = A i r ( l - s , x ) .
Much more general theta series are obtained b y considering any integral positiv e
definite quadrati c form , an d associate d harmoni c polynomial s i n n variable s instea d
of only binary quadrati c forms . However , ther e is no interpretation i n terms o f ideal
classes of number fields in the genera l cas e (fo r 4 variables, o f course, ther e ar e link s
with quaternio n algebras) .
Let A = (dij) b e a real , symmetric , positiv e definit e matri x o f ran k r =
0 (mod 2), an d
J\\X\ — XJ~±X — y CLi jX^Xj
the correspondin g quadrati c for m (th e theor y o f form s i n od d numbe r o f variable s
is somewhat harde r becaus e o f a complicated multiplie r syste m fo r th e correspond -
ing modula r forms) . A homogeneou s polynomia l P G C[X] i s calle d a harmoni c
polynomial i f A ^ P = 0 , wher e
1
dxidxj
i,3
is the Laplac e operato r associate d wit h th e matri x A 1 — (a* •). Wit h thi s dat a w e
define th e thet a serie s
I9(2) = Y, P(.m)e(^A[m]).
meZr
14. H O L O M O R P H I C M O D U L A R F O R M S 363
Suppose A an d NA~ l hav e integra l coefficients , an d eve n diagonal , fo r som e
suitable intege r T V ^ 1 . The n 0(z) G Mk{N,XD) wher e k = \ + d e g P an d x D{d) =
( ^ ) i s the Kronecke r symbo l wit h D = (—1 )%\A\. Moreover , i f d e gP > 0, then
0(z) = 6(z; A, P) i s a cusp for m whic h satisfie s
k
0(z;A1P) = z%\A\-^z- 0(--;A-\l
where P*(x) = P{A~ lx) i s a harmonic polynomia l associate d wit h A~ l (se e als o
Section 20. 4 and , fo r instance , [1 4] , Chapter s 9 and 1 0 fo r complet e proofs) .
14.4. M o d u l a r form s associate d t o ellipti c curves .
Cusp form s associate d t o elliptic curve s hav e b o t h arithmeti c an d analyti c
flavors. W e begi n b y introducin g th e basi c invariant s o f elliptic curve s ove r an
arbitrary fiel d (se e [Sil ] an d als o Sectio n 1 1 .9) . An y ellipti c curv e E, defined ove r
a fiel d fc, has a plane mode l give n b y a Weierstrass equatio n
(14.18) E : y2 + a\xy + a^y = x3 + <i2X2 + a^x + CIQ
with coefficient s a,j G k. Associate d wit h th e a-coefficient s i s th e se t o f coefficient s
62, 64, be, 6 8 give n b y b 2 = a\ + 4a 2 , 6 4 = a xa^ + 2a 4 , 6 6 = a 2 + 4a 6 , b 8 = a2a6 -
aia^a^^ Aa 2a§ + a 2al — a\. Th e 6-coefficient s ar e relate d b y 46 g = b 2bQ — b2. Havin g
these quantitie s w e expres s th e discriminan t o f (1 4.1 8 ) b y
3
(14.19) A = -b\bs - Sb 4 - 21 b\ + 96 2 6 4 6 6 .
The cubi c equatio n (1 4.1 8 ) define s a n elliptic curv e ove r k i f an d onl y i f it i s
non-singular, whic h mean s t h a t it s discriminant A doe s no t vanish . T o sim-
plify equation s w e associat e wit h (1 4.1 8 ) anothe r se t o f coefficients C4,C 6 give n
by C 4 = b\ — 24&4, CQ = — b3 + 366264 — 21 665 . I n thes e term s w e hav e
3
(14.20) 2 A = c\-c\.
R E M A R K S . T h e firs t se t o f 6-coefficients arise s fro m completin g th e square .
Assuming th e characteristi c o f k is no t 2 , w e ca n chang e th e coordinate s (x,y) i n
(14.18) int o (x,y + ^x + ^) gettin g
(14.21) E :y2=x* + ^x2 + ^ x + ^ .
7
4 2 4
The secon d se t o f c-coefficients arise s fro m completin g th e cube . I f further th e
characteristic i s not 3 , we ca n chang e th e coordinate s (x , y) i n (1 4.21 ) int o (x— j | , y)
getting
(14.22) E : y 2 = x 3 _ | x _ | L = x 3+ a x +6 5
say. Thi s ha s th e discriminan t A = I2~ 3(c\ - c§ ) = - 1 6 ( 4 a 3 + 276 2 ). I n this
equation w e chang e furthe r (x , y) int o (x , |) an d multipl y throug h b y 4 getting
(14.23) E : y2 = Ax3 + Ax + B
with A = - j | = 4a an d i ? = — ^ = 46. No w th e discriminan t o f (1 4.23 ) i s simpl y
given b y
(14.24) A = -(A3 + 27B2) = (c3 - c 2
6)/1 728
36 4 14. H O L O M O R P H I C MODULA R FORM S
(notice tha t 1 72 8 = 1 2 3 ).
We retur n t o th e genera l equatio n (1 4.1 8 ) ove r an y field k. W e define th e
so-called j-invarian t by
(14.25) j = 4A-1 = (12c 4)3(c34 - eg)" 1 ;
this make s sens e becaus e w e assume A / 0 . Th e j-invariant play s a n importan t
role i n classification o f elliptic curves . Firs t w e single ou t two special cases .
CASE j = 0 . Thi s i s equivalent t o c4 = 0 , therefore i f E i s given b y (14.23), the n
A = 0 and A = - 2 7 £ 2 .
CASE j = 1 728 . Thi s i s equivalent t o c 6 = 0 , therefore i f E i s given b y (1 4.23) ,
then B = 0 and A = -A 3,
EXAMPLE 1 . Suppos e j ^ 0,1 728 . The n th e equation
(14.26) E
vJ
: y2 + xy = x3 ^ - x - -—^— —
y y - _ 1 7 2 8 • _ 1 7 2 8
gives an elliptic curv e ove r any field k which contain s j ; it s j-invariant is equal to j ,
the c-coefficient s ar e C4 = j(j — 1 728) _1 , CQ = —j(j — 1 728) _1 an d the discriminant
is A = j 2(j — 1 728) _3 . Als o the cubic equatio n
/1 A 0 7 \ T? 2 , • 3 J U~ 1 )2 ^ 33
(14.27) Ej : y + ixy = x x : T^T^X : —- —
vJ
3 y J y 4 J _ 1 7 2 g3 _ 1 7 2 g
(which is the quadratic twis t o f (14.26) b y j) give s an elliptic curv e ove r the field fc;
its invariants ar e c 4 = j 3 ( j - 1 7 2 8 ) " \ c 6 = - j 4 ( j - 1 7 2 8 ) _ 1 an d A = j 8 ( j - 1 7 2 8 ) ~ 3 .
EXAMPLE 2 . Suppos e j ^ 0,1 72 8 an d char.k ^ 2,3 . The n th e cubic equatio n
(14.28)
v yE : y2 = 4x 3 - 2
\ { x- , 2 ^{no
' j - 1 72 8 j - 1 72 8
with j G k give s a n ellipti c curv e ove r k whos e j-invarian t i s equal t o j an d the
discriminant i s A = 2 6 • 3 1 2 • j2{j - 1 728) - 3 .
From no w on we consider ellipti c curve s E define d ove r Q. W e can assum e
that E i s given b y the Weierstrass equatio n (1 4.1 8 ) wit h intege r coefficient s aj (if
necessary, chang e th e coordinates (x , y) int o (c _ 2 x, c~ 3y) and multiply throug h by
c6 t o kil l th e denominator s o f aj b y choosin g a suitabl e c G N). Ever y ellipti c
curve E/Q ha s a minima l model , tha t i s E i s given b y the equation (1 4.1 8 ) wit h
integer coefficient s suc h that ord p A is minimal for every p among all such equation s
defining E (thi s specia l equatio n i s possibl e t o arrang e b y a rationa l chang e of
variables becaus e Z has unique factorizatio n property) . Th e following conditio n
(14.29) min{ord p A, 3ord p c 4 } < 12
for al l p is necessary an d sufficient fo r the model (1 4.1 8 ) wit h aj G Z to be minimal.
Given a minimal equatio n (1 4.1 8 ) fo r E w e look a t its reduction modul o p. If
p \ A, then th e reduced cubi c i s nonsingular, s o it defines a n elliptic curv e ove r the
finite field F p . I n thi s cas e i t i s sai d E ha s good reductio n a t p . I f p \ A, the n
the reduce d curv e i s singular, an d it i s said E ha s bad reduction a t p. Th e ba d
reduction divide s int o tw o types:
14. H O L O M O R P H I C M O D U L A R F O R M S 365
(1) Multiplicativ e reductio n (o r semistable) i f p | A and p \ c^ (thi s mean s
E/Fp ha s a node).
(2) Additiv e reductio n (o r unstable ) i f p | A and p | C4 (this mean s E/¥ p has
a cusp) .
The multiplicativ e reductio n i s said t o be split o r non-split i f the slope s of the
tangent line s a t the nod e ar e rationa l o r not, respectively .
For ever y p we defin e th e conducto r exponen t f p a s follows:
(1) f p = 0 i f E ha s goo d reductio n a t p.
(2) f p = 1 i f E ha s multiplicativ e reductio n a t p.
(3) f P = 2 i f E ha s additiv e reductio n a t p ^ 2 , 3.
When £ ha s additiv e reductio n a t p = 2 or p = 3 , then th e definitio n o f fp i s
quite involved (se e [Sil]) , but w e always have f p^2 an d f p — 2 is a certain "measur e
of wil d ramification" . I n these case s on e ca n sho w tha t $1 ^ 8 and / 3 ^ 5 . Having
the exponent s / p fo r all p the conducto r o f E/Q i s set to be
(14.30) N = Y[p fp-
v
Note tha t N i s squarefree i f and onl y i f E ha s n o additive reductio n a t all places.
In thi s cas e E is called semistable .
Hasse define d th e L-functio n o f E as an Euler produc t o f local factor s define d
by looking at the reductio n E p of E modulo p for al l primes p, whic h i s a curve ove r
the finite field ¥ p. Th e loca l zet a functio n i s defined b y the forma l powe r serie s
Zp(E) = exp{j2\E(^)\~
n^O
where |J£(F pn)| is the numbe r o f points of Ep in a finite field of cardinality p n (wit h
the poin t a t infinity included) .
Let a{p) b e the intege r define d b y | ^ ( Fp ) | = p + 1 — a(p). Notic e tha t i f E is
given by a simpler Weierstras s equatio n
y2 — x 3 + ax + 1
(to which one can reduce for p ^ 2 or 3), then a(p) ca n be expressed b y the quadrati c
character su m
<p) = - xL
mo d p
(— ~v—J-
For p I A , the coefficien t a(p) i s given by
0i f E ha s additiv e reduction ,
a(p) = 1i f E ha s spli t multiplicativ e reduction ,
— 1 i f E ha s non-spli t multiplicativ e reduction .
Hasse prove d tha t Z P(E) is a rational functio n fo r every p , an d fo r p \ A (th e
reduced curv e i s then smooth )
l a{p)X+pX2
(14.31) Z p{E)- -
(l-X)(l- PX)-
36 6 14. H O L O M O R P H I C MODULA R FORM S
Moreover, Hass e proved the Riemann Hypothesis for Ev i n the form o f the inequalit y
(14.32) \a(p)\^2^p
which implie s tha t bot h root s of the numerato r o f Zp hav e modulu s equa l to 1 /y/p.
See Sectio n 1 1 . 8 for elementary proof s o f (14.31 ) an d (14.32).
The Hasse-Weil zeta function o f E, already mentioned in Section 5.1 4 , is defined
by th e Euler produc t
(14.33) L(E, s) = I J ( 1 - a(p)p-T 1 1 ^1 " a
(P)P~° + P1" 2 *)" 1-
p\A pf A
Define a(n) for all n ^ 1 by expanding (1 4.33 ) int o the Dirichlet serie s
oo
s
(14.34) L(E, s) = Y^ a{n)n- .
l
This L- function i s holomorphic i n the region R e (s) > | b y the Riemann hy-
pothesis (1 4.32) . Hass e furthe r conjecture d tha t th e L-function thu s define d fro m
local data coul d be extended to an entire functio n satisfyin g th e functiona l equatio n
A(E,s) = wA(E,2-s)
where w = ± 1 an d
S
/\/N\
A ( E , S ) = ( ^ - ) T(s)L(E,s),
N bein g th e conductor o f E.
This functiona l equatio n i s the same as that satisfie d b y the Hecke L-function s
of cus p form s o f weight 2 and level N wit h trivia l characte r (se e Section 1 4.7) .
Around 1 950 , Shimur a an d Taniyama mad e th e conjecture tha t fo r any E/Q,
there i s a cus p for m whos e L- function i s equa l t o L(E,s). Late r Wei l refine d
the conjectur e b y postulating tha t th e level o f this for m shoul d equa l exactl y the
conductor o f E. Thi s refine d conjectur e ca n be tested computationall y sinc e the
space o f cusp form s o f a given level , weigh t an d character ha s finite dimension .
This dee p modularit y conjectur e wa s proved i n 1 99 5 by Wile s [W] , Wiles-
Taylor [TW ] for semistable curve s (wit h Fermat's Grea t Theore m as a consequence,
as ha d been show n befor e b y Ribet), an d is now known fo r all elliptic curve s ove r
Q, th e final work bein g du e to Breuil, Conrad , Diamon d an d Taylor [BDCT] .
T H E O R E M 1 4.6 . Let E/Q be any elliptic curve of conductor N. There exists
a primitive cusp form
oo
(14.35) f(z) = ^\{n)n*e{nz) G S 2(T0(N))
l
such that a(n) — X(n)n^, that is
oo
(14.36) L(E, s + \) = L(f, s) = J2 A W™" 5 -
l
Since the cusp for m / associate d t o E is primitive o f level A ^ it is an eigenfunc -
tion o f the Fricke involutio n (Wf)(z) = N'lz~2 f(-l/Nz), i.e. ,
(14.37) Wf = -wf, wit h w = ± 1.
14. H O L O M O R P H I C M O D U L A R F O R M S 36 7
Hence th e functiona l equatio n o f th e complete d L-functio n is
(14.38) A ( / , s ) = w A ( / , l - s ) .
The sig n w — ±1 in this equatio n i s also calle d th e roo t numbe r o f E. I t i s
not difficul t t o comput e w computers fo r an y concret e curve , ye t ther e i s no simpl e
formula fo r w in general. I n the importan t specia l cas e o f semistable curves , tha t
is whenT V i s squarefree , w e obtain b y th e theor y o f Heck e operator s
(14.39) w = -n(N)a(N) = -n(N)\(N)N* = - ( - l ) m
where m is the numbe r o f place s wit h spli t multiplicativ e reduction .
The celebrate d Birc h an d Swinnerton-Dyer Conjectur e assert s tha t L(E, s)
(resp. L(f,s)) vanishe s a t s = 1 (resp. s = | ) t o order exactl y equa l t o th e
rank o f th e grou p E(Q) o f rational point s o n E. Thoug h r = rankE(Q) i s not eas y
to compute , it s parit y i s conjecture d t o b e determine d b y
(14.40) w= (-l) r,
and thi s ha s bee n prove d b y Nekovar [Ne ] unde r th e assumptio n tha t th e Tate -
Shafarevitch grou p o f E is finite (whic h is known i f the order o f vanishing o f L(E, s)
at s = 1 is ^ 1 ) .
Proofs o f the modularit y o f special ellipti c curve s existe d before , notabl y for
the curve s wit h comple x multiplication , du e to Deuring (fo r a self-contained proo f
in the cas e of the "congruen t number " curves , se e for instanc e [1 4] , Chapter 8) . A n
early exampl e (du e t o Shimura ) o f a modular curv e whic h i s not a CM-curve i s th e
following ellipti c curv e
E : y2 + y = x 3 - x 2
whose j-invariant i s j — —4096/11 an d discriminan t i s A = —1 1 .
It follow s fro m th e propertie s o f the conducto r tha t th e leve l o f the weigh t 2
cusp for m / correspondin g t o thi s particula r E must b e 1 1 . Bu t i t is easy to show
that 52(1 1 ) i s a 1 -dimensiona l space , an d actuall y i t is not to o har d t o construct
an elemen t i n this spac e fro m th e knowledg e o f the Ramanuja n A function, namel y
f(z) = {A(z)A(llz))1/12 = ? n ( l - <Z") 2(1 " <? Un ) 2
(q = e(z) as before). I n other words , th e modularit y o f E is equivalent wit h th e
identity
(14.41) g J I a - 9") 2(1 - Q lln)2 = £ a(n)q n.
No elementary proo f o f this seems to be known. On e proceeds in two steps. Th e
first on e consists o f showing tha t Yo(ll ) = r 0 (ll)\IH[ (which , suitabl y compactified ,
is a compac t Rieman n surface ) i s an algebraic curv e isomorphi c t o the (affine )
elliptic curv e E; thi s i s done b y constructing meromorphi c function s g\ an d gi
on X 0 ( l l ) (a s quotient s o f modular form s o f the sam e weight ) whic h satisf y th e
equation o f E, an d goes bac k t o Fricke an d Klein . Th e secon d step , whic h is
arithmetic an d muc h les s elementary , i s to deduce fro m thi s th e require d identit y
of //-functions . Thi s las t argumen t i s an exampl e o f the so-calle d Eichler-Shimur a
theory.
36 8 14. H O L O M O R P H I C MODULA R FORM S
Another interestin g exampl e i s the Gross-Zagie r curv e give n b y th e twiste d
Weierstrass equatio n
-139y 2 = x3 + 10x2 - 20 x + 8.
This curv e ha s conducto r N = 37 -1392 an d the sign of the functiona l equatio n
is e = — 1. Moreover , th e rank o f the group o f rational point s o n this curv e i s 3
while on e can show usin g the Gross-Zagier formul a tha t th e Hasse-Weil L-functio n
vanishes a t s = 1 to orde r exactl y 3 (see Appendix t o Chapte r 2 3 for a sketch).
Hence thi s exampl e confirm s th e conjectur e o f Birch an d Swinnerton-Dyer . I n
combination wit h a n earlier wor k o f Goldfeld i t yields a non-trivial effectiv e lower -
bound fo r the class numbe r o f imaginary quadrati c fields , a s will b e explained i n
Chapter 23.
14.5. Heck e L-functions .
Hecke showe d ho w t o produce Dirichle t serie s ou t of modular form s satisfyin g
many o f the properties o f Dirichlet L-functions . Conside r / G M/c(g,x), s o / ha s
the Fourie r expansio n a t oo
/(*) = ^2 af(n)e(nz)-
One defines it s Hecke L-function t o be the Dirichle t generatin g serie s for the Fourie r
coefficients
(14.42) L(f,s) = Y,af(n)n- s.
This L-functio n an d the completed L-functio n
(14-43) A ( S ,/)=(^)Sr(S)L(/,S)
are holomorphi c i n the region R e (s) > 1 + | b y the trivial boun d (1 4.1 0) .
We als o introduc e a n operator W (sometimes calle d th e Fricke involution ) o n
Mk{q,x) b y
k/2
(14.44) Wf{z) = q- z-kf(—)
V qz J
which, becaus e th e elemen t H q1
=( normalizes To(g) , i.e . H qTo(q) —
M
T0(q)Hq, induce s map s W : M k(q,x) ~+ k{q,X) an d W : S k(q,\) ~ >S k(q,x)-
T H E O R E M 1 4. 7 ( H E C K E ) . With notation as above, the L-function of f has a
meromorphic continuation to the whole complex plane and the completed L-function
satisfies the functional equation
k
(14.45) A(f,s) = i A(Wf,k-s).
Moreover, L(f,s) is entire if f is a cusp form, and otherwise it has only a simple
pole at s = k.
14. H O L O M O R P H I C M O D U L A R F O R M S 369
P R O O F . Th e proof is very much like Riemann's (second ) proo f o f the functiona l
equation o f C( 5)- B y th e definitio n o f th e gamm a functio n w e hav e fo r al l n ^ 1 ,
so i n th e regio n o f absolut e convergenc e w e have th e representatio n
Here we split th e integra l int o th e par t fro m 1 to +o o an d th e par t fro m 0 to 1 and
we transform th e latte r a s follow s
by applyin g (1 4.44) . Addin g bot h part s w e obtai n th e integra l representatio n
Since f — cto(f) decay s exponentiall y fas t a t infinity , th e meromorphi c continuatio n
follows, an d th e functio n equatio n i s then clear . •
If / = Ek i s a n Eisenstei n series , on e ca n se e fro m th e Fourie r expansio n tha t
L(Ek,s) i s a produc t o f tw o Dirichie t L-functions , s o w e d o no t ge t an y reall y
new L-functions . Bu t whe n / i s a cus p form , th e associate d Heck e L-functio n i s a
genuine GL(2) object .
REMARK. Lookin g bac k a t th e thet a serie s i n Sectio n 1 4.3 , on e ca n observ e
that i t provide s a n equalit y
LK(S,X) = L(f x,s)
where o n th e lef t sid e i s th e L-functio n attache d t o a clas s grou p characte r o f th e
quadratic extensio n K, whic h ha s a n Eule r produc t
1
s
LK(s,x) = \[(l-x{p)Np- )-
p
of degre e 1 over prim e ideal s i n (9 , wherea s o n th e righ t sid e w e hav e th e Heck e
L-function o f th e modula r for m f x G M I ( | D | , X D) whic h ha s a n Eule r produc t o f
degree 2 over rationa l primes . Thi s i s a n earl y sig n o f th e existenc e o f a n instanc e
of Langland s functoriality . Similarly , L-function s define d ove r numbe r fields ar e
expected t o b e identica l wit h particula r higher-degre e L-function s ove r Q . Anothe r
case o f thi s phenomeno n appear s i n th e theor y o f ellipti c curves , wher e th e Hasse -
Weil zet a functio n o f a n ellipti c curv e E/Q wit h comple x multiplicatio n wa s als o
shown b y Deurin g t o coincid e wit h th e L-functio n o f a Hecke characte r o f weight 2
of a n imaginar y quadrati c field.
37 0 14. HOLOMORPHI C MODULA R FORM S
14.6. Heck e operator s an d automorphi c L-functions .
We no w sketc h th e theor y o f Heck e operator s an d th e Atkin-Lehne r theor y o f
primitive forms , whic h establishes a link between Dirichle t character s an d automor -
phic forms , showin g clearl y ho w th e latte r generaliz e th e former .
The Heck e operator s ar e certai n linea r operator s actin g o n space s o f modula r
forms, whos e existenc e an d propertie s ca n b e see n t o b e intimatel y relate d t o th e
arithmeticity o f the subgroups To(g ) o f 5L(2, M), although thi s will not b e apparen t
in thi s surve y (se e [Mar]) .
Fix integer s k ^ 1 , q > 1 and a character \ modul o q satisfying th e consistenc y
property x{~ 1 ) = ( — l)fe- Th e operato r T{n) i s define d b y th e formul a
(14.46) T(n)f(z) = \ £ X (a)ak £ /(^ )
ad—n 0^b<d
for n ^ 1 an intege r (notic e tha t T(n) als o depend s o n \ an d s o implicitl y o n q\
this ca n b e sometime s significant) . Th e followin g propositio n i s fundamental .
PROPOSITION 1 4.8 . For any n ^ 1 , T(n) acts on modular forms and on cusp
forms: in other words, it induces linear maps
T(n) : M fc (g,x)->M fc ((7,x),
T{n) : S k(q,x) ~ > S k(q,x)-
The proof o f this proposition require s identifying th e formula (1 4.46 ) a s anothe r
instance o f a n averagin g operator , thi s tim e ove r th e (finite ) orbi t spac e A n =
r 0 ( g ) \ r n , wher e T n i s the se t o f integral matrice s o f order 2 and determinan t n , o n
which 5L(2 , Z) act s naturally .
From th e definition , w e comput e th e actio n o f T(n) o n th e Fourie r expansio n
(at oo ) o f a modula r for m (o r eve n o f an y 1 -periodi c function) , namel y
T k
(n)f(z) = \ £ X(a)a £ £ a / ( m ) e (m££±*)
ad—n 0^.b<d m
•iE-<(-)E«c^)E'(?)
m ad=n 0^b<d
= £ ( £ X{a)a k-la}{di))e{mz).
771
ad=n
a£=m
Therefore
(14.47) T(n)f(z) = ^2( £ X (d)dk-'af{^))e(mz).
m d\(n,m)
This formul a coul d als o be take n a s a definitio n o f T(n), bu t th e modularit y o f
T(n)f woul d b e harde r t o prove . However , sinc e th e Fourie r expansio n determine s
a modula r form , i t ca n b e used t o giv e quick proof s o f many propertie s o f the Heck e
operators.
14. HOLOMORPHI C MODULA R FORM S 371
PROPOSITION 1 4.9 . The Hecke operators commute. More precisely, for any
n ^ 1 and m ^ 1 we have
T{m)T(n)= £ x (d)^r(^)
d\(n,m)
or equivalently
T(mn)= £ M ( d ) X ( d ) d f1
e
- T(^)T(^).
d|(n,ra)
In particular , th e T(n) ar e multiplicative : T(mn) = T(m)T(n) i f m an d n ar e
coprime. Therefor e an y T(n) i s a product o f Hecke operators T(p u) fo r som e prime s
p an d integer s v. Thos e operator s obe y a secon d orde r linea r recurrenc e relatio n
(showing tha t the y ar e determine d b y T(p)):
T(p»+l) = T(p)T( Pn - X
1
(p)P k 1
- T(pv- )
which ca n b e summarize d b y th e identit y o f generatin g function s
v
£ T{p )Xv = ( 1 - T(p)X + X (p)pk-lX2)-\
so the Dirichle t generatin g serie s o f all the T{n) ha s a n Eule r produc t (o f degree 2 )
J2 T(n)n- S = J ] ( 1 - T (P)P~° + X O V " 1 - 2 5 ) - 1 -
n^>l p
For p prime , w e have th e formul a
T(p)f(z)=X(p)pk-1f(pz) + l£ /(—) •
If p divide s q (th e level) , thi s simplifie s t o
W1-;EO
The operator s T(p) fo r p no t dividin g q turn ou t t o hav e ver y differen t prop -
erties tha n thos e wit h p dividin g q\ th e latte r ar e ofte n referre d t o a s th e "bad "
Hecke operators, an d th e correspondin g p a s th e ba d o r ramifie d primes . Th e mai n
difference betwee n those two kinds of operators, fro m th e analyti c point o f view, lie s
in their relatio n wit h th e inne r product . Indeed , th e nex t lemm a fail s fo r (n , q) > 1 .
LEMMA 1 4.1 0 . Let n be coprime with q. Then the operator T(n) acting on the
space of cusp forms Sk(q, x) 2 5 normal with respect to the Petersson inner product;
more precisely, its adjoint is
(14.48) T(nY =x(n)T(n).
The lemm a mean s tha t fo r / an d g tw o cus p form s w e hav e
(T(n)f,g) = X (n){f,T(n)g);
note tha t indee d thi s formul a canno t hol d fo r n no t coprim e wit h g , i.e . fo r n wit h
x(n) — 0 ? because i t woul d impl y tha t T(n) = 0 , which i s certainly no t alway s true .
37 2 14. H O L O M O R P H I C MODULA R FORM S
The proof of this lemma (du e to Hecke) is quite involved (se e [1 4] , pages 1 04-1 0 6
for instance) .
From the normality of the T(n) wit h (n , q) = 1 and the fact tha t the y commute ,
we deduc e b y standard linea r algebr a th e following
PROPOSITION 1 4.1 1 . There is an orthonormal basis of the space Sk(q,x) °f
cusp forms which consists of eigenfunctions of all the Hecke operators T(n) for
(n,q) = 1.
Any non-zer o modula r for m / G Mk(q, x ) f° r which ther e exis t comple x num -
bers X(n) such tha t
T(n)f = X(n)f
for al l n coprim e wit h q is called a Hecke form . I f / i s a cuspidal Heck e form , the
adjointness formul a (1 4.48 ) show s tha t X(n) = x(^)A(n) , fo r (n,q) = 1 .
What doe s i t mea n tha t / i s a Hecke for m fo r the Fourier coefficient s aj(n) o f
/ ? Fro m (1 4.47 ) applie d t o / w e obtain b y taking th e first Fourie r coefficien t o f
both side s tha t
(14.49) X(n)a f(l) = a f(n), fo r all n wit h (n,q) = 1
so i f a / ( l) 7^ 0, the Hecke eigenvalue s an d the Fourier coefficient s ar e the same up
to a constant factor ! Thi s alread y prove s tha t th e Ramanujan r functio n i n Section
14.2 is multiplicative, a s already conjecture d b y Ramanujan himsel f (an d prove d by
Mordell). Indeed , sinc e $1 2(1 ) i s one-dimensional an d spanned b y the A function ,
perforce A is an eigenfunction o f all Hecke operator s (ther e ar e no ramified prime s
here), an d since r ( l ) = 1 , the eigenvalue i s r(n) an d the multiplicativity follow s
from Propositio n 1 4.9 .
14,7, P r i m i t i v e form s a n d specia l basis ,
Proposition 1 4.1 1 i s not exactl y wha t w e wish t o be true. I t woul d b e mor e
interesting t o hav e a basi s o f eigenfunction s o f al l th e Heck e operator s withou t
exceptions. The n (1 4.49 ) hold s for all n, and so a / ( l) must b e non-zero or otherwise
/ = 0 . Afte r normalization , th e Fourier coefficient s wil l be multiplicative, an d more
importantly th e Hecke L-functio n o f / wil l hav e a n Euler produc t
L(f, s) = fj (1 " af(p)p-s + X {p)pk-l-2srl-
P
Such a wonderful worl d doe s no t exist, however . Indeed , le t %* modulo q* be
the primitiv e characte r whic h induce s x - Tak e any q' and d with q* \ q', dq' \ q an d
let x' b e the character modul o q' induced b y x*- No w if / G Sk(q'', x')i th e functio n
/ | d (z) = f(dz) i s in Sk(q,x)i a s follow s fro m th e matrix identit y
d 0 \ fa p\ _ f a pd\ fd 0 \
0 l) [ 7 6)-\ild 6 J \01 )'
It i s also eas y t o show tha t th e Hecke operator s T(n) wit h (n , q) — 1 commute
with th e operatio n / \-> f |^ , so / bein g a Heck e for m implie s tha t / | d i s one .
However, sinc e
/ I d (z) = ^2a f(n)e(dnz),
14. H O L O M O R P H I C M O D U L A R F O R M S 373
the firs t Fourie r coefficien t o f / \d vanishes i f d > 1 .
The theor y o f primitiv e form s (o r "newforms" ) remedie s t o thi s defec t b y con -
sidering tha t cus p form s suc h a s / \d in Sk{q, X) a r e no t reall y o f leve l q but com e
from lowe r levels , an d tha t a bette r theor y o f Heck e operator s shoul d hol d whe n
such "oldforms " ar e pu t aside . Thi s wa s develope d b y Atki n an d Lehne r [AL] .
Therefore, le t £ b (g, x) D e th e subspac e o f Sk(q, x) spanne d b y al l cusp form s o f
the typ e / \d where / < G Sk(q' ', x') w ^ h q' < q and dq' \ q. The n le t S*(q, x) b e th e
orthogonal complemen t o f S\(q,x) wit h respec t t o th e Petersso n inne r product , s o
b
Sk(q,x) = S (q,x)®S*(q,X)
(note tha t i f x i s primitive , w e hav e S%(q,x) = Sk{q,x)i a n d tha t ^ X i s trivial , q
is prim e an d fc ^ 1 0 or k = 1 4 , w e als o hav e S%(q) = Sk(q), becaus e 5^(1 ) = 0 fo r
those k).
Since (T{n)f) \d= T(n)(f \d) for (n,q) = 1 , i t i s immediat e tha t T(n) act s o n
Sfr(q,x) f° r ( n5 q) — 1 - Sinc e thos e Heck e operator s ar e normal , the y ac t als o o n
the orthogona l complemen t S%(q, x)- I n particular , bot h space s stil l hav e a basis of
Hecke forms. B y definition, a Hecke form / whic h i s in S%(q, x) ls calle d a primitiv e
form (o f weigh t fc, level q, an d characte r %) • Th e mai n resul t i s the following :
THEOREM 1 4.1 2 (MULTIPLICIT Y ON E PRINCIPLE) . Given a sequence (A(n) )
of complex numbers, the subspace of S^(q,x) spanned by Hecke forms with eigen-
values A(n ) for all (n , q) = 1 is at most one-dimensional. In other words, such a
Hecke form f, if it exists, is unique up to multiplication by a scalar.
EXERCISE 4 . Prov e tha t th e multiplicity-on e principl e i s equivalen t t o th e as -
sertion tha t an y primitiv e for m / G S%(q,x) n a s non-zer o firs t Fourie r coefficient .
Suppose no w that T i s a linear operato r o n th e spac e S%(q, x) whic h commute s
with al l T(n) fo r (n , q) = 1 . The n i t follow s tha t T act s o n the commo n eigenspace s
of thos e T(n). B y th e multiplicity-on e principle , thos e ar e one-dimensional , eac h
spanned b y a primitiv e for m / , an d therefor e / i s als o a n eigenfunctio n o f T . I n
particular applyin g thi s propert y t o th e "bad " T(n), i.e . fo r (n,q) ^ 1 , w e obtai n
the desire d result .
PROPOSITION 1 4. 1 3 . Let f e S%(q,x) be a primitive form. Then f is an
eigenfunction of all Hecke operators,
T(n)f = \ f(n)f, for all n^ I
and for all n^l, a/(n ) = A/(n)a/(l) . Hence the first Fourier coefficient a / ( l ) is
non-zero.
It i s natura l t o normaliz e a primitiv e form , s o tha t i t become s a distinguishe d
basis o f it s commo n eigenspace ; the n th e basi s o f S%(q,x) ° f normalize d primitiv e
forms i s also unique. Tw o natural normalization s exist : w e will say that / i s Hecke-
normalized i f its first Fourie r coefficien t i s a/(1) = 1 , so that th e Fourie r coefficient s
are the same as the Hecke eigenvalues, an d that / i s Petersson-normalized i f ||/| | = 1
(such normalizatio n i s ofte n mor e usefu l i n th e contex t o f th e Petersso n formula) .
If / i s Hecke-normalized, the n o f course | | / | | - 1 / i s Petersson-normalized. Th e nor m
ll/H i s relate d t o th e valu e o f th e adjoin t squar e L-functio n o f / a t 1 , an d play s a
37 4 14. HOLOMORPHI C MODULA R FORM S
role i n som e importan t applications . Se e Sectio n 5.1 2 , an d i n particula r Corollar y
5.45 fo r th e compariso n betwee n th e tw o normalizations .
When n o othe r indicatio n i s present , "primitive " wil l alway s mea n "Hecke -
normalized primitive" . I n an y case , onc e a normalizatio n i s chosen , th e basi s o f
primitive form s o f S%(q,x) i s unique . I n particular , i t i s possibl e t o spea k o f th e
set o f primitiv e form s withou t ambiguity .
The Heck e L-functio n o f a (Hecke-normalized ) primitiv e for m / , fro m th e pre -
vious arguments , ha s a n Eule r produc t
L(f, s) = J] ( 1 - A/(p ) + XWP"- 1 -2*)-1 -
P
This i s equivalen t wit h th e formul a
(14.50) A /(mn)= £ x (d)d"-lXf(^).
d\(m,n)
This ca n b e als o writte n b y Mobiu s inversio n a s
k
(14.51) A /(m)A/(n)= £ ^d) X{d)d -'\f(^)\}Q.
d\(m,n)
Moreover, th e functiona l equatio n o f Theore m 1 4. 7 is als o simple r becaus e th e
primitivity o f / wil l sho w tha t Wf i s expressed i n term s o f / . Th e linea r operato r
W i s a n isometry , an d i t i s eas y t o sho w tha t W commute s wit h th e goo d Heck e
operators:
WTx(n) = x(n)TX(n)W, fo r (n , q) = 1
(where th e superscrip t x indicate s fo r whic h characte r th e T(n) i s defined) .
To com e bac k fro m Sk(q,x) t o th e origina l spac e 5fc(g,x) > w e us e anothe r
operator, th e if-operato r define d b y
KM = 7FI).
Note tha t K act s o n Fourie r expansion s b y conjugatin g th e coefficients ,
K z
f( ) = ^~^(n)e(nz) = 70)> say.
One mus t b e carefu l becaus e K i s no t C-linear , namel y K(af) = aKf. Bu t
from th e definitio n i t i s clea r tha t K 2 = Id, tha t K i s a n isometry , an d als o tha t
K commute s wit h al l T(n). Conside r no w th e compositio n (als o onl y R-linear )
W = KW : S k(qlX)^Sk(q,x)
which satisfie s th e commutatio n relatio n
T(n)W = x(n)WT(n) , fo r (n , q) = 1 .
Let / b e a primitiv e form . The n fo r (n , q) = 1 we hav e
T{n)Wf = X (n)WT(n)f = X (n)W(Xf(n)f) = X {n)Y^)W{f) = X f(n)W(f)
by th e formul a fo r th e adjoin t o f T(n). B y th e multiplicit y on e principle , i t follow s
that Wf mus t b e a multipl e o f / , henc e
14. H O L O M O R P H I C M O D U L A R F O R M S 37 5
PROPOSITION 1 4.1 4 . If f is a primitive form, then there exists rj E C, with
\r)\ = 1 , such that Wf — r\f.
That1 77 1 = 1 follow s fro m th e fac t tha t W i s a n involution , W = Id. Th e
eigenvalue rj is another ver y intricat e invarian t o f / . I t i s determined b y / bu t onl y
in specia l case s d o we have a formul a fo r it s valu e i n term s o f more accessibl e data .
PROPOSITION 1 4.1 5 . Let \ be a primitive character of conductor q. Then rj is
given by
k/2
ri = r(x)Xf(q)q- .
PROPOSITION 1 4.1 6 . Let q be squarefree, and \ trivial. Then 7 7 is given by
1
77 = fi(q)Xf(q)q -^.
From Propositio n 1 4.1 4 an d Theore m 1 4. 7 w e deduc e th e mai n resul t abou t
automorphic L-function s o f primitiv e forms .
T H E O R E M 1 4.1 7 . Let f be a primitive form. Then the Hecke L-function of f
has an Euler product expansion
L(f,s) = Y[(i-\ f(p) + x(p)p k-1 -2sr1
p
and has analytic continuation to an entire function. The completed L-function
S
A(f,s)=(^) T(s)L(f,s)
satisfies the functional equation
k
A(f,s) = i vA(7,k-s).
Notice that thi s theore m i s in perfect analog y wit h th e correspondin g resul t fo r
primitive Dirichle t characters . Eve n deepe r analogie s ar e reveale d b y th e insigh t o f
the Langland s program , whic h use s representation theor y t o unify th e whol e theor y
of L-function s (cf . [BG]) .
We conclude thi s sectio n b y statin g a non-trivia l generalizatio n o f the multipli -
city-one principl e whic h i s very usefu l i n applications .
T H E O R E M 1 4.1 8 (TH E STRON G MULTIPLICIT Y ON E PRINCIPLE) . Given a
sequence (A(n) ) of complex numbers and a fixed positive integer M, there exists at
most one primitive form f E S%(q,x) such that
Xf(n) = A(n) , for all n prime to M.
The mai n poin t i s tha t M ha s nothin g t o d o wit h th e leve l q. I n effect , thi s
shows that a primitive for m i s known whe n on e knows its eigenvalues fo r al l primes,
except finitely many . I n fac t on e ca n sho w usin g Rankin-Selber g L- functions tha t
finitely man y prime s suffic e (dependin g o n th e weigh t an d level) ; se e Propositio n
5.22 an d th e remar k following .
376 14. HOLOMORPHI C MODULA R FORM S
EXERCISE 5 . Sho w tha t i f / G Sk(q,x) i s a Heck e form , the n ther e exist s
a uniqu e primitiv e for m g G Sk(q',x') wit h q' \ q suc h tha t x'( n) — x{n) a n d
Ap(n) = A/(n ) for al l n prim e t o q.
14.8. Twistin g modula r forms .
We hav e alread y mentione d tha t w e intend t o exploit th e eigenvalues A/(n ) of
primitive modula r form s a s arithmetic harmonics ; however , befor e the y ca n be used
effectively fo r such a purpose, i t i s necessary t o get information o n the eigenvalue s
themselves. Fo r this , followin g ou r strategy , i t i s natura l t o prob e thei r behavio r
when twiste d b y othe r harmonics . Th e cas e o f primitiv e Dirichle t character s i s
basic an d the results ar e nice.
PROPOSITION 1 4.1 9 . Let f e Mk{q,x) be a modular form with Fourier co-
efficients a,f(n), not necessarily a Hecke form, q* the conductor of the Dirichlet
character x an d let IJJ be a primitive Dirichlet character modulo r. Let f ^ip be the
function on HI given by the Fourier expansion
(f®*p){z) = ^2^(n)a f(n)e(nz).
Then f <8>ip is also a modular form, more precisely f <8>ip G Mk(N, x^ 2 )? where the
level N is the least common multiple of q, g* r and r2 . If f is a cusp form, then so
is f ® ip.
PROOF. Sinc e ip is primitive , w e hav e fo r an y n th e expressio n fo r ip(ri) in
terms o f additive character s
u e
r(i/j)ip(n) = J2 ^( ) yy)
u (mo d r)
where r(yj) ^ 0 is the Gauss sum , whence
2
(14.52) {f®iP){z) = T®)-1 J2 /( +
u (modr )
It i s no w a matte r o f writing , fo r 7 = 1 , I i n To(N), th e matri x identit y i n
SL(2,R),
,0 l ) \ c d) \ c d -cj^ ) { 0 x
to obtai n
{{f®^)\kl){z)=x{d)T{^)-1 ] T 4{u)f(z + ^p)
u (mod r )
= x(#M 2 (/®^)W
which show s the modularity. A s for /<8>^ being a cusp for m whe n / is , the criterion
of Exercis e 1 applies immediatel y fro m (1 4.52) . •
By multiplicativit y o f tp and th e formul a (1 4.47) , w e se e tha t th e operatio n
of twistin g b y ip takes Heck e form s t o Heck e forms . O n the othe r hand , i t migh t
14. H O L O M O R P H I C M O D U L A R F O R M S 37 7
not tak e a primitiv e for m t o a primitiv e form , becaus e th e leve l N a s describe d i n
Proposition 1 4.1 9 migh t no t b e optimal . Thi s happens , fo r instance , whe n / = f x
is on e o f th e form s o f weight 1 associated t o a n ideal-clas s characte r \ m Sectio n
14.3. Indee d sinc e th e Fourie r coefficient s o f f x ar e supporte d o n integer s whic h
are norm s o f ideal s o f th e field K = Q(y/D)^ s o twistin g b y th e field characte r
I/J = XD has n o effec t a t all , sinc e i t i s identically 1 on norm s o f ideals. B y Exercis e
5, however , ther e i s a uniqu e primitiv e for m g, wit h leve l dividin g TV , which i s suc h
that
ip(n)\f(n) = A p (n), fo r al l n coprim e wit h q.
In th e specia l cas e when q and r ar e coprime , however , ther e i s no problem an d
f ® ijj is primitiv e wheneve r / i s (a s migh t indee d b e expected) . I n thi s cas e th e
functional equatio n o f th e twiste d L-functio n take s th e followin g form .
PROPOSITION 1 4.20 . Let f E S^(q^x) be a primitive form andip be a primitive
Dirichlet character modulo r with (r , q) = 1 . Then f 0 i\) is primitive of level
N = qr 2 and the Hecke L-function of f 0 ij; is entire and polynomially bounded in
vertical strips. Moreover, the completed L-function (l^.Jf.3) satisfies the functional
equation
A(s, f ®ip) = i kwfjfA(k - s , 7 ® ^>)
where rjf is the eigenvalue of f for the operator W, and the root number w depends
only on x an d i>, namely
w = X\TW{q) .
r
The fact tha t ther e is no pole for an y of the twists of a cusp form show s that th e
sign o f th e Fourie r coefficient s ar e changin g randoml y an d independentl y o f thos e
of an y Dirichle t character . Th e lac k o f periodicit y o f th e Fourie r coefficient s i s a
welcome featur e i n analyti c numbe r theory . Se e [DuI2] , [DuI3] , [DuI4 ] fo r som e
applications o f twistin g a s a replacemen t o f positivity .
Twists ar e importan t als o a s a simpl e cas e o f Rankin-Selber g convolutio n
GL(2) 0 GL(1 ) (se e Chapte r 5) , an d the y occu r i n so-calle d convers e theorem s
which characteriz e modula r form s b y mean s o f analyti c propertie s o f L-functions .
The first convers e theore m usin g twist s i s du e t o Wei l [We3] . Deepe r variant s ar e
important i n proving instances o f Langlands functoriality , se e the survey b y Cogdel l
in [BG] . We quot e Weil' s result :
THEOREM 1 4.21 . Let
S
Li(s) = ] T a{n)n~ s, L 2(s) = J2 H™)™~
be two Dirichlet series absolutely convergent for R e (s) > C for some C > 0 .
Assume that there exists integers k ^ 1 , q ^ 1 and M > 0 such that for any ip
primitive modulo m, with (m,Mq) = 1 , the Dirichlet series
_s s
L(f ® ij), s) = ^2 a ( n ) ^ ( n ) n , L(g 0 T/> , S) = ^ 6(n)^(n)n~
admit analytic continuation to entire functions bounded in vertical strips such that
the functions
A ( / ® V>, s) = (27T)- sT(s)L(f ® V, s), A(g ®^,s) = (2w)~ sT(s)L(g ® </>, s)
378 14. H O L O M O R P H I C MODULA R FORM S
are entire and satisfy the functional equation
A(/<g ) i/;, s) = w^(qm 2)k/2-sA(g 0 ^ , k - s),
with
w^p = i*x(m)^(g)T(^) 2 r _1 .
or
T/ien t/ier e ezzst e a cws p /orr a / G Sk(q,x) f some \ modulo q such that
L(f1s) = L 1 (s) andL(Wf,s) = L 2(s).
14.9. Estimate s fo r th e Fourie r coefficient s o f cus p forms .
The questio n o f th e siz e o f th e Fourie r coefficient s (o r Heck e eigenvalues ) o f
cusp form s i s on e o f th e classica l problem s i n th e stud y o f automorphi c form s fo r
themselves. Conside r / e Sk(q,x)- B y th e criterio n fo r cus p form s o f Exercis e 1
and th e Parseva l formul a w e obtain, fro m th e Fourie r expansio n o f / ,
$>/(n)| 2 e - 4 ™"= f\f{z)\ 2
dx^y-k
for an y y > 0 , s o tha t fo r an y N ^ 1 ,
Y,\af(n)\2«y-ke^y
and b y choosin g y = TV -1 w e derive a boun d fo r th e second-momen t o f th e Fourie r
coefficients
2
(14.53) V J | a/(n)| «iVfc.
Hence th e trivia l boun d (1 4.1 0) , namel y a>f(ri) < C n^/ 2 , follow s b y positivity .
This las t ste p i s rather crud e an d i t i s therefore expecte d tha t th e resultin g boun d
for th e individua l coefficien t cif(n) ca n b e improved. O n th e othe r hand , th e boun d
obtained o n averag e turn s ou t t o hav e alread y th e correc t orde r o f magnitude , a s
shown b y th e propertie s o f th e Rankin-Selber g L-functions . Thus , i t show s tha t
af(n)1 o n average , i s of orde r n~^~ .
The statemen t tha t a/(n ) < C n~^~+£ fo r an y e > 0 holds individuall y i s know n
as the Ramanujan-Petersso n Conjecture . Fo r primitiv e cus p forms , thi s was prove d
by P . Delign e [Del ] fo r k ^ 2 a s a consequenc e o f th e Rieman n Hypothesi s fo r
varieties ove r finite fields (th e Wei l conjectures) , i n th e ver y shar p for m
(14.54) |A/(n) | ^ r ^ n ^
for an y n ^ 1 , where r i s the diviso r function . Th e correspondin g formul a fo r k = 1
holds an d i s due t o Deligne an d Serr e [DeSe] . Thi s i s an extremel y powerfu l result ,
in particular , becaus e i t i s completel y uniform . I f / i s no t primitive , b y writin g i t
as a linea r combinatio n o f Heck e forms , w e deriv e fro m (1 4.54 )
a
/ ( n ) ^ r(n)n~2 _ ,
but th e implie d constan t no w depend s o n / quit e badly .
14. H O L O M O R P H I C M O D U L A R F O R M S 37 9
It i s often convenien t t o normalize th e Fourier coefficient s s o as to extract th e
expected orde r o f magnitude, writin g
(14.55) f(z) = y^a/(n)n~2~e(nz )
instead of (14.9), and similarly for primitive form s wit h A/(n) . Thi s has the effect of
putting the critical line of the L- function a t Re (s) — |, an d the functional equatio n
then relate s A(/ , s) an d A(/ , 1 — 5). Thi s normalizatio n i s mos t convenien t i n
analytic numbe r theor y an d will b e in effect fro m no w on, except wher e explicitl y
stated. I t i s als o th e normalizatio n use d i n Chapte r 5 , se e Section 5.1 1 for the
necessary adjustments . I n this context , Deligne' s boun d take s th e form |A/(n) | ^
r(n) fo r / primitive , an d the Hecke L- function become s
£ A/(n)n- s = n a - xf(p)p~s + x{p)p- 2srl-
n^lp
In recen t application s (fo r instance, o f the "amplificatio n method") , th e need
for lowe r bound s o n the Fourie r coefficient s ha s also arisen . Sinc e a n individua l
lower boun d i s impossible, a s there exis t form s / (primitive ) an d n (coprim e wit h
the level ) wit h Ay(n ) = 0 , this ca n only b e expected t o hold o n average. Indeed ,
from th e Rankin-Selber g metho d w e know tha t th e upper-boun d (1 4.53 ) ca n be
sharpened t o the asymptotic (remembe r tha t w e now normalize af(n) a s in (14.55))
(14.56) £ | a/ (n)|2= C/ iV + 0(iVf) ,
with cj > 0 the implie d constan t dependin g o n / . I n particular , fo r a fixe d / ,
the Fourie r coefficient s ar e not too small to o often. However , th e dependence of
the constan t o n / i n thi s formul a i s no t explici t enoug h an d so the uniformit y
which i s often require d fo r applications i s not present. I n this contex t a very usefu l
observation i s that i f / i s a primitive for m
\f{p2)-xf{p)2 = x(p).
From thi s simpl e formul a w e see that fo r p not dividing the level q it is not possible
for Xf(p) an d A/(p 2 ) t o be simultaneously ver y small ! Thi s i s very usefu l becaus e
it i s completely unifor m i n all parameters involved . Her e i s an example o f the us e
of thi s formula .
P R O P O S I T I O N 1 4.22 . Let f e S%(q,x) be a primitive form, \f(n) its Hecke
eigenvalues. There exists a sequence of complex numbers c(n) such that
-1
(14.57) ] T c(n)A/(n ) x V ^ l o g T V ) ,
(14.58) ] T |c(n)| 2 x y/NilogN)- 1
if N ^ > (log2g)2 , where the implied constants are absolute.
380 14. H O L O M O R P H I C MODULA R FORM S
I t is enough t o take c(n) — \{p) it n = p2 ^ N, c(n) = ~x{p)^f(p) i f
PROOF.
n = p ^ y/N, an d c(n) = 0 otherwise. The n th e left sid e o f (14.57) i s equal to
2
£ x(p)(A/(p )-A,(p)2) = J2 l^v/iVOogiV)- 1
by Tchebyshev' s estimat e fo r ir(yJW) an d u;(g) < C (log2g)(loglog3g)_1. O n the
other hand , c{n) satisfies (1 4.58 ) b y Deligne's boun d (1 4.54) . •
R E M A R K . Pro m the proof on e sees that th e numbers c(n) ca n be chosen to have
support o n primes o r squares of primes an d that \c(n)\ ^ 2. Thes e extr a propertie s
are no t important bu t could b e convenient fo r technical reasons .
14.10. Average s o f Fourier coefficients .
The Petersso n formul a (Propositio n 1 4.5 ) i s very usefu l i n applications; it per-
mits averagin g ove r natura l familie s o f automorphic forms , whic h ar e so to spea k
"complete". I n doing so, it shows that th e Fourier coefficient s o f automorphic form s
behave a s "harmonics", the y ar e nearly orthogona l (se e als o Sectio n 7. 3 for the un-
derlying philosophy). Thi s is also very true with the similar effect s o f the Kuznetso v
formula (se e Chapter 16).
It i s especially importan t t o deal wit h coefficient s o f primitiv e forms , whic h
benefit als o fro m multiplicativ e propertie s (amon g othe r things) . Th e issue arise s
that i n general ther e i s no basis o f Sk(q,x) ° f primitive forms . However , takin g a
basis o f Hecke form s (se e Propositio n 1 4.1 1 ) , say Hk(q,x)i a n d denotin g
their Fourie r expansion , Propositio n 1 4. 5 yield s
COROLLARY 1 4.23 . Let q ^ 1 , k ^ 2 . Let x be a character modulo q and
Hk(q,x) an y Heche basis of Sk(q,x)- F° r an V m, n ^ 1 , we have
(14.59)
V^ Xf(n)Xf(m) = <5(ra,n) + 2ni~k \ J c~ 1
Sx(m, n ; c)Jk-i ( ) •
feHk(q,x) 0 0
c=0 (mo d q)
where the superscript h means that f has been spectrally normalized to be of Peters-
son norm 1 . Without normalization this means the terms of summation are appro-
priately weighted, namely
<"*» E * . , - $ j^ E Z-
feHk{q,x) v J
feH UJ
"
k(q,x)
It i s important t o know ho w goo d the approximate orthogonalit y ca n really be,
that is , to estimate the series of Kloosterman sums . A simple but effective treatmen t
yields th e following corollar y (se e Theore m 1 6. 7 for a more refine d estimate) .
14. H O L O M O R P H I C M O D U L A R F O R M S 381
COROLLARY 1 4.24 . With notation as above, we have for any m, n ^ 1 ,
k
(14.61) Yl A / (n)A / (m) = <5(ra,n )
feHk(q,x)
i.
+ 0^r3((m,n))(m,n,g)2(mn)4^1 og^ l + ^ - — ^ j
qy/k ^ Vqk
where the implied constant is absolute.
PROOF. Us e th e estimat e Jk-i(x) < C min(l,:r//c ) an d th e Wei l boun d fo r
Kloosterman sum s (Corollar y 1 1 .1 2 )
(14.62) |5(m , n- c)\ ^ (m , n, c) 1 / 2 c 1 / 2 r(c).
I I I
For c = qr, say , thi s yield s the boun d (m , n, g) 2 (m, n, r) 2 (gr) 2 r(g)r(r). Henc e th e
sum o f Kloosterma n sum s i n (1 4.59 ) i s estimate d b y
(m,n,q)2-^- > - — / ( m , n , r ) 2 m m 1 ,- ^ .
V^ , v ^ \* kqr J
Hence th e resul t follow s b y th e elementar y estimat e
r^
which hold s fo r an y positiv e X. •
If th e spac e o f old form s vanishes , the n th e se t o f primitive form s Sfc(g , x)* i s a
distinguished basi s o f Sk(q, x)- I n particular , thi s i s so if x i s primitive fo r instance .
However, w e wil l discus s furthe r th e specia l cas e wher e q is prime , \ i s trivia l
and k < 1 2 , becaus e thi s wil l b e use d i n Chapte r 2 6 fo r studyin g non-vanishin g o f
L-functions. I n tha t cas e S%(q) = Sk(q) becaus e 5^(1 ) = 0 (th e firs t cus p for m fo r
SL(2,Z) i s o f weigh t 1 2) . I f / G Sk{q)* i s a primitiv e for m i t ha s rea l eigenvalues ,
and sinc e q i s squarefre e th e sig n ej = ± 1 o f th e functiona l equatio n i s give n b y
sj — —Xf(q)q1^2 (Propositio n 1 4.1 6 ; note we have changed normalizatio n o f Fourier
coefficients slightly) . I t i s ofte n desirabl e t o b e abl e t o averag e ove r form s wit h a
fixed sign , sa y e — ± 1 . Thi s ca n b e don e b y insertin g a facto r ( 1 -f ££/) , an d
applying th e summatio n formul a twice :
= Y? A / ( m ) A 7 R - sq 1 ^2 ] r " X f(q)Xf(m)Xj^j.
ff
Because / i s primitive an d q is the level , Xf(q)Xf(n) = Xf(qn) fo r al l n. W e deduc e
38 2 14. HOLOMORPHI C MODULA R FORM S
PROPOSITION 1 4.25 . Let q be prime, 1 < k < 1 2 , e = ± 1. For any m , n ^ 1,
we /iai>e
(14.63) 2 ^ A / (m)A / (n) = 6(m, n) - eq 1/25(m1 nq)
+ 2m~ k Y^ c~ 1
(S(m, n ; c) - eq1/2S(m, nq; c))Jk-i ( 7Tx/mn
\.
c>0
c=0 (mo d q )
Here als o w e ca n estimat e som e o r all Kloosterma n sum s gettin g a n approxi-
mation t o th e diagona l symbo l i n term s o f th e Heck e eigenvalue s o f primitiv e cus p
forms.
COROLLARY 1 4.26 . Let q be prime, 1 < k < 1 2 and e = ± 1. For any m , n ^ 1
with (m , q) = 1 we have
(14.64) 2 ^ X f(m)Xf(n) = 6(m,n)
£f=£
2eiri~k v ^ 1 ^, _ N T /47 r /m? v
^ (r, * ^) =—i
l V "
9
1I1
/ , ,u (km(ran)
n ) 4 4/ / ((mn)
mn)4\ 4 \
and
(14.65) 2j : ' A / ( m ) ^ 0 = ,(m,„) + O(«)(^)* V , 1(l + (™)i
T/ie implied constant in both estimates is absolute.
P R O O F . Sinc e q does no t divid e m we hav e <5(ra , gn) = 0. Moreover , th e firs t
term involvin g Kloosterma n sum s o n th e righ t sid e of (14.63) i s alread y estimate d
in Corollar y 1 4.24 . Fo r th e second , w e write c = qr an d estimat e th e contributio n
of th e term s wit h q | r, by th e sam e metho d gettin g a bound whic h i s of smaller
order o f magnitude .
When (r,q) = 1 , the Kloosterma n su m factorize s
S(ra, n; qr) = S(mq, n ; r)S(0, m; q) = — S(mq, n ; r).
This give s (1 4.64) . Th e remainin g su m o f Kloosterma n sum s i n (1 4.64 ) ca n als o b e
estimated directl y b y usin g Weil' s bound , an d th e resul t i s give n i n th e erro r ter m
of (1 4.65 ) (thi s i s worse tha n th e erro r ter m i n (1 4.64 ) i n term s o f q). •
The erro r ter m o f (14.65) i s large r tha n th e erro r ter m i n (14.64) an d wil l no t
be sufficien t (just ) fo r application s i n Chapte r 26 . Bette r estimate s wil l b e derive d
from (1 4.64 ) b y exploitin g a cancellation i n sums o f Kloosterman sum s ove r th e
variables m and n.
http://dx.doi.org/10.1090/coll/053/16
CHAPTER 1 5
SPECTRAL THEOR Y O F
AUTOMORPHIC FORM S
15.1. Motivatio n an d geometri c preliminaries .
Abelian harmoni c analysis , a s developed i n the previous chapter s (se e especially
Chapter 4) , give s powerfu l mean s o f transformin g o r estimatin g summation s ove r
integers, o r lattic e point s i n variou s geometrica l domains . However , i n analyti c
number theory , i t i s desire d t o hav e suc h tool s fo r sum s ove r othe r subset s o f
integers whic h ar e no t define d b y analyti c condition s alone . Commo n amon g thes e
are sum s ove r (a , 6, c, d) G Z4 restricte d b y th e determinan t equatio n
where h i s fixed. Fo r example , on e encounter s th e determinan t equatio n whe n
investigating th e powe r moment s (wit h amplification ) o f L-functions o n th e critica l
line t o handl e off-diagona l contributions .
Although th e classica l Fourie r analysi s applie s t o th e determinan t equatio n
quite well (especiall y whe n enhance d b y estimates fo r Kloosterma n sums ) th e spec -
tral theor y o n th e hyperboli c plan e produce s muc h stronge r results . Th e ke y poin t
is that fo r th e analysi s o f the determinant equatio n th e automorphi c form s ar e mor e
natural harmonic s tha n th e exponentia l function s o r Dirichle t characters . The y en -
ter th e moder n analyti c numbe r theor y throug h man y othe r doors . Fo r example ,
the ol d problem s concernin g th e clas s grou p o f quadrati c fields (Lagrange , Gauss ,
Dirichlet) canno t b e properl y understoo d withou t speakin g o f automorphi c theory .
Another incentiv e fo r ne w analyti c tool s i s to stud y automorphi c L-functions .
References fo r thi s chapte r ar e fo r instanc e [Bu] , [1 5] . Th e basi c theor y wa s
constructed b y Maas s [Ma ] an d Selber g [S6] .
As i n th e previou s chapte r w e conside r first th e grou p G — SX(2,R). Jus t
as Z i s a discret e subgrou p o f R , fro m whic h th e theor y o f periodi c function s an d
Fourier serie s arises , Y = To(l ) = SX(2,Z ) i s a discret e subgrou p o f G. Studyin g
the determinan t equatio n wit h h = 1 by harmoni c analysi s mean s takin g a kerne l
(smooth, compactl y supported ) k : G —> • C an d finding a summatio n formul a a l a
Poisson fo r th e functio n K : G —> C give n b y
K{g) = YJKl9)
which i s T-periodic ; JC{^g) — IC(g) for 7 G I\
383
38 4 15. S P E C T R A L THEOR Y O F AUTOMORPHI C FORM S
However, a usefu l simplificatio n i s possibl e b y performin g firs t a n ordinar y
Fourier expansio n wit h respec t t o th e compac t abelia n subgrou p
(which i s isomorphi c t o th e circl e R/27rZ) . An y functio n k ca n b e expande d a s a
series
where k m satisfie s a simple transformation rul e km(gr) = e(mO)k(g) fo r al l r = TQ G
K. Therefore , instea d o f function s o n G , w e can loo k a t function s o n th e quotien t
GjK. Thi s quotien t i s diffeomorphi c t o th e Poincar e uppe r half-plan e b y
/A : - > M
(15.1) a b \ ai +b
c d) ci +d
Instead o f doin g thi s reductio n explicitly , w e work o n H I and hav e G ac t o n it .
Recall fro m Chapte r 1 4 tha t th e Poincar e uppe r half-plan e i s H I = {z G C |
Im (z) > 0} . W e wil l se e i t no w a s a mode l o f th e hyperboli c plane , a complet e
Riemannian manifol d o f dimensio n 2 wit h constan t negativ e curvatur e — 1 when
equipped wit h th e Poincar e metri c
2
(15.2) ds = y~ 2{dx2 + dy 2) = y~ 2
\dz\2.
The geometr y o f H I with thi s metri c i s accessibl e becaus e H I has a larg e isom -
etry group . Indeed , th e actio n (1 4.2 ) o f G = 5L(2,R ) o n H I by linea r fractiona l
transformations
^ ^ az + b fa c
(lb.6)
vJ gz
u = - , ro ra =7
cz + du \c d
is a n actio n b y isometries : thi s follow s directl y fro m th e formul a abov e an d fro m
(15.4) d{gz) = (cz + d)~ 2dz.
Hence G C Isom(HI) , th e isometr y grou p o f HI , wit h th e cente r o f G actin g
trivially. Actually , PSL(2,R ) = SX(2,R)/{±1 } i s th e ful l grou p o f orientation -
preserving isometrie s o f HI , an d Isom(HI ) i s generate d b y G an d th e reflectio n a :
z ^ > —~z.
Using thi s larg e isometr y group , man y geometri c propertie s o f H ca n b e estab -
lished fairl y quickly :
(1) Angle s i n H ar e th e sam e a s euclidea n angle s (simpl y becaus e th e metri c
is conforma l t o th e euclidea n metri c |dz| 2 ).
(2) Th e hyperboli c geodesie s i n HI are the half-circle s (i n the euclidea n sense )
orthogonal t o th e "boundary " MUoo , an d th e vertica l line s R e (z) = constant . I n
particular, suc h a circl e o r lin e i s transforme d int o anothe r b y g G G.
(3) A geodesi c circl e o f cente r w, {z G HI | d(z,w) = r } , is als o a euclidea n
circle (wit h differen t cente r an d radius) .
(4) Th e distanc e (i n th e hyperboli c metric ) i s explicitly give n b y
\z-w\ + \z-w\
d(z, w) = lo g = ^ -,
15. SPECTRA L THEOR Y O F AUTOMORPHI C FORM S 385
but i t i s actuall y ofte n mor e convenien t t o wor k wit h th e functio n u(z, w) give n b y
(15.5) u(z,w) =»Mz,v,)) = 4 ^-)£H
where u : R +— > R + i s th e functio n o f th e distanc e
(15.6) u{d) = i(cosh d - 1 ) = 2(sin h - ) .
REMARKS. (1 ) On e ha s t o b e carefu l tha t th e topolog y o f H I induced b y th e
Riemannian metri c i s differen t fro m it s (euclidean ) topolog y a s a subse t o f C . Fo r
instance, H I is complet e wit h th e Riemannia n metric , bu t o f cours e no t wit h th e
euclidean one .
(2) Notic e tha t th e compac t subgrou p K i s th e stabilize r o f i G HI, hence th e
isomorphism G/K ~ H I described abov e i s simpl y th e ma p g i-> gi o n G.
Recall tha t th e invarian t measur e associate d t o th e Poincar e metri c i s
(15.7) dfi{z) — y~ 2dxdy
(see (1 4.4)) .
When w e com e t o harmoni c analysi s o n quotient s o f H I by discret e subgroups ,
the congruenc e subgrou p r 0 (#)\HI i n particular , se e Sectio n 1 4.1 , thinking o f th e
abelian cas e o f R an d it s ow n discret e subgroups , man y ne w phenomen a occur .
First, an y discret e subgrou p o f R i s necessaril y o f th e for m aL wit h a 6 R , an d
they ar e therefor e basicall y "al l th e same" . Thi s similarit y allow s us , fo r instance ,
to deduc e quickl y a Poisso n summatio n formul a fo r a n arithmeti c progressio n fro m
the on e fo r Z (se e Sectio n 4.3) . O n th e othe r hand , ther e ar e ver y subtl e ques -
tions concernin g harmoni c analysi s o n ro(#)\H I (simila r t o thei r complex-analyti c
differences mentione d i n Chapte r 1 4) .
15.2. Th e laplacia n o n H .
Harmonic analysi s o n H I is associate d wit h th e hyperboli c Laplac e operator ,
which i s give n b y
/ B 2 ft 2
\
This i s a G-invarian t differentia l operator , i.e . fo r an y functio n / whic h i s twic e
differentiable w e hav e
A ( / | g) = A / .
Here an d hereafte r th e actio n o n th e righ t o f G o n function s i s o f cours e give n b y
(/ I g)(z) = f(gz).
There ar e man y way s t o analyz e A o n M , depending o n whic h coordinate s ar e
chosen. Usin g rectangula r coordinate s z = x + iy an d workin g b y separatio n o f
variables, on e consider s function s / o n H I which ar e periodi c o f perio d on e i n th e
variable x\ the n / ha s a Fourie r expansio n
(15.9) /(* ) = £/n(2/)e( r
38 6 15. S P E C T R A L THEOR Y O F AUTOMORPHI C FORM S
with coefficient s
(15.10) fn(y)= f f{x + iy)e(-nx)dx.
Jo
Applying th e Laplac e operato r on e get s
2 2
Af(z) =y*J2 (4n n fn(y) - %{y))e(nx).
nez
Of specia l concer n ar e th e eigenfunction s o f A , th e function s / suc h tha t A / = Xf
for som e A E C . I t i s bes t t o writ e th e eigenvalu e A in term s o f a paramete r s
such tha t A = s( l — s). Becaus e A i s a n ellipti c differentia l operator , an y suc h / i s
automatically real-analyti c o n H .
Continuing wit h a functio n / periodi c i n x , w e se e tha t i t i s a n eigenfunctio n
with eigenvalu e A i f and onl y i f f n i s a solution t o th e ordinar y differentia l equatio n
r + ( A _ w )/ = o.
For n = 0 all solutions ar e linea r combination s o f y s an d y l~s (replac e y l~s b y
l 2
y / \ogy i f s = 1 /2) . The n fo r n ^ 1 , tw o linearl y independen t solution s t o th e
equation ar e give n b y standar d Besse l function s
1 /2
fn(y) = 2y K1
s_ /2(27r\n\y)
and
fn(y) = 2y 1 '2I13_ /2(27r\n\y)
which ar e distinguishe d b y thei r asymptoti c behavio r a s y — > -foo . Thi s lead s t o
LEMMA 1 5.1 . Let f : M—>> C be an eigenfunction of A which is l-periodic in
x, with eigenvalue X = s( l — s), and suppose f satisfies the growth condition
f(z) = o(e 27ry ), as y -+ +oo.
Then there exist complex numbers a 7 b, and a n for n ^ 0 , such that
f(z) = ay s + by 1 '8 + y 1 '2 ^ a nKs_ /2(2n\n\y)e(nx)
1
(if s — 1/2, replace y l~s by y l 2
/ \ogy).
In particular , thi s explici t Fourie r expansio n implie s tha t / i s in fact o f at mos t
polynomial growth ; precisely , w e hav e
/(2)«j/ff + j /
i
-
for y —> +oo , wher e a = R e (s) a s usual .
15.3. Automorphi c function s an d forms .
We no w conside r a discret e subgrou p o f G fo r whic h Vo l ( r \ H) < +o c (suc h
subgroups ar e calle d Fuchsia n group s o f th e first kind) . O f particula r importanc e
are th e Heck e group s To(q) fo r q ^ 1 .
We defin e differen t space s o f T-periodi c function s o n H :
^ ( r \ M ) = { / : H - + C | / | 7 = / , for al l 7 eT}
15. S P E C T R A L T H E O R Y O F A U T O M O R P H I C F O R M S 38 7
(the spac e o f automorphi c functions) ,
L2(r\H) = {/ G A(r\u) | ll/ H = J \f(z)\ 2dn(z) < +^}
(the spac e o f square-integrabl e automorphi c functions) ,
AS(T\M) = { / e A(T\M) I A/ = 5( 1 - s)f}
(the spac e o f automorphi c forms , o r Maas s forms , wit h eigenvalu e A = s( l — s)).
Let a be any cusp of T and a a a scaling matrix as in (1 4.5) . Fo r any / G L 2 ( r \ H )
the functio n f a = f | a a i s thu s 1 -periodi c i n x, an d ha s a Fourie r expansio n a s
in (1 5.9) . W e defin e furthe r
2
Lg(r\H) = {fe L (T\M) | / fli0 - 0 fo r an y cus p a }
(the spac e o f cuspida l automorphi c functions) ,
A°8(T\E) = L 20(T\M) n A S(T\U)
(the spac e o f cus p form s wit h eigenvalu e 5( 1 — 5)) .
Note, i n particular , tha t an y automorphi c for m i n L 2 ( r \ H ) (o r wit h moderat e
growth) admit s a Fourie r expansio n a t th e cus p 0 0 a s i n Lemm a 1 5.1 .
Our firs t goa l wil l b e t o describ e ho w t o decompos e element s o f L 2 (r\M) i n
"spectral" terms , simila r t o th e Fourie r decompositio n (serie s o r integrals ) o n R/ Z
or R . Thi s wil l b e don e i n term s o f th e eigenvalue s o f th e laplacia n actin g o n
automorphic functions .
We defin e th e domai n V o f A t o b e
V = { / G L2 ( r \ H ) I / an d A / ar e o f clas s C°° an d bounded} ,
thereby turnin g A int o a n unbounde d linea r operato r o n L 2 ( r \ H ) , define d o n th e
dense domai n V. Usin g Stoke' s formula , i t i s show n tha t A i s symmetri c an d
positive, an d b y general functiona l analysis , i t therefor e ha s a self-adjoint extensio n
to al l o f L 2 (r\M).
If the quotien t r \ H wer e compact , the n th e genera l Hilber t spac e theory woul d
imply tha t A ha s pur e poin t spectrum , an d tha t L 2 ( r \ H ) i s spanned b y eigenfunc -
tions o f A , a s i s th e cas e wit h R/Z . Du e t o th e lac k o f compactness , thing s ar e
more complicate d becaus e a continuou s spectru m exist s but , o n th e othe r hand ,
the presenc e o f the cusp s give s rise to Fourie r expansions , whic h ar e powerfu l tools .
15.4. Th e continuou s spectrum .
We start th e spectra l decompositio n b y constructing a space o f "obvious " auto -
morphic functions , namel y thos e constructe d b y th e familia r averagin g technique .
More precisely , fo r a tes t functio n ip : R +— > C whic h i s smoot h wit h compac t
support an d a cus p a , w e defin e th e incomplet e Eisenstei n serie s
(15.11) E a(z^)= ^2 ^(Ima-Sz )
7er a \r
which i s obviousl y i n L 2 (r\M) sinc e ip has compac t support . O f course , fo r th e
same reason , E a(',ip) ca n neve r b e a n eigenfunctio n o f A . However , usin g Melli n
388 15. SPECTRA L THEOR Y O F AUTOMORPHI C FORM S
inversion, w e find tha t
(15.12) E a(z, V ) = ^- J E a(z, s)^(s)ds
where a > 1 , - 0 is th e Melli n transfor m o f ip and E a(-,s) i s the Eisenstei n serie s
Ea(z,s) = Yl (IHKT-S^) -
7er a \r
the serie s bein g absolutel y convergent , uniforml y o n compac t sets , fo r R e (s) > 1 .
Because A(I m z) s = s( l — s)Im (z) , the Eisenstein serie s E a(z, s) i s an eigenfunctio n
of A , bu t unfortunatel y i t i s not squar e integrabl e s o it canno t b e use d directl y fo r
the spectra l decomposition .
We define a new space £ ( r \ H) t o be the closure in L 2 (T\H) o f the space spanne d
by th e incomplet e Eisenstei n series . Clearly , A act s o n £ ( r \ H ) . Th e solutio n t o
the spectra l decompositio n i n thi s subspac e lie s i n th e meromorphi c continuatio n
of the Eisenstei n serie s (wit h respec t t o th e s- variable) t o th e whol e complex plane ,
which wa s prove d b y Selberg . I n th e cas e o f To(q), th e analyti c continuatio n ca n
be see n quit e easil y b y computin g explicitl y it s Fourie r expansio n a t th e cus p oo .
Let a , b b e tw o (no t necessaril y distinct ) cusps . The n E a(-,s) ha s a Fourie r
expansion a t th e cus p b , which w e writ e
Ea(abz, s) = S aibys + <p a,b?/1 _s + y 1 /2 Yl <Pa,b{n, s)K s_1 /2{27r\n\y)e(nx).
The squar e matri x o f constan t term s <£(s ) = ((f a,b(s))a,b i s calle d th e scatterin g
matrix o f T .
In th e cas e o f T = 5L(2 , Z), the Fourie r coefficient s ar e (w e drop th e subscript s
a, b since ther e i s onl y on e cusp )
r ( s - i/2 ) c(2 s - i )
<p(s) = \J-K- T(s) C(2a ) '
-1/2
ab=\n\
and th e meromorphi c continuatio n ca n indee d b e see n fro m tha t o f th e Rieman n
zeta functio n ((s). I n thi s cas e th e ful l Fourie r expansio n o f E(z, s) i s
(15.13)
oo
E(z, s) = 0(s)y s + 0( 1 - s)y 1 -s + V^ r s_ /2(n)K1
1 s_ /2(27rny) cos(2^nx) .
l
For To(q) i n general , simila r expression s hol d i n term s o f Dirichle t L-functions .
Let R e (s) ^ 1 /2 . I t i s easil y see n tha t i n thi s regio n E(z, s) ha s onl y a simpl e
pole at s = 1 with constant residu e V~ l, wher e V = Vo l ( r \ H ). Fo r general fuchsia n
groups o f th e first kind , ther e migh t b e othe r pole s i n th e half-plan e R e (s) ^ 1 /2 ,
although onl y a finite number , al l i n th e interva l 1 / 2 < s ^ 1 , non e o n th e lin e
Re (5) = 1 /2 . Al l residue s ar e i n L 2 (r\EI) an d on e denote s 7^(T\H ) th e subspac e
that the y span ; thi s i s calle d th e spac e o f residua l spectrum .
15. S P E C T R A L T H E O R Y O F A U T O M O R P H I C F O R M S 38 9
For To(q), on e furthe r see s that s = 1 is the onl y pol e o f E(z, s) , with constan t
residue
Vq = Vol(r 0 (g)\H) = | g j ] ( 1 +p' 1 ).
p\q
Using th e self-adjointnes s o f A , on e ca n prov e th e functiona l equatio n o f the Eisen -
stein serie s
E(z,s) = $(s)E(z,l-s)
where E(z,s) i s th e colum n vecto r (E a(z,s))a; correspondingly , fo r th e scatterin g
matrix w e hav e
$ ( s ) $ ( l - s ) =Id .
One show s tha t th e orthogona l complemen t t o 7£(r\H ) i n £ (r\M) ha s continuou s
spectrum o f multiplicity equa l to the numbe r o f cusps, describe d b y the eigenpacke t
of Eisenstei n serie s fo r R e (s) = 1 /2 . Fo r To(g) , w e therefore derive :
THEOREM 1 5.2 . Let f e £(T 0(q)\E). Then f has the expansion
E
(15.14) f(z) = ±- I f(z)dfi(z) + J2 J- I </ > ^ \ + it))E a(z, \ + it)dt
where V q — Vo l (ro(g)\lH). This formula is valid in L 2 sense, and the integrals are
absolutely convergent point-wise and uniformly on compact sets if f G V.
The mai n poin t i n th e proo f i s that , whil e E a(-,s) i s no t square-integrabl e
(because o f ter m S a^ys + ip a^(s)y1 ~s i n th e Fourie r expansio n a t th e cus p b) , i t
is almos t s o whe n R e (s) = 1 /2 . Indeed , on e ca n sho w tha t E a(a^z,s) < C y/y
when y —> • oo , an d tha t th e extr a integratio n i n t i n (1 5.1 4 ) save s a facto r logy,
which i s sufficien t t o tur n th e right-han d sid e int o a n L 2 function . O n th e othe r
hand, on e migh t b e tempte d t o sa y tha t (1 5.1 2 ) o n th e lin e R e (s) = 1 / 2 give s th e
spectral decomposition , bu t i t i s no t tru e becaus e th e coefficien t ip(s) i s no t th e
inner produc t (£^ a(-,,0), Ea(-, s)) a s i t shoul d b e (sinc e (E a(-,s),Ea(-,s)) diverges ,
multiplying b y th e Eisenstei n serie s an d integratin g i s no t permitte d i n (1 5.1 2) . I t
is necessar y t o appea l t o th e functiona l equatio n t o ge t th e prope r formul a whic h
(despite what (1 5.1 2 ) migh t suggest ) show s that i t i s necessary t o us e the Eisenstei n
series a t al l cusp s b to ge t th e expressio n o f the incomplet e Eisenstei n serie s a t th e
cusp a (se e [1 5] , 22.1-22.3).
In th e cas e o f a genera l subgrou p T , th e Eisenstei n serie s remai n i n man y
respects rathe r mysteriou s objects , bu t the y ar e comparativel y les s s o fo r To(q),
where th e explici t Fourie r expansio n yield s crucia l estimate s i n a straightforwar d
way, wit h th e consequenc e that , fo r th e application s i n thi s book , th e continuou s
spectrum wil l b e easil y manageable .
15.5. Th e discret e spectrum .
We no w tur n ou r attentio n toward s th e complemen t o f th e spac e £ ( r \ H ) .
LEMMA 1 5.3 . The orthogonal complement of £(T\E) is the space L\ ( r \ H ) of
cuspidal automorphic functions.
This follow s b y a simpl e computation , whic h w e write dow n completel y t o giv e
a feelin g o f th e whol e theory : w e simpl y comput e th e scala r produc t (/ , E a(-, ip))
390 15. SPECTRA L THEOR Y O F AUTOMORPHI C FORM S
of a n L 2 -automorphic functio n agains t a n incomplet e Eisenstei n serie s
(f,Ea(.^)) = [ f(z)E a(z,'ip)dfi(z)
JF
= I f(z) X ) ^(Ima-S*)^* )
F
7^r a \r
1
Ef Ff{^{lma-
1 z)d^z)
JF
ieTa\T
which b y automorph y o f / transform s t o
(f,Ea(-,il>))= Yl I f{<r azj$(z)dn{z)
1/ >+ 00
2
/ f((J az)^(z)y~ dydx
/o Jo
since th e set s o~ x^F ar e disjoin t an d mak e a partitio n o f th e stri p 0 < R e (z) < 1
(see (1 4.5 ) o r thin k o f a = oc) .
This w e can no w writ e a s
2
(15.15) (f,E a(.,r(>))= / faAy)Hv)y~ dy
Jo
where f a,o(y) i s the constant ter m i n the Fourier expansio n o f / a t a , an d the lemm a
is no w clear .
Again, A act s o n the spac e LQ(T\IH[) . Bu t no w on e ca n sho w that th e resolven t
of th e laplacia n i s a compac t operato r o n Lo(r\H) . Fro m thi s i t readil y follow s
that A ha s pur e poin t spectru m o n Lo(r\H ) an d thu s thi s spac e i s spanne d b y
eigenfunctions o f A , whic h ar e calle d Maas s cus p forms .
Because A i s self-adjoint, th e eigenvalue s o f A mus t als o b e non-negative , an d
indeed the y ar e positiv e sinc e th e eigenvalu e 0 correspond s t o bounde d harmoni c
functions o n H , whic h ar e necessaril y constant . I n term s o f s thi s mean s tha t
either 1 / 2 < s ^ 1 , o r R e (s) = 1 /2 . Ther e ca n b e onl y finitel y man y eigenvalue s
A = 5( 1 — 5 ) wit h 1 / 2 < s ^ 1 , an d the y ar e calle d exceptiona l eigenvalue s fo r
T. Althoug h i t i s fairl y eas y t o construc t fuchsia n group s havin g arbitraril y man y
exceptional eigenvalues , Selber g conjecture d tha t the y d o no t exis t fo r congruenc e
(arithmetic) groups .
SELBERG'S EIGENVALU E CONJECTURE . For any q > 1 , we have
where Xi is the smallest eigenvalue of A acting on LQ(r 0(q)\lHI).
Further, Selber g prove d
THEOREM 1 5.4 . For any q ^ 1 , we have A i ^ ^_ .
This ha s bee n improved , a s mentione d i n Sectio n 5.1 1 , the bes t resul t t o dat e
being A i ^ 975/409 6 b y Ki m an d Sarna k [KiS] .
As wil l appea r clearl y i n Section s 1 5. 8 an d 1 5.9 , exceptiona l eigenvalue s ar e
somewhat analogou s t o exceptiona l zero s o f Dirichle t L-function s (se e (5.51 )) , an d
15. S P E C T R A L T H E O R Y O F A U T O M O R P H I C F O R M S 391
too man y o f the m ca n compromis e analyti c estimates . However , Theore m 1 5. 4 i s
analogous t o a zero-free stri p an d i s thus muc h stronge r tha n wha t i s known i n thi s
case. Th e mor e fruitfu l analog y i s wit h th e Ramanujan-Petersso n conjecture , an d
in representatio n theoreti c term s ther e i s a commo n formulation .
For genera l T , cus p form s ar e shroude d i n mystery ; indeed , i t i s b y n o mean s
clear tha t the y mus t exis t (se e [PS]) , excep t i n certai n case s wher e thi s i s ensure d
by symmetr y (fo r instance , i f th e reflectio n z —
i > • — ~z descend s t o th e quotient , od d
functions mus t com e fro m od d cus p form s sinc e al l Eisenstei n serie s ar e even) . Fo r
congruence subgroups , however , i t wa s show n b y Selber g usin g th e trac e formul a
that no t onl y d o cus p form s exist , bu t the y ar e i n a certai n sens e "dominating "
over th e Eisenstei n series . Thei r presenc e i s measure d b y th e countin g functio n
Nroiq)(T) = \{j | M<T} |
which i s shown t o satisf y th e Wey l La w
iVr„M(T) = v ° " r 4 f w ^ + 0(^rio S,r)
for T ^ 2 , wher e th e constan t i s absolute . Henc e ther e ar e man y eigenvalues ,
on averag e abou t qT betwee n T an d T - f 1 , bu t th e proo f i s indirec t an d doesn' t
produce an y o f the correspondin g cus p forms !
15.6. Spectra l decompositio n an d automorphi c kernels .
From th e tw o previou s section s w e ca n deduc e th e ful l spectra l decompositio n
theorem fo r th e automorphi c laplacia n o n ro(tf)\HI .
THEOREM 1 5.5 . Let u 0, ui, U2, ... be an orthonormal basis of the residual
_i
and cuspidal spaces, i.e. u^ — constant = Yol(To(q)\M) 2 G 1 Z(To(q)\H.) with
eigenvalue A o = 0 and Uj G .4^.(ro(g)\IH) ; with eigenvalue Xj = Sj(l — Sj) for
j = 1 , 2, . . . . Then any f G L2(To(q)\H.) has the spectral decomposition
(15.16) f(z) = Y </ , u3)u3(z) + Y 7- j </ > E *('> \ + **)>£<*(* , \ + tydt
valid in L 2 sense, and also converging absolutely and uniformly on compact sets if
f G V. Moreover, the Parseval formula holds
(15.17) ll/ f = ]T |(/,^-)| 2 + Y, T I K/> E*('> \ + it ))\2dt
We wil l no w appl y Theore m 1 5. 5 t o ge t a kin d o f Poisso n summatio n formul a
for quotient s o f th e hyperboli c plane . Fo r thi s w e conside r a kerne l functio n i n
two variables 1 fc : H I x H I -> C , sa y continuou s an d compactl y supported , whic h
is a "point-pai r invariant" , i.e . k(gz,gw) — k(z,w) fo r al l g G G. Thi s mean s
that k(z^w) depend s onl y o n th e distanc e d(z,w). W e expres s thi s propert y b y
saying tha t ther e exist s anothe r functio n i n on e variabl e k : R +— > C suc h tha t
k(z,w) = k(u(z,w)) wher e u i s the functio n i n (1 5.5) .
The secon d variabl e i s usefu l becaus e ther e i s n o privilege d basepoint .
392 15. S P E C T R A L THEOR Y O F AUTOMORPHI C FORM S
Associated t o k is an invariant integra l operato r L acting o n functions o n HI by
Lf(z)= [ f(w)k(z,w)dn(w)
and, fro m th e assumption tha t k depends only on the distance, it follows that L is G-
invariant. I n particular, L map s automorphi c function s t o automorphic functions ,
and indee d fo r / G «4(ro(^)\H) whic h i s bounded w e have anothe r formul a
Lf(z) = J f(w)K(z,w)dn(w).
where th e kernel K i s defined by
(15.18) K(z,w)= Yl k(z^w).
jer0(q)/{±i}
The ne w kerne l K i s automorphic i n both variables : indeed , i n w by the averaging
construction, an d in z becaus e k is symmetric.
Applying th e spectral decompositio n i n the z-variable give s th e expression
K(z,w) = ^2(K(^w),Uj)uj(z)
+ Y, J " / W" > W )>E*('> \ + it))Ea(z, \ + it)dt
where th e coefficients (K(-,w),Uj) (respectivel y (K(-,w),E a(-^ + it))) ar e again
automorphic an d can be also expande d similarly . However , th e next lemm a show s
that the y ar e simply proportiona l t o the cusp form s Uj (respectivel y th e Eisenstein
series).
LEMMA 1 5.6 . Let k : R +— > C be smooth and compactly supported, k(z )w) =
k(u(z,w)) the associated kernel and L the integral operator as above. If f : H—> C
(not necessarily automorphic) is any eigenfunction of the laplacian with eigenvalue
A = 5( 1 — 5) , 5 = 1 / 2 + it, t G C, then f is also an eigenfunction of the integral
operator L, with an eigenvalue which depends on A and L, but not on f. More
precisely, we have
where A = h(t), the transform k \-> h being obtained by the following steps:
r+oc
q(v) = / k(u)(u — v)~ l'2du,
JV
g(r)=2q[(Smhr-)2),
irt
h(t) = f g(r)e dt.
15. S P E C T R A L T H E O R Y O F A U T O M O R P H I C F O R M S 393
For a proof, se e e.g. [1 5 , Th. 1 .1 6] . Th e transfor m abov e is called th e Harish -
Chandra/Selberg transfor m fo r 5L(2,R). I t can b e inverted by
irt
9(r) = - ^ / h(t)e~ dt,
1
i r+oc
1 /2
k(u) = / (v-u)- dq(v),
sufficient condition s fo r h being
( h{t) = h(-t),
(15.19) I h is holomorphic i n |Im (t)\ < \ + 5,
2 s
[h(t)^(\t\ + ir -
for som e 5 > 0 . Thes e condition s ar e in fac t sufficien t fo r th e analysis o f the
automorphic kerne l K(z, w), i.e . th e restrictio n tha t k has compac t suppor t ca n be
relaxed.
We deduc e fro m thi s th e spectral decompositio n whic h wa s our goal.
T H E O R E M 1 5.7 . Let h be a function satisfying the conditions (1 5.1 9), k the
Harish-Chandra/Selberg transform of h and K the automorphic kernel on r 0 ( g )
associated to k. Then the following expansion holds
K(z,w) = Y^ k{u(z,~/w)) = ^h{tj)u j{z)uj{w)
ier0(q)/{±i} j7*o
+ E — - / h(t)E a{z, \ + it)Ea(w, \ + it)dt
where the convergence is absolute and uniform on compact sets.
It i s importan t t o hav e som e contro l o n the growth o f the eigenvalues and
eigenfunctions i n this formula . Th e followin g i s often sufficien t fo r first estimations .
PROPOSITION 1 5.8 . Let T ^ 1 and z e H . We have
E M*)i 2
+ E^ / T
\ E*(z> \ + «)i2 * « T2
+ T y(z)
J
\t3\<T a * ~T
where the implied constant depends on V alone and
y{z) = max maxim (a" 1 72:).
a jer
For th e proof, se e [15], Section 7.2.
15.7. Th e Selberg trac e formula .
In Sectio n 1 5.6 , w e have obtaine d th e spectral expansio n o f an automorphi c
kernel, an d in doin g so , we have see n tha t i t i s usefu l t o conside r th e invarian t
integral operato r L associate d t o it, namely
Lf(z) = J f(w)K(z,w)d fi(w)
394 15. S P E C T R A L THEOR Y O F AUTOMORPHI C FORM S
where K i s give n b y (1 5.1 8) . Anothe r formula , whic h i s als o a generalizatio n o f
the Poisso n summatio n formula , arise s whe n on e compute s th e trac e o f L , usin g
the spectra l expansio n o f K o n th e on e hand , an d it s definitio n o n th e othe r hand :
this i s the celebrate d Selber g trac e formula . O f course , th e correc t definitio n o f th e
trace o f an operato r o n a n infinit e dimensiona l spac e i s already a problem, bu t her e
we take a pragmatic approac h an d defin e th e trac e o f L (o r o f K) t o b e the integra l
of th e kerne l o n th e diagona l (whic h i s well-defined becaus e ou r kernel s ar e alway s
at leas t continuous) :
(15.20) TrK= f K(z,z)dfi(z).
If there were no Eisenstein serie s in Theorem 1 5.7 , the spectral expansion woul d
yield immediatel y
since (UJ) is a n orthonorma l basis , an d w e would obtai n th e pre-trac e formul a
jF
j ier
The right-han d sid e is still no t ver y explicit. T o go further, Selber g partitione d
the su m ove r 7 i n such a way that th e partia l sum s ar e computable. Thi s i s done by
dividing th e grou p int o it s conjugac y classe s (tha t thos e ar e non-trivia l i s a featur e
of th e non-commutativit y o f T). Fo r on e conjugac y clas s C , sa y o f 7 G T, w e hav e
C = {r~ V \ reT}
which i s i n bijectio n wit h Z(7)\T , wher e Z("Y) is the centralize r o f 7 i n T ,
Z(j) = { r G 7 I 7T = rj}
so th e contributio n o f th e clas s C t o th e su m i s equa l t o
l
E k(u(r- ^TZ,z)) = E k{u(p(Tz,Tz))
rGZ(7)\r rGZ( 7 )\r
which, b y invariance , give s th e contributio n
T r c X = / k{u{nz,z))dii(z)
JZ(~f)\M
to the trac e o f K. Th e advantag e w e gain in performing thi s summation ove r 7 G C
is tha t th e abov e expressio n fo r T r c K no w depend s o n th e conjugac y clas s o f 7
in ST(2,R) , no t i n I \ Henc e i n furthe r computation s w e can replac e 7 b y it s mos t
standard representativ e T~ 1 JT fo r suitabl e r i n G. Suc h classe s hav e geometri c
meaning, an d th e fac t tha t 7 i s a n elemen t o f T wil l translat e thi s meanin g int o
information abou t geometri c dat a o f th e quotien t spac e r \ H .
We first describ e th e classificatio n o f th e conjugac y classe s i n G = 5L(2,R )
using th e geometri c actio n o f G o n I , namel y b y considerin g th e fixed point s o f
the isometr y o f H I induced b y a n elemen t g G G. Thi s give s informatio n abou t
conjugacy classes , becaus e i f z i s fixed b y g, the n TZ i s fixed b y th e conjugat e
r~1gr o f g. Becaus e th e grou p i s ver y symmetric , th e fixed point s wil l b e enoug h
to classif y g up t o conjugacy .
15. S P E C T R A L T H E O R Y O F A U T O M O R P H I C F O R M S 395
Let g e G, 9 = [ , I , # 7 ^ ± 1 , actin g non-triviall y o n i U Q U 00. Th e
equation gz — z i s a quadrati c equatio n cz 2 - f (d — a)z — b = 0 , wit h discriminan t
A = (d — a) 2 + Abe = (Tig) 2 — 4 , an d ther e ar e thre e cases .
• Firs t case : \Tr g\ > 2 (g is called hyperbolic) . The n A > 0 , an d ther e ar e tw o
distinct fixed point s i n R (th e cas e c — 0, a ^ 1 also belong s t o thi s case , bu t on e
of the fixed point s i s 00) . Ther e exist s r G G which send s on e o f the fixed point s t o
0 an d th e othe r t o 00 ; the n b y th e observatio n abov e th e conjugat e r~ xgr fixes 0
and 00 , an d clearl y (changin g r int o r _ 1 i f necessary) , w e find tha t g i s conjugat e
to a uniqu e elemen t o f th e for m
(15-21) a(p ) = ±^ 1 ()/2 p _1
° /2)
for som e p > 1 , which act s geometricall y a s a dilatio n z ^ pz. W e cal l p th e nor m
of g an d writ e Ng = p; th e nor m classifie s exactl y a hyperboli c conjugac y class .
Notice tha t p ca n b e recovere d fro m g b y solvin g th e equatio n z + z~ x = Tr g an d
taking th e squar e roo t o f th e solutio n whic h i s large r tha n 1 .
Associated t o a hyperboli c motio n i s th e geodesi c 7 ^ i n H whic h i s th e half -
circle perpendicula r t o R a t th e tw o fixed points , o r i n th e cas e c = 0 , th e vertica l
line fro m th e rea l fixed poin t t o 00 . Thi s geodesi c i s preserved b y g (thi s i s obviou s
for a(p) , an d follow s b y conjugatio n i n th e genera l case) . Not e tha t th e nor m o f
g ca n b e recovere d b y log Ng = d(z,gz) fo r z G jg (g act s a s a translatio n o n th e
geodesic).
Conversely, t o ever y geodesi c 7 i n H , on e ca n associat e th e isotrop y subgrou p
T 7 i n G, containin g element s which preserv e 7. Thi s is generated b y ± 1, an elemen t
of orde r 2 whic h interchange s th e end-point s o f 7 i n R U {00}, an d th e subgrou p
of thos e element s fixing th e end-point s (whic h i s isomorphi c t o th e multiplicativ e
group o f positiv e rea l numbers , an d s o to R b y th e logarith m map) . Al l these fact s
follow b y conjugatio n t o th e cas e whe n th e geodesi c i s th e imaginar y axis , an d T 7
is generate d b y th e involutio n z1— > — z~x an d th e grou p o f element s a(p) above .
Let agai n g be hyperbolic . Th e centralize r Z(g) o f g in G is a subgroup o f T 7g ;
one check s tha t th e involutio n i s no t i n it , bu t th e othe r subgrou p is , s o Z(g) i s
isomorphic t o R (time s ± 1 , of course) .
• Secon d case : \Trg\ = 2 (g i s calle d a paraboli c motion) . The n A = 0 , s o g
has a uniqu e fixed poin t a , a G R i f c ^ 0 , an d a = 0 0 i f c = 0 . A s before , som e r
sends a t o 0 0 an d on e see s thi s wa y tha t g i s conjugat e t o a uniqu e elemen t
1x
(15.22) n(x) = ±
01
for som e x G R, whic h act s a s a horizonta l translatio n z i- > z + x. Th e paramete r
x classifie s th e paraboli c conjugac y classes . Th e grou p N = {n(x) \ x G R} i s th e
isotropy grou p o f 0 0 i n G. an d i t i s th e centralize r i n G o f an y o f it s elements . B y
conjugation, th e centralize r o f a paraboli c elemen t i s thu s isomorphi c t o R .
• Thir d case : |Trg | < 2 (g i s calle d a n ellipti c motion) . The n A < 0 , s o th e
quadratic equatio n ha s tw o conjugat e comple x roots , on e o f which , sa y z g, i s i n
M. Movin g z g t o i b y a n elemen t r G G , a shor t computatio n reveal s tha t g i s
396 15. SPECTRA L THEOR Y O F AUTOMORPHI C FORM S
conjugate t o a uniqu e elemen t
(15.23) r{0) = ±( COS.ea S i n ^
v v
' ' \ - s m 0 co s 0 J
with 0 G R/27rZ , whic h act s b y hyperboli c rotatio n o f angl e 0 around i. Th e grou p
K = {r{0) | 0 G R/27rZ}, isomorphi c t o R/27rZ , i s th e centralize r o f an y o f it s
(non-trivial) elements , an d th e centralize r o f an y ellipti c elemen t i s isomorphi c t o
K.
Now conside r a discret e subgrou p r o f G an d a n elemen t 7 G T. Le t Zri'j) b e
the centralize r o f 7 i n T ; i t i s o f cours e a subgrou p o f th e centralize r Z(j) o f 7 i n
G, an d i t i s a discret e subgroup . Whe n 7 i s hyperboli c o r paraboli c an d ^(7 ) i s
isomorphic t o M , thi s implie s tha t Zr(~f) i s infinit e cyclic , an d whe n 7 i s elliptic ,
this implie s tha t ^ ( 7 ) i s finite cyclic . I n al l cases , w e say tha t 7 (o r it s conjugac y
class i n T ) i s primitiv e whe n 7 i s itsel f a generato r o f th e centralize r Zri^f)- I t i s
clear tha t an y elemen t 7 i s o f th e for m ±7 ™ fo r som e intege r m ^ 0 and a uniqu e
primitive elemen t 70 . B y conjugatin g th e generato r 7 0 to on e of the standar d form s
described above , w e se e tha t th e actio n o f ^ r ( l ) o n H ha s th e followin g simpl e
fundamental domain , dependin g o n th e typ e o f 7 :
(1) I f 7 i s hyperbolic , 7 0 i s conjugat e t o (1 5.21 ) fo r p = N^ 0, actin g b y
z ^ pz, an d a fundamenta l domai n i s th e horizonta l stri p
Z r ( 7 ) \ H = {z = x + iy G M | 1 < y < p}.
(2) If 7 i s parabolic, 7 0 is conjugate t o (1 5.22 ) fo r som e x, an d a fundamenta l
domain i s th e vertica l stri p
Z r (7)\EI = {z = x + iy I 0 < x < 1}.
(3) I f 7 i s elliptic , 7 0 i s conjugat e t o (1 5.23 ) fo r som e 6. Le t w b e th e fixed
point o f 70 (th e center o f the rotation) ; a fundamental domai n fo r Zr(j) i s obtaine d
by takin g th e domai n betwee n tw o geodesie s goin g throug h w wit h a n angl e equa l
to 0 (by discreteness, notic e that 0 must b e of the for m 2TT/£ fo r som e integer £ ^ 1 ) .
In th e specia l cas e o f T — SX(2, Z), the paraboli c an d ellipti c conjugac y classe s
are ver y eas y t o describe . First , th e fixed poin t o f a paraboli c elemen t mus t b e
in Q U {00}. Bu t al l suc h point s ar e T-equivalent , b y Bezout' s theorem , an d sinc e
no = n(l ) clearl y generate s th e isotrop y grou p o f 0 0 i n 5X(2,Z) , w e find tha t it s
conjugacy clas s i s th e onl y paraboli c conjugac y clas s i n SX(2,Z) . Th e centralize r
ZY(JIQ) i s {n(x) I x G Z}, an d a fundamenta l domai n fo r Zr(no) i s th e vertica l
strip {z — x 4 - iy \ 0 < x < 1}.
By simpl e computations , on e finds als o tha t SX(2 , Z) contain s onl y tw o primi -
tive conjugac y classe s o f ellipti c elements , wit h representative s ( ~ x J , ( _ 1 0 ) o f
order 2 (a s a n isometry , bu t 4 as a matrix) , fixing i an d o f orde r 3 (a s a n isometry ,
but 6 a s a matrix) , fixing j , respectively.
EXERCISE 1 . Fin d th e primitive parabolic conjugac y classe s in r 0 (g), an d sho w
that the y correspon d t o th e se t o f cusp s a s describe d i n Sectio n 1 4.1 .
We com e bac k t o th e notatio n o f th e introductio n t o thi s section , wit h T a
discrete subgrou p suc h tha t th e quotien t r \ H ha s finite volume , an d ha s cusp s
(finitely many) . Th e lac k o f compactness create s som e analyti c difficulties , becaus e
15. SPECTRA L THEOR Y O F AUTOMORPHI C FORM S 397
it turn s ou t that th e kerne l i s not integrable o n the diagonal , s o the trace , as
we hav e define d it , is infinite. Thi s proble m i s solved b y considering truncate d
fundamental domain s F(Y) whic h ten d t o F fo r Y—) > +00, i n the sense tha t
IF(Y) f^ z)^^z) " ^ IF /( 2 )^/ i ( 2 ; ) f° r a n y integrabl e / . Th e kerne l K restricte d to
F(Y) i s no w integrabl e o n th e diagonal , an d on e ca n comput e th e truncate d trac e
TrY K= [ K{z,z)dfji(z)
JF(Y)
using th e spectra l expansio n fo r th e restrictio n o n th e on e hand , an d th e decom -
position int o conjugac y classe s o n th e othe r hand . O n th e spectra l side , thi s give s
for Y — > +0 0 a n expressio n o f th e for m T r K — A logy - h T\ + o(l), an d o n th e
geometric sid e another expressio n T r y K — A l o gF + T2 + o(l), fo r th e som e A > 0.
Hence w e have th e tautolog y A = A, an d th e non-trivia l formul a T\ =T 2, which i s
the trac e formula .
A smal l analytica l miracl e occur s whe n doin g th e exac t computations : th e
complicated Harish-Chandra/Selber g transfor m disappear s fro m th e picture , an d
all tha t i s left i s the functio n h on on e side , an d it s Fourier transfor m h on the
other side .
We no w giv e indication s o n th e shap e o f th e variou s parts , an d stat e th e com -
plete formula i n the specia l case T = 5T(2, Z). O n the spectra l side , the cus p form s
and residua l spectru m Uj contribute, a s i n th e cas e o f a compact quotien t
(15.24) £ > ( * ; )
3
and th e Eisenstei n serie s ar e show n t o contribut e
(15.25) M ^ ^ i ^ i . f +c°h{ty{i + lt)dt
where $(s) is the scatterin g matri x forme d b y th e first Fourie r coefficient s o f the
Eisenstein serie s an d tp(s) — det<3>(s) i s it s determinan t (se e Sectio n 1 5.4) .
The geometric computations ar e done independently fo r each type of (primitive )
conjugacy classes . Th e clas s o f th e identit y motio n ha s a contribution give n b y
(15.26) Voir\ H J th^t^h^dt.
A hyperboli c conjugac y clas s P with associate d primitiv e clas s PQ contribute s
A primitiv e ellipti c clas s R of orde r m (as a n isometry ) contribute s
+
^/ TT£\- 1 /* °° e- 2nt£/m
(15.28) £ (2msin- ) / ^—-^h^dt .
0<£<m J ~°°
Finally, al l primitiv e paraboli c conjugac y classe s contribut e th e sam e amount ,
which is
(15.29) ^ - ^ - { f e ( 0 ) l o g 2 - / $(l+it)h(t)dt}
398 15. S P E C T R A L THEOR Y O F AUTOMORPHI C FORM S
s
where ijj(s) = jr( s) * ^n e logarithmic derivativ e o f the Gamma function .
For SL(2 , Z), afte r som e rearranging , an d taking int o accoun t th e descriptio n
of the cusps an d of the scattering matrix , w e obtain
T H E O R E M 1 5.9 . Let h be a function satisfying the conditions (1 5.1 9), h its
Fourier transform. Let (p(s) = 0(1 — s)6(s)~ 1 , where 0(s) = 7r~ sT(s)((2s), be the
scattering matrix o/SX(2,Z) . Then we have the formula
ft
(15.30) J2 (*j) + T " / — ( 2 + it)Wt)dt =
f h(t) (^ tanh(jrt ) - - ^ log 2 - i-tf>( l + it))dt
Z7r KZ7r J
P NP 2 - NP~2
hyperbolic
+E E (2 m sin -) ^^^KDdt.
elliptic
15.8. Hyperboli c lattic e poin t problems .
One o f the mos t direc t application s o f Theore m 1 5.7 , similar i n spiri t t o the
examples give n fo r th e Poisso n summatio n formul a an d th e Vorono i formul a i n
Chapter 4 , is to the study o f hyperbolic lattic e poin t problems .
Let D C I b e an open subset . Th e lattice poin t proble m fo r D i s the problem
of countin g th e number o f translate s o f a give n ZQ G HI by element s o f a discret e
subgroup T C G whic h ar e in D. Th e classical problem s o f Gaus s an d Dirichle t
already mentione d i n Chapte r 4 ar e exactl y analogue , takin g th e actio n o f the
lattice 1 ? on R2 b y translation an d D the points i n a circle or under a n hyperbola .
As i n these cases , analyti c numbe r theor y ca n have it s say only i f one considers a
sequence o f subsets gettin g larger , an d sufficiently regular .
We tak e a geodesi c bal l o f radiu s tendin g t o infinity , i n th e mor e convenien t
form
B(w, R) = {zeM | 4u(w, z)+2^R}
and loo k fo r good asymptotic s fo r the counting functio n
N(R) = \{7 eT\7w e B(w,R)}\.
As in the euclidean case , the expected valu e of N(R) i s the area of the geodesi c
ball, but whereas Gauss' s classica l argumen t (packin g with unit squares ) establishe s
the correspondin g resul t easil y wit h a fairly goo d erro r ter m R 1 /2 i n the euclidean
case, thi s simpl e argumen t i s very inefficien t i n the context o f hyperbolic geometry .
Indeed, th e error ter m i s proportiona l t o th e length o f the boundary o f the disc,
which i s small i n euclidean geometry , bu t is of the same orde r o f magnitude a s the
area i n the hyperbolic case . Thi s fac t follow s fro m th e isoperimetric inequality : i f
the boundar y dD o f D is smooth, A — Vo l (D) and L = length(cL4), the n w e have
4TTA + A 2 < L 2 , henc e L ^ A.
One ca n at leas t prov e elementar y uppe r bound s whic h ar e quite usefu l (i t is
used i n the proof o f Proposition 1 5. 8 and in Chapter 22).
15. SPECTRA L THEOR Y O F AUTOMORPHI C FORM S 399
PROPOSITION 1 5.1 0 . Let T be a discrete subgroup o/SL(2,R ) such that the
volume Vo l (r\M) is finite and let a be a cusp ofT.
(1) For any z e H we have
|{ 7 er a \r | im^-V > Y}\ < 11 0
cay
(2) For any z, w e H , an d J > 0 , we have
| { 7 € a- xTaa | u( 7 2, w) < S}\ < y ^ + T ) ( I m («, ) + c" 1 ) + * +
/ + 1
c a l m (it; )
u>z£/i an absolute implied constant.
Here we let
ca — min ^ c > 0 | I , G cr~1rcra /o r som e a , 6 , d I.
See [1 5] , Lemma 2.1 1 fo r th e proof .
Applying Theore m 1 5. 7 for suitabl e kernels , one can get a n asymptotic formula .
THEOREM 1 5.1 1 . Let T be a discrete subgroup o/SX(2,R ) with Vo l ( r \ H) <
+oo. LetI D G I . We have
2<5,<1
with an implied constant depending on T and z only.
SKETCH O F PROOF . W e conside r onl y T = T 0(q) a s before . I f k(u) ^ 0 is an y
function wit h k(u) ^ 1 for u < (R — 2)/4 , an d i f th e correspondin g automorphi c
kernel K(z,w) satisfie s th e assumption s o f Theore m 1 5.7 , we hav e b y positivit y
(15.31) N{R) ^ Y^ k{u(w, ~fw)) = 2K(w, w).
By Theore m 1 5.7 , we hav e
(15.32) K(w,w) = Th(t J)\uJ(w)\2 + T±- f h(t)\E a{w,\+it)\
2
dt
with notatio n a s i n th e theorem . W e take k(u) = 1 for u < (R — 2)/4 an d k(u) = 0
for u ^ (- R + S — 2)/4 , an d / c linear between .
We estimat e h(t) usin g Lemm a 1 5.6 , fo r a suitabl e functio n / o n E I which i s
eigenvalue o f A wit h A = \ + 1 2 . Fo r example , f(z) — y s yield s
s
h(t) = / k(i,z)y d/j,(z).
Ju
One finds first tha t
r(s + i)
for \ < s ^ 1 , wher e th e implie d constan t depend s o n s. Th e constan t eigenvalu e
for j — 0 yield s th e mai n ter m 7ri?(Vo l ( r \ H ) )_ 1, an d th e exceptiona l eigenvalue s
(if any ) wit h 0 < Xj < \ yiel d th e secon d term , u p t o th e erro r ter m 0(R 1 ^2 - f S).
1400 5 . SPECTRA L THEOR Y O F AUTOMORPHI C FORM S
Let T — RS~ 1 . Fo r Re (s) = |, integratin g b y parts yield s
5/2
h(t) < \s\- R^2(mm(\slT) + \ogR),
with a n absolute implie d constant . Henc e w e estimate th e Eisenstein serie s contri -
bution t o (1 5.32 ) a t a given cus p a by
/ h(t)\E a(w, \ + it)\2dt « R 1 '2 [ ^ (( 1 + tf + log R)Wa{^l^t)? dt
JR J-VT U + ^J
R/
I _( T + log^)'
0 V , , ^5 d t
l\t\>Vr^ ' ~ ~'( * + *)
1 /2
< (#T)
by integratio n b y part s usin g Propositio n 1 5.8 . Th e same boun d hold s fo r the
contribution o f the non-exceptional discret e spectru m b y partial summatio n fro m
Proposition 1 5.8 . Usin g thes e estimate s wit h 5 = it! 2/3, (1 5.31 ) an d (15.32) giv e
2<Sj<l
Similarly wit h k(u) = 1 for u < (R - S - 2)/ 4 an d k(u) = 0 for u ^ (R - 2)/4 , we
get th e lower boun d an d Theorem 1 5.1 1 is proved. •
R E M A R K . Not e tha t fo r T 0(q) w e have A i ^ § s o Sj < | b y (5.88) , s o th e
contribution o f exceptional eigenvalue s i s smaller tha n th e error term .
COROLLARY 1 5.1 2 . (1 ) Let Ni(X) be the number of integer solutions to the
equation ad — be = 1 with a 2 + b 2 + c 2 + d 2 ^ X. We have
2 3
N1(X) = 6X + 0(X / ).
(2) Let r(n) be the number of representations of n as a sum of two squares. We
have
J2 r(n)r(n + 1 ) = SX + 0{X 2/3).
PROOF. Not e first tha t fo r w = i, we have 4u(ji, i) + 2 = a 2 + 6 2 + c 2 -f <i2 for
any 71
=( . So (1) follows immediatel y fro m Theore m 1 5.1 1 for T — SX(2, Z)
and w = 2 , since Vo l (SX(2, Z)\H) = STT" 1 an d Ai = 91 .1 4.. . > \ fo r the modular
group.
For (2) , consider th e group
!•={(• 5 ) 6 5 « a , z , i » j { ^ ) } . ( ; - I)"'r0,2)(; ->
We hav e a + d = 2k, a — d = 2l, b Jre = 2m , 6 — c = 2n , wit h ad — be —
k2 - i 2 - m 2 + n 2 = 1 and a2 + 6 2 + c 2 + d 2 = 2(/c 2 + £ 2 + m 2 + n 2 ), s o Theorem
15.11 give s
J2 r(m)r{m + 1 ) = N(AX + 2) - 8 X + 0 ( X 2 / 3 ) .
a
15. SPECTRA L THEOR Y O F AUTOMORPHI C FORM S 401
15.9. Distributio n o f lengt h o f close d geodesie s an d clas s numbers .
The simples t applicatio n o f th e Selber g Trac e Formul a i s t o th e distributio n
of th e length s o f closed geodesie s o n the quotien t spac e X — r \ H, als o calle d th e
length spectrum. I n fact, close d geodesies are in one-to-one correspondenc e with the
hyperbolic conjugac y classe s of V. Indeed , i f g G T is an hyperbolic element , the n g
maps th e geodesic 7 (half-circle ) joinin g it s two fixed-points t o itself. I n particular ,
for an y 2 G 7 , th e geodesi c segmen t fro m z t o gz i s mappe d i n r \ H t o a close d
geodesic. Takin g th e model situatio n wher e g = a(p) wit h p > 1 as in (1 5.21 ) an d
z = i, observ e tha t th e length o f this close d geodesi c i s d(i,g(i)) — log p = log Ng.
Conjugates o f p, whic h hav e th e sam e fixed points , giv e ris e t o th e sam e close d
geodesic, an d conversely an y closed geodesi c i s obtained i n this way.
THEOREM 1 5.1 3 . Let T be a subgroup o/SL(2,R ) with Vol(r\H ) < +00 . We
have
8
Y^ logNP = X+ J2 s^X * +0(X 3'A)
where the sum is over primitive hyperbolic conjugacy classes and the implied con-
stant depends on T.
REMARKS. (1 ) For r wit h compac t quotient , thi s was proved b y Huber [Hub] .
(2) Th e exponent ca n be improved , se e e.g. Luo , Rudnick an d Sarna k [LRS] ,
where 3/ 4 i s replaced b y 7/1 0 + e fo r an y e > 0. Althoug h th e analog y wit h th e
Prime Numbe r Theore m suggest s tha t i t should b e X1/ / 2 + £ , an d the analogue of the
Riemann Hypothesi s holds , thi s i s yet unknown . Th e main difficult y i s that ther e
are man y mor e eigenvalue s tha n zero s o f the zeta function .
Also, a s i n Theore m 1 5.1 1 , i f T = r 0 (g), Selberg' s inequalit y A i ^ ^ mean s
that th e exceptional eigenvalue s hav e smalle r contributio n tha n th e error term .
SKETCH O F PROOF . Th e argument i s quite simila r i n principl e t o tha t o f the
previous theorem . W e choose h such tha t th e Fourier transfor m h localizes the sum
over hyperboli c conjugac y classe s t o thos e wit h NP ^ X + Y fo r som e paramete r
Y ^ 1 , say h(t) = 2 cosh(7rt)q(27rt) where q is a smoot h function , even , supporte d
on \t\ ^ log( X + Y) an d wit h 0 < q ^ 1 an d q = 1 on [-logX,logX] . Th e
contribution o f the discrete an d residual spectru m t o the spectral trac e is
Y,Htj) = X+ J2 s^X's+OiY + X^T)
1<s <
i2 3
with T = X 7 " 1 , an d th e sam e estimat e hold s fo r Eisenstei n serie s (t o se e this ,
estimate h b y Fourie r inversio n o n ^ < s ^ 1 an d o n R e (s) — 1 separately) .
The identity , ellipti c an d parabolic motion s hav e contribution s < C X1/2T b y simple
estimations, s o one gets
1
^9(log7VP)(log7VP)=X+ J2 ^7 X ^ + 0 ( y + X1/2T).
2<SJ<1
Subtracting thi s applie d a t X - f Y an d X, b y positivity, w e see that
J2 (log7VP)< F + X 1/2 T
X<NP<X+Y
402 15. S P E C T R A L THEOR Y O F AUTOMORPHI C FORM S
hence th e resul t b y choosin g Y = X 3 / / 4 . •
In [Sa2] , P . Sarna k deduce s th e followin g corollary .
T H E O R E M 1 5.1 4 . For any integer d > 1 such that d = 0, 1 (mod 4) , let h(d)
be the class number of Q(yd) and Ed be the fundamental unit. We have
2
] T M^)=Li(X ) + 0(X 3 / 2 (logX) 2 )
for X ^ 2, with an absolute implied constant.
P R O O F . Th e proo f depend s o n the followin g ma p betwee n indefinit e quadrati c
forms an d close d geodesies . Le t Q(x,y) — ax2 + bxy + cy2 b e a primitive integra l
indefinite quadrati c for m o f discriminant d > 0, i.e. (a , 6, c) = 1 and d = b 2 — 4ac >
0. B y linea r chang e o f variable , SL(2,Z ) act s o n suc h form s an d th e numbe r o f
equivalence classe s i s h(d).
The equation s Q(0, 1 ) = 0 has tw o rea l roots , 6\ = (—6 + Vd)/2 an d 62 =
(—6 — \/d)/2, t o which i s associated th e geodesi c i n HI joining # 1 an d #2 - The
stabilizer o f Q unde r th e actio n o f 5X(2,Z ) i s equa l t o th e stabilize r o f Q\ (or # 2 ),
and a generator i s give n explicitl y b y
n(n\ - f( to-bu0)/2 -cu 0
9W)
V ^ 0 (t 0 + bu 0)/2
where Ed = (to + uoVd)/2 (i n othe r words , (to,^o ) i s th e fundamenta l solutio n t o
the equatio n t^ — du^ = 4). Th e nor m o f thi s hyperboli c elemen t i s the n e\.
The ma p Q H > g send s primitiv e integra l quadrati c form s t o hyperboli c conju -
gacy classe s o f 5L(2,Z) . W e clai m i t induces a bijection betwee n classe s o f form s
and primitiv e hyperboli c conjugac y classes .
To se e this , le t g be an y primitiv e hyperboli c elemen t o f 5L(2,Z) . Le t 0\ an d
62 be th e fixed point s o f g an d K — Q(0i , #2)- Clearl y K is a real quadrati c field,
say K = Q(Vd) fo r a unique d > 0 , d = 0, 1 (mo d 4) . The n th e stabilize r o f 8\ is
generated b y g becaus e g i s primitive . Th e for m Q(x,y) = (x — 6\y){x — 62V) is a
primitive quadrati c for m wit h g(Q) = g. Thi s implie s surjectivit y o f th e ma p an d
injectivity i s easy t o check .
Hence th e norm s o f conjugac y classe s o f primitiv e hyperboli c element s ar e e 2d
with multiplicit y h(d). I n thi s cas e Theore m 1 5.1 3 become s
3 4
(15.33) Y. 2h(d)logs d = X + 0(X / ),
ed^Vx
hence th e resul t afte r summatio n b y parts . •
REMARK. Siege l [Sie2 ] ha s show n tha t
7
(15.34) Yl 1Hd) loge d =
8C(3 ^) 7^+0(XlogX)
ttx
for X ^ 2 . Notic e th e differenc e betwee n th e orderin g o f the discriminant s in
(15.33) an d (1 5.34) . A n asymptoti c formul a fo r h(d) ove r d ^ X i s no t known .
EXERCISE 2 . Prov e (1 5.34 ) usin g th e Dirichle t Clas s Numbe r Formul a (2.31 ) .
http://dx.doi.org/10.1090/coll/053/17
CHAPTER 1 6
SUMS O F K L O O S T E R M A N SUM S
16.1. Introduction .
Individual Kloosterma n sum s S(m, n ; c) ar e quit e wel l examine d b y algebrai c
methods (se e Chapte r 1 1 ) . Th e resul t i s Weil's boun d
(16.1) |5(m,n;c) | < (m,n,c)^c^r(c)
which i s th e bes t possible . Thi s an d earlie r estimate s hav e bee n use d i n analyti c
number theor y indirectl y i n many ingeniou s ways (se e for instanc e [Li2] , [At] , [110],
[Wi3]). Yu . V . Linni k [Li3 ] pointe d ou t tha t fo r certai n application s on e actuall y
needs estimate s fo r sum s o f typ e
(16.2) S mn(X) = Yl c^Sim, n ; c).
c=0(mod q)
The uppe r boun d (1 6.1 ) implie s
(16.3) 5 mn (I)«r((m,n)g)(m,n,^lhogI
for X ^ 2 where th e implie d constan t i s absolute . However , Linni k suggeste d tha t
a smalle r boun d shoul d b e true , i f X i s sufficientl y large , becaus e o f cancellatio n
of Kloosterman sums , whic h i s due t o a regula r chang e o f sign . Linni k conjecture d
that
(16.4) S mn(X) « X e,
where th e implie d constan t depend s o n m , n an d £ , an d h e als o provide d som e
heuristics t o suppor t thi s bound .
In hi s investigation s Linni k wa s drive n b y a n analog y o f th e zet a functio n
c
(16.5) z mn(s) = J2 ~2'sKn;c)
c=0(mod q)
to the hypothesi s o f Hasse on the L-function s o f elliptic curve s (t o be precise Linni k
spoke onl y fo r q = 1 and use d a differen t notation) . Thi s analog y turn s ou t t o b e
close onl y i n th e analyti c aspect s o f th e serie s (1 6.5 ) (lik e analyti c continuation ,
but no t th e Eule r product) . B y independen t analysi s A . Selber g [S7 ] revealed tha t
Zran{s) admit s a spectral decompositio n wit h respect t o automorphi c form s (fo r th e
group To(q)). Selber g le d th e foundatio n fo r th e spectra l theor y o f Kloosterma n
sums yet h e did no t delive r th e necessar y technica l tool s to cultivat e thi s fertil e soil .
403
40 4 16. SUM S O F KLOOSTERMA N SUM S
In this chapter w e sketch proofs o f central results of this profound theory . Ther e
are soli d presentation s o f selecte d topic s i n th e article s [1 3] , [1 1 3] , [DI ] an d i n th e
books [1 5] , [Sa3] , [Motl] , however , a conclusiv e expositio n i s yet t o b e written .
16.2. Fourie r expansio n o f Poincar e series .
There ar e variou s method s t o develo p th e spectra l decomposition s o f sum s o f
Kloosterman sums . Arguabl y th e bes t structura l treatment s appea l directl y t o
the Gree n functio n (se e [1 5]) , o r th e resolven t operato r (se e [GS]) . Nevertheless ,
we follo w th e traditiona l metho d whic h begin s b y dua l treatment s o f th e Poincar e
series (fo r th e grou p T = T 0(q))
F
(16.6) P m(z)= Yl (m7^), m = l , 2 , 3 , . . . .
Here F(z) i s a functio n o n H I which i s periodi c i n x o f perio d on e an d i t satisfie s
F(z) < C ya wit h som e a > 1 so th e serie s (1 6.6 ) converge s absolutel y (a = 1 will
be attaine d b y continuity) . Ou r treatment s o f P m(z) ar e paralle l t o thos e alread y
given fo r th e holomorphi c form s i n Sectio n 1 4.2 , an d ou r goa l i s th e Kuznetso v
formula (1 6.34 ) whic h i s a n analo g o f th e Petersso n formul a (1 4.1 5) . Th e extr a
care i s neede d t o justif y infinit e summation s ove r th e spectru m o f th e Laplac e
operator. T o thi s en d on e need s som e crud e estimate s fo r th e Fourie r coefficient s
of cus p form s i n th e spectra l parameters . W e shal l prov e muc h stronge r estimate s
than necessar y i n Sectio n 1 6.5 , bu t onl y afte r th e final Kuznetso v formula s ar e
established. However , on e coul d deriv e th e require d auxiliar y estimate s fro m th e
early form s o f Kuznetsov' s formulas . W e encourag e th e reade r t o d o s o an d t o
organize our argument s int o a correct logica l order (w e compromised th e order , bu t
not th e completeness , fo r reducin g th e lengt h o f presentation) .
Applying th e doubl e cose t decompositio n (1 4.1 4 ) an d the n th e Poisso n sum -
mation formul a a s i n th e proo f o f Lemm a 1 4. 2 w e arriv e a t th e Fourie r serie s fo r
Pm(z),
nz
Prn(z) = F(mz) + Yl E Y.< )f F (m(ar * H)z)e(-nZ)dt.
Now assum e tha t th e generatin g functio n F{z) satisfie s th e rul e F(z-\-t) = e(t)f(z)
for t £ M so b y th e actio n o f T o n H I we hav e
F m =e
( (c d)0 (v) F ( - m / c ( £ * + d ) ) -
From thi s actio n on e get s th e Kloosterma n sum s i n th e Fourie r serie s
(16.7) P m{z) = F(mz) + ^2 E S
(m'n; c )Fc(m>n:' y) e nz
( )
n c=0(mo d q)
where
2
F c (m, n; y) = f F(-m/c Z)e(-nOdi
Jim £=y
by shiftin g £ to £ — d/c. W e pu t
(16.8) F(z) =p(47ry)e(z)
16. SUM S O F K L O O S T E R M A N SUM S 405
and cal l p(y) th e tes t functio n fo r th e correspondin g P m (z), gettin g
(16.9) F c(m,n;y) = £ ^ . ( ^ f )e(-% " *)*.
In thi s generalit y w e cannot comput e th e integra l (1 6.9) . A n interestin g choic e
for th e tes t functio n i s
(16.10) p(y) = ys
where s i s a comple x paramete r a t ou r disposa l wit h R e s > 1 (ther e ar e othe r
eigenfunctions o f A whic h lea d als o to interesting results) . I n this cas e the Poincar e
series become s
s
(16.11) P rn(z,s) = ^ (47rml m ^z) e(myz)
and it s Fourie r expansio n become s
(16.12) P m (z, s) = {47rm) s{e(mz) + ] T Z mn(s; y)e(nz)}
n
where
c
Zmn{s\ y) = ^2 ~2sS(m, n ; c)Is(m, n ; y)
c = 0 ( m o d q)
and
2s
/ 8 (m, n;y)= [ |£|- e ( - < ~ m/Z<?)d£.
The genera l Poincar e serie s P m(z) ca n b e represente d b y P m(z, s) b y applyin g th e
Mellin transfor m t o p(y).
The particular Poincar e series P m(z, s) wa s introduced b y A. Selberg [S7] . Thi s
is nearl y a cus p form , bu t no t exactl y so , becaus e i t fail s t o b e a n eigenfunctio n o f
A. Precisel y w e hav e
(16.13) ( A + s(l - s))P m(z, s) = sP m(z, 5 + 1 ) ,
because ( A + s( l — s))F{z) = ^ixsyF{z) fo r F(z) = y se(z). Henc e w e ge t th e
recurrence formul a
(16.14) P m(z,s) = sRxP m(z,s + l)
where R\ = ( A + A ) - 1 i s the resolven t o f A a t A = s( l — s). B y th e spectra l theor y
R\ i s meromorphic , therefor e P m(z,s) ca n b e continue d meromorphicall y t o th e
whole comple x s-plane . Moreover , i n th e half-plan e R e (s) > \ th e pole s o f R\
are a t th e point s Xj = Sj(l — Sj) > 0 i n th e discret e spectru m o f A , an d the y ar e
simple, s o ar e th e pole s o f P m (z, s) a t s — Sj. I n fac t i t i s conjecture d b y Selber g
that Xj ^ | , s o Sj = \ + itj wit h tj rea l an d P m (^, 5) i s holomorphic i n R e (s) > | .
In what follow s we avoid the analytic continuation o f the Selberg-Poincar e serie s
and th e resolven t operato r altogether . Instead , w e rel y heavil y o n th e spectra l
decomposition o f automorphi c functions .
40 6 16. SUM S O F K L O O S T E R M A N SUM S
16.3. Th e projectio n o f Poincar e serie s o n Maas s forms .
We return t o the Poincar e serie s P m(z) fc> r a generating functio n o f type (1 6.8) .
Let f(z) b e a Maas s for m o f eigenvalu e A = | + r 2 give n b y th e Fourie r serie s
(16.15) f(z) = ay^ r + by^ r + 2 /^ a nKir{2*\n\y)e{nx)
(see Lemm a 1 5.1 ) . B y th e unfoldin g metho d w e deriv e
/•OO pi
(f,Pm)= / / f(z)F(mz)d liz.
Jo Jo
Inserting (1 6.1 5 ) an d (1 6.8 ) w e pic k u p th e ter m a m b y integratio n i n 0 < x < 1 .
Then, changin g th e variabl e y — > y/2irm w e obtai n th e formul a
/»oo
(16.16) </,P m) = (2^m)*a m / e-*K ir(y)p(2y)y-*dy.
Jo
Hence observ e tha t P m(z) i s orthogona l t o constan t functions . I n particula r fo r
p(y) — V s th e abov e integra l i s a produc t o f gamm a function s givin g
(16.17) (/ , P m) = 2nT(s - \ + r)T(s - \ - i r ) r ( s ) - W a m.
This i s clearl y a n analo g o f th e formul a fo r th e inne r produc t o f a holomorphi c
modular for m wit h a holomorphi c Poincar e serie s give n i n Lemm a 1 4.3 .
16.4. Kuznetsov' s formulas .
Now w e ar e read y t o deriv e explici t relation s betwee n Kloosterma n sum s an d
Fourier coefficient s o f automorphi c forms . T o thi s en d w e comput e i n tw o way s
the inne r produc t ( P m , Q n ) o f tw o Poincar e serie s Pm(z) an d Q n(z) generate d
by F(z) — (47ry)2p(4r:y)e(z) an d G(z) = (Airy)^ q(A7ry)e(z) respectivel y (not e w e
modified th e tes t function s p(y),q(y) b y th e facto r y*).
First b y th e spectra l decompositio n (se e Theore m 1 5.2 ) w e get
(16.18)
OO
J =l
OO
(P m ,#oo(*, \ + ir) > (£«>(•, \ + ir),Q n)dr.
/
where (UJ(Z)) i s a n orthonorma l basi s o f cus p form s an d (E 00(z, \ + ir); r G R) i s
the eigenpacke t o f Eisenstei n serie s associate d wit h a syste m o f inequivalen t cusps .
Suppose Uj(z) ha s th e Fourie r expansio n
(16.19) Uj(z) =Pj{0)y s> +y l
^Pj(n)Kltj{27T\n\y)e{nx)
n^O
where Sj = \ +itj wit h tj real , o r | < Sj < 1 because Xj = Sj(l — Sj) — \+i] > 0.
Then
(16.20) (uj,Q n) = {^n)^p 3{n)ujq{t3)
16. SUM S O F K L O O S T E R M A N SUM S 40 7
where
/•OO
1
(16.21) u q(t) = / e-yK lt(y)q(2y)y~ dy
Jo
by th e formula (1 6.1 6) . Similarl y fo r the Eisenstein serie s
(16.22) E a(z, ±+ir) = 5 ay^r + y> fl(± + ir)yi~ ir
+ 2/ 5 ^2^a(n,r)K ir(27r\n\y)e(nx)
n^O
we hav e
(16.23) {Ea&\ +ir),Q n) = (4^n)*r a(n,rH(r).
Inserting (1 6.1 9 ) an d (1 6.23 ) int o (1 6.1 8 ) w e obtain
OO
(16.24) (P m,Qn) = 47ry / ^|y^pj(m)pj(n)cjp(tj)a; g (tj)
00
1 Z* " i
an
Next w e compute ( P m , Q n ) b y unfolding wit h respec t t o Q n d applyin g th e
Fourier expansio n (1 6.7 ) fo r P m. W e obtain
/•OO /» 1
(PrmQn}= / / P rn(z)G(nz)d/JJZ
Jo Jo
\6(m, n)F(imy) + ^ 5(m , n; c)Fc (ra, n; y)e(iny))G(iny) — .
q\c
Changing th e variables w e arrive a t
(16.25) (P m, Qn ) = 47rvm^(^(m , n)(p, g) + ^ ^ ^ ( m , n; c)Vm[ 7r ^mn^
q\c
where
/•OO
1
(16.26) (p,q)= / e-yp(y)g(y)y' dy
Jo
and
We simplify V pq(x) b y changing £ = 2z(( 2 + 1 ) x , where £ runs ove r th e semi-circle
|£| = 1 , Re(£) > 0 clockwise. Lettin g R e (£) — r] w e have £ = ^(C^) _1 , |£l = ^ _ 1
and ICI"" 1 ^ = z(C^) -1 ^C- Henc e
(16.28) V :+y)C^ dyd(
pq{x) = if J p{^J^q{xyrj)e-^v
/o rKy'')^*'"" y(rj'
Combining (1 6.24 ) an d (1 6.25 ) w e arrive a t th e followin g
408 6 . SUMS O F K L O O S T E R M A N SUM S
PROPOSITION 1 6.1 . Let m,n > 0 andp(y),q(y) be smooth, bounded functions
on R + . Then we have
(16.29)
Y^Pj{rn)p3{n)uv(t3)ujq(tj) + ^— / f a (ra,r)r a (n, r)uj p(r)cuq(r)dr
4 7 F
3=1 a ^-oo
e/ w N V ^ - I n / x-r rf ^ y^TOU
= d{m,n)(p,q) + 2_ ^c M m
> n ; c )*>7(
c = 0 ( m o d g)
Now let us choose p(y) = q(y) = yz+tl/ wit h v G R. Fo r these tes t function s we
get
ujp(t) = ^T(\ + i* + « ^ ) r (| - i * + iv)T(l + iz/)"1,
whence
(16.30) (jj p(t)ujp(t) = 7r2 sinh(7rz/)/^ cosh7r(^ — t) cosh7r(z/+ t).
On th e other hand , (p,q) = 1 and Vpp{x) = B2i v(x), wher e
1
_
(16.31) B s(x) = 2ix / X s (Cx)C ^C.
In thi s cas e (1 6.29 ) become s (a n auxiliary versio n o f Kuznetsov's formula )
COROLLARY 1 6.2 . For ra,n> 0 and v e R we have
Pj(m)pj(n)
(16.32) £
_^ cosh7r(i / — tj) cosh7r(z/ + tj)
, \ ^ 1p T a (m,r)r a (n,r) ^
i -
^- ' 47 r J-oo cosh7r(z/ — r) cosh7r(i/ + r)
7r__2zy
/r / N v ^ 1- 0 / x^ , /47rA/^m\ \
= —- —- d(ra,n ) + > c S(m,n;c)B 2iv[ ) ) .
smn(7ri/) V ^— 'V c / /
vJ
c=0(mo d q)
The integra l (1 6.31 ) ca n be expressed i n terms o f the Bessel functio n J s(y) of
real positiv e argument . Indeed , b y Cauchy's theore m
Bs(x) = 2ix([ + f) KsiCxX^dC
oo
l
(Ks(ixy) - K s{~ixy))y' dy.
/
For y > 0 we have (se e (5.14.4), (5.22.5) , (5.22.6 ) o f [Leb] )
^Hy)-^^(e^J.M-e-^Js(y)).
Hence
— 7T ?
K.(iy)-K.(-iy) = ^^jT)(My) + J. B(y)).
16. SUM S O F K L O O S T E R M A N SUM S 409
Finally, thi s yield s
(16.33) B s{x) = / (J s(y) + J-sWjy^dy.
COS(7TS/2) J x
With th e real paramete r v one can generate quit e a lot of functions h(t) out o f
the particula r on e h(t) = (cosri7r( ^ — £)cosh7r(z / + t))~ 1 . N . V. Kuznetsov [Kuz]
produced fro m (1 6.32 ) th e following
T H E O R E M 1 6.3 . Suppose h(t) satisfies the conditions (1 5.20). Then for any
ra, n > 0 we have
(16.34)
^, , . N h{tj) v- ^ 1 f°° , , , .h(r)dr
> Pj{m)pAn) T^— + > — / T a(m,r)Ta(n,r)
£_jfj\ ;rj\ ^ Q g h ^ Z^ 4nJ a v
' a vy
cosher
w
j —1 ^ a
1x '4ir \/mn
5(ra,n)# 0 + ^2 c c ${m,n\c)g\
S{m,n;c)g(-
c = 0 ( m o d q)
where
oo
(16.35) 9o = n : rh(r) tanh.(nr)dr
/ -oo
and
2% rft(r)
(16.36) 9\x) = - J2irWT T ^ r
-
7T 3 cosh(7rr )
R E M A R K S . Th e formula (1 6.34 ) wa s also establishe d independentl y b y R. W.
Bruggeman [Bru] , bu t only fo r a certain typ e o f test function s h(t) an d with g(x)
in less refined for m tha n (1 6.36 ) o f Kuznetsov. Th e resul t als o holds (an d th e proof
is similar ) fo r ran < 0, except fo r the integral transfor m (1 6.36 ) whic h change s to
4 f°°
g (x) = — / K2ir{x)rh{r)smh{irr)dr.
The Kuznetso v formul a (1 6.34 ) ca n be derived fro m (1 6.32 ) b y means o f the
following
LEMMA 1 6.4 . Suppose h(t) satisfies the conditions (1 5.20). Then
oo cosh(7rr)dr —2h(t)
(16.37) (h(r+i)+h(r-i))
/ -OO
cosh7r(r — t) cosh7r(r + t) cosh(7rt )
PROOF ( S K E T C H ) . Mov e the integral o f h(r+ | ) dow n to the lin e I m (r) = — \
excluding smal l half-circle s centere d a t r = t — |, r = — t — |, an d mov e th e
integral o f h(r — | ) u p to the line I m (r) = \ excludin g smal l half-circle s centere d
a t r = £ + | , r = £ — | . Th e horizontal integral s cance l ou t because h(t) i s even.
The fou r integral s ove r th e small half-circle s yiel d i n the limit — 2h(t)/ cosh(7rt) by
computing residues . D
410 16. SUM S O F K L O O S T E R M A N SUM S
By integratin g (1 6.32 ) accordin g t o (1 6.37 ) on e obtain s immediatel y th e spec -
tral term s o n th e lef t sid e o f (1 6.34) . O n th e righ t sid e on e obtain s th e diagona l
term wit h
»-5?£<« r + * > + *<'-*"ds5S)
- 1 f°° rdr 1 f°°
7T2 J_ QO 2 tanh(Trr ) TT 2
J_00 2
which i s exactl y (1 6.35 ) becaus e h(r) i s even . Moreover , on e obtain s th e su m o f
Kloosterman sum s wit h
Inserting (1 6.34 ) an d interchangin g th e integratio n w e ge t
because h(r) i s even. Next , b y th e recurrenc e formul a
(s + 1 ) Js+1 (y) - (s - l)J s-i(y) = -2sy{J s(y)/y)\
we ge t
-2ix [<*> [<*> rh(r) 1( ,
g(x) = / / M2ir(y)/y) drdy
* Jx J-oo cosh(Trr )
which coincide s wit h (1 6.36) .
The formul a (1 6.34 ) finds it s basi c application s fo r estimatio n o f th e Fourie r
coefficients o f cusp forms i n spectral parameter s an d th e leve l (se e the next section) .
In th e leve l aspect alon e the preliminar y formul a (1 6.32 ) i s often sufficien t an d eas y
to use . However , th e extr a flexibilit y offere d b y th e larg e famil y o f function s h(t)
in th e Kuznetso v refine d versio n (1 6.34 ) i s helpful fo r tacklin g delicat e issues , suc h
as exceptiona l eigenvalue s o r th e distributio n o f lengt h o f close d geodesie s o n th e
Riemann surfac e r\M ; cf . [Ill] , [1 1 2] .
For problem s i n analyti c numbe r theor y w e nee d t o trea t th e sum s o f Kloost -
erman sum s rathe r tha n th e Fourie r coefficient s o f cus p forms . T o thi s en d w e
need a n equatio n i n whic h th e tes t functio n attache d t o th e Kloosterma n sum s i s
given a priori , an d th e one s attache d t o th e spectra l term s ar e obtaine d b y integra l
transforms wit h explici t kernel s (somethin g lik e th e Rieman n explici t formul a fo r
primes). Unfortunately , th e formul a (1 6.34 ) i s no t sufficien t becaus e on e canno t
represent ever y requeste d functio n f(x) b y th e integra l (1 6.36) . Moreover , eve n i f
g(x) admit s th e integra l representatio n (1 6.36) , i t i s no t eas y t o fin d it s origina l
function h(r).
Historically speaking , i t wa s E . C . Titchmars h [T3 ] wh o firs t trie d t o inver t
i » g give n b y (1 6.36 ) fo r g i n a dens e subspac e o f L 2 (R + , x~ ldx), bu t
the ma p h —
16. SUM S O F KLOOSTERMA N SUM S 411
not i n th e contex t o f automorphi c forms . The n jointl y wit h D . B . Sear s [ST ] the y
realized tha t th e imag e o f this ma p i s no t dense . Indeed , writin g (1 6.36 ) a s
9(X) = - / (J2it(x) - J-2it)(x)) } .M
IT JQ cosn(7rr )
it i s clea r tha t th e Besse l function s Jk-i(x) o f positiv e integra l orde r k — 1 with k
even ar e missed , becaus e the y ar e orthogona l t o J2it{x) — J-2it(x). However , the y
showed tha t { Ji(x)\ £ = 1 , 3, 5 , . .. } together wit h {J2it{x) — J-2it(x)\ 0 < £ < oo }
form a complete , orthogona l syste m i n L 2 (M + , x~ ldx), th e latte r i n th e sens e o f
continuous spectra l measure .
Precisely, le t f(x) b e a functio n o f clas s C 2 o n [0 , oo) suc h tha t
(16.38) /(0 ) = 0 , /<">(* ) < (s + l)- a
for a = 0,1 , 2 with a > 2 . The n w e hav e
(16.39)
/(*) = r w M - j-2u(x))(r(j2u(h) -j-2*mf{v)-). 2 /f.
Jo V oy }
smh(27rt )
1
+ Y. W - l)Jk-i(x) J k-i{y)f{y)y' dy.
k>0,k eve n ^ °
Here the integral component o f f(x) i s in the image of the integral transform (1 6.36) .
Therefore, fo r completenes s on e needs , i n additio n t o (1 6.34) , a formul a fo r sum s
of Kloosterma n sum s S(m,n;c) twiste d b y th e Besse l function s J/ c _1 (47r v / mn/c).
These w e alread y kno w fo r eac h k a s th e Petersso n formul a (1 4.1 5) . Pu t
• /"OO
1
(16.40) M f(t) = , / (J 2it(x) - J^u(x))f(x)x- dx
and
A k
(16.41) AT f(k) = \ ~Sl r J 1
k-i(x)f(x)x- dx.
Let pj{m) an d r a (ra, r) b e the Fourier coefficient s o f a complete orthonormal syste m
of cus p form s an d th e Eisenstei n serie s respectively . I n additio n le t ipjk(m) fo r
1 ^ j ^ dimSfc(r ) b e th e Fourie r coefficient s o f a complet e orthonorma l syste m o f
holomorphic cus p form s
oo
(16.42) f jk{z) = Y2 ^jk{m)m^~e(mz)
of weight k = 2 , 4, 6 , . .. Addin g Kuznetsov' s formul a (1 6.34 ) t o Petersson's formul a
(14.15) i n accordanc e t o th e decompositio n (1 6.39 ) on e derive s
412 16. SUM S O F K L O O S T E R M A N SUM S
THEOREM 1 6.5 . Let f satisfy (1 6.38). For mn > 0 we have
/A / \ °°
(16.43) Y. c-'S(m,n;c)f(^^\ = £ M/ft^ftHftfn )
c = 0 ( m o d q) ^ ' j =l
1 f°°
+ Y1 A~ Mf{r)f a(m,r)Ta(n,r)dr
0<fc=0(mod 2 ) l^j^dimS k{T)
Observe tha t th e diagona l ter m o n th e spectra l sid e i s gone !
One als o needs a formula fo r sum s of Kloosterman sum s S(ra , n; c) with ra,nof
different sign . Thi s i s a similar case , actuall y i t i s simpler becaus e th e holomorphi c
cusp forms d o not appea r (the y have no Fourier coefficient s a t negativ e frequencies) .
Put
/>oo
1
(16.44) K f(t) =4 / K 2lt{x)f{x)x- dx.
Jo
THEOREM 1 6.6 . Let f satisfy (1 6.38). For mn < 0 we have
(16.45)
1
£ c- S(m,n]c)f(Any^\ =f^K f(tj)pj(m)pj{n)
c = 0 ( m o d q) ^ ' j =l
1 f°°
+ AX ^ 4 ^ / fcf(r)f a(m,r)Ta(n,r)dr.
a
PROOF. Thi s follow s fro m Theore m 1 6. 3 (wit h g(x) replace d b y g (x)) an d
the followin g Kontorovich-Lebede v inversio n (se e (5.27.1 4 ) o f [Leb] )
1
(16.46) f{x) =^ J K 2lt{x) (J K 2lt{y)f{y)y- dy\ tsinh(27rt)dt .
We clos e thi s sectio n wit h remark s o n th e Kloosterma n sum s zet a function .
Applying th e formul a (1 6.43 ) fo r f(x) = x 2 s _ 1 formall y on e derive s th e followin g
spectral decomposition :
k 2s-l
/.^ „„ x ~(2ixJmn) „ , x v ^ T( s — s 7 -)r(s — 1 + 57-) , x , x
(16 47) 2 z (s)=
- ^fer 2 1
-fe
g coihU ) ''ftMftfr o
+ ^ 7r- (47r) - (fc-l)!r(a-|)r(s-l + |) ^ ^(m)^ fe(n)
fc>0,fc eve n l ^ j ^ d i m SK(T )
V- 1 r r ( 3 - | - i r ) r ( 8 -|+ir)_
T
Z . I47r
T /j _ T^i^A
0 0 cosh(7rr)
« ( m ' r) T » ( n ' ^
Of course , th e functio n f(x) = x 2s~l doe s no t satisf y th e condition s (1 6.38) , nev -
ertheless on e ca n prov e rigorousl y tha t th e formul a (1 6.47 ) hold s true .
16. SUM S O F K L O O S T E R M A N SUM S 413
The serie s
AnJmn
(16.48) L mn(s) = ^ c^Sim, n ; c)J t
c E 0 ( m o d q)
seems mor e natura l tha n Z rnn(s) becaus e i t satisfie s a suitabl e functiona l equatio n
(16.49) L mn(s)-Lmn(-s) + — ^ r a ( ^ m , — J ra ^ n , - — J = TT -1 sin( — -JS(m, n) ,
a
(see Theore m 9. 2 o f [1 5]) .
16.5. Estimate s fo r th e Fourie r coefficients .
In thi s sectio n w e deriv e fro m th e Petersso n an d th e Kuznetso v formula s im -
mediate estimate s fo r th e Fourie r coefficient s o f basi c modula r forms . T o thi s en d
we simply estimat e th e involve d sum s o f Kloosterma n sum s b y usin g Weil' s boun d
for th e individua l sum . Mor e advance d method s an d stronge r result s hav e bee n
established i n [1 3] , [1 1 3 ] an d [DI] .
Our problem s reduc e t o estimatin g th e serie s (1 6.48) . Firs t w e sho w tha t
(16.50) ] T c-^IS^njc) ! < 8(cr - ^)- 2 r((ra,n))(m,n, q)^r(q)q-i- a
c = 0 ( m o d q)
if | < a ^ 1 . Indeed , b y applyin g (1 6.1 ) th e lef t sid e o f (1 6.50 ) i s bounded b y
oo
c=l
and th e las t serie s i s bounded b y
cv+D E T ^d~°-
d\(m,n)
l
Hence (1 6.50 ) follows , becaus e r(d) ^ 2dS an d ((a + \) < 2(< r - \)~ .
For k ^ 2 we hav e
y.k-1
J f e _ 1 (*)« m in(l ) ? !--)«(f]
(fc-1)!
for an y 0 ^ a < 1 . Therefor e (1 6.50 ) yield s
_2tV™n\ r(q)^
(16.51) L mn (A: - 1 ) « (2 a - l)~ z ( \~ ) " y M ( m , n))(m, n , q)*
for an y | < a ^ 1 , where th e implie d constan t i s absolute .
Next fo r s = a + it wit h \ < a < 1 we have (se e (23.41 1 .4 ) o f [GR] )
Therefore (1 6.50 ) yield s
(16.52) L mn ( S ) « ( 2 a - l ) - 2 e ' r l s l / 2 ( / ^ ' ) lM T((m,n))(m,n,q^
\ \s\q J y/q
414 6 . SUM S O F KLOOSTERMA N SUM S
where th e implied constan t i s absolute.
Note th e right sid e o f the Petersson formul a (1 4.1 4 ) equal s
S(m, n) + 27ri kLrnn(k - 1 ) .
Hence, usin g (1 6.52 ) wit h 2< J — 1 = (log3ran) _ 1 , w e conclude
T H E O R E M 1 6.7 . Let k > 2 , k even. The Fourier coefficients of an orthonormal
system of holomorphic cusp forms of weight k and level q satisfy
(16.53) / 47rNfc_i Yl^3k{m)^jk{n) = 5(m, n)
3
-f O i ——T((m, n))(m, n , q) * (ran) * log 2 3ran J
where the implied constant is absolute.
Next notic e tha t th e right sid e o f the Kuznetsov formul a (1 6.34 ) equal s
1
f r 1 N h(is/2)s .
6(m,n)g0 + — / L m n (s) — —ds
2m J {(T) COS(7TS/2 )
for an y | < a < 1 . Insertin g (1 6.52 ) wit h 2< r — 1 ^ (log3mn) - 1 w e conclude
THEOREM 1 6.8 . Suppose h(r) satisfies (1 5.20). Then for m , n> 0
h(r)dr
(.6.54) £ ft(m)ft<B)_»|L+ £ ^ ra(m,r)ra(n,r)-
7=1 ^ a
cosh(7rr)
8(m,n)go + Of H r((ra , n))(ra, n, q)2 (ran)4 )
q
where
(16.55) H = rj~ 2 f \rih{r)\dr
_1
for any 0 <7 7 ^ (41 og3ran) , and the implied constant is absolute.
In particular , Theore m 1 6. 8 yield s
f e / T ) 2 lft(n)| 2
(16 56) V e - +Y— f°° e- (t/T)2^^-dt
4^* COSh(7T^ )^ 47 T J_00 COSh(7Tt )
= g 0 + O ^ J r ^ g - V ^ ) ^ , # n i log 2 3n)
where g o — TT~ 2T2 -\- 0(1) an d the implied constant s ar e absolute.
Another interestin g choic e o f the test functio n i s
(16.57) h(t) = (X u + X~ u)2(t2 + I ) "2
where X ^ 1 . Thi s i s non-negative o n the spectrum, i.e. , for t o r it real , an d large
for th e exceptiona l points , precisel y w e hav e h(t) ^> X2s~l i f s = \ + it > | .
Clearly g 0 < C 1 and the integral (1 6.55 ) satisfie s
7/ < rj- 2xi+2r] < C X2 (log3mnX) 2
16. SUM S O F K1
L O O S T E R M A N SUM S 4 5
_1
by choosin g rj = (41 og3ranX) . Therefor e (1 6.54 ) fo r th e tes t functio n (1 6.57 )
yields
2
(16.58) Yl |P;H| ^ 2 S j1
_
« l + — r(n)(n,^(nX)*(lo g3nX)
2
.
i<sj<i q
According to Selberg' s eigenvalue conjecture fo r th e grou p To(q) th e sum on th e
left sid e o f (1 6.58 ) i s void . Thi s ha s no t ye t bee n proved . However , th e inequalit y
(16.58) show s tha t th e exceptiona l point s appea r infrequently . Specifically , takin g
n — 1 and X — q 2 w e ge t
2
(16.59) £ |p J(l)| g 21
^- )«r(g)(log2g)2.
Here one can choose a basis ( a Hecke-Pizer basis ) suc h that |pj(l) | ^ > q~^~ £, gettin g
2(251
j ) 1 +£
(16.60) Yl 4 " «4 .
Hence w e deriv e th e followin g densit y estimat e
3 4a+£
(16.61) \{j; Sj > a}\ « g "
for an y a ^ \ an d e > 0 , th e implie d constan t dependin g onl y o n e . I f a > ^ , thi s
bound i s quit e smal l relativ e t o th e volum e o f JT 0(g)\H.
16.6. Estimate s fo r sum s o f Kloosterma n sums .
Now w e hav e everythin g read y fo r estimatio n o f sum s o f Kloosterma n sums .
The origina l su m o f Linni k give n b y (1 6.2 ) fo r q = 1 wa s treate d i n 1 97 7 b y
Kuznetsov [Kuz] . H e applie d hi s formul a (1 6.43 ) fo r a suitabl e f(x) an d imple -
mented Weil' s boun d fo r th e Kloosterma n sum s nea r th e boundar y o f th e summa -
tion rang e gettin g
(16.62) S mn(X) « X*(log2X) ^
for X ^ 1 an d ra, n ^ 1 , wher e th e implie d constan t depend s o n ra, n . Th e
exponent ^ her e i s remarkabl e b y compariso n t o th e | i n (1 6.3) , thoug h i t i s no t
yet arbitraril y smal l a s conjectured b y Linnik. I n practice on e is satisfied wit h sum s
of 5(ra , n; c) ove r th e modulu s c restricted smoothl y i n a n interva l rathe r tha n b y
the shar p cut-off . Thes e smoothe d sums , whil e no t compromisin g rea l applications ,
can b e give n th e tru e estimat e (se e (1 6.72)) .
Fix ra, n,q ^ 1 an d a functio n / o f clas s C 3 supporte d o n [|,1 ] . W e shal l
estimate
(16.63) S(X) = V cT^m , n ; c)f(2ny/mnX/c)
c = 0 ( m o d q)
for al l X ^ 1 . B y Theore m 1 6. 5 th e proble m i s transferre d t o estimatio n o f th e
Fourier coefficient s o f basi c modula r form s an d th e integra l transform s (1 6.40) ,
(16.41) o f f(xX/2).
416 16. SUM S O F K L O O S T E R M A N SUM S
Introducing th e power serie s expansio n
tJ2l+v
(-l)V
(16.64) J u(2x) = ^2
£=0
£T(£ + v + l)
into
M f J 1
f( ) = — T ^T / ( 2,-i(2x) - J 1 ^2s(2x))f(xX)x- dx
where s = h2 -\- it, we get
w M _ _ z L _ f ( t W + 28-i) 2a ^.-21
sin(27rs) ^ ' £\Y{£ + 2s)
+ th e same expressio n wit h 5 changed int o 1 — 5 ,
where f(s) i s the Melli n transfor m o f fix). Al l we need t o kno w abou t / i s tha t
\f^\ ^ K for a = 0,1, 2, 3 and some K, > 0. Integratin g b y parts u p to thre e time s
we deduc e tha t
(16.65) f(s) < K(\S\ + l ) " 3 i f Re (5) > 0,
where th e implied constan t i s absolute. Moreover , w e use the Stirlin g formul a
5 S
(16.66) r ( S )=(f) Q ( l+ o ( ^ ) ) ifRe( S)>0.
Hence, estimatin g al l term s o f th e powe r serie s fo r Mf(t), excep t fo r £ = 0 , we
deduce tha t
(16.67) M f{t) = u>(s, X) + 0 ' KX~3 X l 0 g 2X
\s\ cosh(7rt )
where
W ( a , X
- ^ j ( l ^ ~T(2-2s) J'
Next b y the functional equatio n r ( $ ) r ( l — s) = 7r/sin(7rs) w e write
(16.68) UJ(S, X) = T(2 s - 1 )/( 1 - 2s)X 1 ~2s - T( l - 2s)f(2s - l)X 2s
~\
We als o estimat e th e leadin g ter m LJ(S,X), bu t onl y fo r s = ^ + it wit h £ real,
getting
*;log2X
(16.69) M 7 ( t )< C
s| 3 cosh(7rt )
Similarly fo r the integral transfor m (1 6.41 ) w e deduce tha t
3
(16.70) Af f(k) ^K,k- X-\
Finally introducin g (1 6.69 ) an d (1 6.70 ) int o (1 6.43) , excep t fo r the points \ <
Sj < 1 , the n applyin g Cauchy' s inequalit y an d th e estimate s (1 6.53) , (1 6.56 ) w e
arrive a t
16. SUM S O F K1
L O O S T E R M A N SUM S 4 7
THEOREM 1 6.9 . Let f be of class C3 with compact support in [\, 1 ] and | / ^ | <
1 for a = 0,1 , 2, 3. Then for m n, q, X ^ 1 we have
(16.71) S(X) = £ u(s 3,X)p3(m)p3(n) + R(X)
\<Sj<l
where UJ(S,X) is given by (1 6.68) and R(X) satisfies
(16.72) R(X) < ( l + ( m , ^ ) 1 ( l + (n , g ) ^) "r(g)r(mn)(log2mnX) 3
u^'£/i £/& e implied constant being absolute.
Assuming Selberg' s eigenvalu e conjectur e th e su m ove r ~ < s 3<1 i s void ,
so S(X) — R(X) an d (1 6.72 ) become s a pur e boun d fo r (1 6.63) . I n particular ,
it i s know n tha t th e Selber g conjectur e hold s tru e fo r th e modula r grou p wher e
q = 1 (fo r a proof , se e e.g . [DI] ; Hejha l compute d tha t th e smalles t eigenvalu e i s
Ai = 91 .1 4...) . I n thi s cas e w e get th e unconditiona l resul t
(16.73) y ^ c~ 1 S(m, n ; c) f (2ir y/mnX / c) < r(mn)(mn) ^ (log 2mn)2 .
oo
Notice w e replaced (log2ranX) 3 b y (log2ran) 2 becaus e th e facto r log2 X i n (1 6.69 )
appears fo r smal l t whic h i s no t i n th e spectru m fo r th e modula r group .
Denote th e su m ove r th e exceptiona l spectru m (hypotheticall y empty ) b y
(16.74) E(X)= J2 u(s 3,X)p3(m)p3(n).
\<s 3<l
Suppose s\ = \ + OL i s the larges t point , i.e. , A i = s i ( l — s\) — \ — a 2 i s the lowes t
eigenvalue i n th e spac e o f cus p forms . Sinc e OJ(SI,X) ^ > X 2a fo r sufficientl y larg e
X w e deduc e b y comparin g (1 6.71 ) wit h (1 6.3 ) fo r m = n wit h pi(n) ^ 0 tha t
a ^ | whic h correspond s t o th e Selber g lowe r boun d A i ^ ^ . Th e bes t know n
estimate i s a < ^ du e t o Ki m an d Sarna k [KSa ] (se e Sectio n 5.1 1 ) . Th e estimat e
a
^ n * s °btained i n [1 1 4 ] b y simple r arguments . Applyin g Cauchy' s inequalit y
and LJ(S,X) < C X 2s~l log2 X uniforml y fo r \ ^ s ^ 1 we ge t
E(X)<&Em(X)*En(X)*log2X
where
2
Em{X) = Y J\Pj{m)\ X^-\
3
2a
For an y K r ^ I w e hav e E m(X) ^ (X/F) E m ( r ) , an d (1 6.58 ) yield s
1
Em(Y) < C l + T(q)T(m)q- (m,q)2(mY)^{log2mY)2.
Choosing Y = min(X , max(l, q 2 /m(m, q))) w e deriv e
Em(X) « [ 1 + ((m,q)m/q 2)iX2a + ((m , q) m X / q2)2a}r(q)r(m) (log 2m X ) 2 .
Combining th e abov e estimate s w e obtai n
418 16. SUM S O F K L O O S T E R M A N SUM S
PROPOSITION 1 6.1 0 . Suppose the lowest cuspidal eigenvalue for T$(q) is A i =
\ — o? with 0 < a ^ \. Then the exceptional sum (1 6.74) satisfies
(16.75)
E{X) < ( l + {{m,q)m/q 2)^Xa + ((m , g ) m / g 2 ) a X a )
( l + ( ( n , g ) n / g 2 ) ^ I a + ((n , g ) n / g 2 ) a X a ) r ( ^ ) r ( m n ) ( l o g 2 m n X ) 3
iu/iere the implied constant is absolute.
Because th e boun d (1 6.72 ) i s absorbe d b y (1 6.75) , th e latte r constitute s ou r
final estimat e fo r th e su m (1 6.63) . Althoug h th e Selber g eigenvalu e conjectur e i s
not ye t proved , ou r boun d fo r S(X) turn s ou t t o b e stron g enoug h i n th e leve l
aspect t o yiel d adequat e result s i n th e mos t importan t applications . Ver y ofte n i n
practice i t suffice s t o know that A i ^ ^ . Puttin g a = \ w e get th e followin g simpl e
unconditional estimat e
THEOREM 1 6.1 1 . For m,n,q,X ^ 1 we have
(16.76) S(X) < ( l + (m , q) ^ f) * (l + (n , q) ^) \{q)r(mn) (log 2mnXf
where the implied constant is absolute.
Similar argument s wor k an d th e sam e result s hol d tru e fo r th e su m (1 6.63 ) i f
S(ra, n; c) is replaced b y S(—m, n ; c) for an y m, n ^ 1 . T o this end replac e Theore m
16.5 by Theore m 1 6.6 .
http://dx.doi.org/10.1090/coll/053/18
CHAPTER 1 7
PRIMES I N ARITHMETI C PROGRESSION S
17.1. Introduction .
The Prim e Numbe r Theore m assert s tha t ever y residu e clas s a (mod q) wit h
(a, q) — 1 contains th e sam e proportio n o f primes , i.e. , fo r fixed q we hav e
(17.1) TT(X; q, a) ~ — — TT(X),
(17.2) ^ ( x;g>a)„ J_^(s )
as x tend s t o infinity . On e eve n ha s reasonabl e estimate s fo r th e erro r term s i n th e
asymptotics (1 7.1 ) , (1 7.2) . However , mor e importan t tha n th e erro r ter m i s th e
range o f uniformit y fo r th e modulu s q in term s o f x. Th e Siegel-Walfis z Theore m
(Corollary 5.29 ) assert s tha t
A
(17.3) il>(x ]q,a) = - ^ - + 0(x(logx)- )
for an y q > 1 , (a , q) = 1 , x ^ 2 and A ^ 0 , the implie d constan t dependin g onl y o n
A. Notic e tha t thi s estimat e i s non-trivial onl y i f q <C (logx) A .
w
The Gran d Rieman n Hypothesi s fo r L(s,x) ^ n x( m
° d q) implie s
(17.4) ^(x ; g, a) = - £ - + 0(x ^ (lo g xf)
where th e implie d constan t i s absolute , an d thi s i s non-trivial fo r q <C x? (logx) - 2 .
It i s conjecture d b y H . Montgomer y tha t
e
(17.5) 4( x;q,a)=*+0(q-ixi+ )
where th e implie d constan t depend s onl y o n e. Thi s implie s tha t th e asymptoti c
formula (1 7.2 ) hold s tru e uniforml y i n q ^ x 1 ^6. I t wa s shown , however , b y Fried -
lander an d Granvill e ([FG] , [Gra]) , extendin g previou s wor k o f Maie r [Mai ] o n
primes i n shor t intervals , tha t (1 7.1 ) an d (1 7.2 ) canno t hol d fo r q ^ x(logx)~ A fo r
any A > 0.
The GR H ma y b e ou t o f reac h fo r severa l generation s o f researchers , bu t w e
have alread y a satisfactor y substitut e o f (1 7.4) , o r rathe r th e extensio n o f (1 7.3) ,
in th e followin g form .
419
42 0 17. P R I M E S I N A R I T H M E T I C P R O G R E S S I O N S
THEOREM 1 7. 1 (E . BOMBIERI , A . I . VINOGRADOV , 1 965) . We have
A
(17.6) 7 ma x ?/>(#;</ , a) -—\<^x(logx)~
^ ( a . ? ) = i l <p{q)\
for any A ^ 0 , where Q — x 2 ( l o g x ) _ B with B = B(A), the implied constant
depending on A.
Here ar e brie f comment s o n th e histor y o f th e problem . Al l treatment s us e i n
one wa y o r anothe r th e ide a o f larg e siev e du e t o Linni k [Lil] . First , A . Reny i
[Re] establishe d (1 7.6 ) wit h Q = x e~£ fo r som e 0 > 0, the n K . Rot h [Rot ] go t
this fo r 0 = | whil e M . B . Barba n [Bar ] succeede d wit h 9 = | (an d die d soo n
after). Finall y A . I . Vinogrado v [Vi ] reache d th e leve l 0 = | whil e E . Bombier i
[Bo5] workin g independentl y i n th e sam e tim e prove d a slightl y stronge r resul t a s
stated i n Theore m 1 7.1 . Bot h author s derive d thei r result s fro m ne w densit y the -
orems fo r zero s o f L-function s whic h the y establishe d first . Bu t Gallaghe r [Ga3 ]
and Motohash i [Mot2 ] simplifie d th e argumen t t o avoi d th e us e o f zeros , instea d
feeding th e mai n idea s directl y int o the estimatio n o f bilinear form s ove r arithmeti c
progressions. Thes e development s ar e relate d t o th e identitie s o f Vinogradov , Lin -
nik, Heath-Brow n o r Vaugha n typ e fo r sum s ove r prime s discusse d i n Chapte r 1 3 .
In thi s chapte r w e borro w fro m thes e innovation s a s muc h a s possibl e t o prov e
Theorem 1 7. 1 with B(A) = 2A + 6 by employin g minima l work .
EXERCISE 1 . Prov e th e asymptoti c formul a fo r th e Titchmars h diviso r prob -
lem:
V r(p - 1 ) - ex wit h c = ^ 7 ^ 3 ) = 1.943596 . . .
as x—)• +oo. Thi s formul a wa s prove d firs t b y Linni k [Li7] . [Hint : Us e Dirichlet' s
hyperbola metho d t o reduc e t o countin g prime s = 1 (mo d d) wit h d ^ y/x, the n
use Theore m 1 7. 1 an d th e Brun-Titchmars h inequalit y (6.95). ]
See [Kol ] fo r a fairl y natura l analogu e o f thi s proble m fo r ellipti c curves .
The Bombieri-Vinogrado v theore m i s expecte d t o b e tru e wit h Q = x x~£ (th e
Elliott-Halberstam conjecture ) whic h follow s fro m (1 7.5) , bu t no t fro m th e GRH .
There ar e som e unconditiona l result s o f typ e
(17.7) T\(q)U(x;q,a)-^-) « x(logz)-4
with Q — £ 7_ £ , wher e a ^ 0 i s fixe d an d X(q) i s an y well-factorabl e functio n i n
the sens e o f siev e theor y (se e [BFI]) . Her e e > 0 , A > 0 are arbitrar y an d th e
implied constan t depend s o n a, £, A. Thi s resul t contain s usefu l informatio n abou t
equidistribution o f prime s p ^ x in a fixed residu e clas s a (mod q) o f modulu s q
larger tha n x^ , i n whic h rang e th e GR H doe s no t apply .
One ca n handl e th e large r modul i q if th e averagin g ove r th e classe s a (mod q)
is permitted . Thus , P . Tura n [Tu2 ] derive d fro m th e GR H tha t
2
4
(17.8) Yl* ( ^ * ; g , a ) - - 7 - r ) <x(logx)
a ( m o d q)
17. PRIME S I N ARITHMETI C PROGRESSION S 421
where th e implie d constan t i s absolute. Thi s estimat e i s non-trivial fo r q a s larg e
as x(log:r)~ 5 . Introducin g th e additiona l averagin g ove r th e modulu s M . B . Bar -
ban [Bar ] an d H . Davenpor t an d H . Halbersta m [DH2 ] establishe d unconditiona l
results o f comparabl e strength .
THEOREM 1 7. 2 (BARBAN , DAVENPOR T AN D HALBERSTAM , 1 966) . We have
( 17 - 9 ) E I T (^;4,a)--^) 2 «x(io gx)-A
for any A > 0, where Q = x(logx) - j B with B = B(A), the implied constant de-
pending on A.
There ar e analogou s result s fo r th e Mobiu s functio n i n arithmeti c progressions .
EXERCISE 2 . Prov e tha t
(17.10) y ^ ma x ^ fi(m) <x (logx)~A
m=a(mod q)
for an y A > 0 , where Q = ^ ( l o g x ) - ^ wit h B — B(A), the implie d constan t
depending o n A.
17.2. Bilinea r form s i n arithmeti c progressions .
Given a reasonable arithmeti c functio n / i t is natural t o expect it s value s to
be equidistribute d ove r primitiv e residu e classes , i.e. , on e shoul d hav e a non-trivial
bound fo r
(17.11) D f(x;q,a)= £ f(n) - -L£ /(n )
n^x ^ ^ n^x
n=a(mod q) (n,g ) =l
for al l (a,q) = 1, provide d g is not extremel y large . I n many case s on e ca n sho w
that
(17.12) D f(x;q,a) « ( £ I / H I 2 ) " * * (lo g = r ) ^.
n^x
This i s non-trivial onl y fo r q ^ (log;r) A . Havin g (1 7.1 2 ) on e ca n prov e b y th e larg e
sieve inequalit y (7.31 ) a bound fo r Df(x; q, a) fo r som e typ e o f / whic h i s good fo r
almost al l q $J x^ (logx) _ j B . I t is surprisin g ho w littl e i s require d o f /. Essentiall y
/ need s t o b e represente d a s a convolution o f tw o sequences , sa y / = a * (3, one of
which, say /?, is equidistributed ove r primitive residue classes uniformly wit h respec t
to th e modul i q ^ (logx) A , o r mor e generally , / shoul d b e wel l approximate d b y a
linear combinatio n o f suc h convolutions .
We begi n ou r investigation s b y examining a sequence (3 — (/3 n) o f complex
numbers supporte d o n 1 ^ n ^ N. W e assum e tha t fo r som e 0 < A ^ 1 it satisfies
9
(17.13) \D e(N;q,a)\^m\N^A
for al l (a,q) — 1, wher e
=c*>
42 2 17. P R I M E S I N A R I T H M E T I C P R O G R E S S I O N S
First w e show tha t (1 7.1 3 ) implie s th e following boun d fo r character sums .
LEMMA 1 7.3 . For a non-principal character \ (mo d r) and a positive integer
s we have
(17.14) | J2 Mn)\ < ||/3||iV^A3rT(S).
(n,s) = l
P R O O F . B y Mobius' inversio n w e chang e th e condition (n , s) = 1 and then we
split th e sum as follows:
(n,s) = l fc|s n=0(mo d fc)
= £Mfc)£/*w E ^x(n ) + EMfc) E &»*(") •
fe|s £|/ c (n,£ ) = l fc|s n=0(mo d fc)
Next w e estimate th e sum of ^nx{n) o v e r ( n> ^) — 1 by splitting int o classe s modul o
£r, an d for each clas s w e apply th e hypothesis (1 7.1 3) . Thi s give s u s (the leading
terms cance l ou t because \ ls non-principal )
||/3||A^A 9 £ £ | / i ( * M * r ) < \\(3\\N^ 9
Kv(r)T(s).
k\s £\k
ove rn
Then w e estimate th e sum o f finX( n) = 0(mod k) usin g Cauchy' s inequality .
This give s us
\\/3\\Ni Y, k ~"2 ^ MN'K-iT(s).
k\s
k>K
Adding bot h estimate s w e obtain (1 7.1 4 ) b y taking K = A- 6 . •
Now we proceed t o the main resul t
T H E O R E M 1 7.4 . Suppose ft = (/3 n) is a sequence supported on 1 ^ n ^ N
which satisfies (1 7.1 3). Let a — (am) be any sequence supported on 1 ^ m ^ M.
Then we have
(17.15)
Y ] ma x \D a*p(MN;q,a)\
« ||a| | ||/3||(AM5iV5 + Mi + Nl* +Q)(\ogQ) 2
where the implied constant is absolute.
PROOF. Usin g Dirichle t character s w e writ e
DMf}{MN;q,a) = -— £ x(a)(£amX(m))(£/3„x(n)) .
^ ' x ( m o d q) m n
x/Xo
Hence, reducin g t o primitive characters , th e left sid e o f (17.15) i s bounded by
£ i£ ^r) S * I £ «rnX(™)\\ J2 ^X(n)[
v v
s^Q ^ ' Kr^Q ^ ' X ( m o d r ) ( m , s ) = l (n,s )=l
17. PRIME S I N ARITHMETI C PROGRESSION S 423
If r i s small , sa y r < R, w e appl y (1 7.1 4 ) gettin g
||/3||tfiA3 E 4 4 E r
« H/5||A^iA 3^2(logQ)2.
5^Q ^ ^ r<C H
For th e remainin g r > R w e spli t th e rang e int o dyadi c segment s P < r ^ 2P an d
for eac h o f such reduce d sum s w e apply th e larg e siev e inequalit y (7.31 ) gettin g
1
I N ||/3| | E ^ ( ^ 2 + M)^(P 2 + A0ilog Q
R<P<Q
1
< H | ||/3||( Q + M i + JV * + M ^ / r )(logQ)2.
Adding bot h result s fo r i ? = A - 1 w e get (1 7.1 5) . •
17.3. Proo f o f th e Bombieri-Vinogrado v Theorem .
Suppose x s < n ^ x. B y (1 3.39 ) wit h y = z = x$ w e writ e A(n ) = A#(n ) - f
b
A (n), wher e
A«(n) = 5 2 A(%(m) ,
I
m^x 5
Ab(n)= V J A(%(m) .
im—n
14
a: 5 <m^ix 5
Here A(£ ) i s th e incomplet e logarith m
A
(17.16) X(e) = logl- E (*0-
I
Accordingly w e spli t
(17.17) D A(x;q,a) = D At(x;q,a) + D Ai>(x;q1 a) + 0 ( x ^ lo g a;)
where th e erro r ter m come s fro m th e contributio n o f term s n < x$ fo r whic h th e
above identitie s d o no t apply .
If / i s continuou s an d monotoni c o n [l,y] , the n b y elementar y argument s w e
derive
\Df(y;q,a)\^2\f(l)\+2\f(y)\.
Applying this fo r f(£) = log £ an d f(£) = 1 we obtain D\(y; q, a) < C xs logx . Henc e
DA$ (x ; q,a) « x s lo g x an d summin g ove r g ^ Q w e ge t
ma x
(17.18) E |^A«(x;g,a) | < Qx ^ logx .
We shal l estimat e D Ab(x;q,a) b y a n appea l t o Theore m 1 7.4. T o thi s en d w e
need A b(n) t o b e i n a for m o f convolutio n o f tw o sequences , an d w e almos t hav e i t
except tha t th e restriction n = £m ^ x make s £ and m interconnected . T o relax thi s
and t o kee p hol d o f th e size s o f Im, w e ar e goin g t o divid e th e interva l 1 < n ^ x
1424 7 . PRIME S I N ARITHMETI C PROGRESSION S
into shor t subinterval s o f type y < n ^ ( 1 -f 6)y wit h x~~s < 6 ^ 1 . Ther e are
0(5~1) suc h subintervals . W e cove r A b(n) b y partial sum s o f type
(17.19) £ £ A(<)/i(m )
£rn=n
L<^(1+<5)L
Af<ra^(l+<5)Af
with L , M taking value s ( 1 + <S)J in the rang e x 5 < L,M < x*, LM < x, except for
n < xs an d (1 + J ) _ 1 x < n < ( 1 + <5)rr in which range s th e coverin g i s not exactl y
with multiplicit y one . Estimatin g th e contribution i n these exces s area s triviall y
we obtai n
(17.20) D A, (x; q,a) = ^2^2 D (LM'i ^ a
) + 0(6q~ lx lo g x)
M
where D(LM\q,a) stand s for
1
L L A(%(m)- — J2H Wtirn). W
L<£^(1+<S)L ^ L<l^{l+8)L
M<m<(l+(5)M M<m^(l+(5) M
£ra=a(mod g ) (^ra,<? ) =l
For eac h o f D(LM;q,a) w e ca n appl y Theore m 1 7. 4 wit h A = (logx)~ A becaus e
the sequenc e /i(m ) satisfie s th e hypothesis (1 7.1 2 ) b y the Siegel-Walfisz Theore m
(also th e sequenc e X(£) satisfies thi s hypothesis) . Thi s give s
(17.21) 2 ^ ma x |D(LM;g,a) | <(5Ax(logx )
, (^<?) = ! '
2
for Q — Ax 2. Summin g ove r L , M (ther e ar e 0(6 ) such segments ) w e obtain b y
(17.20) an d (17.21) tha t
1
(17.22) ^T m&x\D A>(x;q,a)\<z:(6- A+ 5)x(\ogx) 3.
^Aa,q) = l
We choos e 6 — A 2 gettin g th e boun d A2x(logx) 3 . Addin g (1 7.22 ) t o (17.18) we
obtain
(17.23) V ma x \i/>(x;q,a) - ^-fi~\ < A2x(logx) 3 .
(a,q) = l <p(q)
Here w e ca n replac e ip(x) b y x + O(Ax) b y the Prim e Numbe r Theorem . Havin g
done this , Theore m 1 7. 1 follows wit h
(17.24) B(A) = 2, 4 + 6.
17.4. Proo f o f the Barban-Davenport-Halbersta m Theorem .
We shal l establis h Theore m 1 7. 2 for arithmetic function s / mor e genera l tha n
A. I t does no t requir e / t o be a convolution typ e a s in Theorem 1 7.4 , but onl y tha t
/ i s well distribute d ove r residu e classe s t o quite smal l moduli .
17. P R I M E S I N A R I T H M E T I C P R O G R E S S I O N S 425
THEOREM 1 7.5 . Suppose that for some 0 < A ^ 1 we have
9
(17.25) \D f{x-g,a)|< ||/MA
for all (a,q) = 1. Then
2
(17.26) Y, E * l^/(^9^)| « | | / | | 2 ( A x + Q)(logQ) 2
q^Q a ( m o d q)
where the implied constant is absolute.
PROOF. B y the orthogonality o f characters
E * l^/(*;<7,a) l = ^y E |E/(")X(») . m
o ( m o d (? ) X ( ° d <? ) n ^x
XT^XO
Averaging ove r q, then reducin g t o the primitive characters , w e see that th e lef t
side o f (17.26) i s bounded by
E ^ j E ^r) E* | E /(»)x(n)f .
VJ Vy
s^Q ^ l<r<C Q ^ x ( m o d r ) n<C x
(n,s) = l
At thi s poin t w e apply th e same argument s a s in the proof o f Theorem 1 7. 4 with
a — ^ — f gettin g th e bound
| 2 { x A 3 # 2 + Q + xH xR- l
}(\ogQ)2.
Choosing R = A * we arrive a t (1 7.26) . •
COROLLARY 1 7.6 . Let the conditions be as in Theorem 1 7.4- Let ab ^ 0 . Then
we have
(17.27)
aml3n a
E | E E
<7^Q a r a = 6 n ( m o d q )
-~rtn){ E -)( E)&>
(m,q) = l (n,g =l
(g,ab) = l ( m n , g )=l
« ||a| | ||/3||(M + Q)5(Ai V + Q)i(logQ) 2 .
PROOF. Th e difference o f the sum s i n (1 7.27 ) i s also give n by
Y* E * D a(M;q,u)Dp{N;q,v).
au=bv(mod q)
Hence th e resul t follow s b y Cauchy' s inequalit y fro m Theore m 1 7. 5 applied fo r
/ = a wit h A = 1 and for / = / ? with A . •
This page intentionally left blank
http://dx.doi.org/10.1090/coll/053/19
CHAPTER 1 8
THE LEAS T PRIM E I N A N
ARITHMETIC PROGRESSIO N
18.1. Introduction .
After havin g unifor m distributio n o f prime s i n primitiv e residu e classe s t o a
given modulu s q the natura l questio n t o as k i s how bi g i s the firs t prim e i n a give n
class? Denot e
(18.1) p(q, a) = min{p : p = a(mo d q)}.
Clearly p(q, a) canno t b e smalle r tha n ( 1 + o(l))ip(q) logq fo r som e a(mo d q). I t i s
conjectured tha t
1 +e
(18.2) p(q,a)<£q
while th e Rieman n Hypothesi s fo r Dirichle t L- functions o f character s x(mo d q)
implies
(18.3) p{q,a)<^^{q)\ogq)\
Unconditionally, i t i s easy t o deriv e fro m th e Siegel-Walfis z Theore m (se e Corollar y
5.29) tha t
p(q,a) <exp(<f )
for an y e > 0 , th e implie d constan t dependin g o n e (ineffectively) . Moreover , i f
there i s no exceptional characte r modul o g , then i t follow s fro m th e Prim e Numbe r
Theorem tha t
p(<7,o)<«clo«
where c i s a positiv e constant . Th e celebrate d theore m o f Yu . V . Linni k [Li4] ,
[Li5], [Li6 ] o f 1 94 4 assert s
THEOREM 1 8. 1 (LINNIK) . There exist absolute constants c ^ 1 and L ^ 2
such that
L
(18.4) p(q,a)^cq .
This beautifu l theore m i s on e o f th e greates t achievement s i n analyti c numbe r
theory. Originall y Linni k di d no t giv e a numerica l valu e o f L , thoug h hi s metho d
was effective , i.e. , bot h constant s c an d L coul d hav e bee n compute d give n th e
time an d will . Her e i s a selecte d lis t o f th e Linni k constan t produce d b y variou s
researchers
L = 1 0,00 0 Pa n (1 957) , L = 2 0 Graha m (1 981 ) ,
L = 1 1 1 Che n (1 965) , L = 1 6 Wan g (1 986) ,
L = 8 0 Jutil a (1 977) , L = 5. 5 Heath-Brow n (1 992) .
427
42 8 18. TH E LEAST PRIM E I N A N ARITHMETIC PROGRESSIO N
In thi s chapte r w e prove Linnik' s theore m withou t determinin g th e constant L.
Our treatment s borro w substantiall y fro m th e magnificent wor k of S. Graham [Gl],
[G2]. Fo r readers intereste d i n learnin g othe r method s (th e Turan power-sums) ,
we recommen d Chapte r 6 of E. Bombieri [Bo2] . Th e sharpest tool s ar e developed
in Heath-Brow n [HB4] .
All proof s o f Linnik's theore m us e in one form o r anothe r th e following thre e
principles abou t th e zeros o f the Dirichlet L-functions . Throughou t w e denote
(18.5) L q(s)= f jL ( S >X)-
x(mod q)
w m
A zer o o f L(s , x) b e denote d b y p = (3 + ry , o r b y p x = /3 X + i"f x i f the
dependence o n th e characte r nee d b e displayed . Fo r \ < a ^ 1 an d T ^ 1
we denot e b y N(a, T , \) th e number o f zero s p x counte d wit h multiplicit y i n the
rectangle
(18.6) a < a ^ 1 , \t\^T.
Thus
N
(18.7) N q(a,T)= Y, (<*>T>X)
x(mod q)
is the number o f all zeros of Lq(s) counte d wit h multiplicit y i n the rectangle (1 8.6) .
PRINCIPLE 1 (TH E ZERO-FREE REGION) . There is a positive constant c\ (ef-
fectively computable) such that L q(s) has at most one zero in the region
(18.8) a^ 1 - c i / l o g g T , \t\^T.
The exceptional zero, if it exists, is real and simple and it is for a real, non-principal
character.
PRINCIPLE 2 (TH E LOG-FRE E ZERO-DENSIT Y ESTIMATE) . There are positive
constants c\, C2 (effectively computable) such that for any \ ^ a ^ 1 and T ^ 1 ,
C 21
(
(18.9) N q(a,T) ^c(gT) -a).
PRINCIPLE 3 (TH E EXCEPTIONA L ZER O REPULSION) . There is a positive con-
stant C 3 (effectively computable) such that, if the exceptional zero fii exists, say
L{P1,Xi) = 0with
(18.10) l - c i / l o g g T ^ / ? i < l ,
then the function L q(s) has no other zeros in the region
(18.11) v>l-c 3 j ^, \t\^T.
The firs t principl e i s classical (du e to Landau) , an d is proved i n Sectio n 5.9.
The secon d principl e i s due to Linnik [Li6 ] and the third principl e i s a quantitativ e
version o f the Deuring-Heilbronn phenomeno n (als o due to Linnik [Li5]) . Bombier i
[Bo2] establishe d a somewha t stronge r versio n o f Principl e 3 . H e showed tha t i f
there i s a n exceptiona l zero , the n Principl e 2 can be mad e stronge r a s well . W e
extract fro m hi s result th e followin g
18. T H E L E A S T P R I M E I N A N A R I T H M E T I C P R O G R E S S I O N 429
PROPOSITION 1 8. 2 (BOMBIERI) . There is a positive constant c 2 (effectively
computable) such that, if the zero fi\ satisfying (1 8.1 0) exists, then
(18.12) N q(a,T) « ( 1 -/JxHloggTXgT)^ 1 -")
for any | ^ a ^ 1 , where the implied constant is absolute (and effectively com-
putable).
EXERCISE 1 . Sho w tha t Propositio n 1 8. 2 implie s Principl e 3 .
EXERCISE 2 . Deriv e fro m Propositio n 1 8. 2 th e Siege l boun d (5.73 ) fo r rea l
zeros o f Dirichle t L-functions .
The numerica l value s o f the constant s ci , c2, c% an d c yield th e Linni k constan t
L, bu t fro m ou r estimate s thi s wil l b e quit e large . Assumin g (a s w e can ) tha t q
is sufficientl y larg e an d T ^ log g th e abov e principle s ar e know n t o hol d fo r th e
constants c\ = y^ , c2 = 3 , c$ = \.
18.2. Th e log-fre e zero-densit y theorem .
Recall th e Huxle y densit y estimat e (se e Theore m 1 0.4 )
(18.13) N q(a,T) « (qT)^'^ (logqT)A
where A i s a n absolut e constant . Therefore , (1 8.9 ) i s ne w onl y fo r zero s nea r th e
line R e s = 1 , namely , fo r a wit h
1
-
(18.14) l - a < AC log£
where
(18.15) C = \ogqT.
In thi s sectio n w e ar e goin g t o prov e Principl e 2 wit h th e constan t c 2 — 47,
however, th e metho d i s capabl e o f givin g a muc h smalle r constan t b y direc t modi -
fications.
Note tha t fo r th e principa l characte r N(a, T , Xo) equals th e numbe r o f zero s of
the Rieman n zet a functio n i n th e rectangl e (1 8.6) . Thu s b y th e Vinogrado v zero -
free regio n (Corollar y 8.28 ) an d th e Huxle y densit y estimat e (se e Chapte r 1 0 ) i t
follows tha t
13
(18.16) i V ( a , T , x o ) « T ( -).
This coul d als o b e establishe d withou t appealin g t o th e zero-fre e regio n alon g th e
lines o f this section , bu t w e exclude th e principa l characte r fro m consideratio n her e
to simplif y ou r exposition .
The mai n ide a behin d th e proo f o f Principl e 2 i s th e sam e a s fo r th e densit y
estimates i n Chapte r 1 0 , i.e. , w e construc t a Dirichle t polynomia l whic h serve s
as a zero-detecto r becaus e i t assume s enormou s value s a t th e zero s o f L(,s,x) -
Then, usin g th e dualit y ide a followe d b y estimatio n o f resulting characte r sums , w e
obtain th e desire d boun d fo r N q(a,T). I f w e ha d applie d thi s directl y t o L(s,x )
then a los s o f logarithmi c factor s woul d occur , whic h i s no t acceptable . Therefor e
some refinement s ar e necessary . W e eliminat e th e logarithmi c factor s b y reducin g
the coefficient s o f L(s,x) wit h a devic e simila r t o tha t whic h gav e u s a positiv e
43 0 18. T H E L E A S T P R I M E I N A N A R I T H M E T I C P R O G R E S S I O N
proportion o f zero s o f ((s) o n th e critica l lin e (se e Chapte r 24) . Ther e ar e othe r
methods fo r establishin g Principl e 2 , suc h a s th e Powe r Su m Metho d o f P . Tura n
(used b y Bombieri , Jutila , Montgomery) .
We introduc e tw o kind s o f mollification s t o L(s , x). Pu t
OO
(i8.i7) ^ ) = E ( EA ^ ) ( E ^ ) n " s
1 d\n b\n
where A ^ i s a continuousl y cut-of f Mobiu s functio n an d 0^ is a kin d o f coefficien t
used i n siev e methods . Thoug h A ^ an d 0^ appear t o hav e simila r structure s the y
play her e distinc t roles . Th e firs t on e i s applied t o annihilat e th e earl y term s o f th e
series (excep t fo r n = 1 ) whil e th e secon d on e i s applie d t o sif t ou t (o r rathe r t o
diminish th e contribution ) o f th e term s whic h hav e smal l prim e factors . Precisel y
we choos e
(18.18) A d = /i(d ) minfl, i^SfZ l
for 1 < d ^ z wher e 1 < w < z, an d w e se t A ^ = 0 if d > z. The n w e choos e
(18.19) ^ =* E ^ t
(a,bq) = l
where G i s the normalizatio n facto r suc h tha t 0\ — 1, i.e. ,
/i2(a)
(18.20) G =Y . . .
V
a<:v ^ '
(a,9) = l
The coefficient s Ot, ar e comin g fro m th e A 2-sieve an d hav e th e followin g propertie s
(see Sectio n 6.5) :
(18.21) |0 6| <1
Ol,C>2
(18.23) G^ —log y
q
(in fac t on e coul d choos e Ob fro m essentiall y an y upper-boun d siev e o f leve l y) .
Since A ^ agrees wit h fi(d) fo r 1 < d ^ w i t follow s b y Mobiu s inversio n tha t
v—-v (I i fn = 1 ,
18.24 Y\ d=\
V}
4 ^I 0i f 1< n ^ w.
d\n
For an y n > w w e have th e trivia l boun d
Xd r(n)
(18.25) \52 \ ^ '
d\n
18. T H E LEAS T P R I M E I N A N A R I T H M E T I C P R O G R E S S I O N 4 3 1
however, i t wa s show n b y S . Graha m [Gl ] tha t
(18.26) E ( E A d ) % ^ . (1 +0(^)).
b to
w<n^x d\n ' <
Since w e d o no t see k th e bes t constant s w e mak e th e comfortabl e choic e o f th e
levels y,w,z, namel y
2 7
(18.27) V=(qT) , w=(qT) ,z = (qT) s.
For an y x(mo d q) we hav e th e twiste d serie s
oo
(18.28) ^(*,x ) = E ( EA " ) ( E ^ ) x ( n ) n " i -
1 d\n b\n
This factor s int o
(18.29) K(s,x) = L(s,x)M(s, X)
where
(18.30) M ( « ) = E ( E EA A ) ^ m ) ™ " s .
m [b,d]=m
Actually w e shal l tak e onl y a partia l su m o f K(s, x) 5
(18.31) K x(s,X)= J2 ( E A ^ ) ( E ^ ) x ( n ) n " S
l^n^x d\n b\n
with
23
(18.32) x = (qT) .
Notice tha t m = [6 , d] ^ bd ^2/2 and fo r X 7^ X o
s
(18.33) \Y^x(n)n- \^2q\s\(~y.
Hence, fo r s = a + it i n th e rectangl e (1 8.6 ) th e tai l o f K(s, x) satisfie s
(18.34) \K(s, X) ~ K x(s,x)\ < 2q\s\yzx^ < AqTyzx~ a < 1.
For s = p, a zer o o f L(s , x), w e hav e K(p, y) — 0 by (1 8.29) , s o (1 8.34 ) become s
(18.35) \K x{p,X)\^\-
Extracting fro m K x(p, x) th e firs t term , thi s inequalit y become s th e zer o detecto r
(18.36) | ]T Q T A d)(5»X(n)n- >3-
w<n^x d\n b\n
For th e trivia l characte r X — Xo th e detecto r o f zero s i s somewha t differen t
from (1 8.36) , s o w e shal l coun t thes e zero s separately . Le t <S(x ) denot e th e se t o f
zeros o f L(s,x) (counte d wit h multiplicity ) i n th e rectangl e (1 8.6) . B y (1 8.36 ) w e
deduce tha t th e tota l numbe r
(18.37) R= "£, \ S(X)\= £ j V ( a , T , X )
XT^X o x / x o
432 18. TH E LEAS T PRIM E I N A N ARITHMETI C PROGRESSIO N
satisfies
*<2E E | E (E^)(E^KH ^
X s£S(x) w<n^.x d\n b\n
2
E |(E^)(E^)I| E E c*(*)x(n)r
w < n ^ x d\n b\n X s€<S(x)
for som e number s c x(s) wit h |c x (s)| = 1 . B y Cauchy' s inequalit y w e obtai n
R2 ^ AUV
where
, 2
(18.38) U= ) n ™ '~ -2a
Y, ( E \ *d)
w<n^x d\n
and
(18.39) ^ = E / (n ) ( E ^ ) 2 n 2 Q _ 1
|EE c
x(s)x{n)n
n b\n X seS(x)
Here / i s an y non-negativ e functio n suc h tha t f(n) ^ 1 for w < n ^ x.
Using (1 8.26 ) on e derive s b y partia l summatio n tha t fo r an y \ ^ a ^ 1 ,
(18.40)
log z/w I \ log z/w/ J
To estimat e V w e square ou t an d chang e th e orde r o f summatio n gettin g
(18.41) y ^ ^ ^ ^ | £ ( X l ^ 2 j S l+ s 2+ i_ 2a )|
Xl X 2S iS 2
where
B(X,s)=Y,f(n)(Y/^)2x(n)n-s
n b\n
for x — X1X2 and s = si - f 5 2 + 1 — 2a. Not e tha t R e (s) ^ 1 . Tak e an y / supporte d
on [w/v, xv] whic h i s continuous, bounde d an d piecewis e monotonic . The n w e have
E /(*,)(*)--sa +0(|.|i)
n=a(mod q )
where
(18.42) F(s) = J f(QC"dZ
and th e implie d constan t i s absolute . Henc e w e deriv e tha t fo r an y (d,q) — 1 ,
E f(n)x(n)n-° = 5(x)^F(s)+0(\s\q^)
n=0(mod d)
a nd
where S(xo) = 1 <Kx ) = 0 otherwise. Thi s yield s b y (1 8.28 ) an d (1 8.29 )
(18.43) S (XjS)= 5 ( x )r^__^ + 0 | s(gr_y.
qG \ w
18. T H E L E A S T P R I M E I N AN A R I T H M E T IC P R O G R E S S I O N 43 3
Introducing (1 8.43 ) int o (1 8.41 ) w e ge t b y (1 8.30 )
(18.44) V^Yl J2J2 \F(si + s2 + l-2a)\(logyr1 + 0(R 2 1 2
Tqvw- y)
One ca n find / suc h tha t fo r R e (s) ^ 1,
(18.45) F(s) < ( l + |5-l|logv)- 2logx.
For example , w e ma y tak e
^}
logw/f1 _ logg/ .
/(()=min 1 - ^ ,1 , 1
logv ' ' lo g
for ly/i ; ^ £ $J xv and /(£ ) = 0 otherwise. Indeed , writin g
/ ( 0 lo g v = l o g + ( W 0 - l o g + ( ^ / 0 - \og+(w/0 + \og +
(w/v0
one verifie s b y th e formul a
Jo
that
Fd - s)=/(S) = r / ( o * - 1 * = K - z , a / + w
Jo 5 lo g ^
(t/s - l)(x s — w 2v~s) . 1 / \
=2 l <mi n logx , —y -z .
s^logi; V \s\ \ogvJ
This give s (1 8.45) .
For s = s i + s 2 + l - 2a w e have j s - l j = j/^ i + /32 - 2 a + i(7i -721 ) ^ |7 i ~1'2\-
By (1 8.44 ) an d (1 8.45 ) w e ge t
(18.46) y « ^ V VV ( l + |7 i - 7 2 | l o g ^ ) - 2 + ^ V ^ 1 ? / 2 -
log y ^ rrf v
LEMMA 1 8.3 . Let \ (mo d g ) 6 e a non-trivial character, ^ ^ a ^ l , i > ^ 2 an d
t a real number. Then
1
(18.47) Yl t +1 7 "* l log^)" 2 < ^ (l - a + ~) lo g A ^ ( | t| + 1)
where A is an absolute constant.
PROOF. W e firs t suppos e tha t x * s primitive. W e hav e
where th e summatio n range s ove r al l zeros o f L(s,x) wit h /3 > 0 and B(x) i s a
constant wit h
Re£(x) = - ^ R e -
p
43 4 18. T H E L E A S T P R I M E I N A N A R I T H M E T I C P R O G R E S S I O N
(see (5.29)) . Henc e i t follow s tha t fo r 1 < R e s ^ 2 ,
(18.49) V / R e— = \ log(gM ) + R e ^-(s, x ) + 0(1 ) .
-^i s — p2 L
p
For s — a + it wit h a > 1 we hav e
2
Rp _J _ = (^-/3 )> i / , , h-*h-
+
s-p (a - /3) 2 + ( 7 - ty ' (a - /3 ) V a - :
-( S ,x)| < EA ( n ) n_CT = ~\^ = v^i + ° ( 1 ) -
Hence (1 8.49 ) yield s (dro p th e zero s wit h f3 < a)
^| 7 -i|x-2
E ( 1 + 73r)"2 ^( CT - a )(5iog«(i*i + i) + ^T + o(i)).
Choosing c r = 1 + 1 /log v w e obtain (1 8.47) . I f x(mo d q) is induce d b y a primitiv e
character x*(mo d q*) with q* \ q, then th e lef t sid e o f (1 8.47 ) agree s wit h tha t fo r
X*, so th e resul t hold s fo r an y \ 7 ^ Xo -
Notice tha t 1 + ( 1 - a) lo g v ^ v 1 '*. Therefor e Lemm a 1 8. 3 an d (1 8.46 ) yiel d
(18.50) V « Rv ^^^pL + R2Tqly2 << c Rv l-a
log y lo g v w
by th e trivia l boun d R < C qT log gT an d b y choosin g
(18.51) v = qT.
Multiplying (1 8.40 ) an d (1 8.53 ) w e obtain b y (1 8.38 ) tha t
(18.52) R < (x 2vy-a = {qT) 47
^-a\
Hence w e ge t (1 8.9 ) b y addin g th e numbe r o f zero s o f L(s,xo ) whic h i s negligibl e
by virtu e o f (1 8.1 6) . •
18.3. Th e exceptiona l zer o repulsion .
In this section we are going to prove Principle 3 , however, ou r metho d i s capable
of givin g (1 8.1 3) . I t begin s b y a suitabl e modificatio n o f th e argument s whic h w e
used fo r th e proo f o f Principl e 2 in th e previou s section .
Suppose xi(mo d q) i s the exceptiona l characte r an d /3 i i s the exceptiona l zer o
of L(s , xi) whic h satisfie s
(18.53) 5 1 = 1 - ft ^ ci(loggT ) - 1
Throughout w e assume (a s w e can) tha t c\ i s a sufficientl y small , absolut e positiv e
constant. Ou r goa l i s to prov e ther e exist s a n absolut e constan t C o ^ 2c\ suc h tha t
the functio n L q(s) ha s n o zer o othe r tha n (3i i n th e regio n
(18.54) a ^ i _ ________ _ | t |<r.
Clearly thi s resul t implie s Principl e 3 .
18. T H E L E A S T P R I M E I N A N A R I T H M E T I C P R O G R E S S I O N 435
To tak e ful l advantag e o f th e repulsio n propert y o f /3 i w e appl y th e metho d
of zero-detector s fro m th e previou s sectio n t o th e functio n £(s)L( s + 5i , Xi) rathe r
than t o C(s). W e hav e
oo
s
(18.55) C(s)L(s + 5i , xi) = £ p(n)n-
l
where p(n) i s a positive , multiplicativ e functio n give n b y
Sl
(18.56) p(n)=J2xi(a)a- .
a\n
REMARK. On e coul d wor k a s wel l wit h th e produc t
oo
T n
(18.57) a*)L(s,Xi)=J2 ( >^n~S
1
where
(18.58) r ( n , x i ) = 5 > i ( a )
a\n
but th e sligh t shif t i n L(s + <5i , Xi) turn s ou t t o simplif y som e technica l arguments .
As befor e w e introduce tw o kind s o f mollification s t o ((s)L(s -f - <Si , xi) gettin g
oo
f
(18.59) K( S) = Y, (ZXd){J2e<>)p(n>~S
1 d\n b\n
where A ^ are define d b y (1 8.1 8 ) an d 0^ are define d b y (1 8.1 9 ) wit h th e parameter s
y, w, z give n by (1 8.27) . The n fo r an y non-principal characte r x(mo d q) we consider
the twiste d serie s
oo
(18.60) K(s, x) = £ ( E Xd ) ( E 0b)p(n)x(n)n- s
.
1 d\n b\n
This factor s int o
(18.61) K(s, x ) = L(s, X )L(s + 5 uXXi)M(s', x )
where
(18.62) M(s, X) = Y,{J2Y, X^) T\(P(P)-X{p)p- s 2h
- Mrn)m-s.
m [b,d]=m p\m
a s
To se e thi s w e arrang e K(s,x) follows :
s
K(s> X) = J2{1 2Y, ^dO b)x(m)m- ^ p(mn)x{n)n-* .
m [6,d]= mn
436 18. TH E LEAS T PRIM E I N A N ARITHMETI C PROGRESSIO N
Opening p(mn) b y (1 8.56 ) w e se e tha t th e innermos t su m i s
a n=0(mo d a/(a,m)) a '
s 6
= L(s, X)J2xi(c)c-^ £ XiX(a)a- ->
c\m (a,c ) =l
= L(s , x)L(s + ^ xxi ) £ X i (c)c"* I T 1 - X1 X(P))P" 8-41
c]ra p| c
and thi s yield s (1 8.61 ) an d (1 8.62 ) becaus e m i s squarefree . B y (1 8.61 ) i t follow s
that K(s,x) i s holomorphi c i n th e whol e comple x plan e (i f x — Xi? the n th e pol e
of L(s + #i , Xo) a t s = 1 — Si = fii cancel s wit h th e zer o o f L(s, Xi)) -
As before w e take onl y a partia l su m o f K(s, x) ,
s
(18.63) K x(s,X)= £ ( £ > ) ( £ > ) P(")x(")«~
l^.n^x d\n b\n
with x give n b y (1 8.32) , an d w e show by contour integratio n tha t th e tai l o f K(s, x)
satisfies
(18.64) \K(s,x)-K x(s,x)\^%
for 5 = a + it wit h < J ^ | an d |£ | $ J T. The n fo r s = p , a zer o o f L(s , x) whic h i s
different fro m /3 i i f % = xi ? w e have K(p , x) — 0 by (1 8.61 ) , therefore (1 8.64 ) yield s
(18.65) | J2 (£^)(Efc)/>Mx(")«~i H
if p = / J + i 7 wit h / ? ^ | an d |7 J < T .
The inequalit y (1 8.65 ) i s viewe d a s th e zero-detector . Thi s wouldn' t b e usefu l
if no t fo r th e fac t tha t p(n) i s ver y smal l quit e frequently . On e canno t exploi t
a possibl e cancellatio n o f term s becaus e o f th e complexit y o f th e coefficient s A^ .
Remember tha t i t i s du e t o th e introductio n o f th e factor s
(18.66) cj{n) = Y, Xd
d\n
that th e earl y term s wit h 1 < n ^ w ar e annihilated . Now , ou r strateg y wil l b e t o
remove these factor s (b y applying Holder' s inequality ) t o create a sum o f v{n) 2p(n),
where
(18.67) is( n) = Y2°b
b\n
over n wit h w < n $ J z. Whe n applyin g Holder' s inequalit y on e coul d als o remov e
the facto r ^(n) 2 , bu t i t woul d resul t i n a los s o f log g whic h i s no t acceptable . Fo r
this reaso n w e retain v{n) 2p(n). Still , th e su m o f u(n) 2p(n) i s manageable becaus e
the suppor t o f #5 is relatively smal l b y comparison t o th e rang e o f n. I n thi s wa y b y
taking advantag e tha t p(n) nearl y vanishe s quit e ofte n (dependin g o n ho w clos e t o
one is the exceptiona l zero ) w e derive fro m th e inequalit y (1 8.65 ) tha t /3 i s not clos e
to one , precisel y thi s inequalit y prohibit s p — /3 + ij t o b e i n th e regio n (1 8.54) .
18. T H E L E A S T P R I M E I N A N A R I T H M E T I C P R O G R E S S I O N 43 7
Having outline d th e strateg y w e proceed t o details . B y (1 8.65 ) w e ge t
£ \oj{n) V{n)\p{n)n-P > \.
w<n<^x
Hence applyin g Holder' s inequalit y
2
(18.68) 6U VW ^ 1
where
2
V= J2 v (n)p3(n)n-\
w<n^x
2 1
W= Y, v (n)p{n)n- .
w<n^.x
For U w e hav e b y (1 8.40 )
(18.69) U^x^-n.
In V w e estimat e p 3(n) b y r 3(n) an d appl y a siev e o f dimensio n 8 (se e th e Funda -
mental Lemm a 6.3 ) gettin g
(18.70) V « fe^) « 1 .
We shal l sho w tha t
(18.71) W < £ i l o g z .
Multiplying thes e estimate s w e derive fro m (1 8.68 ) tha t
1 4{
(18.72) S 1 {logx)x ~^ > 23c 0,
where C Q is a positiv e absolut e constant . Assumin g (1 8.53 ) hold s wit h 2c\ ^ C o we
conclude b y (1 8.72 ) wit h x = {qT) 23 tha t p = 3 + 2 7 is not i n th e regio n (1 8.54 ) a s
claimed.
We no w procee d t o th e proo f o f (1 8.71 ) , i t i s her e wher e th e exceptiona l zer o
plays it s role . Ou r treatmen t i s essentiall y elementary , ye t som e argument s ar e
quite subtl e (w e borro w thes e fro m th e wor k o f Bombier i [Bo2]) . W e begi n b y a n
asymptotic evaluatio n o f W. T o thi s en d w e examin e th e generatin g zet a functio n
00
2
(18.73) W(s) = J2 v {n)p{n)n-s.
1
This i s just (1 8.60 ) wit h \ d = 0 d an d \ = Xo, so b y (1 8.61 ) an d (1 8.62 )
W(s)=L(s,xo)L(s + 6 1 ,Xi)M(s)
where
M(s)= J2 °ml[( P(p) - P-s-2Sl)m~*
(m,g) = l p\m
1438 8 . TH E LEAS T PRIM E I N A N ARITHMETI C PROGRESSIO N
and
[b1,b2]=m
Hence W(s) i s holomorphic everywher e excep t fo r a simpl e pol e a t s = 1 with
res W(s) = ^ L ( l + 5 U Xi)M(l) .
s=i q
By contou r integratio n w e derive tha t
(18.74) W = ^ M ( l ) L ( l + <Ji , xi) lo g - + O (-) .
Now we need estimate s fo r M(l ) an d L( l + 5i, Xi)- Fro m siev e theory (se e Chapte r
6) w e ge t
1
(18.75) M ( l ) =£ ^n^P)-^" "1
2
' ) " 21
"
(ra,g) = l p|r a
«no-T)-*n('-T)-
w e nrs
To estimat e L( l + 5i,xi ) ^ examin e
1
(18.76) P(x,y)= Y, ( + Xi(p))p~ 1 -
LEMMA 1 8.4 . For x > y ^ q 2 and q sufficiently large we have
(18.77) P(x,y) ^4(Jilogx .
P R O O F . Conside r
where r(n, Xi ) i s given b y (1 8.58) , s o it i s a multiplicative functio n wit h r(p , xi) =
1 -f - Xiip)- Multiplyin g P{x,y) an d S(y) w e deduc e fro m th e non-negativit y o f
r(n, xi ) tha t
P(x,y)S(y)^S(xy)-S(y).
Then b y th e elementar y formul a (se e (22.1 1 ) )
S(x) = L(l , Xi)(logx + 7 ) + L'(l , xi) + 0(qix~* logs )
we ge t
S(xy)-S(x) = L(l,xi)logx + 0(qiy~hogy) ^ 2L(1 , Xi) logx
18. TH E LEAS T PRIM E I N A N ARITHMETI C PROGRESSIO N 43 9
because L ( l , x i ) ^ > q~? an d y ^ q 2. O n th e othe r hand , w e deriv e a lowe r boun d
for S(y) a s follow s
S ( y ) > y - * £ T ( n , ; a ) ( i - - ) n~^
J(i)) s(
2iri ./(i s+ l)
-M+«»-'«*> >sr^.x.).
Combining thes e bound s w e complete th e proo f o f (1 8.77) . •
REMARKS. Lemm a 1 8. 4 show s tha t i f Si = 1 — fi\ — o(l/logg) , the n XI(P) —
— 1 for almos t al l primes i n segment s q 2 < p ^ q A. Fo r ver y smal l prime s th e abov e
arguments ar e rathe r crude , bu t se e Chapte r 22 , i n particula r (22.1 00) .
Now w e ar e read y t o estimat e L( l + 5i , Xi)- Fo r an y 1 < s ^ 2 we hav e
s
as)Hs,xi)~l[(l + T(p,xi)p- )
p
and b y Lemm a 1 8. 4 w e obtain b y partia l summatio n
J | ( l + r(p,xi)p~s)^exp(^r(p,xi)p"s) < e x p ( — ^ ) .
p>y p>y
Hence
s
L(slXi) « (s - l)ex p ( ^ ) H(l + r(p,xi)p- )-
p^y
In particular , w e ge t
(18.78) L( l + <5i,Xi)«<5i n ( ! + — ) •
Combining (1 8.74) , (1 8.75 ) an d (1 8.78 ) w e get W < <5 i logx-f-g - 1 whic h complete s
the proo f o f (1 8.71 ) , an d o f Principl e 3 .
18.4. Proo f o f Linnik' s Theorem .
Throughout ci , C2, C3 and c are th e absolut e constant s fro m Principle s 1 , 2 an d
3. W e assum e (a s w e can) tha t
(18.79) c,c 2 > 1 > ci,c 3 > 0 .
Let
4c2
(18.80) x ^ q
and pu t
(18.81) R = x 1 /2C2.
44 0 18. TH E LEAS T PRIM E I N A N ARITHMETI C PROGRESSIO N
We us e th e truncate d explici t formul a (se e (5.65) )
YWJ Y\H) x ( m Q d q) px H X
where J2 restrict s th e summatio n t o zero s p x = f5 x + ryx o f L(s , x) i n cr ^ | , \t\ ^
R. Firs t w e estimat e th e su m ove r zero s p x wit h \T < \*yx\ ^ T for 1 < T ^ R by
using Principl e 2 as follows :
x
|£ x(a )2 ^ x l< |E E ^ = "I / ' ^ ^ ( « , r )
/ 2
X(modg) l T < | 7xKT X | 7x|<T
a
= ^ Nq(lT) + ?-(\ogx) [ N q(a,T)x da
1l
Jl/2
2c 2
^ — x HqT)^ + %x(\ogx) I ((qT^/x^da
T 1
Jl/2
2c x log x ^ , _ x
^ T l o g ( ^ T ) ^ ) ^ 2 c ( C 2+ 1 )
T
provided x ^ (gT) C2+1 whic h i s our case . Notic e th e absenc e o f logx i n the las t
bound. Henc e th e explici t formul a reduce s to
0
(18.82) , ( , ; , , „ ) , _ - _ £ ««> £ 7 7 + ( ^ + « <^
X(mod q) P x
where Y restrict s th e summation t o zeros p x wit h |7 X | ^ T , and th e implied
constant i s absolute. W e choos e
(18.83) T = q,
so th e erro r ter m i n (18.82) i s O(x(logq)/q(p(q)).
Now w e procee d t o a more precis e treatmen t o f the su m ove r th e zero s wit h
|7 X | ^ T . I f there exist s a real characte r xi(nio d q) for which L(s,\i) n a s a r e a l
zero f3\ — 1 — 5\ wit h
(18.84) J i <ci/21 ogg ,
an
we exclud e i t fro m J2 d we estimat e th e remainin g su m Yl b y an appeal to
Principle 1 as follows:
Px
X m
x(mod q Px x( od q) P x
pl-rj pl-rj
a a
-2 / x dNq(a,T) = 2x*Nq(\,T) + 2(logx) / N q(a,T)x da,
Jl/2 J\
where
(18.85) rj = Cl /2logq
if th e zer o /3 i doe s no t exist , an d
(18.86)7 7 = c 3 |log(2(Jilogg)|/21 ogg
18. TH E LEAS T PRIM E I N A N ARITHMETI C PROGRESSIO N 441
if fiiexists. B y Principl e 2 we hav e
2c
Nq(a,T) ^ cq ^-a\
so ou r su m ove r th e zero s p x othe r tha n /3 X is bounded b y
rl-ri
2cx2qC2+2cx(logx) I (q^/xy-^da
2«iogx / ^ y . . ^
log(xq~2c2"> \ x J
Therefore w e hav e establishe d th e followin g
PROPOSITION 1 8.5 . Let c,ci,C2,C 3 be the absolute constants from Principles
1, 2, 3. Then for x ^ q 4°2 we have
(18.87) ^ ( x;g,a) = ^ | i _X l ( a ) ^ + ^ x - ^ + 0 ^)|
where the term involving Pi does not exist if Pi does not, r\ is given by (1 8.85) or
(18.86) respectively, \9\ ^ 4 and the implied constant is absolute.
In th e first cas e o f Propositio n 1 8. 5 we deduc e
THEOREM 1 8.6 . Let ci , c2 and c be the absolute constants from Principle 1 and
Principle 2 respectively. Suppose for any real character \i (mo d q) there is no real
zero of L(s,xi) ^ n
(18.88) a> l - c i / 21 o g g .
4c2
Then for x ^ q we have
(18.89) ^ ; , , a )= 4 - (l + ^cexp(-^^)+ 0(^
where \0\ ^ A, and the implied constant is absolute (effectively computable).
In th e secon d cas e whe n fi\exists, w e assum e tha t x ^ q u wher e
/ 4 4 log: 8c
(18.90) z / = max(4c 2 ,—,
Ci C 3 | l o g C i |
and w e estimate a s follows :
r £ i— 1 r —Si
1 " X i ( a ) - r- ^ 1 - - 5 - > Pi ~ * " * >Pi-q
Pi P i
= i - g °' -6i^ — —— 5 i
1 + i/dilog g
^ i l o g g 4* i
l + z/ci/ 2 3c i
„/2 ^ - * „ / 2 {2di logq) »c3/4 ^ (25 i l ogq)c-3/4-l ^ »1 <*i_ l Q g g _
=
loggjc x - ^ -
4cci
Inserting thes e estimate s int o (1 8.87 ) w e deduc e
442 18. T H E L E A S T P R I M E I N A N A R I T H M E T I C P R O G R E S S I O N
T H E O R E M 1 8.7 . Let c, ci,C2,C3 6 e the absolute constants from Principles 1 , 2
and 3. Suppose there exists a real zero fii of L(s, \i) f or a rea ^ character \i (mo d q)
such that Si = 1 — /3i ^ c\/2\ogq. Then for x ^ q v with v given by (18.90) we
have
where the implied constant is absolute.
From Theorem s 1 8. 6 and 1 8. 7 it follow s tha t i n an y case w e have (becaus e
S1logq > q~i)
COROLLARY 1 8.8 . If q is sufficiently large and x ^ q L with
{4 4 los8 c 1
(18.92) L = max<Uc 2 ,— log8c , — — - —r \ ,
\c1 c 3 | log ci I J
then
(18.93) ^ ( x ; 4 , a ) >
where the implied constant is absolute.
This resul t i s a quantitative versio n o f Linnik's theorem .
http://dx.doi.org/10.1090/coll/053/20
CHAPTER 1 9
T H E GOLDBAC H P R O B L E M
19.1. Introduction .
Christian Goldbac h (1 690-1 764) , a membe r o f th e Petersbur g Academy , i n a
letter o f 1 74 2 t o Leonar d Eule r pose d th e proble m o f provin g tha t ever y numbe r
TV ^ 5 ca n b e represente d a s th e su m o f thre e primes . Eule r answere d wit h th e
equivalent hypothesi s tha t ever y eve n numbe r N ^ 4 i s a su m o f tw o primes .
Inspired b y many failure s o n these problems , Edmun d Landa u challenge d (i n 1 91 2 )
to sho w tha t ever y numbe r N > 1 is a su m o f a t mos t k prime s wit h k sufficientl y
large bu t fixed. Landau' s proble m wa s solve d i n 1 93 0 by L . G . Shnirelma n (se e e.g .
[Kh]). Th e wor k o f Shnirelma n opene d a ne w direction i n genera l additiv e numbe r
theory.
In 1 93 7 I . M . Vinogrado v [V6 ] succeede d i n solvin g th e origina l Goldbac h
problem fo r al l od d i V which ar e sufficientl y large . Befor e him , G . H . Hard y an d
J. E. Littlewood [HL1 ] made a serious attack b y means of a then ne w circle method .
Indeed, the y succeede d conditionall y unde r th e Rieman n Hypothesi s fo r Dirichle t
L-functions. Vinogrado v followe d thei r lin e of attack (wit h his own beaut ification o f
the circl e method) , an d remove d th e us e of GR H i n the estimat e o f the exponentia l
sum ove r prime s
a
(19.1) £ < P^
which i s required b y th e circl e method . Hi s treatment o f this particula r su m (siev e
and doubl e sum s methods ) becam e a fundamenta l too l fo r estimatin g a larg e clas s
of sum s ove r prime s (se e Chapte r 1 3) .
In thi s chapte r w e deriv e Vinogradov' s Thre e Prime s Theore m b y a mixtur e
of hi s origina l an d somewha t moder n ideas . W e replac e th e exponentia l su m ove r
primes (1 9.1 ) b y th e exponentia l su m wit h th e Mobiu s functio n fo r whic h w e hav e
a unifor m boun d (1 3.54 ) du e t o H . Davenport . I n thi s wa y w e minimiz e th e rol e
played b y th e circl e method , an d w e do no t appea l t o th e distributio n o f prime s i n
residue classe s (th e us e o f thes e propertie s i s accumulate d i n th e proo f o f (1 3.54)) .
One ca n avoi d completel y th e ide a o f circl e metho d b y usin g onl y th e elementar y
dispersion metho d (se e th e Rutger s Lectur e Note s o f 1 994 , Chapte r 6) .
None o f th e curren t method s promise s an y chanc e t o solv e th e Goldbach-Eule r
problem fo r tw o primes. However , i t wa s established independentl y b y N . G . Tchu -
dakov [Tchu] , va n de r Corpu t [Cor3 ] an d T . Esterman n [Est ] tha t almos t al l (i n
the sense of density) eve n numbers ar e represent able as the sum of two primes. Thi s
follows by a suitable generalization o f the Vinogradov resul t (on e of the three prime s
443
444 19. T H E G O L D B A C H PROBLEM
can b e take n fro m an y sequenc e o f integer s whic h i s relativel y dense) . H.L . Mont -
gomery an d R.C . Vaugha n [MV3 ] showe d tha t th e numbe r o f exceptiona l eve n
integers (thos e whic h ar e no t representabl e a s th e su m o f tw o primes ) i s quit e
small.
For eve n number s N w e conside r
(19.2) G 2(N)= Y, A(m)A(n 2 ),
n1-\-n2=N
and fo r od d number s N w e conside r
(19.3) G 3(N) = J2 A(m)A(n 2 )A(n 3 ).
ni+n2+n3=iV
Therefore
A
(19.4) G 3{N) = J2 (n)G2{N - n).
n<N
Hence a goo d estimat e o f G 2(N') fo r sufficientl y man y N' < N wil l yiel d a corre -
spondingly goo d estimat e o f Gs(N).
The heuristi c Mobiu s Randomnes s La w describe d i n Sectio n 1 3. 1 justifies th e
following stronge r for m o f th e Goldbac h proble m (whic h implie s it s solutio n fo r
sufficiently larg e TV):
CONJECTURE. For even numbers N ^ 4 ,
A
(19.5) G 2{N) = <5 2(N)N + 0(N{\ogN)- )
where
(19.6) © 2 (jV)=C2TT^
p\NF
p>2
and
(19.7) C 2 = 2;[](1 -(P-1 )- 2)-
Here A is any positive number and the implied constant in the error term depends
only on A.
We shal l prov e th e conjectur e fo r almos t al l eve n numbers .
T H E O R E M 1 9.1 . Let A,B,X ^ 4 . The number of even integers N with 4 ^
N ^ X for which
A
(19.8) \G 2(N) - <S 2(N)N\ > BN(logN)-
does not exceed CX(logX)~A where C is a constant which depends only on A,B.
The celebrate d resul t o f Vinogrado v fo r thre e prime s follow s immediatel y b y
(19.4) an d Theore m 1 9.1 .
19. T H E G O L D B A C H P R O B L E M 44 5
THEOREM 1 9. 2 ( V I N O G R A D O V ) . For odd numbers N ^ 7 we have
(19.9) G 3{N) = <5 3(N)N2 + 0(N 2 (lo g N)~ A)
where 6 3 (N) > 0 is given by
(19.10) ©aw= \ na - (p -1)-2) n(i+(? -1)-3) > 0,
p|iV pfT V
and A 25 any positive number, the implied constant depending on A.
19.2. Incomplet e A-functions .
Given z ^ 2 we split A(n ) a s follows:
(19.11) A(n ) = - ^ / / ( r a ) l o g m = A^r O + A^n )
m|n
where A^(n ) an d A b(n) ar e the partial sum s ove r th e divisors m ^ z an d m > z
respectively. The n w e write
(19.12) G 2(N) = Gf{N) + 2Gf (JV ) + C?2 b(W)
where
G«»(iV)= X ) A V ) A » M
ni+77 2 =AT
b bb
and G^ (N),G (N) define d similarly .
LEMMA 1 9.3 . Let N be an even number ^ 4 and z > 2 . T #e /za?je
(19.13) Gf(A 0 = e 2(N)N + 0(N (log z)~A + T(N)Nz'i + z3 )
/or an y A ^ 0, the implied constant depending only on A.
PROOF. W e hav e
G*J(N)= ] TM (m
1 )/i(m 2 )(logm
1 )(logm 2 ) ^ 1 .
r
mi, 771 2^2 : £i mi-\-l2'm2=N
The equatio n fim i + £ 2ra2 = A f i s equivalen t wit h th e congruenc e condition s
(rai,ra 2 )|Af an d
£mi(mi,m2)~ 1 = ^ ( ^ I , ^ ) " " 1 (mo d ra 1_
2(rai,ra2) )
with 1 < £ < Nm^ 1 . Therefor e th e number o f solutions i s Af[rai,ra 2 ] - 1 + 0(1)
giving
2
Gl\N) = N<5 2(N;z) + 0((zlogz) )
where
777.1,7712^2
(mi,m2)\N
446 19. T H E G O L D B A C H P R O B L E M
It remain s t o estimate 02(iV;2) . W e write
W;*)= £ 4f £ ^^(logdmOOogdm ^
*-^ d ^— '1
1
17717 2
1
1
1d\N 77 ,77 2^ 2
(mi,m2) = l
(mi77l2,d) = l
= E^ E ^( E ^x***)'-
d\N c^z/d m^z/cd
(m,cd) = l
For cd >Z2 w e estimate triviall y by
2
]T ^ c~ d~l(log z) 4
^2r(N) Z-^ {log z)4.
l
^ Cd>z 2
If cd ^ z? we derive fro m th e Prime Numbe r Theore m tha t
_£ ^2logcdm=^-{ l + O(r(cd)(l0gZ )- A )}.
m^.z/cd
(m,cd) = l
Hence
Now extending th e sum ove r cd ^ z^ t o the infinite serie s (a s we can wit h the error
term whic h i s already present ) w e obtain
d\n Y V }
(c,d ) = l ^ V }
This complete s th e proof o f Lemma 1 9.3 . •
Lemma 1 9. 3 reveal s tha t th e main contributio n t o the binary Goldbach-Eule r
equation come s fro m th e incomplete functio n A^(n) . I n other words , th e comple-
mentary functio n A b (n) give s considerabl y less . Thi s i s due to the sign chang e of
the Mobiu s functio n involve d i n Ab(n) whic h range s ove r a large segment. However ,
we do not have yet rigorous argument s to justify thi s heuristic. Bu t it is a relatively
simple matte r i n ternary additiv e problems .
19.3. A ternary additiv e proble m wit h A b.
In thi s sectio n w e give a bound fo r a general sum
Tb(N) = £ u evmA\n)
£+m+n=7V
where U£ an d vm ar e arbitrary comple x numbers . T o this en d we need a n estimate
for th e exponential sum
(19.15) S b
(a) = 5Z A b (n)e(an)
19. TH E GOLDBAC H PROBLE M 44 7
which i s unifor m i n a. B y Theore m 1 3.1 0 w e derive b y partia l summatio n tha t fo r
any a G l an d x ^ 2 ,
(19.16) 2_, ^( m)0-ogm)e(am) ^ x(logx) "
with an y A ^ 0 , th e implie d constan t dependin g onl y o n A. Sinc e
|S b (cOK 5 ^ 1 X ! MO ) (lo g m)e(c^m)
^<CiV/z z<m^N/£
we get b y (1 9.1 6 ) tha t
(19.17) S b
(a) < C 7V(log7V)(log^)-A.
REMARK. Any sequenc e o f number s i n plac e o f A b(n) coul d b e use d provide d
the associate d exponentia l su m (1 9.1 5 ) satisfie s S b(a) < C N(logN)~A fo r an y A >
0.
Now w e ar e read y t o prov e th e followin g
LEMMA 1 9.4 . For any complex numbers u£ and v m we have
b A
(19.18) ] Tu iVrnA (n) « \\u\\ \\v\\ N(logN)(logz)-
l+m+n=N
with any A ^ 0 , the implied constant depending only on A.
PROOF. W e hav e
Tb(N) = (J2 u ee(a£)^j ( ^ v me(am)^S
b
(a)e(-aN)da.
Hence (1 9.1 8 ) follow s b y (1 9.1 7) , Cauchy-Schwar z inequalit y an d th e Parseva l for -
mula
pi , , 2
/ X^ ( ^ da=^2\ui\ 2.
e a
19.4. Proo f o f Vinogradov' s thre e prime s theorem .
Let CN be an y comple x number s fo r N ^ X,N odd . W e writ e
b b
Y, c NG2(N)= £ c N(Gl\N) + 2G^(N) + G 2 (N)).
Then w e us e Lemm a 1 9. 3 with z = X+ gettin g
A
£c NGiH.N)= £ c N<52(N)N + 0(\\c\\xH\ogX)- ).
N^X N^X
For th e sum s o f CNG^(N) an d CNG^(N) w e appl y Lemm a 1 9. 4 gettin g th e sam e
bounds a s th e erro r ter m above . Henc e
C A
(19.19) ^2 NG2(N) = ^2 c N®2(N)N + 0(\\c\\xi(logX)- ).
N^X N^X
448 19. THE G O L D B A C H P R O B L E M
For th e specia l coefficient s CN = C^AT ) — &2(N)N thi s yield s
PROPOSITION 1 9.5 . For any X ^ 3 we have
-1A
(19.20) Yl (°2( N
)~ ®2(N)N) 2 < X 3 (logX)
N eve n
where A is any positive number and the implied constant depends only on A.
Clearly (1 9.20 ) implie s Theore m 1 9.1 . Fo r th e proo f o f Theore m 1 9. 2 we appl y
(19.19) t o (1 9.4 ) gettin g
3
G3(N) = J2 A(n)C 2 (iV - n)(N - n) + 0(N {logN)-A).
n<N
By (1 9.1 4 ) th e mai n ter m i s equal t o
V V
(c,d) = l ^ ^ ^ n<N
n=N(mod d)
The inne r su m i s equa l t o
N2
if (d , A/") = 1 by th e Prim e Numbe r Theorem . Sinc e
(d,ciV) = l r \ JV \ J
this complete s th e proo f o f (1 9.9) .
http://dx.doi.org/10.1090/coll/053/21
CHAPTER 2 0
T H E CIRCL E M E T H O D
20.1. Th e partitio n number .
The circl e metho d originate d i n th e pape r o f 1 91 8 b y G . H . Hard y an d S . Ra -
manujan [HR ] o n partitions . An y decompositio n n = n\ + ri2 + n 3 + • • • int o
non-negative integer s withou t regar d t o thei r orde r i s calle d a partitio n o f n. Le t
p(n) denot e th e numbe r o f suc h (unrestricted ) partitions . Thi s i s als o equa l t o th e
number o f ordere d solution s (xi,X2,... ) i n non-negativ e integer s t o th e equatio n
n — X\ + 2^ 2 + 3^ 3 H— • . Algebraically, i t i s also th e numbe r o f conjugacy classe s
in th e symmetri c grou p <5 n.
L. Eule r wa s fascinate d b y th e remarkabl e propertie s o f p(n). H e introduce d
the generatin g powe r serie s
oo oo
n 1
m
F(z) = J2p(n)z = U(l-z )-
0
and showe d tha t
F(z) =n(i-*m)=D-1)'* (3^-1)^/2
which is known a s Euler's Pentagona l Number s Theorem . Playin g with other powe r
series Eule r derive d man y amazin g formulas , suc h a s
P(n) = ~ ^2 cr(h)p(n - h)
0<h^n
where cr(h) is th e su m o f divisor s o f h.
Ramanujan noticed , an d late r gav e proof s o f th e congruenc e propertie s
p(n) = 0(mo d 5) , i fn = 4(mo d 5) ,
p(n) = 0(mo d 7) , i fn = 5(mo d 7) ,
p(n) = 0(mo d 1 1 ) , i fn = 6(mo d 1 1 ) .
He wa s als o fascinate d b y analyti c expression s fo r p(n) i n term s o f modula r forms .
Changing th e variabl e z int o e(z) on e transforms th e powe r serie s F(z) int o th e
Fourier serie s
oo
(20.1) f(z) = Y,p(n)e(nz).
o
449
450 20. T H E CIRCL E M E T H O D
We writ e f(z) = e(z/2A)/rj(z) wher e
OO
is th e Dedekin d et a function . Thi s i s a modula r for m o n th e uppe r half-plan e H I
with respec t t o th e modula r grou p SL2CZ) o f weight | an d a suitabl e multiplie r
system. Precisel y Dedekin d prove d tha t
(20.2) r](-fz) = 0(j)(cz + d)l2r](z)
for an y z G H an d 7 = ( a ) G SLiffi), wher e th e multiplie r #(7 ) i s define d b y th e
1
properties 0 ( - 7 ) = e(i)0( 7 ), fl^ J) ) = e ( £ ) an d
»((: s))-(^-i-5-<* 4
if c > 0 . Her e s(d, c) i s the Dedekin d su m
x(mod c ) V 7 V 7
(recall tha t ip(x) — x — [x] — | ) .
Note tha t th e cus p 0 0 i s non-singula r wit h respec t t o th e Dedekin d multiplie r
system (se e [1 4 ] fo r definition) , s o th e Fourie r expansio n o f rj(z) ha s n o constan t
term; i t i s a lacunar y serie s
^z) = ^2a ne(n
2
z/2A)
n
with a n = 1 i f n = + 1 (mo d 1 2) , a n = — 1i f n = ± 5 (mo d 1 2 ) an d a n = 0
otherwise. However , th e Fourie r coefficient s p(n) o f f(z) = e(z/24)/rj(z) ar e quit e
large. Not e th e trivia l boun d p(n) $ C 2 n whic h follow s fro m
00 0 0
n m
Y,P(n)2- = ~[[(l-2- )-\
10
One ca n deriv e a bette r boun d fro m th e modula r equatio n
'«=@^H)M4)
(see (20.2 ) fo r 7 = ( ~ )) . I n particular , fo r z = z/ y wit h y ^ 1 this yield s
p(n) < y ^ e x p ( 2 ^ ( | j + - ) ) .
Hence b y takin g y = 7Ty/n/6 on e get s
B
(20.3) p ( n ) « n ^ e ^
where i ? = 2TT/^/6. Hard y an d Ramanuja n prove d th e asymptoti c formul a
p{n) ~ (4v/3n)~ 1 e B v ^.
They als o gav e a n asymptoti c expansio n wit h a n estimat e fo r th e erro r term .
20. T H E C I R C L E M E T H O D 451
Refining th e metho d o f Hardy-Ramanujan , H . Rademache r [R ] establishe d i n
1937 the exac t formul a
THEOREM 20.1 . Forn ^ 1 we have
(20.4) P („) = - l 1 f ; c U ( n ) A i - 8 i n h ( | A n )
V
C= l
where B — 2TT/^6, X n = (n — 7^)2 and
(20.5) A c(n)= ^ * e{\s(a,c)-an/c).
a(mod c )
Notice tha t th e Rademache r serie s (20.4 ) converge s absolutel y b y virtu e o f th e
trivial estimate s |A c (n)| ^ c an d
—-( — sin h — An ) < c" 3 exp(— A n ).
dn\Xn c J V c/
Taking onl y th e firs t ter m i n (20.4 ) on e alread y get s a ver y goo d approximatio n
(20.6) KB )= J^(1_^. + 0 ( «-t«.)).
The startin g poin t o f the Hardy-Ramanuja n metho d i s the Cauch y integra l fo r
the coefficient s o f a powe r series . I n ou r cas e
1n
(20.7) p(n) = — I F{z)z- ~ dz
27T« J\z\=r
(this integratio n ove r a circl e gav e th e metho d it s name) . Changin g z int o e(z) w e
get b y periodicit y
(20.8) p(n) = / f(z)e(-nz)dz
Jw
where w i s an y poin t i n th e uppe r half-plan e E I and th e pat h o f integratio n i s an y
continuous curv e i n H I from w t o w + 1 . W e shal l choose th e pat h o f integratio n
as a continuou s chai n o f certai n circula r arc s o n whic h f(z) ca n b e analyze d quit e
precisely b y exploitin g it s modularity .
To construc t th e arc s i n questio n w e nee d som e well-know n propertie s o f th e
Farey sequenc e or series (se e e.g. [HW]) . Le t C b e a positive integer. Th e collectio n
of al l reduce d fraction s ^ wit h 1 ^ c ^ C an d (a , c) = 1 arranged i n th e increasin g
sequence
a' a a"
cc c"
is calle d th e Fare y sequenc e o f orde r C. Give n a poin t ^ i n tha t sequenc e le t ^ 7
and ^7 7 denote th e adjacen t points . Th e denominator s c r, c" ar e determine d b y th e
conditions
{C-c<c'^C, ad = l(modc) ,
v(20.9)
J < ^
\C-c<d'K:C, ad' = -l(mo d c )
45 2 20. T H E C I R C L E M E T H O D
and th e numerator s ar e a! — (acf — l) c \ a' (ac" + l) c * , so the point s ar e
a 1 a" a 1
(20.10) -c + —
c cc cc
Between a/c an d a'/c' ', a"/c " ther e are the mediants (a'+a)/(c'+c) , (a+a") / (c+c")
which ar e als o reduce d fraction s bu t d o no t belon g t o th e Fare y sequenc e o f orde r
C. The y hav e denominator s i n th e segmen t (C , c -f C] and satisf y
a' + a 1a a 1
cf + c c c(c -h d) +
c c^c-f-c' )
a" + a1 a 1 a"
c" + c c c( c + c") " c" c"{c + c")'
To eac h reduce d fractio n ^ wit h c ^ 1 , (a , c) = 1 we associat e th e circl e
2 1
(20.11)
2? 2^
which i s denote d b y C(a/c) an d i s calle d a For d circle ; i t i s a circl e i n H tangen t
to th e rea l lin e a t z = ^ . Tw o differen t circle s d o no t intersect ; the y ar e tangen t
if an d onl y i f the y belon g t o consecutiv e fraction s i n th e sam e Fare y sequence . Al l
the circle s C(a/c) ar e obtaine d a s th e image s o f th e horizonta l lin e I m z = 1 under
the actio n o f SL 2 (Z) . Indee d writin g
az + b 1
(20.12) 72: =
cz + d c(cz + d )
for 7 = (1 ac )d wit h c ^ 1 it follow s tha t
a% 2 2 + d/ c
Z —
^
c„ ++ 2c
o ^ 92
2c r» 22 "z + d/ c
if I m (z) = 1 , whence i t i s clea r tha t 72 : satisfies (20.1 1 ) .
Now le t C ^ 1 be given . Suppos e a!/c' < a/c < a"/c" ar e thre e consecutiv e
points fro m th e Fare y sequenc e o f orde r C. The n C(a/c) touche s C(a'/c') an d
20. T H E C I R C L E M E T H O D 453
C(a"/c") a t th e point s
-1
(20.13) + C' ac Wit h &
c(d + ic) '
// 1
(20.14) + C wit h c ac c(c" — ic)
Indeed, w e see that th e point (20.1 3 ) i s an appropriat e mea n of the centers of C(a/c)
and C(a'/c f)
-1
I /a' i \ I /a i \ l 11
2 ? W + 2d*) + 2d* V c + 2c^)\ 2^ + 2^2 c c(c ' -f ic )
by (20.1 0) , so this i s the tangenc y poin t o f the circle s C(a/c) an d C(a'/c f). Similarl y
one verifie s th e poin t (20.1 4) .
On eac h circl e C(a/c) wit h 1 ^ c ^ C w e choos e th e uppe r ar c 7 a c whic h
connects th e tangenc y point s (20.1 3 ) an d (20.1 4 ) wit h th e circle s o f th e adjacen t
fractions. Th e chai n o f suc h arc s j a c give s u s a n infinit e continuou s curv e whic h i s
periodic o f period one . W e choose a s th e pat h o f integratio n i n (20.8 ) th e fragmen t
of thi s curv e whic h i s containe d i n a vertica l stri p o f widt h on e beginnin g i n a
tangency point . W e ge t
(20.15) P(n) = YJ 1 2 H
^n)
c^.C a ( m o d c )
where
Hac(n) = / f(z)e(-nz)dz
by virtu e o f th e periodicit y o f f(z) an d e(—nz).
We shall evaluate each arc integral H ac(n) separatel y b y applying specific trans -
formation 7 e SL2{ rl). First , changin g th e variabl e z int o a/c + iz/c 2 (translatio n
- magnificatio n - rotation ) w e transform (20.1 1 ) ont o th e circl e \z — | | = \ gettin g
i ( an J a iz\ ( inz\
dz
cz \ c
where z i s o n th e righ t ar c o f th e circl e betwee n th e point s
(20.16) -ifC'ac C + IC
22/-H —ic
(20.17) z" = -ic Cac
c" — ic '
45 4 20. T H E CIRCL E M E T H O D
Given c ^ 1 and (a , c) = 1 the modula r relatio n (20.2 ) wit h z replace d b y —d/c
1/cz translate s int o
\c cJ \% J\ «ii.(*=)+if'+«)V-'-''
24cC\z J '
J \ c czs
where ad = l(mo d c) . Changin g z int o iz/c w e write thi s equatio n a s
where E c(z) = z? exp(j^( ^ — Jj) . Applyin g thi s t o th e integra l H ac(n) w e ge t
Hac(n) = ic~*e{\s{a,c) - an/c)I ac(n)
where
2
Iac(n) = I ^ / ( - - + -^E c{z)e ™*/Sdz.
Z
ac
(More precis e bu t cumbersom e notatio n woul d b e I a^c(n)).
To evaluat e 7 ac (n) w e replac e / b y it s constan t ter m i n th e Fourie r expansio n
(20.1) whic h i s p(0) = 1 , an d the n w e exten d th e ar c o f integratio n t o th e whol e
circle. Thi s wil l giv e u s th e mai n ter m
hip) = Ec(z)e2nnz/c2dz.
Changing z int o \jz w e transfor m th e circl e \z — || = \ int o th e vertica l lin e
Re (z) — 1 getting
Jc(n)=
L exp U+^vn-2i)J^2^
Therefore w e hav e
(20.18)
'•<»>-*W'.(3W
where /^(y ) i s th e Besse l functio n (se e [GR] , 8.4-8.5) . (Thi s unfortunat e coinci -
dence i n notatio n wil l en d shortly! )
20. T H E C I R C L E M E T H O D 455
It remain s t o estimat e th e differenc e I ac{n) — / C W = Iac( n) ~ ^ac( n) ~ Kc{ n)->
say, wher e
/:» = j'" (f(-- c + j) - 1) ^ ) ^ 4,
C
I'ac(n)= r Ec(z)e2™^2dz,
JO
2
C(n) = f E c(z)e ™^dz.
1
JZ '
For estimatio n o f I^ c(n) w e move the pat h o f integratio n fro m th e ar c t o th e chor d
of th e ar c becaus e o n thi s chor d i/z ha s larg e heigh t s o f(—d/c + i/z) — 1 i s
small. Indee d o n thi s chor d w e hav e \z\ ^ min( |
z'nX WnA) ^ 2cC~ l an d Re z ^
max(Re < c , R e < c ) < c 2C~2 b y (20.1 6) , (20.1 9 ) an d (20.9) . Th e lengt h o f th e
chord i s bounde d b y \z' ac\ + \z'^ c\ ^ AcC~ 1 . Moreover , o n th e chor d w e hav e
ra=l x 7
/ c V (2irn\
*{c) exp V^J
because R e (z' 1 ) = 1 , R e {z) ^ c 2C~2 an d p(m ) < 2 m . Therefor e
3
(20.19) / a*c(n) « (£)* ex p ( ^ ) .
Next w e estimat e I' ac(n). Th e lengt h o f th e ar c fro m 0 t o z' ac o n th e circl e
\z — 1 1 = \ i s bounded b y f | ^ c | ^ 7rcC _1 an d fo r z o n that ar c we have \z\ ^ cC~ l.
Moreover, w e derive (a s w e di d whe n estimatin g Ia C{n)) tha t
Therefore I' ac{n) satisfie s th e boun d (20.1 9) . Similarl y w e show that J^ c (n) satisfie s
the boun d (20.1 9) . Collectin g thes e bound s an d th e formul a (20.1 8 ) w e ge t
2
Hac(n) = ^ e ( | 5 ( a , c ) - ™)(1 2A n)-ija ( T ^ ^ ) + O ^ C ^ exp(27mC- )).
Inserting thi s int o (20.1 5 ) w e ge t
p(n) = J2 — A c(n){\2\n)-iU2 ( - A n) + 0(C"* exp^nC" 2)).
C C
c^C
Letting C ten d t o infinit y w e obtai n
(20.20) p(n) = -^-\-3>J2c-lAc{n)I± (-A„) .
C=l
456 20. TH E CIRCL E METHO D
Using th e formula s
I±(y)=(—-J sinhy , h(y) = y^—(y-*Ii(y))
we conclude (20.4) .
REMARKS. Sinc e rj(z) i s essentiall y a thet a function , th e multiplie r syste m
#(7) ca n b e expresse d i n term s o f Gaus s sum s rathe r tha n th e Dedekin d sums .
Consequently th e coefficient s A c(n) i n th e Rademache r formul a (20.4 ) ca n b e als o
expressed b y simpler functions . Indee d A . Selber g showe d (usin g directl y th e Eule r
Pentagonal Number s Theorem ) tha t
(
(20.21) A c(n)=(lY J2 -^COS(H^
^(mod 2c )
^(3^-l) = -2n(mod 21 )
Hence \A c(n)\ ^ 2r(c)c^ .
20.2. Diophantin e equations .
After hi s historica l pape r wit h Ramanuja n o n th e partitio n functio n Hard y
continued workin g o n th e circl e metho d jointl y wit h J . E . Littlewood . Fro m 1 92 0
to 1 92 8 the y publishe d a serie s o f eigh t paper s unde r th e commo n title , "Som e
problems o f Tartiti o Numerorum'" . The y realize d tha t th e modularit y o f th e
generating Fourie r serie s i s no t a n essentia l propert y fo r th e method , bu t rathe r
good estimate s fo r th e relevan t exponentia l sum s ar e sufficient . Thi s observatio n
opened numerou s ne w applications , th e mos t popula r on e bein g fo r th e Warin g
problem.
This concern s th e solvabilit y o f th e equatio n
(20.22) x\+xl + '-- + xl = N
in positiv e integer s # i , . . . , xn, wher e k ^ 1 and N ^ 1 are give n numbers . I n 1 77 0
Edward Warin g asserte d tha t (20.22 ) doe s have a solution provide d n i s sufficientl y
large i n term s o f k alone . Thi s wa s prove d i n 1 90 9 by D . Hilber t [H] . By th e circl e
method Hard y an d Littlewoo d obtaine d a n asymptoti c formul a fo r th e numbe r
of representation s (20.22) . I n thi s sectio n w e illustrat e th e metho d b y showin g a
somewhat late r resul t du e t o L-K . Hu a [H2] .
THEOREM 20.2 . Let n > 2 k. Then the number u(N) of representations of N
as the sum of n positive k-th powers satisfies
(20.23) u(N) = <5(N) r(\ + ]{® n N*- 1 ^ + 0(JV~*)} .
1 [njk)
Here &(N) is given as the sum of infinite series of the multiplicative function
(20.24) *.*) = E '( IE .(£))".( -
^q * —' v q / / v q J
a(mod q) i(mo d q)
20. T H E C I R C L E M E T H O D 457
The series
(20.25) e(N) = Y,c(q,N)
9=1
converges absolutely and satisfies c\ ^ <S(N) ^ C2, where c\, C2 are positive numbers
depending on k and n, but not on N; also 5 > 0 and the implied constant depend
at most on k,n.
We begi n b y applyin g th e metho d t o a genera l equatio n
(20.26) / ( z i , . . . ,z n) = 0
where / i s a polynomia l o f degre e k ^ 1 with intege r coefficients . W e see k integra l
solutions x = ( x i , . . . , xn) restricte d t o a bo x B C [—X, X]n wit h X ^ 1 . Thu s
(20.27 ) f(x) <^X k ifxeB.
The method work s if the numbe r o f variables i s much larger tha n th e degre e and th e
values of f(x) var y considerably (s o f(x) canno t b e constant i n many variable s afte r
any unimodula r transformation , se e ou r analyti c conditio n (20.35)) . Therefore , b y
a statistica l reasonin g w e anticipat e tha t th e numbe r Vj(JS) of solution s t o (20.26 )
in x e B fl Z n shoul d satisf y
uf(B)<^Xn-k.
Exactly, th e numbe r o f suc h solution s i s given b y th e integra l (Vinogradov' s varia -
tion o n th e Hardy-Littlewoo d powe r serie s setting )
(20.28 ) M®)= I ( E e(af(x)j)da.
xeBnz
As in the Hardy-Ramanuja n wor k o n the partitio n numbe r w e expect her e tha t th e
main contributio n come s fro m th e integratio n i n neighborhood s o f rationa l point s
with smal l denominators .
According t o th e Dirichle t approximatio n theore m ever y a satisfie s
(20.29) a < —, withl^g^P , M )= l ,
where P i s a fixed positiv e number . Le t P ^ 2Q ^ 2 . I f a satisfie s (20.29 ) wit h
q ^ Q , w e sa y a belong s t o th e "majo r arc "
a_
(20.30 )
m |a; J ~ | ^ ~ ^ ' withg ^ Q,(a,g ) = l j .
Notice tha t th e interval s o f Wl centere d a t differen t point s a/q d o no t overla p
because
\ - - ^ \ >— > — > —
\q qi\ qqi qQ qP
if a/q ^ a\/q\ wit h q\ ^ Q. Th e remainin g par t o f th e segmen t m = [0,1 ] \ Tl i s
called th e "mino r arc" .
458 20. T H E C I R C L E M E T H O D
REMARK. I n th e Hardy - Ramanujan wor k th e circl e a (mod 1 ) wa s divide d ex -
actly b y th e mediant s o f th e Fare y sequenc e o f orde r C — P, s o ther e wa s nothin g
in th e mino r arc .
For aGOJIw e evaluat e th e exponentia l su m
Sf(a)= £ e(af(x))
by splittin g int o residu e classe s
e e x
Sf(<*) = Yl (^ /(M) ) £ <n
^V
u ( m o d q) x(zBnZ
x = u(mod q)
where 6 = a — ajq. Sinc e |0 | i s small , namel y |0 | ^ (qP)~ l b y (20.29) , w e ca n
replace th e su m ove r residu e classe s u(mo d q) b y a correspondin g integra l wit h a
reasonable erro r term . Suppos e tha t P ^ > X k~x wit h th e implie d constan t larg e
enough s o tha t
£«*>
P
$J — fo r 1 ^ v ^ n , x G B.
Hence \0df /dx v\ < l/2g , s o Lemma 8. 8 applie d successivel y fo r eac h variabl e x v
i^(mod q) yield s
X\^-U
VJ e(0f(x)) = q-nBf{6) + o((l + ^ " " ' j
Q
a;Eu(mod q)
where Bf(0) i s th e integra l i n questio n
(20.31) B f(0) = [ e(6f(x))dx.
JB
A trivia l erro r ter m woul d b e ( 1 + X/q) n, s o w e onl y go t savin g o f th e siz e o f
one variabl e o f summation . Not e tha t Bf(0) doe s no t depen d o n th e residu e clas s
u(mod q) (i n mor e sophisticate d version s o f th e circl e metho d th e residu e clas s
does appea r i n th e mai n ter m wit h a profoun d effect) . Therefor e summin g ove r th e
classes w(mo d q) w e ge t
1
(20.32) S f{a) = B f{e)Cf(a/q) + 0{q{q + X)^ )
where Cf(a/q) i s the normalized , complet e exponentia l su m
(20.33) C f(a/q) = ±; £ e(^/(«)) .
ii(mod q)
Integrating (20.32 ) ove r th e majo r arc s w e ge t
/ S f(a)da = ] T c f(q) f B f(0)d0 + 0{P~ lQ2{Q + X)"' 1
)
Jm q ^Q J\e\^i/ qp
where
c
(20-34) <*(« ) = £ J2* £ (^/(«))-
a ( m o d q) w(mo d q)
Note tha t Cf(q) i s rea l an d \cf(q)\ ^ q.
20. T H E C I R C L E M E T H O D 459
First w e evaluat e asymptoticall y th e integra l o f 23/(0 ) ove r th e segmen t \0\ ^
1/qP. Suppos e tha t fo r an y 6 > 0 ,
k
(20.35) B f(6) < {6X )-l^Xn
where 7 > 0 . Hav e i n min d tha t X n i s th e trivia l boun d an d 6X k i s abou t 0f(x),
so we postulate a saving which i s only slightly large r tha n th e siz e of the amplitude .
This conditio n i s quit e realisti c fo r a smoot h functio n f(x) whic h change s fas t i n
several variables . Observ e tha t th e boun d (20.35 ) remain s th e sam e i f a constan t
is adde d t o / . Fo r example , i f
(20.36) f{x) = N - x\ xk
for 0 < x v < X = Ni, the n
k n/k
\Bf(0)\ = I / e(0x )dx <C 1
so (20.35 ) hold s wit h 7 = | - 1 > 0 , i f n > k. Usin g (20.35 ) w e ca n exten d th e
integration ove r \0\ ^ 1 /qP t o th e whol e rea l lin e gettin g
(20.37) / 8 f(0)d0 = V f(B) + 0{(qPX~ ky X n k
~)
where
oo
Bf(0)d0.
/ -OO
The conditio n (20.35 ) als o implie s tha t
n k
(20.39) V f{B)<^X -
(use (20.35 ) i f |0 | > X~ k an d estimat e triviall y i n |0 | < X~ k
).
Next w e evaluat e asymptoticall y th e su m o f Cf(q) ove r th e modul i q ^ Q.
Suppose tha t fo r an y q ^ 1 , (a, q) — 1 ,
2_r7
(20.40) C / ( a / g ) < < T
wThere r\ > 0 . Observ e tha t thi s boun d remain s th e sam e i f a constan t i s adde d t o
/ . Henc e
1
(20.41) c f(q)<^q- -ri.
Using (20.41 ) w e ca n exten d th e summatio n ove r q $C Q t o al l modul i gettin g
c
(20.42) £ / t e )= <5
/ + °( £rT?)
where
00
(20.43) 0/=5>/O?) -
1
Now collectin g th e abov e result s w e obtai n
n k
(20.44) f Sf(a)da = (5 fVf(B) + 0(AX -)
46 0 20. TH E CIRCL E METHO D
where
(20.45) A = Q- 1 1 + Q 2
{QPX~1
k
).
Observe tha t w e can take A ^ 2X~ 67r] wit h 5 a small, positiv e number b y choosin g
Q = X6^ an d P = xk-6^2^^\ bu t som e othe r choice s of Q an d P can b e better ,
see th e lin e abov e (20.62) .
The infinit e integra l (20.38 ) an d th e infinit e serie s (20.43) , ar e calle d th e "sin -
gular integral " an d th e "singular series " fo r the equation (20.26) . Thes e ca n be
brought t o expression s whic h allo w meaningfu l interpretation s i n terms o f loca l
densities.
First w e work ou t th e singula r integral . W e hav e
sin
Vf(B) = li m I* ( I e(9f(x))dx)d9 = li m / (27r{(*)T)dx,
Integrating ove r th e leve l se t {x € B; f(x) = t) an d usin g
r
°° sin27r t
-dt = 1
/ DO Kt
one can show that Vf(B) i s the ( n — l)-dimensional volum e of the se t {x E B; f(x) =
0} with respec t t o a suitable measur e whic h depend s o n /. I t is also approximatel y
equal t o the n-dimensional Lebesgu e measur e o f the set {x e B;\f(x)\ ^ \).
Precisely, w e deriv e b y th e conditio n (20.35 ) tha t fo r any V > 0 ,
k
(20.46) V f(B) = ^\{xe B; \f(x)\ < V}\ + 0{{VX' yXn-k)
where |^4 | denote s th e n-dimensiona l Lebesgu e measur e o f A. Thi s i s non-trivial
for V <C Xk, an d i t yields
(20.47) Vf{B) = Hi n ^\{x G B; \f(x)\ < V}\.
For th e proo f o f (20.46) w e use th e formul a
7-oo *0 I 0,i f \y\ > V.
and th e estimate s sin27r6» V = 2ir9V + 0(92V2), si n 2 ?^ < C 1. Henc e b y (20.35)
we obtai n
l
+ oU 9V\B f{9)\d9+ j {eV)- \Bf(9)\de)
l
-^\{x e B; \f(x)\ ^V}\ + 0((TV + {TV)- ){TXk)^Xn-k).
Choosing TV = 1 we ge t (20.46) .
20. T H E C I R C L E M E T H O D 461
For example , i f f{x) i s given b y (20.36) , the n (20.47 ) yield s
k k
V(N) = l J"-J (N-x 2 x S*-ldx2--.dxn.
x%-\ \-x„<N
EXERCISE 1 . Sho w tha t th e abov e integra l i s
(20.48) v(N) = r(i + irr^)"1^*- 1.
Now w e wor k ou t th e singula r series . Th e su m ove r a(mo d q) wit h (a , q) — 1
in (20.34 ) i s th e Ramanuja n su m
E* .(f/<«> ) - E »$y
a ( m o d q) d\q
d\f(u)
therefore
°f^ = i&(TH |Mmod q);f{u)=o(mod d)}|=E^W^
where d n~1 ujf(d) i s th e numbe r o f solution s t o th e congruenc e
(20.49) f(x) = (mo d d).
Since cjf(d) i s multiplicative , w e hav e
£/(<?)=n( w /(p a )- w /(p a _ i ))-
pa\\q
Hence th e singula r serie s i s als o give n b y th e infinit e produc t
(20.50) <S / = II*/(P )
V
where
oo
a 1
(20.51) 5 f{p) = 1 + $ > / ( p ) - ^ ( p " " ) ) .
Note tha t th e serie s (20.51 ) an d th e produc t (20.50 ) converg e absolutel y b y
virtue o f (20.41 ) . Th e number s 5f(p) ar e called "loca l densities" fo r th e reaso n tha t
a
(20.52) 5 f(p) = li m Uf(p ),
a—>oo
so Sf(p) i s a densit y fo r th e p-adic solution s o f f(x) = 0 . B y th e analogou s formul a
(20.47) on e can interpret Vf(B) a s a density measur e fo r th e real solutions of f(x) =
0.
If ujf{p oc) stabilizes , i.e. ,
1a+
(20.53) ^ / ( P ) = " / ( P Q ) , for a ^ a,
46 2 20. T H E C I R C L E M E T H O D
then 5f(p) = ojf(p a) i s the densit y o f solution s o f th e congruenc e
a
(20.54) f(x) = 0(modp ).
For som e polynomial s / (such a s (20.36) ) th e stabilit y propert y (20.53 ) hold s fo r
all p wit h a commo n a which i s sufficientl y larg e dependin g o n / . I n thi s cas e
(20.55) 6 /= rjW/(pa).
We now procee d t o th e estimatio n o f the integra l o f the exponentia l su m Sf(a)
over th e mino r ar c m = [0,1 ] \dJl. Firs t Hard y an d Littlewoo d estimate d 5/(a ) fo r
each a i n m and the n integrate d trivially . Fo r a polynomia l i n one variable w e have
LEMMA 20. 3 (WEYL) . Let g(x) = cx k + c i x ^ - 1 + • • • with c and k positive
integers. Suppose
(20.56) L - - U c — z, with (a,q) = 1 .
I q\ q
Then
a 1
(20.57) Y. < d(n)) « X ^+ | + ^) 21 K
for any £ > 0 and X ^ 1 , the implied constant depending only on e.
REMARK. Th e rang e o f summatio n i n (20.57 ) ca n b e shifte d t o an y segmen t
of lengt h X withou t affectin g th e bound .
PROOF. B y Propositio n 8. 2 th e exponentia l su m i s bounde d b y
1
2X1 X~ k ] T •••V J minpf,||adb!Tii---nfe_i||- )}
— X < 7 l i , . .. , 7 l f c _ i < X
<X{X"1+X-'C V JT 1
k^l(m)mm(X,\\acklm\\- )}
where th e firs t ter m comes fro m n\ . . . n^_i = 0 an d Tfc_i(m ) appear s a s a boun d
for th e numbe r o f representations o f m — ri\ • • • n&_i wit h l^n 1 l/<X sol^m<
M = X k~l. Afte r applyin g Tk-i(m) < C 22 m £ w e exten d th e summatio n t o
(20.58) Yl min(X , H^ir 1 )
where L = ck\M (i.e. , we ignore th e conditio n £ = 0(mo d cfc!)) . I f £ ranges ove r a n
interval o f lengt h q/2 th e point s a£(mod 1 ) ar e space d b y q~ x b y virtu e o f (20.56) ,
therefore th e su m (20.58 ) i s bounde d b y
(! + f ) E mi n ( x ) | ^ |1
" ) « ( l + ^ ) ( X + ,log ( ? ).
£(mod q)
This estimat e lead s t o (20.57) . •
From Weyl' s lemm a fo r a = a/q an d X = q we obtai n
20. T H E C I R C L E M E T H O D 46 3
COROLLARY 20.4 . Let g(x) be a polynomial of degree k ^ 2 with integer coef-
ficients. Let (a,q) = 1 . Then
l 2l
(20.59) £ e(^g{x))«{c, q)H - ~k+s
x ( m o d q)
where c is the leading coefficient of g, and the implied constant depends only on s.
R E M A R K . B y the Riemann Hypothesi s fo r curves ove r finit e field s (prove d by
A. Wei l [Wei] ) on e can obtain th e bound q^ +e, but wit h th e implie d constan t
depending o n g. However , ou r exponent 1 — 2l~k is more tha n sufficien t fo r appli-
cations i n this chapter .
For a in the minor ar c m we have (20.56 ) wit h Q < q < P, so the saving facto r
in the bound (20.57 ) is
0 i-fc
I_ L _ ^
Q + X + Y k/
which i s rather small . However , thi s ca n be multiplied i f / ( # i , . .. , x n ) has many
summands i n distinct singl e variables . Fo r example le t us consider
(20.60) f(x u ...,x n) = /i(:ri) + . .. + f n{xn)
where fi(xi) — C£Xk + • • • wit h Q > 0 for all 1 ^ £ < n. Then , b y Lemma 20.3
1
/I P \n2 l-k
n+
(20.61) Sf(a) = Sfl (a) • • • S fn(a) « X ' (_ + _ + _)
for an y a G m. Th e same boun d hold s fo r the integral o f Sf(a) on m. W e require
this to be of smaller orde r tha n X n~k whic h w e accomplish b y taking n sufficientl y
large i n terms o f k. Choosin g Q = X* an d P = Xk~$ w e ge t
(20.62) / S f(a)da<^X n—k — 5
with som e S > 0, if n > /c2/c+1. Not e tha t th e condition (20.35 ) fo r the polynomial
(20.60) hold s with 7 = n/k—1. S o the abov e choice of Q and P also gives A <C X~6,
see (20.45) . Moreover , w e verify (20.40 ) usin g Corollar y 20.4
Cf(a/q)<^q~n2l~k-^q-2~^
if n > 2k. Therefore (20.44 ) become s
n k 6
(20.63) J S f(a)da = (5fVf(B) + 0(X --)
where 0 / and Vf(B) ar e the singular serie s an d the singular integra l fo r the poly-
nomial (20.60) . Addin g (20.62 ) t o (20.63) w e conclude
46 4 20. T H E C I R C L E M E T H O D
T H E O R E M 20.5 . Let / i ( # i ) , . . . , f n(xn) be polynomials of degree k ^ 2 with
integer coefficients. Suppose n > /c2 /c+1 . Then the number of integral solutions to
the equation
(20.64) /l(Zl ) + " - + /n(*n)= 0
n
in a box B C [— X, X] with X ^ 1 satisfies
(20.65) u f(B) = <S fVf(B) + 0 ( X n " f e - 5 )
/or som e 5 > 0 ; where 0j an d V}(S) ar e t/i e singular series and the singular integral
for the polynomial (20.60).
The implie d constan t i n th e erro r ter m o f (20.65 ) ma y depen d o n th e poly -
nomials / i , . . . , fn bu t rathe r mildly ; i t doe s no t depen d o n th e constan t term s
of / i , . . . , / n . I n particular , th e asymptoti c formul a (20.65 ) applie s nicel y t o th e
equation (b y addin g a constan t term )
(20.66) fi(xi) + -- + f n(xn)=N.
Let c i , . . . , cn b e th e leadin g coefficients , an d suppos e al l ar e positive . Le t ff(N)
be the numbe r o f solutions to (20.66 ) i n positive integers . The y ar e contained i n th e
box B = [0 , X]n wit h X — cN* for som e constan t c ^ 1 depending o n / i , . . . , f n.
Therefore Theore m 20. 5 yield s
1 6
(20.67) i/ f(N) = & f(N)Vf(N) + OiN^- ')
for som e 5 > 0 . I n thi s cas e w e ca n comput e th e singula r integra l Vf(N) asymp -
totically. Firs t w e know fro m (20.46 ) tha t
n k
|{0 ^x u...,xn^X; |/i(a;i ) + • • • + f n(xn) -N\^ U}\ « UX ~
for an y U > 0 . Henc e an d b y (20.46 ) w e deriv e usin g th e approximation s fi(xi) =
QX£ + O p ^ " 1 ) tha t
Vf(N) = ^ | { 0 < X!,.. . ,x n < X ; | C l x{ + • • • + c*x $ - JV | < t / } |
+ 0{U- 1 Xn~1 + (C/X-* 5)7^-*).
In th e erro r ter m w e choos e U = x k~l^lJr^ gettin g 0{X n-k~^^l+^). I n th e
main ter m w e ca n reduc e U t o an y smalle r numbe r V withou t changin g th e abov e
error ter m b y virtu e o f (20.46) . Therefor e
1
(20.68) V f{N) = (c i • • • c nyiv{N) + O^- "*)
where V(N) i s th e sam e on e a s i n (20.48) . Insertin g thi s t o (20.67 ) w e conclud e
that
S
(20.69) u f(N) = ( Cl • • - c n ) - * r ( l + l^T^y'N^H&fiN) + 0(N~ )).
In particular , w e complet e th e proo f o f Theore m 20. 2 i f n > /c2 /c+1 , excep t fo r
the analysi s o f th e singula r serie s <5(N) about whic h w e shal l spea k later .
In orde r t o ge t result s fo r smalle r n w e look fo r improvement s i n the treatmen t
of th e mino r arc . Suppos e / ha s th e first variabl e separated ,
(20.70) f(x u ... ,x n) = g{xi) + h(x 2,... ,x n).
20. T H E C I R C L E M E T H O D 465
Then Sf(a) = S g(a)Sh(&), s o we ca n writ e
(20.71) I / S f(a)da\ < ( m a x | ^ ( a ) | ) / \S h(a)\da.
In th e mino r ar c w e estimat e S g(a) b y Weyl' s lemma . Recal l tha t fo r a G m w e
have (20.44 ) wit h Q < q ^ P. W e choos e Q = X* 5/3 an d P = X k~l+5, wher e J i s
a smal l positiv e number . Assumin g tha t g i s a non-constan t polynomia l w e ge t a
non-trivial boun d
(20.72) S g(a) < X 1 " 2 ^, i f a € m.
Since rj is small , w e cannot wast e muc h i n th e estimatio n o f Sh (a). Ou r goa l i s
n1
(20.73) / |S /l(a)lda<X - -fc+T'
JO
which i s sufficien t t o yiel d
n k
(20.74) / S f{a)da<^X - -^.
Moreover, th e abov e choice of Q and P make s the error term i n (20.44 ) o f admissibl e
order X 71 -^'. Addin g (20.44 ) t o (20.74 ) w e ge t
n k
(20.75) v f{B) - e fVf(B) + 0(X - ^).
It remain s t o prov e (20.73) . Not e tha t thi s boun d i s clos e t o th e tru e orde r
of magnitude . Indeed , b y droppin g th e absolut e valu e o f Sh (a) w e estimat e fro m
below b y th e integra l whic h i s exactly equa l t o th e numbe r o f solution s t o
(20.76) /*(x 2 ,... ,x n) = 0
in th e relevan t box , an d thi s numbe r i s expecte d t o b e o f orde r X n~1 ~k. I n vie w
of thi s remar k th e tas k o f showin g (20.73 ) ma y see m t o b e almos t a s difficul t a s
the origina l problem , o r eve n mor e difficul t becaus e w e hav e on e variabl e les s i n
(20.76) tha n i n th e origina l equatio n (20.26) . However , thi s situatio n i s not a s ba d
in practice, th e reaso n bein g that i t i s easier t o give an upper boun d fo r th e numbe r
of point s o n a variet y tha n t o coun t thes e wit h asymptoti c precision .
In 1 93 8 L-K . Hu a [H2 ] succeede d i n provin g (20.73 ) wit h an y rj > 0 fo r th e
polynomial whic h i s th e su m o f sufficientl y man y fc-th powers .
LEMMA 20. 6 ( H U A ) . Let k ^ 1 and
fe
(20.77) S(a)= J2 e(an ).
Then for any 1 ^ £ ^ k we have
2
(20.78) / |S(a)| 'da<A:2'-'+e
Jo
the implied constant depending on £ and k.
46 6 20. T H E C I R C L E M E T H O D
P R O O F . Fo r £ =1 w e get
I \S{a)\2da = X.
Suppose tha t (20.78 ) hold s fo r £Jowith 1 < £ < k. W e shall prov e i t hold s fo r £ + 1.
By repeated applicatio n o f the Weyl "differencin g process " £ times (se e Propositio n
8.2) w e arrive a t the inequality
e1
\S(a)f ^(2Xf- ~ £ " • £ ^(a/u.-./i^n;/n,.. . A) )
— X<hi,... ,h £<X nel
where / = I {hi,... , hi) is a subinterval of [1 , X] andp(x; / i i , . .. , hi) is a polynomial
of degre e k — £ with intege r coefficients . W e simplify thi s b y writing
1t
\S(a)f ^ (2Xf- - J2ame(am)
where a m i s the number o f solutions t o the equation
hi •••/i£p(n;fti,.. . ,ht)=m
in th e abov e range . Therefor e a^ < C X 1 an d a m < C X s i f m / 0 , becaus e
p(x; / i i , . .. , /i^) i s not a constant polynomial . W e can also writ e
\s{a)\2 = y]bme{am)
where 6 m is the number o f solutions to
" ™ks ~ nk s+i n\ 8 = m
£
with s = 2 ~ \ 1 ^ n i , . . . , n2s < X. Clearl y
2£
2 6 m = |5(0)| =X 2 £
and
1
60= [ \S(a)\2'da^:X2t-e+e
JO
by th e induction hypothesis . No w we derive b y the Parseval identit y
f \S(a)f +1
da < (2I)2 '-'- 1Va m () m « X 2 ^ " 1 * 2 ^ = X2
Jo m
completing th e proof. •
From Hua' s lemm a wit h £ — k on e gets (20.73 ) fo r any polynomial h in which
at leas t 2 k variable s appea r exactl y a s k-ih powers . Thi s hold s fo r the polynomial
(20.36) provide d n — 1 ^ 2 k (w e used on e variable fo r th e non-trivial estimatio n
(20.72) i n the minor arc) . I n particular, thi s complete s th e proof o f Theorem 20.2
except fo r the analysis o f the singular series .
In genera l thi s analysi s o f 0/ i s not simple. I n the special cas e of the equation
(20.23) on e can estimate th e series (20.43 ) b y elementary ye t long arguments , see
the comment s below .
20. T H E C I R C L E M E T H O D 46 7
The asymptoti c formul a
n k 6
(20.79) v f(B) = <5fV f{B) + 0(X ~-)
for th e equatio n f(x) = 0 in x £ B n Z n wil l b e meaningfu l onl y i f on e know s tha t
the singula r serie s <5f i s bounde d belo w b y a positiv e quantit y whic h doe s no t o r
only barel y depend s o n / , an d tha t th e bo x B i s larg e enoug h s o tha t Vf{B) ha s
order o f magnitude X n~k. Th e latte r i s essentially tru e i f the equation f(x) = 0 has
solutions i n rea l variable s layin g deepl y i n th e interio r o f B. However , a goo d lowe r
bound fo r <5f i s a more subtle questio n whic h w e shall addres s o n special occasions .
The positivit y o f 0/ follow s i f the equatio n f(x) = 0 is solvable in every p-adic field
by virtue o f the produc t formul a (20.50) . Actuall y th e loca l densities 5f(p) ar e ver y
close t o on e uniforml y fo r al l p sufficientl y large , s o th e proble m i s t o giv e goo d
lower bound s fo r 5f(p) a t finite numbe r o f place s whic h depen d o n / . I f f(x) = 0
has loca l solution s an d th e numbe r o f variable s n i s considerabl y large r tha n th e
degree /c , then (subjec t t o some extra incidental conditions ) ther e ar e plenty o f local
solutions s o tha t (5f ^ c(fc , n) > 0. Fo r example , i f f(x) — N — x\ — • • • — x\ (th e
case o f th e Warin g problem ) i t i s known tha t (se e H . Davenpor t [Da2] )
(20.80) <S(N) ^ c(fc , n) > 0 i f n > 4fc .
20.3. T h e circl e m e t h o d afte r K l o o s t e r m a n .
The circl e metho d i s a too l whic h i s suitabl e fo r detectin g th e equatio n m — n
by means of the additive characters e(ara) , wher e m runs over a given set of integers
and n i s fixed. I n a mor e genera l settin g on e ha s a sequenc e o f comple x number s
A — ( am ) wit h
^2 \ am\ < O O
mez
and th e proble m i s to evaluat e asymptoticall y a selecte d ter m o f A. W e hav e
Oin — I Sj,{a)e{—an)da
Jo
where
SA(OL) = ^ a m e(am).
m
Assuming w e hav e a n adequat e knowledg e abou t th e distributio n o f A i n residu e
classes t o smal l modul i w e ca n evaluat e SA (a) a t th e point s a whic h ar e clos e t o
rationals wit h smal l denominators . Th e se t o f suc h point s i s calle d th e majo r ar c
because th e mai n ter m i n th e asymptoti c fo r a n come s fro m thi s area . However , a
harder wor k i s neede d fo r estimatio n o f Sj,(a) fo r a i n th e remainin g are a whic h
is calle d th e mino r arc . Her e th e treatmen t depend s ver y muc h o n th e structur e o f
the sequenc e A. W e ar e particularl y please d wit h A bein g a n additiv e convolutio n
of many independen t sequences . I n thi s cas e SA(&) factor s int o several exponentia l
sums, s o eve n a smal l savin g i n estimatio n o f eac h o f thes e ca n lea d t o a success -
ful conclusion . I t i s intrinsi c o f th e Hardy-Littlewoo d typ e argument s tha t on e
needs a t leas t thre e summand s t o resolv e th e additiv e equatio n i n question . Th e
arguments certainl y fai l fo r binar y additiv e problem s (sometime s on e sumrnand ca n
be partitione d int o tw o s o th e additiv e proble m unde r consideratio n seem s t o b e
46 8 20. T H E C I R C L E M E T H O D
of ternar y type , bu t th e rang e o f resultin g variable s i s reduce d drasticall y s o thi s
rearrangement doe s no t reall y chang e th e problem) .
Another characteristi c o f th e equation s m = n fo r whic h th e circl e metho d a s
developed by Hardy-Littlewood work s is that th e expected number of solutions mus t
be larg e relativ e t o n , an d i t shouldn' t depen d essentiall y o n n . Fo r example , th e
method ma y wor k fo r a diophantin e equatio n f(x) = n i f th e numbe r o f variable s
is muc h large r tha n th e degree .
A rea l challeng e fo r th e circl e metho d i s t o detec t th e inciden t m = n whe n i t
is expecte d t o happe n irregularly . Moreover , a ne w ide a i s neede d t o trea t binar y
additive problem s
(20.81) "(n) = Y, J2 a™^
m-\-£=n
where A = (a m) an d B — (/^) ar e give n sequence s o f som e arithmetica l nature . I n
1926 H . D . Kloosterma n [Klo ] succeede d wit h th e equatio n
(20.82) <nx\ + a 2x\ - f a^x\ + a±x\ = n.
Actually hi s metho d work s fo r th e equatio n
(20.83) Q(x) = n
where Q(x) i s an y positiv e definit e quadrati c for m i n r ^ 4 variable s an d integra l
coefficients. (I n particular , usin g i t on e ca n recove r th e fou r square s theore m o f
Lagrange, whic h th e Hardy-Littlewoo d versio n coul d not) . I n th e nex t sectio n w e
shall trea t th e equatio n (20.83 ) b y extendin g th e origina l wor k o f Kloosterman .
To explai n th e novelt y o f th e Kloosterma n metho d le t u s speculat e o n th e
general additiv e binar y proble m (20.81 ) (th e quadri c equatio n (20.82 ) i s o f thi s
type, fo r on e ca n tak e m — a\x\ + a2x\ an d £ = a^x^ + #4X4 as the summand s wit h
a m , fy bein g th e representatio n number s b y th e correspondin g binar y forms) . W e
have
u(n) = I 5^(a)5e(a)e(—an)d a
Jo
where SU(a ) an d S& (a) ar e th e correspondin g exponentia l sums . Th e bes t bound s
for thes e exponentia l sum s whic h on e hope s t o prov e fo r almos t al l a ar e th e £2-
norms
MII = (£K*I 2 )*
77 1
and ||S| | respectively . Eve n i f th e bound s S^a) < C ||*4| | an d S 13(a)<C \\B\\ were
accomplished fo r a i n th e mino r ar c (whic h i s rarel y th e case) , i t i s no t sufficien t
because th e expecte d mai n ter m doe s no t excee d ||^4| | ||B|| . Therefor e on e canno t
rely onl y o n estimates . Fo r som e sequence s on e ca n transfor m th e correspondin g
exponential su m int o anothe r sum , sa y
(20.84) S A(a) = ^7 m ( « ) + E A(a),
77 1
where th e ne w term s 7 m (a) hav e th e statu s o f bein g "dual " t o a me(—am) an d
the erro r ter m E^(a) i s significantl y smalle r tha n \\A\\. O f course , th e dua l su m
is essentiall y a s larg e a s th e origina l one , bu t i t ma y hav e fewe r terms . Now ,
20. TH E CIRCL E METHO D 469
instead o f estimating , on e integrate s th e produc t o f dua l sum s i n a gettin g a n
extra savin g fro m th e cancellatio n whic h i s du e t o th e variatio n i n th e argumen t
of eac h individua l ter m 7 m (a). I n thi s wa y on e break s th e barrie r ||^4| | \\B\\ which
couldn't b e passed b y absolute valu e integration . Th e mai n ter m emerge s a s befor e
from th e integratio n i n smal l neighborhood s o f rational s wit h smal l denominators .
The diophantin e natur e o f a play s a rol e i n th e transformatio n (20.84) , s o i t i s
appropriate t o divid e th e circl e a (mo d 1 ) b y th e Fare y point s ^ wit h 1 ^ c <
C, (a,c) = 1 . However , i n contras t t o th e previou s practice , neithe r gap s no r
overlapping o f thes e segment s i s allowe d (becaus e o f th e prohibitio n o f employin g
estimates fo r th e exponentia l sum s SA(&) an d S& (a)). T o cove r th e circl e exactl y
we choos e naturall y th e divisio n point s t o b e th e mediant s o f th e Fare y sequenc e
of orde r C. Th e resultin g segment s
a' + a a + a" 1 1
m (D - (7 + c ' c + c" c c(c + c'Y c
+ •
c{c + c")
have lengt h whic h change s i n arithmeti c fashio n (se e (20.9) ) an d thi s propert y ha s
to b e considere d whe n integratin g differen t piece s o f th e dua l exponentia l sum .
Kloosterman control s th e lengt h o f th e segment s 9#(f ) b y Fourie r analysis , an d
consequently create s exponential s o f typ e e ( ^ ) fo r variou s u G Z , wher e d i s th e
multiplicative invers e o f a( mo d c) . B y thi s analysi s th e integratio n i n a (mod 1 )
amounts t o the summatio n ove r th e Fare y point s ~ with factor s e(^f). A t th e sam e
time a nic e coincidenc e ma y occu r tha t th e dua l term s 7 m (a) fo r a clos e t o - als o
have a phase of type ^-(~) fo r variou s v E Z (thi s happens fo r th e sequenc e of values
of a quadrati c form) . Multiplyin g thes e factor s an d summin g ove r a(mo d c ) on e
arrives a t th e Kloosterma n su m
n/ x v ^ fdm + an\
S{m,n;c)= 2 ^e [~ )
ad=l(mod c )
with m — —u — v. No w an y non-trivia l boun d fo r S(m,n;c) correspond s t o a
cancellation i n th e integra l o f th e dua l sum s givin g a critica l improvemen t o f th e
direct boun d ||^4| | ||S|| . Kloosterma n succeede d i n showin g tha t
S(m,n;c) < C {m,n,c)^c^r(c),
while th e Wei l boun d (Corollar y 1 1 .1 2 ) assert s tha t
(20.85) |5(m,n;c) | < (m,n,c)h^r(c).
Let u s emphasize tha t i f Q(x) wa s not a quadratic polynomial , the n th e dua l terms ,
if they exist , ma y hav e a phase othe r tha n e ( ^ ) , i n which cas e another exponentia l
sum t o modulu s c would appea r i n place o f the Kloosterma n sums , an d th e metho d
still work s b y a n appea l t o appropriat e result s fo r th e relevan t sums .
We no w procee d t o a detaile d presentatio n o f Kloosterman' s idea s i n a some -
what abstrac t form . W e begi n b y establishin g a Fourier-Kloosterma n expansio n o f
the delt a symbo l
if n = 0 ,
(20.86 ) 5(n) =
if n + 0 .
47 0 20. TH E CIRCL E METHO D
PROPOSITION 20.7 . Let C be a real number ^ 1 . We have
(20.87) 5(n) = 2R e / ^ ^ * (cd)"^ ^- ^)d x
w/iere • restricts the summation by (c , d) = 1 an d J z s the multiplicative inverse to
d modulo c.
REMARK. Fo r n = 0 the formul a (20.86 ) give s a cut e identit y
1 =
(20.88) J2Y, M" i
c^C<d^c+C
(c,d) = l
EXERCISE 2 . Prov e (20.88 ) directly .
For th e proo f o f (20.87 ) w e ma y assum e tha t C i s a positiv e integer . Le t
/ : M.— > C b e a periodi c functio n o f perio d 1 . W e shal l evaluat e th e constan t ter m
of th e Fourie r serie s o f / whic h i s equa l t o th e mean-valu e
/!(/) = / f(x)dx.
Jo
Splitting int o Fare y segment s 97t(^ ) w e obtain b y th e periodicit y o f / ,
M/) = £ £ / , /(*)<* *
^^ fl/c(c+c")
= £ £/ /(?+* >
(o,c) = l
the las t lin e followin g b y (20.9) . Suppos e tha t / ha s th e symmetr y f(—x) = f(x)
(i.e., / ha s rea l Fourie r coefficients) , the n
ri/cc * / /7 \
(20.89) M / ) = 2R e £ / £ /( * - -)dx .
c^C**0 C<d^c+C
cdx<l
Applying thi s t o f(x) — e(nx) an d changin g x int o x/cd on e get s (20.87) .
In th e remainin g par t o f thi s sectio n w e elaborat e (20.89 ) further , no t fo r im -
mediate application s bu t rathe r t o uncover mor e structure o f the formula. W e write
(20.89) a s
(20.90) fi(f) = 2R e f°° ( ^ c^Kffa c ) x 2 dx
20. T H E C I R C L E M E T H O D 471
where Kf(x; c) is a kind o f incomplete Kloosterma n su m associated wit h /
(20.91) K f(x;c)= £ * f(~-^]
C <d^m'm(x ,c+C) c ex
This ca n be expressed a s a complete su m in d(mod c) by applying th e formul a
~x — d' 1i f d ^ x,
)-{ 0 i
•C-d-
mm (•• . c . - c. fd> x
for C < d < c -f C and x ^ C. Henc e w e ge t
[x — d C-d
(20.92) K f(x;c)= ^ min M
J/ Vc ex J
d(mod c)
Moreover, th e "min " facto r i n (20.92 ) equal s 1 if x > c + C, and it is equal to
x-C
n—)+n—)
"x — d' •C-d' d
-
.c . c.
if C < x < c + C, therefor e
(20.93) +r W £^)_^))/(l.i)
d(mod c)
where th e last su m is present onl y i f C < x < c + C. Insertin g thi s int o (20.90 ) we
obtain
P R O P O S I T I O N 20.8 . Let C be a positive integer and f a periodic function on
R of period 1 . Then the constant term of f in its Fourier series is given by
(20.94)
,</)-2R. E c-/;»,!„(», i ^ )( £ * / ( £ - i ) ) . - *
l
dx.
c<C d(mod c)
R E M A R K S . Quit e ofte n i n applications th e first lin e o f (20.94) yield s th e main
term whil e th e second lin e ca n be successfully estimate d b y means o f exponentia l
sums ove r finite fields. Expandin g \p and / int o Fourier serie s the sum Kf(x\ c) can
be expressed i n terms o f complete Kloosterma n sum s (20.85) , but this optio n i s not
always practical . Fo r f(x) = e(—nx) on e get s by (20.94) anothe r expressio n fo r the
delta-symbol
5(n) = 2Re ^2c' lR{n\c) I e(-—) minfl , X ~ )x~ 2
dx
c^C
c" 1 / (R c(n;c) - R x(n;c))e( )x~ 2
dx
J c V CXJ
c<C
472 20. T H E C I R C L E M E T H O D
where R(n; c) = S(0 , n; c) i s the Ramanuja n su m an d
x — d\ fnd
Rx(n;c)= 2 ^ if; I-
d(mod c)
Note tha t
i? x (n;c)<C ( ^T d"2^c^r(c ) log2 c
<i|(c,n)
by applyin g th e Fourie r expansio n
<^)=E<^-M^)+o((1+c||^|)-')
V7
0<|^|< c
and Weil' s boun d fo r th e Kloosterma n sum s S(£,n;c).
20.4. Representation s b y quadrati c forms .
We procee d t o trea t th e equatio n (20.83 ) b y th e Kloosterma n circl e method .
Our goa l i s
THEOREM 20.9 . Let Q(x) be a positive definite quadratic form in r ^ 4 vari-
ables and integer coefficients. Then for any n > 0 the number r(n, Q) of integral
solutions m = ( m i , . . . , mr) to Q(m) = n satisfies
[ +£
(20.95) r(n,Q)= ' " &(n,Q) + 0(n^~i )
T(k)y/\A\
where k = | , \A\ is the determinant of Q and <5(n , Q) is the singular series
CO
(20.96) 0 ( n , Q ) = X)c- r0c(n,Q)
c=l
(20.97) 9c(n,Q)= ]T *£ e(^(Q(/i ) - n)) ,
<i(mod c ) h ( m o d c )
£/ie implied constant depends on e and on the form Q.
The quadrati c for m Q(x) i n x = ( x i , . . . ,x r ) i s associate d wit h a symmetri c
matrix A = (a^ ) o f ran k r wit h a ^ G Z an d a ^ G 2Z whic h i s positiv e definite . I n
SiegePs notatio n
\cg\X) = =
^./i[3? J — 2 X.ri X — ~F: / Q,ii%i \ / / OijXiXj .
The discriminan t o f Q i s define d b y
( D — (—1)5 |A| i f r i s even ,
\ D = \{-l) r-¥\A\ i f r i s od d
where |^4 | is th e determinan t (on e show s tha t D = 0 , l(mod 4 ) i f r i s even). Pu t
CO
(20.98) 0{z) = ] T e (Q(m)z) = ]TV(n , <2)e(nx )
20. TH E CIRCL E METHO D 47 3
where
r
r(n,Q) = \{meZ ;Q(m)=n}\.
This thet a functio n i s a modula r for m o f weigh t k = | , leve l 27V , where N i s suc h
that NA~ l i s integral, an d i t transform s b y a suitabl e multiplie r syste m define d i n
terms o f the Jacob i symbo l (se e Theorem 1 0. 8 of [1 4]) , but w e do no t us e thi s fact .
All we nee d i s the followin g Jacob i inversio n formul a
LEMMA 20.1 0 . Let z e H and x e R r. Then
(20.99) J ; e ( | ^ [ m + x] ) = | ^ | - i Q ) f c £ e {^A^[m]^mx\
SKETCH O F PROOF . Thi s follow s b y th e Poisso n summatio n formul a
J2 f(m + x)= ] T f{rn)e^mx)
mezr mez r
for f(x) = e(|A[x]) . Th e computatio n o f the Fourie r transfor m f(m) i s carried ou t
by a linea r chang e o f variable s whic h diagonalize s A , s o i t reduce s t o th e familia r
integral i n on e variabl e
JRily2--y)dy={^2e(-^).
D
Let ad = l(mo d c) . W e shal l us e Lemm a 20.1 0 t o transfor m 0(z — 7 ) int o a
theta functio n fo r th e adjoin t quadrati c for m
(20.100) Q*(x) = ^A-^x].
REMARK. Th e adjoin t for m Q*(x) doe s not hav e integra l coefficients . I fT V is a
positive intege r suc h tha t NA' 1 i s integral, the n NQ*(x) ha s integra l coefficients .
For example , N = \A\ satisfie s thi s condition , ye t a smalle r numbe r suffice s quit e
often. Not e tha t N r = 0(mo d \A\) s o every prim e facto r o f \A\ i s in N.
Splitting th e summatio n i n (20.98 ) int o residu e classe s h = (hi,... ,h r) o f
modulus c we ge t
0 Z
( ~l)= £ e
(-? 0 ( / l ) ) E e(Q(m)z).
h(mod c ) m=h(mod c)
Then b y th e Jacob i inversio n formul a (20.99 ) fo r x = hc~ l w e ge t
m=h(mod c) rn
Inserting thi s int o th e forme r su m an d changin g h int o dh modul o c we arriv e a t
47 4 20. TH E CIRCL E METHO D
LEMMA 20.1 1 . Let z e H I and ad = 1 (mo d c) . Then
,,0,0, , (l.£).|4-^(J)'EOm(^W_^0)
where Q* is the adjoint quadratic form and G m (d/c) is the Gauss sum
Gm
(20.102) (T)= H ^ W W + ' H ).
/i(mod c )
To pick u p th e coefficien t r(n,Q) i n (20.98 ) w e appl y (20.89 ) fo r f(x)
e(—nz)0(z), wher e z = x + iy wit h 2 / > 0 to b e chose n later . W e obtai n
,1/cC
(20.103) r(n , Q) = 2R e 2 ^ / T(c,n;a:)e(-n^)<i x
where
d
(20.104) T(c,n;x)= J^ e(n-)#(z--) .
c
cdx<l
Then b y Lemm a 20.1 1 we obtai n
.k
(20.105) T(c,n;x) = \A\-h- rl-\ ] TT m (c,n; x)e(-Q*(ra)/c 2 £)
Zj
m£l
where
(20.106) T m(c,n;x)= ^ * e(n-\G,
C«L^c+C
cdx<l
Now w e ar e goin g t o estimat e T m(c, n ; x). Firs t w e estimate th e Gaus s su m
LEMMA 20.1 2 . Let (c,d) = 1 and m eZ r. We have
<<c i
(20.107) Grn (~c)
where the implied constant depends only on the quadratic form Q.
PROOF. W e hav e
Gm
(l)\ = E * e(^(Q(x)-Q(j/)+*(a:-j/)m) )
^C^ ' - -(mo d c )
x,y(mod
J2* e[- cCyAz + Q(z) tzm
y,z(mod c)
< c r |{z(mod c):Az = 0(mo d c)} |
which yield s (20.1 07) . •
Next w e evaluate th e Gaus s su m precisely , bu t onl y i f th e modulu s i s od d an d
coprime wit h th e discriminant .
20. T H E C I R C L E M E T H O D 475
LEMMA 20.1 3 . Let (c, 2\A\d) = 1 and rn e Zr. We have
W
<*«*> ^-(T)HT)*R^ )-
P R O O F . I t reduces t o the one-dimensional cas e b y a local diagonalizatio n and
completing th e square,
(20.109 ) £ e(^-)=eJfjV~c
V
y(mod c) ' ^ '
if (c , 2a) = 1 (see (3.38)). Recal l tha t a nonsingular symmetri c matri x ove r a field
of characteristic ^ 2 is equivalent t o a diagonal matrix . Henc e one shows tha t ther e
exist a n integral matri x V and an integral diagona l matri x B = diag (&i,.. . , br)
such tha t (c , |V|) = 1 and tVAV = B(mod c) . Changing x to Vy modulo c , we ge t
A[x] = B[y\ = Y,b,yl.
Moreover, lettin g V tm = [di,.. . , dr ], we get txm = t yVtm = ^u d vyv. Henc e
rr r
Q(x) + txm=\ ]T(&„}/ 2 + 2dvy„) = \ ^ b v ( y v + bvdv)2 - \ ^ M ^-
Here th e last su m is congruent modul o c to
Moreover, w e have bi--b u = \B\ = \A\ |T/|2(mod c) . Therefore , applyin g (20.1 09 )
in eac h variabl e y v we conclude (20.1 08) . •
To estimat e th e sum (20.106) w e first complet e it as follow s
(20.110) T m(c, n;x)= Yl 7 WW rn, n\ c)
£(mod c)
where
e
'£d + nd\^G(m d
(20.111) K(£,m,n;c)= £* ( ^ T ^ Kc)
<i(mod c)
and
C<6^min(c+C,l/cx)
If |^ | ^ | w e have 7(£ ) < C (1 -f Kl)- 1- Therefor e th e problem reduce s t o estimatio n
of th e complet e sum s K(£, m, n; c).
The complet e su m K{t,,m,n',c) i s multiplicative i n c. Precisely , i f c — c§c\
with (co,ci ) = 1 , then K(l,m,n;c) — K^c°\i, ra, n ; ci)if ( Cl^(£, m, n; Co) where the
superscripts (co),(ci ) indicat e tha t th e additive character s e( — x) an d e( — x) ar e
used i n place o f e( —) an d e( —) respectively . Le t Co denote th e largest facto r o f c
having all its prime divisor s in 2\A\ and c\ be the remaining factor , s o (ci, 2|^4|) = 1 .
476 20 . T HE C I R C L E M E T H O D
We estimat e th e sum s (20.1 1 1 ) t o moduli CQ an d c\ separately. B y (20.1 07 ) an d
trivial summatio n ove r d o (mod Co) , we ge t
1
Cl)
(20.112) K( {e,m,n;c0)<Z:4+
where th e implie d constan t depend s onl y o n th e for m Q. The n b y (20.1 08 ) w e ge t
(20.113) K^(£ 1 1 )= 0^\ fei(f)v^)
m,n;c S r{c0£, c0(n + Q*(m)))
where
d(mod c i)
is eithe r a Kloosterman su m i f r is even , o r a Salie su m i f r is odd . I n eac h cas e
this su m satisfie s th e boun d (20.85 ) givin g
(
(20.114) i f ^ ) ( ^ m , n ; c i ) < ( ^ n + Q*(m),ci)ic^T(ci) .
Inserting (20.1 1 3 ) an d (20.1 1 4 ) int o (20.1 1 0 ) an d summin g ove r \£\ ^ | w e deriv e
LEMMA 20.1 4 . We have
(20.115) T m{c, n ; x) < ( n + Q*(m), cjitfc^ r(c) log2 c
where the implied constant depends only on the form Q.
We shall appl y th e boun d (20.1 1 5 ) t o al l terms i n (20.1 05 ) excep t fo r m = 0 in
the rang e 0 < x < l/c( c + C) i n whic h To(c , n; x) i s equa l t o
e
d\ v ^ / d
(20.116) T(c,n)= J2* (nc) ^ e {--Q(h))-
cJ ^— ' \ c
d(mod c) h(mod c )
We obtai n
T{c,n;x) = \A\-h-r(-) T(c,n)
k
^i / 2ixyQ' {m)"
+ o((c 0c)Mc)(log2c)(^
r
mez
where ^ mean s that m = 0 is excluded fro m th e summatio n i f 0 < x < l/c( c + C).
We estimat e thi s su m b y
Recall tha tT V i s a positive intege r suc h tha t NQ* i s integral an d (ci , 27V) = 1 . W e
take
(20.117) C = n^, an d y = C'2 = n " 1 .
This choic e implie s tha t fo r z = x + iy wit h 0 < x < (cC) _ 1 ,
and if (c(c + C ) )_ 1 < x < (cC) _ 1 , the n w e als o hav e th e lowe r boun d
c\z\y~i > (2C)" 1 i/-^ = |.
20. T H E C I R C L E M E T H O D 47 7
Moreover, w e hav e Q*(m) ^ > \m\ 2. Hence , fo r an y 0 < x < (cC) -1 w e deduc e
E V ^ W e x p ( - ^ g M ) <(c|*|y-i) «
for an y K ^ 0 . Applyin g thi s wit h K — k w e ge t
T( C ,n;x) = | ^ | - i c - r ( ^ ) f e T ( c , n ) + O(C(c 1 )( C0 c)5T( C )(log2c)n*)
where
2
aci) = Y /(ciJ + nN)i(e+l)- .
Inserting thi s int o (20.1 03 ) w e ge t
fl/cic+C)
r(n,Q) = \A\-* Y^ C-rT(c,n) / (i/z) k
e{-nz)dx
c^C J-l/c{c+C)
+ O(n^- iyj^(ci)(co/c)5r(c)log2c).
Here th e erro r ter m i s bounde d b y
---+£
n2 4 .
On th e othe r hand , th e integra l i s equal t o
1
(i/z)ke(-nz)dx = (270^)-V"
/ oo
up t o a n erro r ter m 0((cC) ) . Summin g thi s erro r ter m ove r c ^ C w e find tha t
its total contributio n i s absorbed b y th e erro r ter m alread y presen t (us e th e boun d
(20.115) fo r T{c,n)), therefor e
T(k)y/\A\^C
Finally, extendin g th e summatio n i n th e mai n ter m t o al l c w e obtai n (20.95 )
(notice tha t T(c,n) = g c{n,Q) b y changin g th e variable s h int o dh an d d int o — d
in (20.1 1 6)) .
Theorem 20. 9 i s als o vali d fo r ternar y form s ( r = 2k = 3) , bu t th e erro r
term i n (20.95 ) i s just to o larg e t o yiel d a n asymptoti c formula . However , a sligh t
improvement i s possibl e b y exploitin g additiona l cancellatio n i n sum s o f th e sum s
(20.113) fo r variou s modul i o f the relevan t Fare y sequence . I n th e cas e o f r = 3 one
obtains th e Sali e sum s
e
(20.118) K{a,b;c)= ^ {^) {—~
d(mod c )
These ca n b e compute d explicitl y i n term s o f quadratic root s modul o c. Fo r exam -
ple, i f (c , 2a) = 1 , then (se e Lemm a 1 2.4 )
(20.119) K(a,b;c)=e c{~)d ] T e(^) ,
x2=ab(mod c)
478 20 . TH E CIRCL E METHO D
which easil y yield s \K(a,b;c)\ ^ c^r(c). Clearly , thi s boun d i s bes t possible , ye t
one can deriv e a better estimat e o n averag e ove r th e modulu s c due t o th e variatio n
of e(^f) . Suc h a resul t belong s t o th e spectra l theor y o f automorphi c form s o f
weight k = § , an d i t lead s t o th e followin g formul a (se e [Du2 ] an d [1 6] )
T H E O R E M 20.1 5 . Let Q(x) be a ternary, positive definite quadratic form with
integral coefficients. Then for any n > 0 the number of integral representations
Q(m) = n satisfies
(20.120) r(n,Q ) = 4 7 r ^ V ( 5 ( n , Q ) + 0(ni-5iT )
where &(n,Q) is the singular series given by (20.96)-(20.91 ), and the implied con-
stant depends on Q.
The formula s (20.95 ) an d (20.1 20 ) ar e tru e asymptotic s onl y i f th e singula r
series <5(n , Q) doe s no t vanish . W e complet e thi s sectio n b y highlightin g a fe w
features o f (S(n , Q). W e kno w fro m th e genera l consideration s i n th e previou s
section tha t th e singula r serie s (20.96 ) i s th e produc t o f loca l densitie s (o f p-adi c
solutions t o Q(x) = n)
(20.121) 0(ra,Q )= IppM)-
p
Also th e leadin g facto r
1
(20.122) < W n , Q ) - '
T(kW\A\
embodies th e densit y o f rea l solution s t o Q(x) = n . W e shal l comput e th e loca l
densities S p(n, Q) fo r p\ 2\A\ usin g th e formul a (20.1 08 ) fo r Gaus s sums ; i t gives
/ 2
(20.123) &(«,<?) = ( ^ W v ]T * (^Ve(-r^ )i f (c,2|^| ) = l .
Suppose r i s even. The n th e abov e su m i s the Ramanuja n su m givin g
gc{n,Q) = XD(c)c k ] T V>(p)q
q\{c,n)
where D = (—l) r/2|^4| i s the discriminan t o f the quadrati c for m Q(x) an d XD(C) —
( ^ ) i s the Jacob i symbol . Henc e fo r an y P wit h (P , 2D) = 1
c\P q\{n,P) p| f
In particular, i f every prime powe r factor o f P i s larger tha n tha t o f n, thi s simplifie s
to
^c-^^^^^x^^y-^na-XD^p-^).
c\P q\n p\P
Taking P — p " + 1 , where v — ord p n, w e ge t th e loca l densit y
(20.124) 5 p(n,Q) = (1 + X D ^ P " * ) - ^! - XD^P 1 -")'1 ^ ~ XD(p")p" (1 ~fe))
20. T H E C I R C L E M E T H O D 479
for an y p \ 2D. Henc e i t i s clear tha t
(20.125) S p(n,Q)^0 iip\2D.
Moreover, takin g P t o be the product o f arbitrary larg e numbe r o f primes p \ 2D
we deduce tha t
(20.126) <5(n , Q) = <Ti_ fc(n, X*D)L% X 4 D )1_ f[ 6 p(n, Q)
p\2D
where L(S,X4D) denote s th e Dirichlet L-function , an d
(20.127) < 7 s (n,xH$>(dK.
d\n
The question when S p(n, Q) > 0 for p\2D is quite delicate. I f r > 4 one can show
that th e non-vanishing o f all the local densitie s 5 p(n, Q) wit h p\2D is equivalent t o
the existenc e o f m e 7U such tha t
(20.128) Q(m) = n (mo d 27 |A| 3 ).
Assuming thi s condition , on e can infer b y (20.96)-(20.97 ) tha t
(20.129) <5(n , Q) x JJ(1 + XD^P 1 ^)
p\n
if r i s even, r = 2k > 4. Fo r r = 4 a similar analysi s applies , excep t fo r 5 2 (n, Q).
If r i s odd, r ^ 5 , then th e exact computatio n o f the local densitie s o f S p(n, Q)
are mor e involved ; nevertheless , on e can derive directl y b y the trivial estimatio n
1 +
(20.130) | £ c(n,Q)|^cS if( c ,2|A|) = l
that th e singular serie s <S(n , Q) is bounded fro m belo w an d above b y positive con -
stants depending only on Q, provided the congruence (20.1 28 ) has integral solutions.
REMARK. I f r = 3 , we know onl y tha t 0(n , Q) ^ > n~ £ fo r an y e > 0 wher e
the implie d constan t depend s o n s, Q, still assumin g th e solvability o f (20.1 28 ) and
that S2(n.Q) > 0 . A t presen t th e implie d constan t i s not effectivel y computable ,
because th e result use s Siegel' s boun d (5.76) .
There i s mor e t o sa y abou t th e formul a (20.95 ) i n term s o f modula r forms .
Indeed, th e representatio n numbe r r(n,Q) i s th e n-t h Fourie r coefficien t o f th e
theta functio n (20.98 ) whic h belong s t o the space Mk(2N,$) o f modular form s of
weight fe, level 2A^ , and a suitabl e multiplie r # . Suppos e r — 2k ^ 4 , so the whole
space Mk(2N,d) i s spanne d b y th e Eisenstei n serie s (on e for eac h singula r cus p
with respec t t o $) an d a finite numbe r o f cusp forms . Therefor e
0(z,Q) = E(z,Q) + F(z,Q)
where E(z, Q) is a unique combinatio n o f the standard Eisenstei n serie s (calle d the
Eisenstein serie s of the form Q) and F(z, Q) is a cusp for m uniquel y determine d by
Q. Fro m thi s decompositio n
r{n,Q)=p{n,Q) + T(n,Q)
48 0 20. T H E C I R C L E M E T H O D
where p(n, Q) , r(n, Q) ar e th e correspondin g Fourie r coefficients . I t turn s ou t tha t
p(n< Q) coincide s wit h th e mai n ter m i n th e formul a (20.95 )
/o \ kk 1
[
p(n,Q)=800(n,Q)Y[6p(n,q) = ^n<5(n,Q).
(kW\A\
I (k)\/\A\
In vie w o f thes e observation s (du e t o Siegel ) th e erro r ter m i n (20.95 ) i s jus t a
bound fo r th e Fourie r coefficien t r{n, Q) o f the cus p for m F(z, Q). Havin g reveale d
this fac t w e coul d appl y th e Ramanujan-Petersso n conjectur e
r(n,Q) < r(n)n^~
(the implie d constan t dependin g o n Q) whic h wa s prove d b y P . Delign e i f k i s a n
integer, k ^ 2 , i.e. , i f r = 2k i s even , r ^ 4 . Ou r resul t (20.95 ) correspond s t o th e
Selberg boun d fo r th e Fourie r coefficien t r(n, Q). I f r i s odd, th e theor y o f Delign e
is not applicable .
Siegel [Sie3 ] als o observe d tha t th e Eisenstei n serie s E(z, Q) fo r a for m Q i s
genus invariant . Actuall y h e showe d tha t
w
E(z,Q)= J2 (QvM*M
Qve ge nQ
where Q v run s ove r inequivalen t form s i n th e genu s o f Q , an d w(Q u) ar e th e genu s
masses. T o defin e thes e number s w e first recal l som e definition s an d basi c fact s
from th e theor y o f quadrati c forms , i n particular , genu s theory .
Two quadrati c form s Qi,Q 2 (i t i s assume d throughou t tha t al l form s unde r
consideration hav e the sam e ran k r ^ 2 ) ar e equivalen t i f one ca n b e obtaine d fro m
the othe r b y a unimodula r (invertibl e ove r Z ) chang e o f variables, i.e . i f A\ — A 2U
for som e U £ M r (Z) wit h \U\ — ± 1 . Th e equivalen t form s for m a class . Th e
determinant i s a clas s invariant .
A quadrati c for m Q(x) = ^A[x] i s equivalen t t o itsel f i n a numbe r o f ways .
Denote b y
0(Q) = {U e MriZy^UAU = A}
the grou p o f automorphs o f Q. Thi s i s finite an d th e sam e on e for equivalen t forms ;
we denot e it s orde r b y \0(Q)\. I f r = 2 th e numbe r \0(Q)\ depend s onl y o n th e
determinant \A\, bu t i f r > 2 it varie s fro m clas s t o class .
Two forms Qi , Q2 are in the sam e genu s if they ar e equivalent ove r every p-adi c
field and ove r the rea l numbers (th e latte r conditio n i s redundant sinc e both Qi , Q2
are assume d t o b e positiv e definite) . Th e determinan t i s a genu s invariant . On e
can sho w tha t tw o form s Q\,Q2 o f th e sam e determinan t \A\\ = |^4_2 1 are i n th e
same genu s i f an d onl y i f the y ar e equivalen t ove r Z t o form s congruen t modul o
8|Ai||A2|. Th e numbe r o f classe s i n a genu s o f a fixed determinan t i s finite.
Clearly, th e numbe r o f representation s r(n,Q) i s a clas s invariant , bu t i t ma y
vary within a given genus, because the number o f automorphs ma y do so. Therefore ,
it i s natura l t o ad d th e followin g weigh t w(Q) t o th e representations :
^-i^L^isib)" 1 -
20. T H E C I R C L E M E T H O D 481
Note tha t th e tota l mas s o f th e form s i n a genu s i s
Que ge n Q
Accordingly, th e averag e numbe r o f representatio n o f n b y form s o f a give n genu s
is define d a s th e weighte d su m
r(n, ge n Q) = ^ w(Q u)r(n, Q v).
Q„e ge n Q
Collecting th e abov e notatio n an d result s w e arriv e a t th e Siege l mas s formula ,
V
(27r)knk~1e n
(20.131) r(n , gen Q) = p(n , Q) = / /rn ( > Q)-
1 {k)yJ\A\
20.5. A n o t h e r decompositio n o f t h e delta-symbol .
Dissecting th e uni t circl e b y th e Fare y point s ~ c of orde r C w e develope d a
Fourier typ e expansio n fo r S(n) i n term s o f th e additiv e character s e(n^) , wher e
dd = l(mo d c) . W e cal l ^ th e Kloosterma n fractions . Thes e appea r i n th e expres -
sion (20.87 ) wit h d ranging ove r th e interva l C<d^c + Co£ lengt h exactl y c , bu t
not alone ; the y ar e escorte d b y th e othe r factor s involvin g d. Thi s featur e seem s t o
be insignifican t fro m a n analyti c viewpoint . I n orde r t o rela x thes e "minor " factor s
one uses Fourier technique , an d inevitabl y on e is led to complete Kloosterma n sum s
S(m,n;c) i n additio n t o th e Ramanuja n sum s 5(0 , n;c). On e canno t avoi d the m
in Kloosterman' s treatment , eve n fo r smal l c.
In thi s sectio n w e giv e a differen t expressio n fo r o(n) entirel y i n term s o f th e
Ramanujan sum s
s ( 0 ,„ ; c ) = £ * „(2!)_ £„(')„.
"nd^
VC /
d(mod c ) d|(c,ra )
We begi n b y a simple-minde d treatment . A n intege r n i s equa l t o zer o i f an d onl y
if i t ha s a diviso r large r tha n \n\. Therefore , fo r an y q > \n\ we hav e
'<»> = ?a (£m o d q)•(=) •
Writing - i n it s lowes t term s w e ge t
(20.132) S(n) = -^5(0,n;c) .
c\q
Next w e average ove r q to gai n a n extr a variable . Le t w(u) b e a function supporte d
in a segmen t C < u < 2 C an d normalize d b y
CX D
(20.133) ^Tw(q) = l.
9=1
482 20. T H E C I R C L E M E T H O D
Summing (20.1 32 ) ove r q we ge t
oc o o
1
(20.134) 5(n) = ^ c f ^ O , n ; c) ]T r " ^^).
c=l r = l
This formul a i s vali d onl y fo r \n\ < C ; consequently , i t employ s th e character s
e(nd/c) t o moduli a s large as |n| , whic h is not a good featur e i n applications. I n thi s
respect a Kloosterman typ e expansion ha s the advantag e o f employing character s t o
much smalle r modul i relativ e to |n| , precisel y one can make it usefu l wit h c ^ 2|n | 2.
To refin e th e abov e ide a w e list her e th e followin g observations :
— ever y positiv e intege r divide s zero ,
— a non-zer o intege r ha s fe w divisors ,
— of th e tw o complementar y divisor s o f n > 0 one doe s no t excee d n 2.
Let w{u) b e a functio n o f u ^ 0 which vanishe s a t u = 0 and decay s rapidl y t o
zero a s u —> oc . W e normaliz e w{u) b y th e conditio n (20.1 33) . The n b y th e abov e
observations w e deduc e th e followin g identity :
6(n) = Yl{"(9)-»(!j))-
q\n
Detecting th e conditio n q\n b y orthogonalit y o f additiv e character s e(an/q), an d
then reducin g th e fraction s a/q w e obtai n
PROPOSITION 20.1 6 . Let w(u) satisfy the above conditions. Then for any in-
teger n we have
00
5
(20.135) S(n) = ^2 ( ° ' n ; c )Ac(n)
c=l
where A c(u) is a function on R given by
00
(20.136) A c(u) = Y^^r^iwicr) - w(\u\/cr)).
r=l
Since A c(u) i s not compactl y supporte d (eve n if the test functio n w(u) i s chosen
with compac t support) , w e make a practical alteratio n b y multiplying (20.1 35 ) wit h
a nic e functio n f(u) supporte d i n \u\ < 2N an d normalize d b y /(0 ) = 1 . W e ge t
00
(20.137) S(n) = ] T 5(0 , n; c)A c(n)f(n).
c=l
Now suppos e tha t w(u) i s als o compactl y supported , sa y i n th e dyadi c interva l
C ^ u ^ 2C (thin k o f w(u) a s a bum p function) . The n th e serie s (20.1 37 ) reduce s
to c ^ 2max(C , N/C) = X , say . Clearl y th e optima l choic e i s
(20.138) C = N l*
giving X = 2 C = 27 V i an d
(20.139) S(n)= J2 5(0,n;c)A c (n)/(n).
20. T H E C I R C L E M E T H O D 483
Therefore i t take s character s o f modul i c ^ 2N 2 to detec t th e even t n = 0 withi n
the integer s \n\ ^ 2N.
Of course , th e optima l choic e fo r th e magnitud e o f modul i i s no t necessary ,
so w e continu e ou r investigatio n withou t th e conditio n (20.1 38) , becaus e ther e ar e
some technica l benefit s fro m havin g C independen t o f N. Yes , our ne w expression s
(20.135) an d (20.1 37 ) stil l contai n onl y th e Ramanuja n sum s (ther e ar e nowher e
hidden Kloosterma n fractions!) , bu t wha t abou t th e facto r A c (n)? Assumin g tha t
the tes t functio n w(u) i s smooth on e can regard A c(u) a s a continuous analo g of th e
Ramanujan su m 5(0 , n; c). T o be practical w e must b e abl e to contro l th e variatio n
of A c(u) i n bot h variable s c , n, an d t o separat e c from u a t a lo w cost . T o thi s en d
we ar e goin g t o prov e
LEMMA 20.1 7 . Suppose w(u) is smooth, supported in the segment C ^ u ^ 2C
with C ^ 1 , normalized by (20.1 33), and has derivatives satisfying
1a
(20.140) w^(u)<^C- -
for 0 ^ a ^ A. Then for any c ^ 1 and u E R we have
(20.141) A c(u) < - — —+
{c + C)C \u\ + cC
and for 1 ^ a ^ A,
fl
(20.142) A[ '(u) < (ccyWui + ccy a.
PROOF. W e spli t
(20.143) cA c{u) = V(c) - wO^)
where
(20.144) V{y) = y^r~ lw{yr) - I r~ l
w(r)dr,
r=l J °
(20.145) W% ) = y^r- 1 w{y/r) - I r~ l
w(r)dr.
By th e Euler-Maclauri n formul a
J2F(r)= f(F(r) + {r}F'(r))dr,
rJ
where F i s of clas s C 1 o n R an d {r } = r — [r] i s th e fractiona l par t o f r , w e ge t
(20.146) V(y) = yJ°°{-}(^p-)'dr.
Since {x} < min(l,x ) ^ 2x( l + x ) - 1 , w e deriv e b y (20.1 40 ) tha t
1
(20.147) V ( y ) « C - ( l + C/y)- 1 .
48 4 20. T H E C I R C L E M E T H O D
In th e sam e way , b y applyin g (20.1 47 ) fo r w(u) change d int o w(l/u) an d y int o
1/?/, w e deriv e tha t
(20.148) W(y) < C~ l{\ + y/C)' 1 .
Inserting thes e estimates int o (20.1 43 ) w e obtain (20.1 41 ) . Fo r the proof o f (20.1 42 )
observe tha t Ac(u) vanishes , unles s \u\ > cC i n whic h cas e
Aia\u) = ^ ( - c r ) - a - V a > ( ^ ) < (cC)~ l
{\u\+cC)-a.
. cr.
•
Next w e shal l sho w tha t A c(u) approximate s t o th e Dira c distributio n quit e
strongly o n suitabl e tes t functions . T o thi s en d w e firs t establis h th e followin g
identity
LEMMA 20.1 8 . For any compactly supported function f of class C a on R we
have
(20.149)
J°° A c(u)f(u)du = /(0)t&(0 ) + / ( O K " 1 1 ° ° f3 a Q ( ^ ) {<l) dr
f°° / r \ f°°
-ca~l / p a{-) / w(u)u a w
f (ru)dudr,
where i3 a{x) = -^B a({x}) and B a(X) is the Bernoulli polynomial, so
a
0a(x) = ^2{-2iTim)- e(mx).
P R O O F . W e split th e integra l o n the lef t sid e of (20.1 49 ) int o two parts accord -
ing t o (20.1 43 ) an d evaluat e th e sum s ove r r b y th e Euler-Maclauri n formul a (se e
Theorem 4.2 )
F = {a
(20.150) Y
1 ^ l(F(r) + !3 a(r)F \r))dr.
rJ
First w e ge t
which lead s t o th e secon d ter m o n th e right-han d sid e o f (20.1 49) . W e are lef t wit h
r nu)w(^= r
J-00 \ C / C J-00
w{ \u\)±f{cru)duJM
r = 1
c
JO r
r ^d r
= r w{u) Vc f{cru)du - f(0)w(0) - M r ^ d r
Jo ^ h r
= J w(u) J {r}p a(r) — f(cru)drdu - f(Q)w{0)
by (20.1 50) . Changin g r int o r/c w e obtai n th e remainin g term s o f th e right-han d
side o f (20.1 49) . •
20. T H E C I R C L E M E T H O D 485
COROLLARY 20.1 9 . Let the conditions be as in Lemma 20.1 7. For any f com-
pactly supported and of class C a on R, and for c > 1 we have
oo
Ac(u)f(u)du = /(0 ) + E c(f)
/ -OO
where
oo
(C-2a\f(u)\ + \f a
\u)\)du.
/ -oo
P R O O F . Thi s follow s b y \f3 a(x)\ ^ 1 an d trivia l estimatio n o f th e integral s
in (20.1 49) , bu t wit h th e mai n ter m f(Q)w(0) i n plac e o f /(0) . However , b y th e
normalization conditio n (20.1 33 ) an d th e Euler-Maclaurin formul a (20.1 50 ) w e have
1
a
w(0) = l + O(C- - ).
Here th e erro r ter m i s absorbe d b y (20.1 52 ) becaus e
oo
(\f(u)\ + \f^(u)\)du.
/ -oo
•
Suppose f(u) i s supporte d i n \u\ ^ 2N an d ha s derivative s
{a a
(20.153) f \u) <^N~
for 1 ^ a ^ A. The n (20.1 52 ) give s
2 a
TV / c \ °/ C \
E 1 +
(20.154) ^«7c{c) ( lv) -
In particular, i f C — ViV, then thi s simplifie s t o E c(f) < C (c/C)a (chang e a into a +
1). Recal l tha t th e serie s (20.1 37 ) terminate s a t X = X(C , N) = 2max(C , N/C) =
2\fN. Th e boun d (20.1 54 ) i s ver y smal l i f c ^ X 1 _ £ b y lettin g a b e sufficientl y
large.
When applyin g th e formul a (20.1 37 ) t o solv e a n additiv e proble m i t i s conve -
nient t o have A c (n)/(n) expresse d i n additive character s e(nv) wit h \v\ quite small .
To thi s en d w e conside r th e Fourie r transfor m
oo
Ac(u)f{u)e(-uv)du.
/ -oo
By Fourie r inversio n
oo
gc(u)e(uv)du.
/ -oo
Inserting thi s t o (20.1 37 ) w e ge t
oo
gc(v)e(nv)dv.
/
486 20. T H E C I R C L E M E T H O D
REMARKS. Her e w e kee p th e restrictio n c ^ X , becaus e afte r applyin g th e
Fourier integra l (20.1 56 ) fo r A c(n)f(n) on e loses sight o f the suppor t o f the origina l
test functions .
Now w e nee d estimate s fo r g c(v)- B y Corollar y 20.1 9 fo r f(u)e(—uv) i n plac e
of f(u) w e find tha t
(20.158) g c(v) = 1 + o(± ( £+ ^ + | V|cC)°).
Moreover, w e hav e b y partia l integratio n
oo
(Ac{u)f{u))^e(-uv)du.
/ -OO
Hence w e deriv e b y (20.1 41 ) , (20.1 42) , (20.1 53 ) anothe r estimat e
a 2
(20.159) g c(v) < (\v\cC)~ + ( 1 + NC~ )(\v\N)-a.
The formul a (20.1 58 ) i s fine fo r smal l \v\ whil e th e estimat e (20.1 59 ) i s fo r large r
|v|, th e \v\ — 1/cC bein g th e transitio n value .
The tes t function s f(u) an d w(u) d o no t nee d t o b e compactl y supported , bu t
only decayin g rapidl y enoug h outsid e th e crucia l ranges . W e propose th e followin g
EXERCISE 3 . Giv e explici t computation s fo r th e specia l tes t function s
(20.160) f(u) = e x p ( - £ ) , w(u) = ^exp(-^- £) ,
where K is a normalizatio n constant .
It i s understandable tha t th e Fourie r developmen t o f S(n) o f Kloosterman typ e
(20.87) an d ou r formul a (20.1 35 ) hav e essentiall y th e sam e powe r i n applications ,
because they employ the additive characters to moduli of comparable size . However ,
the absenc e o f Kloosterman fraction s i n (20.1 35 ) ca n simplif y som e treatments. Fo r
example, whe n solvin g th e diophantin e equatio n / ( # ! , . . . , x n ) = 0 b y mean s o f
(20.135) on e i s le d t o a n exponentia l su m fo r a polynomia l i n n + 1 variables ove r
a finite field, wherea s th e Kloosterma n metho d bring s u s a rationa l function .
http://dx.doi.org/10.1090/coll/053/22
CHAPTER 2 1
EQUIDISTRIBUTION
21.1. Weyl' s criterion .
We begi n b y reviewin g th e genera l principle s o f equidistributio n theory . Le t
(X, 23 , djj) b e a probability space , where X i s a topological space , B is the a-algebr a
of Bore l sets , an d d/i i s a measur e normalize d b y
L dfjb(x) = 1 .
A sequenc e {x n} C X i s sai d t o b e equidistribute d (wit h respec t t o th e measur e
d/i) i f
lim h{n^N;x neB}\=fi{B)
N-+oc T V
for an y ope n se t B e B. Unde r som e mil d assumption s thi s propert y i s essentiall y
that
(21.1) li m1 J2 f( xn) = f f(x)drix)
for an y compactl y supporte d continuou s functio n / : X - > C .
Clearly th e equidistributio n implie s that th e sequenc e {x n} i s dense in X, ye t i t
is a more refine d property . Fo r showin g th e equidistributio n i t i s sufficient t o verif y
(21.1) fo r a syste m o f tes t function s / whic h generate s a dens e subse t o f CQ(X).
The syste m o f tes t function s ca n b e simpl y th e element s o f a n orthonorma l basi s
of L 2(X,dfi). I n particular , i f X = G i s a compac t group , th e matri x coefficient s
of irreducibl e unitar y representation s o f G for m a n orthonorma l basi s o f L 2(G)
with respec t t o Haa r measur e (th e Peter-Wey l theorem) . I f X = G* is the spac e of
conjugacy classe s of a compact group , then the characters of G form a n orthonorma l
basis o f L 2(G$,dg), wit h respec t t o th e measur e induce d b y Haa r measure .
Since the formul a (21 .1 ) alway s holds fo r f{x) = 1 which i s the characte r o f th e
trivial representation , specializin g th e abov e discussio n t o thi s cas e give s th e
W E Y L CRITERION . Let G be a compact group, G$ the set of conjugacy classes
in G. Then a sequence of elements x n G G^ is equidistributed with respect to
Haar measure if and only if for any non-trivial irreducible unitary representation
p : G - > GL(V), we have
(21.2) Y^ ^(P&n)) = o(N), as N - + oo .
A classica l exampl e i n numbe r theor y (initiate d b y H . Wey l [Wl ] i n 1 91 6 ) i s
the circl e X = R/ Z wit h th e Lebesgu e measur e dx. I n thi s cas e i t i s customar y
487
488 2 . EQUIDISTRIBUTIO N
to loo k a t a sequenc e o f rea l number s x n , an d fo r th e purpos e o f equidistributio n
to tak e th e fractiona l part s {x n} a s representative s o n th e circle . I f th e fractiona l
parts ar e equidistribute d o n th e circle , the n th e origina l sequenc e i s sai d t o b e
equidistributed modul o one . Th e character s e(hx) fo r h G Z, h ^ 0 , ar e the n th e
representations involved , an d th e Wey l criterio n fo r equidistributio n modul o 1 is
2_] e(hx n) = o(N), a s N — > oo , fo r an y h ^ 0 .
21.2. Selecte d equidistributio n results .
Originally Wey l introduce d hi s criterio n t o stud y th e distributio n modul o on e
of th e sequence s x n = / ( n ) , wher e / i s a polynomia l wit h rea l coefficients .
PROPOSITION 21 .1 . Let f e R[X] be a real polynomial of degree d^\. Assume
that f(x) = ax d + • • • with a £ Q. Then x n = f(n) is equidistributed modulo one.
PROOF. W e conside r f(x) — axd fo r simplicity . Le t h ^ 0 an d / ? = ha. Fo r
Q = X d~6, wit h 0 < 5 < 1 , appl y Lemm a 20. 3 (comin g fro m Propositio n 8.2 ) t o
an approximatio n \/3 — a/q\ ^ q~ 2 wit h (a,q) = 1 and 0 < q ^ Q. Thi s give s
] T e(/3n d) < X l+£(q~l + X " 1 + qX~ d d
y
with 7 d = 2 l~d. I f g ^ X dQ~1 = X 6, thi s implie s
d
^ e(pn ) < X 1 '1 fo r som e 7 > 0 .
If q < X dQ~1 1 denotin g g{t) — (a — -)t d, w e hav e b y partia l summatio n
d a
S(X) = £ e(Pn ) = £ e( ^ ) e ( 5 ( X ) ) - 2 « f fee(^)) 9'(t)e(g(t))dt.
q Jl
n^X n^X n^t ^
By splittin g int o residu e classe s modul o q we ge t
c( T )=- r + 0 ( g ) w i t h 6 =£ c ( T ) .
n^y ^ ^ x ( m o dq jH
Hence b y partia l integratio n agai n w e ge t
S(X) = ^j\(9mt + 0( q(l + ^ ) ) < < ^+ 0( q).
To th e complet e su m 6 w e ca n appl y eithe r th e bound s fo r Ramanuja n sum s (i f
d = 1 ) , Gauss sums (i f d = 2 ) or Weil's bound fo r higher degre e (als o Weyl's Lemm a
(20.59) would be sufficient), gettin g 6 < q 1 '2^ an d hence S(X) < Xq- l2 £
' + +X8.
Since g -> +0 0 becaus e / 3 ^ Q , w e hav e i n an y cas e
d
^ e(/3n ) - o(X ) a s l - i +00 .
The equidistributio n o f arithmetica l sequence s buil t ou t o f prim e number s ar e
fascinating an d mos t challenging . Fo r prime s themselves , i f on e take s a s spac e
21. EQUIDISTRIBUTIO N 489
G = (Z/gZ) x wit h th e countin g measure , the n equidistributio n i s equivalen t wit h
the Prim e Numbe r Theore m i n Arithmetic Progression s (se e Chapter 5) . Similarly ,
if K/Q i s a finite Galoi s extension , the n on e wishes t o stud y th e distributio n o f th e
Frobenius conjugac y classe s a p i n th e Galoi s grou p o f K/Q. Ther e th e Chebotare v
Density Theore m give s th e result :
THEOREM 21 .2 . Let K/Q be a finite Galois extension with Galois group G.
For any set C C G which is stable by conjugacy, we have
A o o l|{p^| f f j , e c}| = {g}.
However, i n thi s cas e a s fo r arithmeti c progressions , question s o f th e siz e o f
the erro r ter m an d uniformit y i n term s o f parameter s i s mos t importan t an d quit e
different i n perspectiv e tha n th e simpl e equidistributio n statement .
One is naturally le d to consider sequences x p = /(p) , especially when x n = f(n)
has alread y bee n prove d t o b e equidistribute d modul o one . Ther e th e method s o f
Chapter 1 3 ar e relevant . Indeed , Theore m 1 3. 6 implie s th e followin g theore m o f
Vinogradov
T H E O R E M 21 .3 . Let a 0 Q be an irrational number. Then the sequence ap is
equidistributed modulo one as p— > +oo .
PROOF. Le t (3 = ah wit h h ^ 0 . W e mus t prov e
e
(21.3) J2 (#P) = o(X{logX)~ l) a s X - > +oo .
p^X
Let Q = x ( l o g x ) - B fo r som e B > 0 , an d le t a/q b e a rationa l approximatio n t o /?
such tha t |/ J — a/q\ ^ l/(qQ) ^ q~ 2. I f q ^ xQ~ x = (logx) 5 , applyin g Theore m
13.6 yield s
^e(/3p)<x(logx)3-B/2.
If q < x Q _ 1 , the n b y partia l summation , splittin g int o arithmeti c progression s
modulo q and th e Siegel-Walfis z Theorem , w e hav e
yq)
P^x ^
for an y A > 0. I f w e take B > 6 and A > B + 1 , then (21 .3 ) follow s sinc e q —> +o c
as X — > +o o fo r ft irrational . •
Similarly, (1 3.55 ) implie s that a^fp i s equidistributed fo r an y fixed real numbe r
a^O.
The technique s o f exponential sum s ove r finite fields, an d particularl y th e pow -
erful result s base d o n the Rieman n Hypothesi s prove d b y Delign e (se e Chapte r 1 1 ) ,
give very goo d tool s t o stud y man y interestin g equidistributio n problems . W e giv e
an exampl e o f Fouvr y an d Kat z [FK ] usin g Theore m 1 1 .43 .
490 2 . EQUIDISTRIBUTIO N
THEOREM 21 .4 . Let P 1 (X),..., P r(X) be polynomials in Z [ X i , . . . ,X n] such
that the total degree of any non-trivial integral linear combination a\P\-\ \-a rPr
is ^ 2 . Let <j){x) —> • -fo o as x —> +co . Then, for p — > +oo , £/& e sequence
{(^-••^)}
w/iere 0 < r r i , . . . , x n ^ ( / ^ ^ / p l o g p , * s equidistributed modulo one.
SKETCH O F PROOF . Le t w(p) = 0(p)y^log p an d fo r a = ( a i , . . . , a r ) ^ 0 , le t
r
a\P\{x) + • • • + a rPr(x)\
* = £ - £ <-
O^Zi^u* (p)
By th e Wey l Criterio n o f (R/Z) r , w e mus t sho w tha t
(21.4) S < C w(p)n a s p -> • +oc.
First assum e w(p ) < p . Usin g Fourie r inversio n o n Z/pZ , w e hav e
^h
where
P
a • P(x ) — h • x^
(here / i • x i s th e usua l scala r produc t h\X\ + • • • + h nxx). W e hav e
1
T(M)« [ ] minfoH/K/pir )
where ||t| | i s th e distanc e t o th e neares t intege r a s usual . Fo r th e complet e sum s
5i(a, P, /i), we hav e tw o bounds : first , a resul t o f Wei l show s tha t
(21.5) S ' 1 (a,P,/i)<pn-^
(the implie d constan t dependin g onl y o n th e Pi) becaus e a • P(x) — h • x i s neve r
identically zer o fo r a ^ 0 b y assumption . Thi s boun d i s insufficien t becaus e ther e
are to o man y h. Bu t b y Theore m 1 1 .4 3 wit h / = a - P , g = 1 and V — Ag, ther e
are varietie s Xj o f dimensio n ^ n — j suc h tha t Si(a,P,h) <^ C C pn / 2 + ^ - 1^ 2 fo r
hi Xj. Weil' s boun d (21 .5 ) show s tha t X n = 0 . Denotin g X 0 = A n w e ge t
S « p " n E p^'- 1
)/ 2 £ |T(fc,ti;(p))| .
Lemma 9. 5 o f [FK ] give s
E |T(ft,n;(p)) | < p n - « "1 ) i i ; ( p y -1 ( l o g p ) n - « -1 )
21. E Q U I D I S T R I B U T I O N 491
from whic h
S<p1/2w(p)n_1(logp)
hence (21 .4 ) follows . I f w(p) > p, one dissects th e se t o f summation int o cube s wit h
sides < p an d appl y th e abov e boun d fo r eac h o f them . •
A differen t se t o f problems , stil l ver y natural , involve s th e "angles " o f expo -
nential sum s t o prim e modulus , o r th e loca l root s o f L- functions fo r varietie s ove r
finite fields. Quit e simila r i s th e questio n o f distributio n o f eigenvalue s o f Heck e
operators T p.
Maybe th e simples t non-trivia l exponentia l sum s ar e Gaus s sums . Le t x b e a
primitive characte r modul o p an d r(x ) th e associate d Gaus s su m (3.1 0) . B y (3.1 4 )
one ca n writ e
r(x) = e(a p(x))y/p,
where a p(x) £ R/Z i s calle d th e "angle " o f th e Gaus s sum . Th e questio n is , ho w
are thos e distributed ?
This questio n admit s a t leas t tw o variants . On e ca n conside r Gaus s sum s fo r
characters o f a fixed orde r d modul o prime s p = 1 (mo d d) , a s p —> +oc , o r on e
can loo k a t al l character s modul o p. I n th e forme r case , whe n d = 2 , Theore m
3.3 an d th e Prim e Numbe r Theore m sho w tha t e(a p(x)) i s equidistribute d amon g
the tw o element s {l,i } (not e tha t i n thi s cas e e(a p(x)) G { ± l , ± z } follow s fro m
r
( x ) 2 — r ( x ) r ( x ) = x(~l)P > s o th e angl e i s no t equidistribute d i n th e grou p
{±i,±i».
Kummer studie d th e argument s o f cubi c Gaus s sums . Fo r p = 1 (mo d 3) ,
there exist s a uniqu e prim e n e K = Q(e(l/3) ) C C suc h tha t TT = 1 (mo d 3 ) an d
NTT — p. Th e cubi c residu e symbo l XTT( X) = ( f ) 3 i s define d a s th e cub e roo t o f
unity suc h tha t
X(P-I)/S = Xn ( mod ^ fo rx e 0 / n 0
where O is the ring of integers in K. I t i s a character o f order 3 on the multiplicativ e
group o f the residue field O/TTO = Z/pZ , i.e . a (primitive ) cubi c Dirichlet characte r
modulo p. Usin g Jacob i sums , on e show s tha t th e Gaus s su m satisfie s
(^)'=«<*»<*•»--?•
see [IR] . It i s well-know n tha t TT/TT i s equidistribute d o n R/ Z (prove d usin g Heck e
characters o f K, se e Sectio n 3.8) , bu t t o ge t e(a p(x-K)) a certai n cub e roo t o f
unity "interferes" . Kumme r an d late r Hass e conjecture d a certai n non-unifor m
distribution base d o n som e numerica l evidence . Howeve r R . Heath-Brow n an d
S. Patterso n [HBP ] proved :
T H E O R E M 21 .5 . The angles e(a p{xx)) of cubic Gauss sums are equidistributed
in R/Z as p - > +oc ; p = 1 (mo d 3) .
The proo f i s base d o n technique s fo r sum s ove r primes , a s i n Chapte r 13, an d
properties o f a (metaplectic ) Eisenstei n serie s whic h ha s th e Gaus s sum s i n it s
Fourier coefficients .
In the case of all characters modul o p, we have the following theore m o f Deligne:
492 2 . EQUIDISTRIBUTIO N
THEOREM 21 .6 . Thep—2 angles (a p(x)) of Gauss sums of primitive characters
modulo p become equidistributed in R/ Z as p tends to oo , i.e. we have
-++00 p - 2 f-^ J 0
X (mo d p)
as p — > o c /o r an? / continuous function f : [0,1 ]— » C
PROOF. B y th e Wey l Criterion , i t i s enough t o sho w tha t
rT n
(2i-6) ii m - J - y- ( (xh
-m = 0
X (mo d p ) v
— see
for n ^ O . On e ca n reduc e t o n > 1 using th e formul a T(X)T(X ) = x ( l)p (
(3.14)).
We hav e
r(x)"= E ^ - x n ) e f1 +
; +
^ )
hence summin g ove r al l x (includin g \ — 1) w e get b y orthogonalit y o f character s
5 > ( x ) n = (p-l)tfn(l,P )
X
where if n (l,p) i s th e multipl e Kloosterma n su m define d i n (1 1 .55) . Subtractin g
the contributio n o f \ — 1 ? w e ge t
n
E* (^f) =p~ n/2 ((p - VW'P) + (-1)"*1)-
X (mo d p ) ^
As p— > +oc , w e hav e K n(l,p) < C p ( n _ 1 ) / 2 b y (1 1 .58) , wher e th e implie d constan t
depends onl y o n n , henc e (21 .6 ) follows . •
Of cours e fo r analyti c numbe r theor y a particula r interes t attache s t o th e clas -
sical Kloosterma n sum s (1 .56) . B y Weil' s bound , on e ca n writ e
S(a,b;p) = 2y/pcos(27r6 p(a,b))
for som e uniqu e 9 p(a,b) G [0,7r]. Th e proble m o f th e distributio n o f th e angle s i s
fascinating. Usin g al l a ^ 0 modulo p , N . Kat z [K4 ] solve d th e problem .
T H E O R E M 21 .7 . The p — 1 angles 9 p(a,a) for a ^ 0 are equidistributed with
respect to the Sato-Tate measure on [0,7r ] given by
djisT = 2TT _1 (sin 0)d0
as p —> +oo ; i.e we have
-IT E * M>(°)) ~ r WW-
a(mod p )
as p —> o o /or an? / continuous function f : [0 , n] —> C .
Moreover, h e state s th e followin g conjectur e wher e a i s fixed:
21. E Q U I D I S T R I B U T I O N 493
T H E SATO-TAT E CONJECTUR E FO R KLOOSTERMA N SUMS . Let a, b be fixed
non-zero integers. For p prime let 0 p G [0, IT] be such that
S(a,b;p) = 2yfp(cos 0 P).
Then as X —> • +oc 7 the angles 0 P, p < X, become equidistributed with respect
to the Sato-Tate measure d/usr-
It i s not eve n known whethe r S(a, b\p) change s sig n infinitely often , o r that th e
statement
(2-g)^S(fl^;p)=2cosflp<2
(for al l p larg e enough) i s false, fo r an y e G]0 , 2[. Se e [FoM ] fo r th e bes t progres s i n
this direction , combinin g siev e method s an d cohomologica l studie s o f exponentia l
sums a s i n Chapte r 1 1 .
In Corollar y 21 .9 , o n th e othe r hand , w e prov e tha t th e angle s o f Sali e sum s
T(a, b;p) (fo r fixed a , b with ab not a square ) ar e equidistribute d a s p—> > +oo, bu t
with respec t t o Lebesgu e measure .
The sam e Sato-Tat e distributio n i s conjecture d t o hol d fo r th e distributio n o f
coefficients a p associate d t o a n ellipti c curv e E/Q withou t CM , o r mor e generall y
for th e eigenvalue s Xf(p) o f a non-dihedra l primitiv e holomorphi c cus p for m / o f
weight ^ 2 . I f p is a prime not dividin g the conducto r o f /, the n b y Deligne's boun d
we hav e
\f(p) = 2 cos 0 p(f)
for som e 0 p(f) G [0,7r]. Fo r E, thi s read s
aE(p) = 2y/p cos 0 P(E).
Then w e hav e
T H E SATO-TAT E CONJECTUR E FO R MODULA R F O R M S . Let f be a primitive
holomorphic cusp form of weight ^ 2 which is not of dihedral type. With the above
notation, the angles 0 p(f), p ^ X, become equidistributed as X— > -hoc with respect
to the Sato-Tate measure dfisT-
Using th e modularit y o f / an d th e analyti c propertie s o f symmetri c powe r L-
functions, mor e progres s ha s bee n mad e concernin g thi s conjectur e tha n abou t th e
analogue fo r Kloosterma n sum s (se e Serre' s lette r a t th e en d o f [Shah]) . I n fact ,
there i s a convincin g argumen t i n favo r o f thi s conjectur e base d o n th e expecte d
functoriality o f symmetric power L- functions an d th e analytic properties that woul d
follow b y th e result s o f Chapte r 5 . W e ca n sketc h thi s argument , takin g th e cas e
of th e Ramanuja n A functio n o f weigh t 1 2 a s a n exampl e - th e leve l i s 1 whic h
eliminates complication s arisin g fro m ramification . W e hav e th e Eule r produc t
factorizing a s
1s
L(A,s) = i[(i-a pP- r (i-PPp-1sr
v
with p 1 1 ^2(ap + f3 p) = r(p) an d a p/3p = 1 . Her e r(p) i s the Ramanuja n r-function ,
not th e diviso r function . Fo r n ^ 1 , th e n-t h symmetri c powe r o f A i s th e Eule r
product
494 21. EQUIDISTRIBUTIO N
The functorialit y principl e o f Langlands woul d impl y tha t L(Sym n A, s) is an auto-
morphic L- function o f degree n + 1 in the sens e of Chapter 5 , with conducto r q — 1
and wit h n o pole . Sinc e a p(3p = 1, we hav e
sin(n + 1)0 P
Asym-Atf) = r- ^" = Pn(COsO p)
S i l l (7r >
where 0 P = 0 P(A) an d P n is the Tchebyshe v polynomial . Applyin g Theore m 5.1 0
we find tha t fo r n ^ 1,
J2Pn(cos6p) = o(7r{X))
as X— > +oo , henc e
lim -— — Y] Pn(cos6 p) = 0 = / P n (cost)d/x 5T (f).
It i s easy t o sho w tha t th e functions P n (cost) spa n C([0,7r]) , S O the Sato-Tat e
conjecture follow s b y the Wey l Criterio n fro m thes e conjecture d propertie s o f the
symmetric powers .
It i s currently know n tha t L(Sym n A, s) is automorphic an d fo r n ^ 4 : n = 2 is
due t o Gelbart an d Jacque t [GeJ ] an d n = 3 , 4 to Kim an d Shahid i (se e [KSh]) ,
and tha t i t is an L-function i n the sens e o f Chapter 5 , possibly wit h extr a poles ,
for n < 9.
If / i s dihedral, whic h i s equivalent t o E having C M in the ellipti c curv e case ,
the distributio n i s known (b y wor k o f Deuring) an d i s quite different . Notic e tha t
one ca n determin e thi s distributio n b y th e abov e argumen t i f one know s th e orde r
of th e pol e of L(Symn /, s) a t s — 1. Similarly fo r weight 1 forms wher e th e Heck e
eigenvalues ca n onl y tak e finitely man y differen t values .
Finally w e shoul d mentio n tha t equidistributio n problem s i n hyperbolic plan e
are also very interesting . Ther e on e has to us e not onl y cusp forms a s test function s
in th e Wey l Criterion , bu t also Eisenstei n series . A s an example, W . Duke [Du2 ]
proved tha t Heegne r point s becom e equidistribute d wit h respec t t o the Poincare
measure a s the discriminan t goe s to infinity. Als o w e refe r t o [Sa4] fo r discussion
of th e problem s o f "quantu m chaos " an d the links betwee n th e quantum uniqu e
ergodicity conjectur e an d subconvexit y bound s fo r certai n high-degre e L-functions .
21.3. Root s o f quadratic congruences .
Let f(X) = aX2 + bX + c be a quadratic polynomia l wit h intege r coefficients .
For a given prim e p \ 2a the congruenc e
(21.7) /(i/ ) = 0(mod p)
has a t most tw o solutions ; precisel y th e numbe r o f solutions is
P(P) = i + (£
where D — b 2 — 4ac is the discriminan t o f / an d (— ) i s the Legendr e symbol . Henc e
50% of primes yiel d tw o roots an d th e othe r 50 % yields none . Therefor e p(p) i s one
21. EQUIDISTRIBUTIO N 495
on averag e
(21.8) Y^p(p ) ~ 7r(x) , a s x —> • oo .
It i s a n interestin g questio n ho w th e tw o root s i/(mo d p) ar e distribute d a s p run s
over th e prime s wit h p \ 2a , p ) = 1 . Obviously , i f / ( X ) factor s ove r Q , the n th e
fractions {v/p} ar e no t dens e i n [0,1 ] ; i n fac t th e limi t point s ca n onl y b e ^ wit h
0 ^ u ^ a . Excludin g thi s cas e W . Duke , J . Priedlander , H . Iwanie c [DFI5 ] an d
A. Tot h [Tot ] showe d
T H E O R E M 21 .8 . Suppose f(X) is a quadratic polynomial with integer coeffi-
cients, irreducible over Q . Then for any 0 ^ a < (3 ^ 1 we have
(21.9) \{u (modp);p^xj(iy)=0 ( m o d p ) , ^ { - } < (3} ~ (/ 3 - a)n(x)
as x —> oo .
Equivalently, fo r an y continuous , periodi c functio n F(t) o f perio d on e
F F d t
(21.10) li m -±- £ £ (^ = / ^
V7 U
P ^ z /(i/)=0(modp ) ^
In othe r words , th e sequenc e o f number s i// p i s equidistribute d modul o one . Fur -
thermore, b y Weyl' s criterio n togethe r wit h (21 .8 ) thi s propert y i s equivalent t o
(21.11) l > f c ( p ) = O(T(*)) ,
p^x
as x —> oo , for an y positiv e intege r /i , where
(21.12) Ph (n)= £ e(^) .
i/(mod n )
/(i/)=0(mod n )
A consequence o f Theorem 21 . 8 is the uniform distributio n o f angles of the Sali e
sums
mx \~ ^ fd\/dm + dn\
T(m,n;P)= £ ( - ) ( ^ — ) •
d(mod p)
By Lemm a 1 2. 4 w e hav e
T(m,n;p) = 2co s ( — j T ( 0 , n ; p )
for p \ 2mn , wher e T(0,n;p ) — sPy/p(-) i s th e Gaus s su m an d i / i s a roo t o f
v1 = 4mn(mo d p) . W e thin k o f T(0 , n;p) a s th e normalizatio n facto r an d cal l
2-Kvjp the angl e o f th e Sali e sum . Therefor e Theore m 21 . 8 implie s
496 21. EQUIDISTRIBUTIO N
COROLLARY 21 .9 . If mn is not a square, then the angles of the Salie sums
T(m,n;p) are uniformly distributed modulo 2TT asp runs over primes.
This contrast s wit h th e conjecture d Sato-Tat e distributio n fo r Kloosterma n
sums.
In th e simples t cas e of the polynomia l f{X) = X 2 •+- 1 the root s ^(mo d p) wit h
p = 1 (mod 4 ) correspon d t o th e representation s o f p a s the su m o f two squares p =
r 2 + s 2 . B y choosin g — 5 < r < s wit h s odd , w e see that ever y suc h representatio n
gives th e uniqu e roo t v = rs(mo d p). Henc e
-1
= (mo d 1).
ps sp
In vie w o f thi s constructio n th e assertio n (21 .1 1 ) become s
J2* e(h^)=o(n(x))
r2-\-s2=p^.x
after clearin g th e almos t constan t perturbatio n e(h/sp) = 1 + 0(h/sp). Thi s resul t
is reminiscent t o th e followin g one :
r2+s2=p^x
which wa s establishe d i n [FI] . Othe r surprisin g application s o f Theore m 21 . 8 ar e
given i n [VdP ] an d [Ko2] .
In thi s chapte r w e present a proo f o f Theore m 21 . 8 fo r th e polynomia l
(21.13) f{X) = X 2 -D
with D < 0 (s o f(X) i s irreducible withou t furthe r conditions) . Th e genera l cas e of
/ wit h negativ e discriminan t ca n b e deal t wit h b y completin g th e squar e an d mod -
ifying slightl y th e arguments . However , th e cas e o f positiv e discriminan t require s
substantial change s (becaus e o f th e infinit e grou p o f unit s i n Q(y/D) i f D > 0) .
This cas e wa s settle d b y A . Tot h [Tot ] usin g a n alternativ e pat h passin g directl y
through Kloosterma n sums . Ou r approac h als o depends o n Kloosterman sums , bu t
indirectly vi a Sectio n 1 6.5 .
21.4. Linea r an d bilinea r form s i n quadrati c roots .
By th e Wey l Criterio n w e must sho w th e inequalit y (21 .1 1 ) fo r th e polynomia l
(21.13) wit h D < 0 an d h ^ 0 . W e ca n assum e tha t D i s squarefre e becaus e th e
Weyl su m (21 .1 1 ) wit h frequenc y h fo r D — m2E i s equa l t o th e Wey l su m wit h
frequency mh fo r E (u p t o bounde d amoun t fo r prime s dividin g m) .
According t o th e equidistributio n criteri a i t suffice s t o prov e tha t
=
(21-14) E^)s(D °("logx
as x — > oo , wher e h i s a fixed positiv e intege r an d g(y) i s a fixed smoot h functio n
supported o n 1 ^ y ^ 2 . T o thi s en d w e appl y Theore m 1 3.1 2 fo r th e sequenc e
21. EQUIDISTRIBUTIO N 49 7
*4 = (a n) wit h a n — ph(n)g(n/x). W e have t o verif y th e condition s (1 3.67) , (1 3.68 )
for A = x e^ wit h e(x) — > 0 . Th e firs t conditio n i s weaker tha n th e estimat e
n
(2i.i5) Y, | E ^( )fl®I«*( lo g*r A
for an y A > 2 , wher e £ ? = 5(A ) an d th e implie d constan t depend s o n A, h an d g.
For th e secon d conditio n i t suffice s t o prov e th e followin g estimat e
A
(21.16) Y^p nPh{mn)g(^\\^x{\ogx)-
for an y comple x number s f3 n supported i n th e interva l
B
(21.17) (logx) ^ n < x^(logx )
with \/3 n\ ^ 1 . W e ca n als o restric t th e suppor t of/3 n t o prime s (se e (1 3.68) ) whic h
helps a t technica l points .
In thi s sectio n w e deriv e bot h (21 .1 5 ) an d (21 .1 6 ) fro m on e estimat e fo r th e
linear form s
(21.18) C d(x)= Yl PkWafe)-
n=0(mod d) ^ '
However, w e nee d a ver y stron g boun d fo r Cd(x) whic h i s unifor m i n larg e range s
of d an d h.
PROPOSITION 21 .1 0 . Letl^h^x. Then
2
(21.19) C d(x) < (x$ + d-*(d,h)ix*)T (dh)log2x
where the implied constant depends on the polynomial f and the test function g.
For d — 1 the spectra l theore m fo r th e modula r grou p woul d giv e a stronge r
(best possible ) resul t
n
where the implie d constan t depend s o n h. Withou t smoothing , th e spectral metho d
yields
Yphin) < X 3 log x
as show n b y V . A . Bykovsk y [Byk] . Ou r approac h t o Cd(N) wa s inspire d b y tha t
of Bykovsky . W e shal l wor k wit h th e grou p To(d) s o w e hav e t o dea l wit h th e
exceptional eigenvalue s tha t migh t b e there . All thes e wil l b e take n car e o f i n
estimation fo r a certai n Poincar e serie s i n th e las t section .
That (21 .1 9 ) implie s (21 .1 5) i s clear . No w w e ar e goin g t o deriv e (21 .1 6) fro m
(21.19). Le t B(x) denot e th e doubl e su m ove r ra,no n th e lef t sid e of (21 .1 6) . Firs t
we install th e conditio n (m,n) = 1 and estimat e th e missin g par t b y
X^IZ|^( m n ^(~)| <*]TKI n~2 <x(logx)- s
498 2 . EQUIDISTRIBUTIO N
because f3 n ar e supporte d o n prime s i n th e interva l (21 .1 7) . Le t B*(x) denot e th e
double su m reduce d b y (m , n) — 1, so B(x) — B*(x) + 0(x(\ogx)~ B). W e arrang e
B*(x) a s follow s
**« < E E I E ^»(v ) E «(£ )
m /((5)=0(m ) (n,m ) = l V 7
f(v)=0(mn) V 7
Then b y Cauchy' s inequalit y B*{x) 2 < C C*(x) logx, wher e
<™-5> E | E «•<= ) E •(£ )
m f(6)=0(m) (n,m ) = l X 7
f(^)=0(mn) X 7
z/=J(m)
Squaring ou t an d changin g th e orde r o f summatio n w e obtai n
C*(x) = Ty2!3 nJn2V*(n1 ,n2)
Til 77- 2
where
-iw \ V ^ (mn l\_(mn2\ sr^* (h v 2 v 2,\
V 7 V 7
(m,n 1 n 2 ) = l /(^)=0(mnj) *
^i=i/2(m)
For ni = n 2 w e use the trivial bound D*(n , n) < C n~2x2. I f ni ^ ri2 , then (ni , ri2) =
1 sinc e w e assume d the y ar e primes , therefor e
z
^-^ \ x J\ x —' V mnino J
V 7
(m,mn 2 ) = l \ / \ / /( v)=0(mmn2)
Note tha t | ^ ^ ^ 2 , or els e th e su m V*{ni,n 2) i s void. Her e w e remove th e
condition (ra , nin2) — 1 and estimat e th e exces s part b y 0((nin 2)~3^2x2) agai n by
taking advantage of ni, ri2 being primes. Wit h the condition (m , nin2) = 1 removed
the resultin g complet e su m T>(ni,n 2) i s just th e linea r for m (21 .1 8 ) fo r d = nin 2l
h replace d b y h(n 2 - n ±) an d th e tes t functio n g(y) replace d b y ^ ^ g ( ^ ) g : ( ^ - ) .
After rescaling , Propositio n 21 .1 0 yield s
V(n1,n2) < C ( ( n i n2 ) - 4 £ 2 + ( m n 2 ) ~ ^ 4 ) x l 0 g 2 x
where th e implie d constan t depend s onl y o n f^g,h. Puttin g togethe r th e abov e
estimates on e ca n easil y complet e th e proo f o f (21 .1 6) .
21.5. A Poincar e serie s fo r quadrati c roots .
In this section w e interpret th e linea r for m Cd(x) give n by (21 .1 8 ) a s a Poincar e
series Ph{z) f° r th e grou p To(d) o f th e kin d considere d i n Section s 1 6. 2 an d 1 6.3 .
Throughout z = x + iy denote s a poin t i n th e uppe r half-plan e H , s o x i s no longe r
the variabl e fro m th e previou s sections .
We begin b y recallin g variou s connection s o f positive definit e binar y quadrati c
forms wit h modula r forms . A fe w relevan t fact s ar e als o reviewe d i n Sectio n 22.1 .
Fix a negativ e intege r D. Conside r al l form s (wit h eve n middl e coefficient )
<p(X, Y) = aX 2 + 2bXY + cY 2
21. EQUIDISTRIBUTIO N 49 9
of discriminan t b 2 — ac = D. Multiplyin g b y — 1 (if necessary ) w e can assum e tha t
a,c > 0 , s o p i s positiv e definite . Ther e ar e tw o zero s o f p whic h ar e comple x
conjugate. Choos e th e on e i n H
a
The modula r grou p T = SL 2(rL) act s o n th e quadrati c form s o f discriminan t D b y
linear chang e o f variable s
ip°{X,Y) = <p{aX + yY,pX + 6Y)
if a = f r. 1 G T. Thi s actio n i s compatibl e wit h takin g th e zeros , i.e. , az^
z^a. Le t F b e th e standar d fundamenta l domai n fo r T (se e (22.1 0 ) an d (22.1 1 )) .
Every for m o f discriminan t D i s equivalent (wit h respec t t o th e actio n o f T) t o th e
exactly on e for m wit h z^ G F. Denot e th e se t o f root s o f representin g classe s b y
A = {z (fie F] discr(<p ) = D}.
Therefore, ther e exist s a one-to-on e correspondenc e betwee n th e solution s o f b 2 —
ac — D wit h a, 6, c G Z, a , c > 0 an d th e point s o f th e orbit s {az;a G T/T z} fo r
z G A , wher e T z i s th e stabilit y grou p o f z. Henc e w e deriv e
e
ph(n)= J2 ( ^ ) = £ l r *1
l" £ e(hReaz).
2
b =D(mod n) ^ ' zeA aeroo \r
Im crz~yJ\D\/n
Next w e split th e inne r su m int o classe s with respec t t o th e subgrou p F 0 (d) gettin g
(21.20) P/. W = 5 ^ | r z | - 1 Yl E e(/iRear*) .
^EA rer 0 (d)\r aGr oo \r 0 (d)
Im arz—yJ\D\ln
We shall us e this formul a fo r number s n = 0(mo d d). Thi s congruenc e ca n b e buil t
into th e summatio n condition s fo r z an d r (bu t no t a) a s th e followin g congruenc e
(21.21) V\ D
\/lm TZ = ° ( m o d d).
Note tha t thi s conditio n doe s no t depen d o n th e choic e o f r i n th e cose t To(d)r.
Now w e ar e read y t o evaluat e th e linea r form s
(21.22) C d = ] T Ph(n)F(2nhy/\D\/n)
n=0(mod d)
where F(u) i s a smoot h functio n compactl y supporte d o n R + . Introducin g (21 .20 )
we arrang e thi s int o
(21.23) £ d = Elr *l - 1 Y! P
^rz)
z€A r e r 0(d)\r
500 21. EQUIDISTRIBUTIO N
where ^ restrict s r b y the congruenc e (21 .21 ) an d Ph(z) i s the Poincar e serie s fo r
the grou p To(d)
(21.24) P h{z) = ] T FQnhlm az)e(hRe az).
The formul a (21 .23 ) trul y represent s th e linea r for m Cd by th e Poincar e serie s
Ph{rz) becaus e ther e ar e relativel y fe w point s rz satisfyin g (21 .21 ) . Precisel y th e
number o f point s z G A is the clas s numbe r h(D) an d th e numbe r o f coset s To(d)r
with r G r i s th e inde x
[T:ro(d)]=dn(l + i ).
This alon e i s quit e large , however , no t ever y r satisfie s (21 .21 ) . Fo r ever y z G A,
let v(d,z) denot e th e numbe r o f coset s To(d)r fo r whic h (21 .21 ) holds . Le t (p =
aX2 + 2bXY + cY 2 b e th e quadrati c for m correspondin g t o z. The n notic e tha t
y/\D\/lm (z) = c = <p(0,1 ) . Henc e th e conditio n (21 .21 ) fo r
ap
75
is tha t
(21.25) <p((0 , l ) r) = ^(7,5 ) = a7 2 + 2b~/S + c6 2 = 0 (mo d d) .
Moreover, countin g th e coset s To(d)r mean s tha t w e coun t th e root s (7 , £) o f (f
modulo d u p t o a n invertibl e scala r modul o d (i.e . th e projectiv e solutions) .
We clai m tha t
(21.26) v(d,z) ^r(d).
To prove this, recal l we assumed tha t D i s squarefree, s o by the Chines e Remainde r
Theorem w e mus t prov e tha t i/(p , z) < 2 for p \ D prime . Sinc e p 2 doe s no t divid e
D, th e quadrati c for m aX 2 - f 2bXY + cY 2 i s non-zer o modul o p. Th e projectiv e
solutions t o (21 .25 ) fo r d — p wit h 5^0 (mo d p) correspon d t o root s o f th e non -
zero quadrati c polynomia l aX 2 + 2bX + c , henc e ther e ar e a t mos t 2 o f them . I f
p\ 5, the onl y possibl e projectiv e roo t i s the "poin t a t infinity " (l,p) , whic h occur s
if and onl y i f p | a. I n that cas e there was a t mos t on e solution i n the first situation ,
so th e tota l numbe r o f solution s i f alway s ^ 2 .
Note tha t (21 .1 8 ) become s (21 .22 ) fo r th e tes t functio n
(21.27) F(u) = g(2nhy/\D\/ux).
This particula r functio n i s supporte d i n Y" -i ^ u ^ 2 F -1
wit h
(21.28) 7rh^/\D\Y = x.
Therefore, b y th e abov e arrangement s an d observations , ou r proble m o f estimatin g
the linea r for m Cd(x) reduce s t o tha t o f th e Poincar e serie s Ph{z) fo r th e tes t
function F(u) give n b y (21 .27) . I n th e nex t sectio n w e trea t Ph(z) b y spectra l
methods.
21. E Q U I D I S T R I B U T I O N 50 1
21.6. Estimatio n o f th e Poincar e series .
We shal l estimat e Ph(z) fo r al l z G HI uniformly i n h an d th e leve l d. Ou r goa l
is
LEMMA 21 .1 1 . Let Ph(z) be given by (21 .24) with F(u) supported in Y" -1 ^
u ^ 2Y' forY ^ 2 such that \F^\ < Y j for 0 ^ j < 4 . 77ie n /or any z G M an d
1
r G T w; e /mi; e
(21.29) P ^ ) « (2 / + y- 1 )1 - ((fcy) i + < H (d , h)i {hY)^)r{dh){\ogY) 2
where y = I m z and the implied constant is absolute.
It i s eas y t o se e tha t th e boun d (21 .29 ) fo r Y given b y (21 .28 ) togethe r wit h
(21.26) yiel d (21 .1 9) . Henc e on e complete s th e proo f o f Theore m 21 . 8 fo r th e
polynomial f(X) = X 2 - D.
We still have to prov e Lemm a 21 .1 1 . W e start fro m th e spectra l decompositio n
(see Theore m 1 5.2 )
Ph(z) = Y^(Ph, Uj)uj(z) + Y,-tJ{S' Ea{ '> * + {t ))E*(z> 5 + U)dt
where (UJ(Z)) i s a n orthonorma l basi s o f Maas s cus p forms , togethe r wit h th e
constant functio n fo r th e zer o eigenvalue , an d a runs ove r th e cusp s o f r 0 ( d ) .
By th e Cauchy-Schwar z inequalit y
2
(21.30) \P h{z)\ ^ K d{z)Rd{h)
where
(21.31) K d{z) = £>(*i)K(*)|2 + J2^jRHt)\Ea(z, \ + it)\2dt
and
(21.32) W = E ^ I ^^ + EyE^l(ft.^^«)>P*-
Here h(t) > 0 i s a functio n decreasin g fas t enoug h a t infinity , introduce d fo r con -
vergence. W e choos e
(21.33) h(t) = (1 + t 2)-1 - ( 4 + t 2)-1 = 3( 1 + t 2)-\A + t 2)"1 .
First w e estimat e Kd(z). T o thi s en d observ e tha t (21 .31 ) i s th e spectra l de -
composition o f th e automorphi c kerne l
k
Kd(z) = Yl «z^z))
~rer0(d)
where k(u) i s th e Harish-Chandra/Selber g transfor m o f h(t) (se e Theore m 1 5.7) .
Moreover, notic e tha t k(u) i s non-negativ e becaus e th e Fourie r transfor m o f h(t)
is positiv e an d decreasin g o n 1 R +. Thes e observation s sho w tha t K(z) will onl y
increase i f one replace s To(d) b y th e modula r grou p T = r 0 ( l ) . Havin g don e
this th e dependenc e o n d i s gon e an d th e enlarge d kerne l i s T- invariant, s o w e ge t
502 2 . EQUIDISTRIBUTIO N
Kd{rz) ^ K\{TZ) = K\(z) fo r an y r e T an d 2 G HI. Usin g crud e estimate s fo r
cusp form s an d Eisenstei n serie s fo r th e modula r grou p w e deriv e
1
(21.34) K d{rz)<z:y + y-
for an y r ET an d 2 E I , wher e th e implie d constan t i s absolute . Actuall y fo r ou r
application w e coul d sto p a t th e inequalit y K^(rz) < Ki(z) becaus e z run s ove r a
fixed se t o f h(D) point s an d w e allo w th e implie d constan t t o depen d o n D.
Now i t remain s t o estimat e Rd(h). B y (1 6.1 6 ) w e hav e
(Ph,uj) = (2irh)hj(h)F(t j)
where pj (h) i s th e Fourie r coefficien t o f Uj (z) an d
/•O O
F(t)= / F(y)K it(y)y-Uy.
Jo
First w e explai n ho w t o estimat e F(t). Sinc e F(y) i s supporte d o n th e dyadi c
segment Y~ x < y < 2Y~ l wit h Y ^ 2 , i t i s usefu l t o appl y th e powe r serie s
expansion fo r Ka(y) gettin g a rapidly convergen t serie s of Mellin transform o f F(y)
at th e point s \ ± it + 2£, for £ = 0,1 , 2, The n integratin g th e Melli n transfor m
by part s fou r time s w e gai n th e facto r h(t) (se e (21 .33)) . Nex t w e estimat e th e
resulting term s b y Stirling' s formul a (5.1 1 2) , and su m the m u p t o deduc e tha t F(t)
on th e spectru m satisfie s
F(t) < /i(t)(cosh7rt)-^y^ {Y u + Y~ lt + 3 l o g y ) .
Actually thi s resul t i s a synthesis o f three differen t bound s derive d alon g th e abov e
lines separatel y i n th e range s 0 < it < | , 0 < £ < | , | ^ £ < 00 . Henc e w e obtai n
2
hit^KP^Uj)? < hYH( tj)\Pj(h)\ ,
where
H(t) = h(
f) AY 2it
+ Y~2it + 9log 2 Y).
COSh{7Tt)
A simila r estimat e i s deduce d fo r th e inne r produc t o f th e Poincar e serie s Ph an d
the Eisenstei n serie s E a. Fro m thes e estimate s w e ge t
Rd(h) « hYfeHMlpjih)? + £ ^ J^H(t)\ra(h,t)\2dt),
where r a (/i, t) i s the Fourie r coefficien t o f E a(z, \ +it) (se e (1 6.22)) . The n applyin g
(16.56) an d (1 6.58 ) w e arriv e a t
(21.35) R d(h)<^hY {I+ d- 1 (d,h)^(hY)1 *}T2{dh) log 4
Y.
Finally insertin g (21 .34 ) an d (21 .35 ) int o (21 .30 ) w e finish th e proo f o f Lemm a
21.11.
http://dx.doi.org/10.1090/coll/053/23
CHAPTER 2 2
IMAGINARY QUADRATI C FIELD S
22.1. Binar y quadrati c forms .
We are primarily intereste d i n the imaginary quadrati c field K ~ Q(v^D) , but
to pu t our treatment int o historica l perspectiv e w e begin b y reviewing th e theory
of binar y quadrati c form s
(22.1) <p(X, Y) = aX2 + bXY + cY 2
of discriminan t
(22.2) D = b2 - 4ac < 0.
We d o not assum e tha t D i s fundamental , however , w e only conside r primitiv e
forms, i.e.,
(22.3) (a,6,c ) = l.
This i s not a serious restrictio n becaus e an y form i s a multiple o f a primitive form .
Changing th e sign of all coefficients (i f necessary) w e can assume tha t
(22.4) a > 0, an d c> 0
so (f is positive definite . W e call a, b, c the first, the middle an d the last coefficient s
of (p, respectively. Th e quadratic for m <p(X, Y) factor s ove r comple x number s int o
linear form s
(22.5) <p{X, Y) = a(X + z aY)(X + z aY) = c{zcX + Y)(z cX + Y)
with
(22.6) z a = — an dz c= — .
2a 2 c
Actually th e roots z a,zc depen d o n b but we do not displa y it . Throughou t w e
choose the root s o that z a an d zc ar e in the upper half-plan e H . Not e that z azc = 1 .
The modula r grou p T — SL 2 (Z) act s on the primitive form s o f a given discrim -
inant D by unimodular transformation s
(22.7) ^(X,Y) = <p(aX + 1 Y,pX + 5Y), * * = ( " j ) e T
= a aX2 + baXY + c aY2,
503
504 22 . IMAGINAR Y QUADRATI C FIELD S
say, wit h
aa = aa 2 + baf3 + c/3 2
(22.8) 6* = 2acr y + 6(a< 5 + £7 ) + 2c/3<5
c a = a7 2 + 67< S + c<5 2.
This actio n i s compatible wit h actio n b y linea r fractiona l transformatio n o n H, an d
the root s ar e preserved .
Two form s p,i\) ar e sai d t o b e equivalen t i f the y belon g t o th e sam e T-orbit ,
i.e., ip = (p a for som e a £ T ; in this case we write cp ~ ip. We denote the equivalenc e
class whic h contain s (p b y
(22.9) [<p] = {ip° : a e T}.
Obviously, ever y equivalence class is represented b y a unique form <p(X, Y) = aX 2 +
bXY + cY 2 whos e roo t z a i s i n th e standar d fundamenta l domai n o f th e modula r
group F = F~ U F+ , wher e
(22.10) F' = {zeU: \z\ > 1 , - - < x < 0},
+
(22.11) F = { ^ e M : \Z\ ^ 1 , O ^ x ^ i } .
such a for m <p i s called reduced ; i t i s characterized i n term s o f coefficient s b y
(22.12) -a<b^a^c wit h b^O i fa = c.
Since I m (z a) = v /TDJ/2a ^ \ / 3 / 2 , th e first coefficien t o f a reduce d for m satisfie s
(22.13) a < V|L>|/3 .
Hence i t follow s tha t th e numbe r o f reduced form s i s finite. Thi s numbe r h — h(D)
is called the class number o f discriminant D becaus e it equal s the number o f distinct
equivalence classes .
The transformation s r eT whic h fix a form (p ar e calle d automorph s o f (/?; they
form a finite cycli c grou p Aut(<p ) = { r G T : (pT = (p} of orde r
(22.14) u> =
f6 if
if
D = -3,
D = -4,
4
2 if D < -A.
Equivalent form s hav e conjugat e group s o f automorphs , namel y
1-
A\lt((pa) = c r A u t ( < / ? ) c r .
For reduce d form s Aut(c^ ) i s generated b y
2
# = ( \ "/ ) if(^(x,y)-x + xr + F2,
1 -
s
=(i ) if^(x,r ) = x 2 + F2 ,
l
T=\ ~ J i f D<-4.
22. IMAGINAR Y Q U A D R A T I C FIELD S 505
Note tha t fo r all integers k the forms p(X + /cY , Y) ar e equivalent; suc h form s are
said t o be parallel. Th e parallel form s hav e th e same firs t coefficien t a while the
middle coefficient s ar e in the same residu e clas s modul o 2a.
In "Disquisitione s Arithmeticae " Gaus s introduce d compositio n o f form s o f
a give n discriminant . Hi s construction wa s subsequently simplifie d b y Dirichle t
but onl y fo r pair s o f form s p 1 (X,Y) = a-^X 2 + b\XY + c\Y 2 an d <p 2(X,Y) =
2 2
a2X + b 2XY + c 2Y satisfyin g
(22.15) ( 6Ji +_ 62 2 )
fll} > a 2 1.
Note that 61 , b2 have the same parit y (namel y 6 1 = b 2 = D(mo d 2) ) so | ( 6i +62) is
an integer . I f the Dirichle t conditio n (22.1 5 ) holds , the n th e forms p>\,p 2 ar e said
to b e united. Clearly , thi s relationshi p extend s t o the classes of parallel forms . For
the unite d form s ther e exist s a unique 6(mo d 2aia 2 ) suc h tha t
2
(22.16) b = 61 (mod 2a x ), b = 6 2(mod 2a 2 ), b = D(mod 4aia 2 ).
Putting b 2 — D = Aaia 2c w e derive th e form
(22.17) p(X, Y) = aia2X2 + bXY + cY 2
which i s primitive o f discriminant D; it satisfie s th e identity
(22.18) <f(x,y) = ^1 (^1 ,2/1 )^2(^2,2/2 )
with x = X1 X 2 — q/1 2/ 2 an d y = a\X\y 2 + 62/1 2/ 2 + ^2^22/1 - Thi s i s the Dirichle t
formula fo r composition o f united classe s o f parallel forms .
Now we use the Dirichlet formul a t o define compositio n o f any tw o equivalenc e
classes. W e sa y that a positive intege r m i s properly represente d b y p if
(22.19) m = <p(a , 7) wit h (a , 7) = 1 .
Since ip is positive definite th e number of proper representations , denote d by r* (ra),
is finite . Equivalen t form s represen t properl y th e same numbers ; thes e ar e exactly
the number s whic h appea r a s the first coefficient s o f forms i n the equivalent class .
Every primitiv e for m represent s properl y a number m whic h i s co-prime wit h any
fixed positiv e integer . Therefor e give n tw o classe s A\,A 2 w e can choose represen -
tatives (/?i,(/? 2 wit h (ai,a 2) = 1 ; such form s ar e united. The n (22.1 7 ) determine s
the thir d clas s A = A\A 2 = [<p\. On e can show tha t A doe s no t depend o n the
chosen representative s pi,(p 2, s o the Dirichlet compositio n induce s a well-define d
multiplication o f classes . Th e law of compositio n i s commutativ e an d also asso -
ciative (th e latter propert y i s not obvious). Thi s make s th e set H o f all distinc t
equivalence classe s a finit e abelia n grou p o f order h. Th e identity elemen t o f 'H ,
which w e denote b y 1, is the class whic h contain s th e principal for m
p = X 2 - jY 2 i f D = 0(mod 4),
(22.20 )
ip = X 2 + XY + ^—T^-Y 2
HD = l(mod 4).
The invers e clas s o f <p(X, Y) — aX2 + bXY + cY 2 i s th e clas s whic h contain s
Tp(X,Y) = aX 2 — bXY + cY 2; thes e ar e calle d th e opposit e forms . Not e th e
Jp(X,Y) i s equivalent t o (p(Y,X) (th e for m whic h i s obtained b y interchanging the
506 22. IMAGINAR Y QUADRATI C FIELD S
first an d th e las t coefficients) . Therefor e th e set s of numbers properl y represente d
by a class A an d it s invers e A~ 1 coincide.
The question of which numbers ar e represented properl y by a given form, ha s no
simple answer . However , ther e is a simple, sufficien t an d necessar y conditio n fo r a
positive integer m to be properly represented b y some primitive form o f discriminant
D, namel y i t is the solvabilit y o f th e congruenc e
2
(22.21) b = D(mod 4m) .
Let Rf)(m) denot e th e numbe r o f al l representations o f m by a complete syste m o f
inequivalent form s o f discriminan t £> , thus
iMm) = £ £ ^(m/d 2)
2
Vd \m
where ip runs ove r inequivalen t forms . I f (m, D) = 1 we hav e
(22.22) R D{m)=wY,XD(d)
d\m
where XD i s the Kronecker symbol . W e shal l se e tha t fo r special discriminant s
(idoneal numbers ) th e abov e formul a yield s th e numbe r o f representations fo r a n
individual for m (becaus e exactl y on e clas s o f forms represent s m) . Thi s "conve -
nient" situatio n i s explained i n th e genu s theor y du e to Gauss, whic h w e ar e goin g
to presen t i n more moder n terms .
First w e classif y th e binar y quadrati c form s o f a given discriminan t accordin g
to rationa l unimodula r transformation s rathe r tha n th e integra l ones . Tw o form s
if, ip are sai d t o b e i n th e sam e genu s if ijj(X, Y) = ip(aX + 7Y, fiX + SY) fo r som e
a = I r. 1 G 5Z'2(Q)- I n fact i t suffice s t o apply a wit h al l entries havin g
the denominato r SD. Obviously , equivalen t form s ar e i n the sam e genu s bu t no t
conversely. Indeed , th e invers e class A~l i s in the genu s of A, though A' 1 doe s no t
always coincid e wit h A. Ever y genu s ha s th e sam e numbe r o f classes. Th e genu s
which contain s th e principa l clas s is called th e principa l genus ; w e denot e i t by Q
and pu t
(22.23) h x = \G\.
A beautifu l theore m o f Gauss assert s tha t Q consists o f square s o f classes ,
(22.24) Q = {A2 : A G H}.
Thus Q is a subgroup o f W. Th e facto r grou p T — Ti/Q i s called th e genu s group .
We sa y tha t a for m <p i s ambiguous i f it is equivalent t o Tp; i n other words ,
the clas s A — [tp] ha s exponen t tw o i n the clas s grou p %. Suc h classe s for m th e
subgroup
(22.25) £ = {AEH:A2 = l}.
Thus £ is isomorphic t o T becaus e it is th e kerne l o f th e homomorphis m A —> A 2.
Putting
(22.26 ) ft 0 = \S\
22. IMAGINAR Y Q U A D R A T I C F I E L D S 50 7
we hav e h — h§h\.
The grou p o f ambiguou s classe s £ i s th e easie r par t o f %. A n ambiguou s for m
(p whic h i s reduced i s characterized b y havin g it s roo t z a o n th e boundar y o f F +,
(22.27 )
2a
Hence a reduce d for m ip(X, Y) = aX 2 + bXY + cY 2 i s ambiguou s i f an d onl y i f
a = 6 , a = c or b = 0 in which cases the discriminant factor s int o D = a(a — 2c), D =
(6 — 2a ) (6 + 2a) o r D = —4ac , respectively. Usin g thes e characterization s on e ca n
compute ho; we obtai n
(22.28 ) h0 •yr+S — l
where r i s the numbe r o f distinct, od d prim e divisors of D an d s = 0,1 , 2. Precisely ,
if D = l(mo d 4) , the n s = 0 , an d i f D = -AN, the n
s= 0 i f N E E 3 , 7(mod 8) ,
(22.29 ) s= 1 i f J V = l,2,4,5,6(mod8) ,
5=2 i fT V = 0(mo d 8) .
If a positiv e intege r m prim e t o D i s represente d properl y b y a for m o f dis -
criminant D (i.e. , (22.21 ) i s solvable) , the n i t i s represente d b y form s onl y o f on e
genus, actuall y th e residu e clas s m(mo d D) alon e determine s thi s genus . I n thi s
connection Gaus s assigne d t o a discriminan t D a syste m o f r + s genu s characters .
These ar e th e Legendr e symbol s \p f° r ever y p \ D,p > 2 , an d i f D = —AN, w e
have extr a s character s modul o 8 listed below
none if i V = 3,7(mod8) ,
X4 if i V = l,4,5(mod 8
X8 if A ^ = 6(mod8) ,
X4X8 if 7 V - 2 ( m o d 8 ) ,
X4 and x s if i V = 0(mod8) .
Let S denot e th e syste m o f genu s characters . W e hav e
(22.30 ) n X=X D
xes
Hence i f m i s represente d b y a for m o f discriminan t D an d (m , D) — 1 , the n b y
(22.30) an d (22.21 ) w e obtai n Yl xesX(m) = Xr>( m) = 1 - Tw o suc h number s m,n
are represented b y forms o f the sam e genus if and onl y if x(m) = x( n) f° r al l % ^ <5.
Therefore a genu s o f form s i s characterize d b y a collectio n o f sign s e x = ± 1 suc h
that
(22.31)
ii'
xes
and w e hav e exactl y 21 5' x possibilities .
508 22. IMAGINAR Y QUADRATI C FIELD S
Now we return t o th e formul a (22.22 ) which , i n vie w o f the genu s theory , give s
the number o f representations o f m b y forms representing the genus whose invariant s
are
(22.32) e x = x(m) fo r al l x G 5.
If ther e i s onl y on e clas s i n eac h genus , the n Rr>{m) reduce s t o th e numbe r o f
representations b y a singl e form . Henc e a question : whic h discriminant s D satisf y
hi = \G\ = 1 , o r equivalentl y
2
(22.33) A = 1 fo r ever y AeH.
After Eule r w e call suc h a discriminan t "numeru s idoneus" , (i n Englis h i t i s calle d
"idoneal number" , o r "convenien t number" ; i n Frenc h "nombr e convenable") . A s
a matte r o f fac t Eule r considere d discriminant s o f typ e D = —AN an d calle d N a
numerus idoneu s rathe r tha n D. Gaus s gav e a lis t o f 6 5 such numbers :
( N = 1,2,3,4,7 with h(D) = = 1 ,
N = 5,6,8,9,10 (plu s te n more ) with h(D) = = 2 ,
I N = 21,24,30,33 (plu s twent y more ) with h(D) = - 4 ,
N = 105,120 (plu s fiftee n more ) with h(D) == 8 ,
{ N = 840,1320,1365,1848 with h(D) = = 1 6 .
The las t numbe r i n Gauss ' lis t N = 1 84 8 i s th e larges t know n numeru s idoneus .
W. E . Brigg s an d S . Chowl a [BC ] showe d tha t ther e i s a t mos t on e suc h numbe r
beyond 1 0 65 .
Idoneal numbers N hav e the property tha t th e form x 2 + Ny 2 represent s prime s
essentially i n n o mor e tha n on e way , wherea s a composit e numbe r ha s severa l rep -
resentations i f an y (fo r detail s se e [Cox]) . Thi s propert y offer s a n algorith m fo r
primality testin g whic h wa s a n inspiratio n fo r Eule r t o stud y th e idonea l number s
in th e first place .
22.2. Th e clas s group .
Prom no w on we assume tha t D i s a negative fundamenta l discriminant . A fun -
damental discriminan t i s characterized b y th e propert y tha t ever y for m tp(X, Y) =
aX2 + bXY — cY 2 wit h b 2 — 4ac — D i s primitive , i.e. , (a , 6, c) = 1 . Suc h negativ e
discriminants ar e o f typ e D = l(mo d 4) , D squarefree , o r D = — 4N, A f squarefre e
with
(22.34) N = 1 ,2,5 , 6 (mo d 8) .
The associate d Kronecke r symbo l X£>( m) = ( ^ ) ( s e e (3-43) ) i s a real , primitiv e
character o f conducto r —D. I t turn s ou t tha t th e fundamenta l discriminant s ar e
exactly th e discriminant s o f quadrati c fields .
Throughout w e us e th e notatio n fro m th e theor y o f th e imaginar y quadrati c
field K = Q(y/D) a s describe d i n Sectio n 3.8 . Th e quadrati c form s (£>(X , Y) (wit h
a > 0 , c > 0 , b 2 — 4ac = D < 0 ) correspon d i n a one-to-on e wa y t o th e primitiv e
ideals
r b+y/D-)
(22.35) a = a, = a[l,z a]cO.
22. IMAGINAR Y Q U A D R A T I C F I E L D S 50 9
Hence th e terminolog y an d result s fo r quadrati c form s whic h wer e use d i n th e
previous sectio n translat e naturall y fo r ideals . I n thi s sectio n w e go through thi s
transition.
Recall th e following :
/ - the group o f non-zero fractiona l ideals ,
P - the subgroup o f principal ideals ,
H = I/P - the class group ,
h=[H\- th e class number ,
D = (VD) - the different .
Every clas s A G H ha s a unique, primitiv e idea l (22.35 ) wit h z a i n the funda -
mental domai n F — F~ U F+ o f the modular grou p Y = SL 2(Z); suc h idea l i s said
to b e reduced. Thu s th e number o f reduced ideal s i s h. Th e subgroup
G = {A 2 : A e H}
is calle d th e principa l genu s an d the facto r grou p T = %/Q i s calle d th e genu s
group. Henc e tw o non-zer o ideal s a , b belon g t o th e sam e genu s i f an d onl y i f
a — be2 for som e eel. On e can also sho w tha t a , b are in the same genu s i f and
only if
(22.36) Na = NbNj fo r some 7 e K*.
A clas s A G % is said t o be ambiguou s i f A = A - 1 , i.e. , A 2 = 1 . Th e group of
ambiguous classe s
£ = {AeH:A 2 = l},
as the kernel o f the homomorphism A —) • A 2, i s isomorphic t o the genus grou p T.
The correspondenc e betwee n quadrati c form s an d primitive ideal s cast s ambiguou s
forms int o ambiguou s ideal s ( a is ambiguous i f a2 is principal); i t proves tha t ever y
ambiguous clas s A ha s its reduce d primitiv e idea l a wit h z a o n the boundar y o f
F + . Recal l tha t a reduce d ambiguou s for m ip(X,Y) = aX 2 + bXY = cY 2 ha s
a — b, a = c or 6 = 0 , hence i t yield s a factorization o f the discriminant int o
-D = d ±d2 = a(4 c - a) , (2 a - b) (2a + 6) , 4ac ,
respectively, wit h d 2 > d\ > 0 and ( d i , ^ ) = 1 (the coprimality o f these factor s
results fro m D bein g fundamental) . Conversely , a factorizatio n — D = d\d 2 wit h
d2 > d\ > 0 , (di,d 2) = 1 yields a reduce d ambiguou s form . Fo r example , i f
D = l(mo d 4 ) we obtain
d l 2 2
f <p(X, Y) = dxX2 + d xXY + ^ r ,i f d 2 > 3a7!,
[ y,(X,y ) = ^ X2 + d l_^XY^dl±AY2, i f * < 3d!.
Correspondingly, a reduce d primitiv e ambiguou s idea l yield s a factorizatio n
of th e differen t D = (y/D) int o D = D1 U 2 with (^1 ,^2 ) = 1 ; not e tha t Cl(d\) =
C£(D2). Distinc t suc h factorization s (u p to the order o f factors di,d 2) yiel d distinc t
ambiguous classes , therefor e
(22.37 ) ft 0 = \S\ =2t~1
510 22. IMAGINAR Y QUADRATI C FIELD S
where t = u(\D\) is th e numbe r o f distinc t prim e divisor s o f D (not e tha t (22.37 )
agrees wit h (22.28 ) i n cas e o f fundamenta l discriminants) .
The genu s o f a n idea l ca n b e determine d b y value s o f rea l character s o f th e
class group . Thes e character s ar e assigne d t o ever y factorizatio n D = D1 D2 int o
two fundamenta l discriminant s D\,D2- Not e tha t Di,i^ 2 hav e differen t sign s an d
are coprime , s o w e hav e exactl y 2 t~l distinc t factorization s u p t o th e order . W e
define XD 1 ,D2 n r s ^ o n prim e ideal s b y
lXD2(Np) i f p f D 2
(this i s wel l define d becaus e XD(NQ) = 1 if (a , D) = 1 ) , an d w e exten d XD 1 ,D2 t o
all non-zer o fractiona l ideal s b y multiplicativity . W e obtai n a characte r XD UD2 :
I— > {±1} suc h tha t XD 1 ,D2(a) — 1 f° r a ^ d E P. Indeed , fo r a = (a ) wit h
a = ±(m + ny/D), (a,D) = 1 we have XD UD2{O) = XDA*) = XDi(^(m 2 - n 2D)) =
XD1(n^) — 1 (similarl y on e ca n chec k th e othe r cases) . Therefor e XD 1 ,D2 € W;
these ar e exactl y al l rea l character s o f th e clas s grou p calle d th e genu s characters .
Two ideal s a , b G / ar e i n th e sam e genu s i f an d onl y i f
x(a) = x(b ) fo r al l genu s characters .
The genu s theor y o f Gaus s wit h it s explici t characterizatio n o f ambiguou s
classes prove d t o b e ver y usefu l fo r primalit y testin g an d factorizatio n techniques .
A metho d fo r factorin g — D whic h exploit s th e clas s grou p structure , a s describe d
by D . Shank s [Sha] , require s onl y 0(|D|4 ) operations .
If — D i s prime, then th e whole class group i s the principal genu s Ti = Q, I n thi s
case H tend s t o b e cycli c mor e ofte n tha n not . Afte r Gauss , a discriminan t D (no t
necessarily negativ e prime ) i s said t o b e regula r i f the principa l genu s i s cyclic (thi s
is not th e sam e a s th e regula r prime s occurrin g i n th e theor y o f cyclotomi c fields).
We stil l d o no t kno w i f ther e ar e infinitel y man y regula r discriminants . However ,
the numerica l evidenc e support s th e existenc e o f a large proportio n o f these. I t ha s
been conjecture d i n [Ge ] tha t th e proportio n o f regula r t o al l discriminant s (bot h
negative) i s
00 _ . .
(C(6)IIC(n)) =0.8469... .
n=4
Gauss genu s theor y gives u s a ful l descriptio n o f th e 2-Sylo w subgroup s o f th e
class group . Ther e ar e som e interesting , however , incomplet e development s fo r
other p-Sylo w subgroup s o f H, i n particular, fo r p = 3 by Davenpor t an d Heilbron n
[DH]. Accordin g to the Cohen-Lenstr a heuristic s th e probabilit y tha t a n imaginar y
quadratic field ha s i n it s idea l clas s grou p a n elemen t o f orde r p i s
00
l-n^-p - ")-
71=1
22. IMAGINAR Y 1
QUADRATI C FIELD S 5 1
22.3. Th e clas s grou p L-Functions .
Let x : H—) > C * b e a characte r o f th e clas s group . Defin e
(22.39) L K(s,X) = J2x(*)(Nay s
a
=
for R e (s) > 1 wher e th e serie s converge s absolutely . Notic e tha t LK(S,X)
LK{S, X)- Fo r an y primitiv e idea l a = [a , b+ 2 ] we hav e
J2X(a)LK(s,X) = hJ2(Nb)- s = ^a- s £ \a\~ 2s
by writin g b = (a)a . Sinc e a - 1 = Z + z a Z, w e deriv e
E i«r 3'= E E i™+^r 2s
O^aGa- 1 (m,n)^(0,0 )
m+n 2s = 2 2s W\\-s
c(2«)E E i ^i" ^ ) ( ^ ) " ^«> s)
(m,n) = l
where
E(Z, S )= 5 3 (im 7«r = ^EEj'*i m+n;j: i" 28
Ter^vr (m,n)= i
is th e Eisenstei n serie s fo r th e modula r group . Thi s give s u s
1h/J\D
(22.40) ^ x(a)L K(s, X ) = — ( ^ - [ ) C ( 2 * ) £(*„ , *)•
We pu t
s
(22.41) 6>(s)=7r- r(s)C(2s),
(22.42) E*(z,s)=0(s)E(z,s),
s
(22.43) A K(S,X) = (2n)- r(s)\D\iLK(s,X).
In thi s notatio n (22.40 ) become s
(22.44) £ x(a)A/ f (*, x) = ^E*(z a, s).
By Fourie r inversio n (th e orthogonalit y o f characters ) w e obtai n
(22.45) A K(s,X) = -J2x(a)E*(z a,s)
a
where a runs ove r a set o f primitive, inequivalen t ideal s ( a good choic e being the se t
of reduced ideals) . B y virtu e o f this formul a th e Heck e L- function inherit s analyti c
properties o f th e Eisenstei n series .
It i s well know n (se e (1 5.1 3) ) tha t E*(z, s) ha s th e Fourie r expansio n
(22.46) £?*(* , s) = 6{s)y s + 0(1 - s)y 1
~s
oo
+ 4 v ^^r s _ 1 / 2 (n)K s _ 1 / 2(27rny)cos(27rnx)
512 22. IMAGINAR Y QUADRATI C FIELD S
where
(22.47) ,(«)-E(sr- ad=n
The Fourie r expansio n yield s analyti c continuatio n o f E(z, s) to the whol e comple x
5-plane, i t shows tha t E(z,s) i n Re (s) ^ \ ha s onl y a simple pol e a t s = 1 with
constant residu e ~ , and i t satisfies th e functiona l equatio n
(22.48 ) E*(z,s) = E*(z,l-s).
Inserting (22.46 ) int o (22.45 ) w e obtain th e Fourie r expansio n
,
(22.49) A Jf (, )X )=ij;xW{'W(f) +»(l-«)(f)1 '
4
\D\\W D\
+ ( ^ ) £T._ Va ( » ) * .1
_ /2 (*n—)cos(™-)}
where a runs ove r primitiv e representative s o f ideal classes . Not e tha t th e Fourie r
series converge s rapidl y sinc e th e Besse l functio n K J/(y) ha s exponentia l deca y
1 / 2
^(y)=(^) e - ( i+ ^ ) .
2yJ
for y > 0, v e C where \0\ ^ \v 2 - \\ (se e (23.451 .6 ) o f [GR]).
We deduce b y (22.45 ) or (22.49) tha t LK(S, X ) n a s analytic continuatio n t o th e
whole comple x s-plane , i t is entire excep t fo r a simple pol e a t 5 = 1 if \ i s trivial
with
27T/ i
r
(22.50) esCx(s)
5=1
w\/\D\
and i t satisfies th e functiona l equatio n
(22.51) A / f (s,x) = A jK:(l-s,x).
All th e above propertie s o f LK(S,X) c a n ^e also deduce d fro m th e integral
representation ( a la Riemann, se e (4.77) ) du e to Hecke
<22-52> «''* > - s ^ ij + /"(»- + . -U (-4fe,)*
where 5(% ) = 1 if x is trivial, o r else <5(x ) — 0 , an d
(22.53)/ XW = E# W
a
where a runs ove r al l non-zer o integra l ideals . Integratin g termwis e w e ge t
22. IMAGINAR Y QUADRATI
1 C FIELD S 5 3
where a =T V a an d T(s,x) i s the incomplete gamm a functio n
s l
T(s,x)= / e-yy - dy.
JX
Hecke derive d th e formula (22.52 ) b y splitting LK{S, X) m ^o a sum of Epstein zet a
functions (apparentl y introduce d first b y Dirichlet)
s
(22.55) Z a(s)= J2 1 2 ^a(m,n)-
(m,n)#(0,0)
where (p a(X,Y) = aX 2 + bXY + cY 2 i s the quadratic for m correspondin g t o the
ideal a . Thi s i n turn i s the Mellin transfor m o f the theta functio n
mn
Both 6 a(z) an d f x(z) ar e modular form s o f weight on e for the group ro(|D| ) an d
character XD- I f X i s n ° t real , the n f x(z) i s a primitiv e cus p for m wit h Heck e
eigenvalues
(22.56) A x (n) =£ X (a).
Na=n
If x i s real , sa y x — XD 1 ,D2 wit h D1 D2 — D, the n th e Heck e L-functio n factor s
into Dirichle t L- functions. Precisely , i t ca n be verified b y comparing loca l factor s
of Eule r product s an d using (22.38 ) togethe r wit h th e factorization la w of ideals in
K - Q(y/D) tha t
KRONECKER FACTORIZATIO N FORMULA .
L
(22.57) L K(s,XDltD2) = (*,XD
1 )L(S,XD2).
For th e trivial characte r w e have th e Dedekind zet a functio n
00
(K(S) = Y,( Na)~S = C ( s ) ^ XD) = £ r(n, XD )n-°.
1a
In thi s case , (22.45 ) become s
(22.58) A K(s) = -J2E*(z a,s).
a
Comparing th e residues a t s = 1 on both side s w e obtain th e celebrated
DIRICHLET CLAS S NUMBE R FORMULA .
(22.59) L( ,XD) = ^m.
W\/\D\
514 22 . IMAGINAR Y Q U A D R A T I C F I E L D S
EXERCISE 1 . Prov e tha t fo r D < - 4,
(2-XD(2))h(D)= J2 XD(n) .
0<n<^
Another interestin g formul a come s fro m (22.45 ) a t s — \. B y th e Fourie r
expansion (22.46 ) w e see that E(z, \) = 0 and
oo
E'(z,\) = ,/ylogy + 4y/y^T(n)K0(2nny) cos(2>Knx).
l
Hence th e central valu e o f LK(S, X) * S give n by
L
*(§'X) = ^ r 1/ 4 I > ( a ) £ ' ( M ) -
a
Inserting th e Fourier expansio n (22.61 ) w e arrive a t
1av
Using Ko(y) < C y~*e~y on e derives b y trivial estimatio n tha t
av
This is interesting fo r the trivial clas s character xo- Assumin g L(^,XD) ^ 0 (whic h
follows fro m th e Rieman n Hypothesi s fo r L(S,XD), bu t ma y conceivabl y b e es-
tablished sometim e withou t recours e t o th e GRH ) th e lef t sid e i s LK{\,XO) —
C,{\)L\,XD) ^ 0 so we derive
D
D\\K V\ \
MD)ȣ(^p)'iog
where a runs ove r th e complete syste m o f primitive, reduce d ideals . Not e tha t w e
dropped th e facto r 2 in th e logarith m (fin d ou t why this i s permissible!) . Recal l
that a = Na < y/\D\/3 s o al l term s abov e ar e positive . Thi s implie s tha t h(D)
cannot b e very small , namel y
h(D)^>\D\*log\D\,
where th e implie d constan t i s effectivel y computable . Moreover , thi s implie s tha t
a positiv e proportio n o f the primitive , reduce d ideal s hav e th e nor m Na x \/\D\.
Duke [Du2 ] has show n unconditionall y tha t th e point s z a ar e equidistribute d i n
the fundamenta l domai n F wit h respec t t o the hyperbolic measur e d/j, = y~ 2dxdy.
His resul t i s ineffective becaus e i t use s Siegel' s boun d h(D) » \D\2~ S (se e (5.76)).
The Clas s Numbe r Formul a (22.59 ) reveal s th e algebraic natur e o f the specia l
value L(l , XD)- On e can derive infinitel y man y formula s o f that kin d b y comparin g
22. IMAGINAR Y Q U1
ADRATI C FIELD S 5 5
coefficients i n the Taylo r expansio n a t s — 1 on bot h side s of (22.45) . O f particula r
interest ar e th e constan t terms . O n th e lef t sid e w e hav e
(22.61) A K(s,x)~^-LK(l,x)
if x is non-trivial , an d
(22.62) A K(s,X) ~ - ( ^- + l o g ^ l +
1 ^( , XD))
if x is trivia l b y th e followin g expansions :
r ( s ) = i - ( * - i )7 + "-
C(«) = —7 + 7 + ---
s—1
L(s, X D) = L(l , XD) + (S- 1)1/(1, XD) + • • •
and th e Clas s Numbe r Formul a (22.59) . O n th e righ t sid e o f (22.45) w e nee d a n
expansion o f E*(z,s) whic h w e shal l deriv e fro m th e Fourie r serie s (22.46) . Firs t
notice tha t 20(s)y s ~ — s- 1 + 7 — log47ry a s s—» 0 by th e followin g expansions :
(y-Y = i +siogy+...
\7T/7 T
r(s) = - -7 + • • •
S
C(25) = - - -5 l o g 2 7 r + .--
using th e value s C(0 ) - - § and C'(0 ) = -| log27r. Moreover , 20( 1 - s ) y 1 - 2 - §
by usin g th e valu e £(2 ) = ^-. Henc e
(22.63) 2£*(z , s) - - 1 - + 7 - l o g 4 ^ + \y + 2D(s)
5—1 O
where D(z) i s the tai l i n th e Fourie r expansio n (22.46 ) a t s = 1 , precisel y
00
D(z) = 4 v /^^r 1 /2(n)K 1 / 2 (27rny)cos(27rnx).
1
y
Since Ki(y) — {^I2y)^e~ an d Ti/ 2 (n) = <j_i(n)n2, thi s become s
(22.64) £>(z ) = 2Re ^ a _ i ( n ) e ( n ^ ) .
1
If w e execute th e summatio n a s follow s
00 0 0 0 0
_ 1
D(z) = 2Re ^ ^ m e ( m ^ ) = - 2 ^ l og | 1 - e{nz)\
11 1
we find tha t
(22.65) D(z) = ~y-2\og\ V(z)\,
516 22 . IMAGINAR Y QUADRATI C FIELD S
where rj(z) i s th e Dedekin d et a functio n
oo
(22.66) V(z) = e(j i)'[[(l-e(nz)).
The et a functio n i s a modula r for m o f weigh t \ o n th e grou p T = SL 2 (Z) wit h a
suitable multiplie r syste m involvin g Dedekin d sum s (se e e.g . [1 4]) . The n 77(z) 24 =
A(z) i s th e famou s Ramanuja n A function , th e primitiv e cus p for m o f weigh t 1 2 .
Therefore th e functio n
2
(22.67) F(z)=(-%L^\r,(z)\
is T-invarian t (her e th e facto r (2/y^DJ) 1 / 2 i s introduced fo r normalizatio n s o tha t
F(za) = a~ 2 \rj(z a)\2). I n thi s notatio n (22.65 ) become s
(22.68) D(z) = ~y - \ lo g ^ - logF(z) .
Inserting (22.68 ) int o (22.63 ) w e obtai n
l
(22.69) 2E*(z,s) —- + 7 - \og2ir^J\D\ - 2\ogF(z).
5—1
Next insertin g (22.69 ) int o (22.45 ) w e obtai n
(22.70) A x (s,x)^^(x)^(^-+7-log27rV^l)--Vx(a)logF(za).
w \s — 1 / w ^—^
0
ls
Now matchin g (22.70 ) wit h (22.61 ) , o r (22.62 ) i n cas e x trivial , w e deriv e a n
exact equatio n whic h i s calle d th e
KRONECKER LIMI T FORMULA . If x is a non-trivial class group character, then
(22.71) L * ( l , x) = -^£ x (a)logF(zn).
W
V \ D\ a
Moreover,
(22.72) ~(l, XD)=log\D\-'y+^logF{za).
a
Consider (22.71 ) fo r a non-trivia l genu s characte r \ — XB,C which correspond s
to th e factorizatio n D = BC int o non-trivia l fundamenta l discriminant s B, C. On e
of thes e i s negativ e an d th e othe r i s positive , sa y B < 0 < C. B y th e Kronecke r
formula (22.57 ) w e have 1/^(1 , XB,C) = L(l, x#)L(l , \c) a n o - b y the Dirichle t Clas s
Number Formul a (22.59 ) w e hav e L ( 1 , X B ) = 27rh(B)/w(B)y/\B\. Dirichle t als o
established th e clas s number formul a fo r positiv e discriminant s (se e (2.31 )) , namel y
(22.73) L ( l , xc) = 2h(C)\C\~i loge(C )
where h(C) i s th e clas s numbe r o f th e rea l quadrati c fiel d Q(VC) an d e(C) i s th e
fundamental unit . Combinin g thes e formula s w e infe r fro m (22.71 ) tha t
(22.74) £ic) 2h(B)HC)/w(B) = -Q F(a) - x(»)
22. IMAGINAR Y Q U A D R A T I C F I E L D S 517
Besides thei r significanc e fo r algebrai c number s th e Kronecke r limi t formula s
can b e use d t o estimate derivative s o f L- functions. Fo r thi s purpos e i t is more
convenient t o operat e wit h D(z) rathe r tha n F(z), s o we retur n t o (22.68 ) gettin g
by (22.72 ) an d (22.59 )
(22.75) I U x D , . | j : ( i+ _ ^I ( 7+ l0 ,1 £t+1 D(^))
where a runs ove r inequivalen t primitiv e ideal s an d a = Na. I t is no t easil y see n
that th e right sid e of (22.75 ) doe s not depen d o n the choic e of representatives o f the
ideal classe s whil e thi s i s clea r i n (22.72) . A goo d choic e i n practic e i s th e syste m
of primitive , reduce d ideal s becaus e th e point s z a have th e larges t heigh t makin g
D(za) eas y t o estimate . Estimatin g triviall y D(z a) < C 1 we obtai n
where th e implie d constan t i s absolute .
wa s
REMARKS. A differen t formul a fo r 1 /(1 , XD) derive d b y S . Chowl a an d
A. Selber g [CS] , namel y
2
\D\iL'(l,XD) = -* E XD(n)logr(^ j) + ^( 7 + log27r).
W
0<n<\D\ ' '
22 A. Th e clas s numbe r problems .
Estimation o f th e orde r o f the clas s grou p o f a numbe r field i s one o f th e mos t
intricate problem s i n arithmetic. I n particular, on e wants t o know which fields have
class numbe r on e sinc e i t means tha t th e rin g o f integer s o f suc h a field has th e
unique factorizatio n property . A smal l clas s numbe r (relativ e t o th e discriminant )
is no t a common feature . A s wit h man y storie s i n arithmeti c thi s on e start s wit h
Gauss. I n hi s Disquisitione s Arithmeticae , Gaus s i s seriousl y concerne d abou t th e
class numbe r h(D) o f th e imaginar y quadrati c field K = Q ( A / D ) .
GAUSS CONJECTURE . We have h{D) -» oo as -D -> oo .
Even i f thi s conjectur e i s prove d th e rea l issu e i n practic e i s
GAUSS CLAS S NUMBE R PROBLEM . Find an effective algorithm for determining
all imaginary quadratic fields K = Q(\/D) with a given class number h = h(D).
Gauss kne w tha t h(D) = 1 for th e followin g nin e discriminant s
(22.77) -D = 3,4 , 7,8,1 1 ,1 9,43,67,1 63 .
Here i s a remarkable characterizatio n o f thes e discriminant s
PROPOSITION 22. 1 (Rabinovitch , 1 91 3) . For D = l(mod 4 ) we have h(D) = 1
if and only if the polynomial
f(X) = x*-x + i^
represents primes for all natural numbers x < ^ ^ .
The questio n becomes :
518 22. IMAGINAR Y QUADRATI C FIELD S
GAUSS CLAS S NUMBE R O N E PROBLEM . Prove that the above list of nine neg-
ative discriminants D with h(D) — 1 is complete.
Recall tha t h(D) i s linke d t o L ( 1 , X D ) b y th e Dirichle t formul a (22.59 ) bu t
it doe s no t hel p much . I n fac t thi s formul a serve d Dirichle t t o estimat e L ( 1 , X D )
rather tha n h(D) givin g
2-7T
(22.78) L( , XD)>
wJ\D\
since h(D) ^ 1 . Th e clas s numbe r on e proble m boil s dow n t o improvin g th e lowe r
bound (22.78 ) fo r \D\ > 163.
Next, recal l tha t th e effectiv e lowe r boun d
(22.79) ft(D)»|£>|ilog|D|,
follows fro m th e formul a (22.60 ) subjec t t o th e hypothesi s tha t L ( | , X D ) ^ 0 - Thi s
can solv e th e CNO P assumin g GR H (i f th e constan t ar e mad e explicit) .
There ar e muc h stronge r connection s betwee n h(D) an d zero s o f L ( S , X D ) > fo r
example
PROPOSITION 22. 2 (Hecke-Landau , 1 91 8) . If L{S,\D) does not vanish in the
region s > 1 — a/ lo g \D\, then
(22.80) h(D) > b ^ .
where a, b are some positive constants, effectively computable.
In moder n time s th e abov e resul t i s routine , an d on e ca n pus h thi s implica -
tion muc h deepe r whe n powere d b y ne w technologie s (Linnik' s densit y theore m
and Turan' s power-sum s method) . However , th e followin g implicatio n wa s quit e
surprising i n it s time .
PROPOSITION 22. 3 (Deuring , 1 933) . If the Riemann Hypothesis for the zeta
function ("(5 ) is false, then h(D) > 1 for all sufficiently large \D\.
A year late r Mordel l improve d Deuring' s resul t t o th e effec t tha t h(D) - > o o a s
—D—» 0 0 unde r th e sam e assumption . I n th e sam e tim e Heilbron n expande d th e
assumption t o th e falsit y o f th e Rieman n hypothesi s fo r an y Dirichle t L-series .
PROPOSITION 22. 4 (Heilbronn , 1 934) . / / GRH is false, then h(D) -> • 0 0 as
-D - > 00 .
Combining bot h Propositio n 22. 2 an d 22. 4 on e complete s a n unconditiona l
solution o f th e Gaus s Conjectur e tha t h(D) — > 0 0 a s — D— > 00 . O f course , th e
result i s not effectiv e becaus e w e do no t kno w whic h on e o f th e tw o proposition s i s
needed. Th e proble m i s that i f som e L(s , x) ha s a zer o of f th e lin e R e (s) = | th e
resulting lowe r boun d fo r h(D) depend s o n thi s hypothetica l poin t whic h ha s n o
numerical value . Therefor e th e CNO P stil l canno t b e reduce d t o a finite numbe r o f
computations. Som e degree o f effectivization wa s achieve d i n the followin g fashion .
22. IMAGINAR Y Q U A D R A T I C F I E L D S 519
PROPOSITION 22. 5 (Heilbronn an d Linfoot, 1 934) . There is at most one imag-
inary quadratic field K = Q(V~D) with h = h(D) = 1 which is not in the list of
Gauss.
For th e above achievement s som e attentio n shoul d b e given t o E. Landa u fo r
his earl y idea s i n [La2] . Promptl y afte r Heilbron n an d Linfoot , h e wa s able t o
prove th e following remarkabl e estimate .
PROPOSITION 22. 6 (Landau , 1 935) . There exists an absolute, effective con-
stant c > 0 such that for every h ^ 1 all but one of the discriminants D < 0 with
h(D) = h satisfy
(22.81) \D\ ^c/i 8 (log3/i) 6 .
Next C . L. Siegel [Siel ] use d simila r argument s t o establish hi s famou s
THEOREM 22. 7 (Siegel, 1 935) . For any e > 0 there exists a constant c(e) > 0
(not computable) such that
€
(22.82) L( , XD)>C(S)\D\- .
Hence
2 £
(22.83) h(D) > c(e)\D\^ -.
Both paper s o f Landau an d Siegel appeare d i n the first volum e o f Acta Arith -
metica, 1 935 . A simple proo f o f Siegel's theore m wa s given b y D. Goldfeld [Gol ]
(see Chapte r 5) . The next interestin g resul t i s
THEOREM 22. 8 (Tatuzawa , 1 951 ) . SiegeVs bound is effective except for one
possible D for each e > 0. Precisely, if 0 < e ^ ^ and slog |D | ^ 1 , then with at
most one exception
£
(22.84) L( , XD)>IS\D\- .
In 1 98 0 J. Hoffstein [Hof ] established furthe r numerica l improvement s o f Tatu-
zawa's estimat e usin g Goldfeld' s technique .
The CNO P wa s solved fo r the first tim e b y Kurt Heegne r [Hee ] in 1 952 , but
his arguments (modula r forms , comple x multiplication) wer e understood an d recog-
nized onl y afte r hi s death i n 1968. A n additional not e of tragedy wa s that thi s firs t
recognition cam e tw o years afte r anothe r solutio n wa s found b y A. Baker. Baker' s
method [B ] is very different : i t use s a n effectiv e lowe r boun d fo r linea r form s of
three logarithm s o f algebrai c numbers . I n th e meantime , H . Star k [Stl ] gav e a
third solutio n whic h turne d ou t to be similar t o Heegner's, The n Star k looke d int o
Baker's proof an d realize d tha t i t sufficed t o use linear forms onl y in two logarithms,
so that th e problem coul d hav e bee n solve d alread y i n 1949 by Gelfand an d Linnik.
Retrospectively, i t seem s tha t H . Webe r wa s capable o f solvin g th e CNO P lon g
before. Se e [Cox] fo r a presentation o f the Heegner-Stark arguments .
Next wa s the problem o f class number two . Thi s has been solve d independentl y
in 1 97 1 by A. Baker [B ] and H. Stark [St2] . The y establishe d tha t ther e are exactly
eighteen negativ e discriminant s D with h(D) = 2 , namely
-D = 1 5,20,24,35,40,51 ,52,88,91 ,1 1 5,1 23,1 48,1 87,232,235,267,403,427 .
52 0 22. IMAGINAR Y QUADRATI C FIELD S
Our stor y culminate s a t tw o remarkabl e achievement s b y Goldfel d (1 976 ) an d
Gross an d Zagie r (1 983) . Goldfel d [Go2 ] gav e a n effectiv e lowe r boun d fo r h(D),
the qualit y o f whic h depend s o n th e ran k o f a n auxiliar y ellipti c curve , an d it s
satisfying th e Birc h an d Swinnerton-Dye r Conjecture . Gros s an d Zagie r ([GZ1 ] ,
[GZ2]) produce d a curv e wit h th e require d properties . Th e combine d result s o f
Goldfeld, Gros s an d Zagie r solve s th e Clas s Numbe r Proble m fo r an y fixed /i , u p
to a finite amoun t o f computation . J . Oesterl e [Oe ] reduce d th e implie d constant s
in Goldfeld' s wor k s o muc h a s t o mak e th e resul t practica l fo r computers . H e
succeeded i n showin g th e followin g nea t estimate :
(22.85) /l(jD)> _L(log|I?|)ri(l_M).
p\D
We shal l presen t Goldfeld' s approac h i n Sectio n 22. 7 wit h som e variation s o f ou r
own, an d w e survey th e constructio n o f Gross an d Zagier i n the Appendi x t o Chap -
ter 23 .
The bes t effectiv e lowe r bound s fo r h(D) whic h curren t technolog y allow s u s
to hop e fo r ar e o f typ e h(D) > c g(log\D\)9 fo r an y g > 0 an d som e computabl e
constant c g > 0, ye t i t i s too shor t t o solv e som e othe r popula r problem s suc h a s
E U L E R IDONEA L NUMBE R PROBLEM . Find all discriminants D for which the
class group ofQ(y/D) has one class in each genus.
If D i s an idonea l discriminant , the n h(D) = 2 t _ 1 wher e t = w(\D\) i s the num -
ber of distinct prim e divisors of D. Sinc e ^(|D|) ca n be as large as log \D\/ lo g log \D\
the proble m require s a n effectiv e lowe r boun d
(22.86) h(D) > |£>|c/iogiog|£> | wit hc > log2 .
We kno w b y Landau' s estimat e (22.81 ) tha t ther e ar e onl y finitely man y idonea l
discriminants bu t w e canno t lis t the m al l becaus e o f ineffectiv e constants .
22.5. Splittin g prime s i n Q(y/D).
If the clas s number h = h(D) i s small, the n ther e ar e only fe w prim e ideal s p of
degree one with smal l norm. Indeed , i f p = p p with (p,p ) = 1 , then p h i s a principa l
ideal generate d b y ^(m + ny/D) wit h n =fi 0 , whence p h — \{m2 — n2D) ^ A wher e
(22.87) A = M .
Therefore th e leas t prim e p\ — pi(D) wit h XD(J>I) — 1 satisfie s
(22.88) p i ^ VA.
Hence X£>( n) agree s with /x(n ) o n al l squarefree number s n ^ \/ A wit h (n , A) = 1 .
This propert y i s no t likel y t o hol d i n lon g segment s (becaus e XD i s periodi c whil e
\i i s not) , therefor e (22.88 ) suggest s tha t h i s rather large .
In thi s sectio n w e establis h severa l estimate s whic h hol d fo r an y D < — 4 bu t
are interesting onl y when h is relatively small . Le t po (p) be the number o f solution s
to
2
(22.89) b = D(mo d 4a )
22. IMAGINAR Y Q U A D R A T I C F I E L D S 52 1
in fr(mod 2a). This i s the multiplicative functio n wit h po(p a) = 1 + XD(P) ifp\D,
PD{P) — 1 if P I D an d PD{p a) = 0 if p \ D,a > 1. Thu s th e generating Dirichle t
series fo r PD{°) n a s th e Euler produc t
a(-)=E^)°-=n(i+i) n (^iX 1 -^)"
Since p # (&) i s the number o f primitive ideal s o f norm a we have
CK(S) = C(2sr 1 CK(s) = C(2S)- 1 C(S)L(S,XD)
whence
(22.90) p D(a) = ^ XD(6)/X(C) .
6c21 a
First w e show th e following basi c inequalitie s
(22.91) £ ) / M a ) ^ ^] C M<0 -
a^>/A a ^ i / | D | / 3
This follows because the class number ft is equal to the number o f primitive, reduce d
ideals a = [ a , ^ ~ ] wit h b satisfying (22.89) . Fo r a ^ y/K th e points
<**) *->-*£•
have heigh t Im2; a ^ 1 , s o choosin g b with —a<b^a, thes e point s li e i n the
standard fundamenta l domai n of V = 51 ^(2 ) provin g the first inequalit y o f (22.91).
The secon d inequalit y follow s alon g th e same line s sinc e ever y reduce d idea l a has
a = Na ^ y/\D\/3.
Now suppos e p i < • • • < p r ar e the firs t r prime s whic h spli t completel y i n
K = Q(>/D). B y (22.91 ) w e have
Er ' < h,
where a i , . . . , ar ru n over non-negativ e integer s an d r' i s the number o f positiv e
exponents. Henc e i/ r(log A/2 log pr ) < ft where
27
i/P(a) = E "'-
Since u r(a) ^ ^ f , thi s implie s lo g A < (2ftr!) 1 / r (logp r ) « rh 1 ^ \ogp r i f r ^ 2),
i.e.,
(22.93) log* . > H *.
1
2
In particular , th e secon d prim e whic h split s completel y satisfie s p 2 ^ A ^.I f
ft ^ (logA)s , the n
(22.94) \{p ^ exp( x /loiA) : XD(P) = 1 } | < 2<?
522 22 . IMAGINAR Y QUADRATI C FIELD S
provided A is sufficiently larg e i n term s o f g, namel y lo g A > (4g) 4fir. Moreover ,
(22.91) yield s
X(P)=I
Hence fo r an y positiv e intege r r we obtai n
\{p^ A * :XD(P) = 1 }\ ^ r h i .
Choosing r = [| log 8ft] thi s give s
(22.95) \{p < A^ los8h : X(P) = 1 } | < log8ft .
Next w e exten d th e basi c inequalit y (22.91 ) t o large r ranges . T o thi s en d w e
use th e followin g elementar y estimat e (se e Propositio n 1 5.1 0) :
|{ 7 er 0 O \r:im 7 ^>y}|<i + y
which hold s fo r an y Y > 0 and z G H. Applyin g thi s fo r th e point s z a in the
standard fundamenta l domai n o f T — SL2 (Z) wit h Y — yfK/A w e deduce tha t
1(L4
(22.96) ] T p D(a)<fc(l +
a<A v A
for an y ^ 4 > 0. A s befor e thi s implie s
2
H(E ) <H 1 + 77x)-
X(P)=I
Hence fo r an y positiv e intege r r we obtai n
|{p < A : X D(p) = 1}| < rh^ + 5 r ^ ( —) r .
Choosing r = [| log 2ft] -f 1 we conclud e tha t
(22.97) |{ p < A : X D(p) = 1}| « ( 1 + A\D\~*'lo*2h) log2/ i
for an y A > 0, wher e th e implie d constan t i s absolute .
In Chapte r 2 3 we shall nee d th e abov e estimate s i n variou s forms , therefor e w e
end thi s sectio n b y derivin g th e require d results . Firs t restrictin g th e summatio n
in (22.96 ) t o squarefre e number s w e obtai n
(22.98) Y?T(a, XD)^h(l + ^ ).
Hence fo r an y positiv e A, B w e derive b y partia l summatio n
l 2
(22.99) Y! r(a, XD)a- / «h^ + ^y /2
B<a^A
1
(22.100) Y? T&XD)^ « h(± + ^ ).
22. IMAGINAR Y QUADRATI C FIELD S 52 3
Next w e ar e goin g t o estimat e th e su m
b
(22.101) S(A,B)= 1
Y? 2
r{a)r{a,XD)a- 1
Ka^A
(a,C)=l
where C stand s fo r th e produc t o f prime s i n D an d al l th e prime s p ^ B wit h
XD(P) = 1 .
We hav e
T
S(A,B) < Y^Y? ^XD)T(.a2,XD){aia2rll2
l<a\a-2^A
(aia2,C) = l
1 /2
< 2 Y? ^Xz))a- + 2^ b^T(a1 ,XD)r(a2,xz))(aia2)-1 /2
.
B<a^A aia 2^A
a1,a2>B
Applying (22.99 ) an d (22.1 00 ) w e ge t
B<a^A v
2
, / l i4\V 2h / \ A\W , 2 / l logi4\/A\i/ 2
<</I
U +
A) +
7 I ( B + A ) +h (B + ^K)W •
Rearranging term s thi s boun d simplifie s t o
(22.102) S(A, B) < h(l + - - = ) ( ^ + ^ ) V 2 + ^ A ^ v ^ l o g A
IB*
where th e implie d constan t i s absolute , an d effective .
22.6. Estimation s fo r derivative s L^(1 ,XD) «
If th e clas s numbe r o f th e imaginar y fiel d K — Q(y/D) i s unusuall y small , sa y
(22.103) ^ ^ ( ^ H ) ,
then s o i s the valu e o f th e Dirichle t serie s L(S,XD) a t 5 = 1 , namel y
(22.104) L (
1 I X D ) = 0 ( _ J _ )
by virtu e o f th e Clas s Numbe r Formul a (22.59) . Thoug h ther e i s no direc t relatio n
between th e clas s numbe r an d derivative s o f L(S,XD) a t s = 1 all o f thes e value s
are als o affecte d b y th e improbabl e hypothesi s (22.1 03) .
Truncating th e Dirichle t serie s fo r L^(S^XD) w e ge t
!
(22.105) L ^ ( l , X D ) - ^ ^ ^ ( - l o g n ) / c + 0(|D|^- 1 aogx)fe+1 )
n
n^.x
by th e Pol y a-Vinogradov inequalit y (1 2.50 )
(22.106) Yl XD(n)<£\D\hog\D\.
52 4 22. IMAGINAR Y QUADRATI C FIELD S
Hence b y absolut e summatio n
fc 1 fc+
(22.107) L< >(l,xz>)«(k>g|Z>|) .
It i s amazing tha t thi s trivia l boun d i s the bes t know n s o far (apart fro m som e
improvements o f th e implie d constant) . B y th e Rieman n Hypothesi s fo r L(S,\D)
one ca n sho w tha t
fc+1
(22.108) L^( ,
1 XD) « (loglog|Z?|) .
A sligh t improvemen t o n (22.1 07 ) ca n b e derive d fro m th e rathe r subtl e result s o f
Graham an d Ringros e [GRi ] fo r special D having onl y smal l prim e divisor s (se e
Chapter 1 2) . I n thi s sectio n w e give estimates whic h ar e bette r tha n (22.1 07 ) i f th e
class numbe r h(D) i s relatively small .
First w e examine 1 /(1 , XD)- W e know b y Landau's analyti c metho d (se e Chap -
ter 5 ) tha t L(s, XD) canno t hav e two real zero s (counte d wit h multiplicity ) clos e to
5 = 1 , therefore L(l , XD) being smal l shoul d impl y 1 /(1 , XD) i s not small . Thi s ca n
be show n quickl y b y elementar y arguments . T o thi s en d w e evaluat e th e su m
r - U_ V - XD[m)(^ x _ , „(m\\ , „(y/\D\
n
Mn,X»)=£^(l0g*
^—' m V m \x +7 + 0(™)) 0(^V,)
J JV + y J
m^.y
2
= L(l, XD)(logx + 7) + L'(l,XD) +
o (- + ^ log a:)
\x v /
for an y 2 < y ^ x , where the error terms above came from estimatio n o f the relevan t
character sum s wit h y < m ^ x b y means o f the Pol y a-Vinogradov inequality .
Hence w e deduc e th e approximat e formul a
1
(22.109) ^ n - r ( n , X D ) = L(l,XD)(logx + 7) + ^ ( l , X D ) + 0 ( | J D | ^ - h o g x ) .
Notice that o n the left sid e all terms are non-negative an d fo r n = dm2 wit h d \ D we
have T(TI,XD) ^ 1 - Summin g ove r thes e number s (wit h d squarefree t o ensur e th e
uniqueness o f th e representatio n n = dm 2) w e infer tha t (b y choosin g x = e~ 7D)
L(l,XD)\og\D\ + L\l,XD) > (^ +0( ]^-l)y(D),
where v(D) i s almost constant , precisel y
i+
(22.uo) ^)=n( ^)-
p\D V
Assuming (22.1 04 ) thi s yield s 1 /(1 , XD) > (^ + o(l))v(D).
Now we derive a precise asymptotic fo r 1 /(1 , XD) by appealin g to the Kronecke r
Limit Formul a (22.75 ) whic h allow s u s to take ful l advantag e o f the smal l clas s
number condition . On e coul d deriv e th e sam e thing s fro m (22.1 09 ) bu t w e choos e
the alternativ e pat h sinc e it works for an y class character an d reveal s other features .
First, estimatin g trivially , w e ge t
(22.111) L'(l , XD) = y * ( D) + 0 ( ^ == lo g \D\)
22. IMAGINAR Y QUADRATI C FIELD S 52 5
where £(D) i s the su m ove r th e norm s o f primitive reduce d ideal s
(22.112) (D) = ^ V1.
a
We shal l approximat e 1 (D) b y the produc t
(22,13) i W - n H )n ( »+ ;)('-;)"•
p\D F p^\D\ F F
XD(P)=1
Clearly w e have th e uppe r boun d
1(D) < J2 PD(a)a~ l ^ P(D).
a^\D\
For a lower boun d w e apply Rankin' s tric k (recal l tha t A = |.D|/4) :
a a l s
e(D) > Yl Pv( )~ > CK(S) - £ PD(a)a~
where CK( S) ~ C(2s) _1 Cfi:(5) i s the zeta functio n o f K = Q(\/D ) (reduce d t o th e
primitive ideals )
-8
CK(s) = J2PD(a)a- s = H(l+p- s) [ J (I+P-^I-P ) -1 ,
a p\D XD(P)= 1
and s is any real numbe r > 1 to be chosen a t the end to optimize th e obtained
results. Applyin g (22.88 ) b y partial summatio n w e ge t
,_ Ush
E s
Here w e can replace l i s by 33 because i f s ^ 3 we have als o th e trivial boun d
3/s\/A derive d b y using PD(O) ^ a and h ^ 1 . The n w e estimate CK( 5 ) from
below b y the partial Eule r produc t restricte d t o primes p ^ \D\, say PS(D), an d
replace P S(D) by P(D) with admissibl e correction . Precisely , usin g th e inequalit y
wit n
f](l - x p) ^ 1 - J2 XP
^-'-('^)('-?)('^)"('-F)"
_2 1 -p 1 g 0/ „ ^plog p
J
" p + 1 l-ps ^ [
P21
-
we ge t P S(D)/P(D) > 1 - 2(s - l)rj{D) wher e r/(L> ) is given by
(22H4) , < C ) =£ £*? •
X£?(P)#-1
Hence £(D) > P(D){1 - 2(s - l)rj(D)} - 33h(D)/(s - 1 )>/A . Choosin g s close to 1 ,
namely 5 = 1 + (33h(D)/ri(D)P(D)y/\D\)^, w e derive th e unconditiona l formul a
2
(22.115) £(D) = P(D) - 0(rj(D)P(D)h(D)/^/\D\)^
where 0 < 6 < S V ^. Not e tha t rj(D) < C log \D\. Henc e w e obtai n
526 22 . IMAGINAR Y QUADRATI C FIELD S
PROPOSITION 22.9 . Put e{D) = h(D)\D\~i lo g \D\. We have
(22.116) £(D) = P(D){1 + 0(e(D)i)}
where the implied constant is absolute and effective.
Inserting (22.1 1 6 ) int o (22.1 1 1 ) w e come u p wit h
PROPOSITION 22.1 0 . Suppose e(D) = h(D)\D\~i log\D\ ^ 1 . Then we have
(22.117) L'( ,
1 XD) = (^+0(e(D)i))p(D).
The produc t ove r splittin g prime s i n P(D) ca n b e reduce d considerabl y t o tha t
over a ver y fe w smal l prime s b y mean s o f (22.97) . Indee d (22.97 ) give s
4 /1
(22.118) Yl P' 1
< ( B~X + l ^ r °g2/llog|^|)log2/i.
B<p^\D\
XD(P) = 1
Applying thi s boun d wit h B — zlog2h w e deduc e fro m (22.1 1 7 ) th e followin g
PROPOSITION 22.1 1 . Suppose
(22.119) h^ l^i/iogiogiz ^
(22.120) 2^z^log\D\.
Then we have
1
(22.121) 1
L'( , XD) = ±«D) n ( ^ ) 1( +
J)" 1 ( 1 + ° @ )-
p^z log 2h
XD(P) = 1
where v{D) is defined by (22.1 1 0).
Now w e estimat e highe r derivatives . W e star t fro m th e su m
Sk(x) = J ^ X D M n - ^ l o g n ) *
n$Cx
which approximate s t o (— l) k L^ k \l,XD) wit h a n erro r ter m give n b y (22.1 05) . O n
the othe r hand , w e hav e
Sk(x) = £ ] > > ( n , XD )^(logmn)fc.
mn^x
By th e convergenc e o f ^ / i ( m ) m _ 1 ( l o g m ) ^ fo r 0 ^ £ ^ & , an d i n cas e o f £ = 0
using
^/i(m)m-1<(log22:)-1,
we obtai n
5 f c ( x ) « ^ ^ ^ l ( l o g 2 n ) ^ r i ^ 1 .
^-^v n log(2x/n) /
n^x '
22. IMAGINAR Y Q U A D R A T I C F I E L D S 52 7
Estimating th e term s wit h y < n ^ x b y mean s o f (22.1 00 ) w e ge t
1
Sk(x) « J2 ^ ^ W n ) * - + -(logy)*" 1 + ^ = (log*)* .
Here w e estimat e log n b y logy , the n exten d th e summatio n t o n ^ D 2 an d appl y
(22.109) gettin g
Sk[x) « L / (l, X D)(logy) f c -1 + - ( l o g y ) ^ 1 + -^=(log*) fc .
yV A
We choos e x — D 2 an d y = 2h 2 t o arriv e a t
k
(22.122) L( \l,XD1 1
)<:\L'(l,XD)\(log2h) k
- +h\D\- H\og\D\)k.
Combining (22.1 22 ) an d (22.1 1 7 ) w e obtai n
PROPOSITION 22.1 2 . Suppose h(D)log\D\ < \f\D\ , then for any k > 1 ,
1k
(22.123) L^( ,
1 XD) « P(D)(\og2h(D)) -
where P(D) is the product given by (22.1 1 3) and the implied constant depends only
on k.
REMARKS. Notic e tha t (22.1 22 ) implie s th e trivia l boun d (22.1 07) , an d i t be -
comes stronge r whe n th e clas s numbe r i s relativel y small . I n th e derivatio n o f
(22.122) (whic h i s unconditional ) w e mad e a n appea l t o th e Prim e Numbe r Theo -
rem i n th e for m
(22.124) \ r ^ M ( i o g m ) *«i.
m
One ca n avoi d thes e no t s o simpl e result s altogethe r b y examinin g a specia l com -
bination o f th e truncate d derivative s Si(x) wit h 0 ^ £ < k.
EXERCISE 2 . Sho w tha t fo r x ^ 2 and a primitiv e characte r %(mo d |D|) ,
2
2 V r ( n ) x ) ^ ^ = (log x-7l)L(l,x)-27L,(l,x)-i"(l,x)+0(|JD|^-hog2x).
^—' n
Derive fro m thi s tha t
d\D
provided L(1 ,XD) < (lo g |Z)|) —2 an d \D\ i s sufficientl y large .
This page intentionally left blank
http://dx.doi.org/10.1090/coll/053/24
CHAPTER 2 3
EFFECTIVE BOUND S FO R
THE CLAS S N U M B E R
We have already seen how the assertion of h(D) bein g small relatively to the dis-
criminant D lead s to simple, yet fake, approximations t o the special values L(^,XD)
and L ^ ^ ( 1 , X D ) - I n thi s sectio n w e conside r centra l value s o f derivative s o f certai n
L-functions associate d wit h automorphi c form s twiste d b y th e characte r XD- I n
this cas e th e leve l o f th e twiste d for m i s muc h large r tha n th e conducto r o f th e
character (i t i s abou t Z) 2), s o w e shal l b e abl e t o observ e positiv e effect s onl y fo r
derivatives o f orde r g > 2 and onl y whe n th e clas s numbe r i s extremel y smal l (es -
sentially h(D) < C (lo g |D|)2 _ 1 ). A s i n th e previou s case s ou r result s ar e effectiv e
but w e d o no t dwel l o n computin g al l th e implie d constants . Ou r motivatio n i s
to illustrat e th e masterwor k o f D . Goldfel d [Go2 ] whic h afte r Gros s an d Zagie r
[GZ1] ha s application s t o giv e effectiv e lowe r bound s fo r h(D). Althoug h fo r thi s
application Goldfel d (an d J . Oesterl e [Oe] ) considere d onl y th e L- function o f a n
elliptic curve , w e tak e an y primitiv e cus p for m o f a fixed leve l an d a centra l char -
acter (possibl y th e trivia l centra l character) . A t th e en d w e derive the lowe r boun d
for h(D) i n question , an d w e make suggestions fo r furthe r searc h o f the L- functions
which ca n b e employe d fo r estimatin g th e clas s number .
23.1. Landau' s plo t o f automorphi c L- functions.
Throughout, / i s a primitive cusp form o f weight k > 1 and characte r £(mo d N)
on th e grou p T 0(N) suc h tha t e(—l) = (— l) k . Therefor e (se e Chapte r 1 4 ) / ha s
the Fourie r expansio n
oo
f(z) = ^2\(n)n—e(nz)
l
with coefficient s X(n) whic h ar e th e eigenvalue s o f Heck e operator s T n fo r al l n .
Notice w e normalize d thes e s o that th e associate d L-functio n
oo
£(*,/) =£A(n)n- s
1
has a n Eule r produc t o f typ e
(23.1) L(s, /) = JJ(1 - \(p)p-° + e(p)p- 2s)-1
v
and th e complet e L-functio n
A(sJ)=^yT(s+^)L(s,f)
529
530 23. E F F E C T I V E B O U N D S F O R T HE CLASS N U M B E R
(which i s entire) satisfie s th e functional equatio n
(23.2) A(s,f) = w(f)A(l-sJ).
Here w(f) i s a complex numbe r dependin g o n / wit h \w(f)\ — 1 and / i s the cusp
form whos e Fourie r coefficient s ar e A(n) .
Let x b e the character associate d wit h th e field K = Q(y/D), tha t i s th e
Kronecker symbo l x(n ) = ( —); i t is a real, primitiv e characte r o f conductor \D\.
We creat e th e for m / 0 x b y twistin g th e coefficients wit h x,
oo
The twiste d for m / ® \ i s a cus p for m o f weight fc,character e and leve l ND 2, but
it i s not always primitive . However , ther e exist s a unique primitiv e cus p for m
oo
Z
fx( ) = J2 A x W n ^ e ( n 2 )
1
of leve l M\ND 2 an d character e x suc h tha t e x(n) — e(n) and Ax (n) = A(n)%(n )
for al l (n,ND) = 1. Therefore th e associated L-functio n
oo
1
has the Eule r produc t
23 1
Hs,fx)= I I (.l-KWp-'+extop- )-
p\ND
(23.3) [ ] ( 1 - X( P)x(p)p-s + E W P "2 * ) - 1.
The complet e L-functio n
A(s,/x) = ( ^r ) r ( s + ^i)L( S)/x)
is entir e an d it satisfies th e functional equatio n
(23.4) A(s,f x) = w(f x)A(l-s,Jx)
with \w(f x)\ = 1 .
Given / an d \ w e conside r th e produc t L-functio n (i n Landau's style )
oo
(23.5) L(s) = L(sJ)L(sJx) = Y,W S'
l
Note tha t L(s) has an Euler produc t o f degree four .
Now w e hav e al l the information abou t L(s, f) an d L(s, fx) tha t i s needed to
work wit h L(s). Th e complet e produc t L-functio n define d by
A(s) = QsT2(s + ^ ) L (5 , /)L(s , /x ) with Q = y/MN/Air 2
23. E F F E C T I V E B O U N D S F O R T H E CLAS S N U M B E R 531
satisfies th e functiona l equatio n
(23.6) A(s ) = wA(l - s)
with th e roo t numbe r w = w(f)w(f x). Ou r first ai m i s t o evaluat e asymptoticall y
the derivativ e A^ 9\s) a t s — \ o f order g ^ 0 . W e make n o further hypothesi s unti l
the tim e come s t o examin e th e mai n term s i n ou r formula .
23.2. A partitio n o f A ^ ) ( \ ) .
We begi n b y expressin g A ^ ( ^ ) a s a su m o f tw o rapidl y convergin g conjugat e
series i n coefficient s o f L(s), simila r t o th e formul a (5.1 2) . T o this en d w e comput e
the contou r integra l
1=-^-. [ Ais+^s-^ds
2W(i)2
in tw o ways . Firs t movin g t o th e lin e R e (s) = — 1 we pas s a pol e o f orde r g + 1
at s — \ wit h residu e A ^ ) ( | ) . The n w e observ e tha t th e ne w integra l o n th e
line R e (s) = — 1 is equal t o — (—l)9wl b y th e functiona l equatio n (23.6) , therefor e
we obtai n A ^ ( | ) — I + (—l) 9wL Her e / denote s th e comple x conjugat e o f / .
Next w e comput e / b y integratin g termwis e th e Dirichle t serie s (23.5) . W e obtai n
/ = Q 1 /2S, s o
1 /2
(23.7) Q - A^(|) = 5+(-l)^ 5
where
,2,s, '-t^v®
and V(y) i s the invers e Melli n transfor m o f g\T 2(s + f ) s ~ ^ - 1 ,
s 2 9 l
(23.9) V(y) = ^~J y~ r (s + ±)s- - ds.
Note tha t (mov e th e integratio n t o R e (s) = | )
(23.10) V(y)= Yl c,(log^'+0(^log§ )
if 0 < y ^ 1 with th e leadin g coefficien t c g = T 2 ( | ) . Moreover , V(y) < y k 2
e~ ^
if y ^ 1 (mov e th e integratio n t o R e (s) = y/y an d estimat e b y Stirling' s formul a
(5.113)). Combinin g bot h case s w e deduce th e boun d fo r th e cut-of f functio n
1
(23.11) F ( 2 / ) < e - v ^ i o g ^ ( l - f 2/- )
which hold s fo r al l y > 0.
By virtue o f (23.1 1 ) th e serie s (23.8 ) begin s t o deca y exponentiall y wit h n ^ > Q
so w e ar e lef t essentiall y wit h Q terms , whic h i s stil l a lot . However , i f th e clas s
number h i s ver y small , the n man y coefficient s a(n) vanish . Unde r thi s fictitious
condition a mai n ter m fo r S wil l emerg e fro m a lacunar y subsequenc e o f th e coef -
ficients (essentiall y fro m a(m 2)). Thi s mai n ter m i s proportiona l t o th e valu e a t
s = 1 o f th e symmetri c squar e L-functio n attache d t o / (whic h doe s no t vanish ,
see Theore m 5.4 4 an d (5.1 01 )) .
53 2 23. EFFECTIV E BOUND S FO R TH E CLAS S NUMBER
Before estimatin g S we pul l ou t al l th e ramifie d place s an d a few smal l one s
which spli t i n the fiel d K = Q(y/D). Precisely , le t C be th e produc t o f primes in
ND an d al l prime s p ^ B with x(p) = 1 - W e shal l choos e B later, i t will b e quit e
small bu t fo r no w w e assum e onl y B ^ |D| . W e writ e
c|C°° V (m,C ) = lV ^
By th e Eule r produc t (23.1 ) th e coefficient s \{n) satisf y
2
(23.13) A(m)A(n ) = ] T s(d)X(mnd~ ).
d\(m,n)
Hence fo r (m , ND) = 1 we hav e
a(rn)= ^ ^(d)r(n,x)A(n )
d 2 n=ra
where ^ = £%. Accordingl y S is partitioned int o
a c 2
( ) V ^ ^(d) \~ ^ / .X(n)/cd n^
S=
c|C°° V (d,C ) = l (n,C ) = lV ^
We expec t tha t th e mai n contributio n t o 5 comes fro m n = m2 and wit h r(ra 2 , \)
deleted, whic h is
a c 2 2
() V ^ ^(d) ^ \(m ) /cd r"2y
Sl =
fe^.Xf .-*- X. —n-Q->
cjC 00
The remainin g term s yiel d
.= Sf.E.^E«^-i)M^) I
c|C~ v ~ (d,C)= l (m,C)= l
V'(d) v ^ ,A(n ) /cd 2 n
v v
„ Poo v
c|C°° A r*\ — -\
" (d,C)= l" ^ r*<\ — l'\
(n,C)= ^
Thus w e hav e 5 = Si + S2 -f 53. W e shal l trea t eac h o f these thre e sum s sepa -
rately. Th e firs t su m S i will b e evaluate d asymptoticall y b y a n appea l t o analytic
properties o f the symmetri c squar e L- function attache d t o the cus p for m / , an d
the othe r tw o sum s will be estimate d b y mean s of (22.86) an d (22.88 ) respectively .
We estimate S 3 essentially b y th e clas s numbe r whil e S 2 turns ou t t o b e negligible .
For th e coefficient s o f th e cus p for m / w e use Deligne' s boun d
(23.14) |A(n) | ^r(n).
This rathe r advance d resul t i s not essential , however , it is useful t o simplify expo -
sition.
23. EFFECTIV E BOUND S FO R TH E CLAS S NUMBE R 53 3
23.3. Estimatio n o f £3 and 52-
We hav e
r(n,X)= J ] a 1( + x(jp)) I I ( l + a (l + x(p)))-
zot l a
p - \\n p' \\n
Hence, writin g n = am , where a is squarefree w e see that r(n, \) ^ T (a, x ) r ( m 2 ) -
2
Moreover, b y Deligne's boun d |A(n) | < r{n) ^ r{a)r(m 2). Therefor e
cIC00 vV
c|C°° dd m
ma>
a>
v
ll ^
(a,C) = l
Applying (22.88 ) an d (23.11) w e deduce tha t
1 0
S 3 « h(l + - ^ ) { ^ c ( l ) + i/c(!)B-*(logA)'+ }
where
c\C°°
fe fc
We hav e a ( / ) = £* A(p )A x (/- ), whenc e fo r any 5 > 0 by (23.14)
00 2
Ms) < n(E (p')p"' ) = ii(i -^ s)-4=fc(*),
r a
P|C 0 p| C
say. Fo r VD{1) we may have a more precis e estimat e
(23.15) MI ) ~ r K1+ — ) « ^ )
where v{D) is defined b y (22.110).
To estimate $ 2 first notic e tha t r(ra 2 , x) = 1 unless m has a prime diviso r wit h
x{p) = 1 , indeed i t is clear fro m th e formul a
r(m2,X)= IJ( l + a(l+X(p))).
p Q ||m
Therefore b y (23.11) an d Deligne's boun d
S2«M§)(logA)*+1 ] T ^^(r(m 2
,X)-l)e-™/^
(m,C) = l
mp/y/Q
m p\C
X(P)=I
Applying (22.86 ) w e infer tha t
l w
(23.16) S 2 « h(l + -^=y c(k)B- (logAy+
We simplif y bot h result s b y taking B in the following rang e
2
(23.17) /i (logA) 2 s + 2 ° < B < A 1 / l o s 2 h .
534 23 . E F F E C T I V E B O U N D S F O R T H E CLAS S N U M B E R
For s — |, 1 we have UQ{S) ^ £,c( s) ^ ^D(S) < C hr(\D\) ^ h 2, whenc e we conclude
that
(23.18) S 2 + S 3 <^c(l).
23.4. Evaluatio n o f 5 i .
To thi s en d w e us e comple x integratio n an d analyti c propertie s o f th e serie s
which com e out . Insertin g (23.9 ) fo r V(cd 2m2/Q) int o (23.1 2 ) w e ge t
(23.19) s 1 = 4 L- f Q s 2
^ {sJr!l)Z{2s + l)s- 1
9
- ds
27TI J (1 )
where Z(s) i s the correspondin g Dirichle t serie s
Z(S)=(£a(c)c-*/2)(£ ^d)d~')( ^ A(m 2
)m-*).
c|C°° (d,C ) = l (m,C )=l
Recall tha t ip = e^ . W e write Z(s ) = L(s,i/>)M(s, f)P(s, f) wher e
P(s,/) = (£ a (c)c~S/2)(E M)d- ay1 ( E A(m 2)m-S)"\
c|C°° d|C° ° m\C°°
L(s,xp) i s the Dirichle t L-functio n fo r th e characte r if) — e\ an d
oo
A
(23.20) M(s, f) = J2 (m2)m_S-
l
Every Dirichle t serie s abov e ha s a n Eule r product . T o compute the m w e factor th e
Hecke polynomia l fo r L(s , /) int o
(23.21)
1 - X(p)p- S + e(p)p- 2s = ( 1 - a(p)p- s)(l - ^P")
with a(p) + fi(p) — X(p) an d a(p)j3(p) = e(p). Similarly , thi s hold s fo r th e loca l
factors o f L(s , fx) wit h a x (p) = a(p)x(p), P x(p) = f 3(p)x{p) i n plac e o f a(p), /3{p).
This yield s
(23.22) A(p' .)= ;
a— p
Here an d hereafte r w e drop th e argumen t p i n th e root s a, ft, a x, (3 X for simplicity .
By (23.22 ) w e deriv e
£ Wfr-* = ~d - oVT1 - ^d - ft-)" 1
= (i - avr'fi+«/3p-s)(i - frr 1
- 1
=(1 - aV) ( l - aPp- syl{\ - p 2p~s)-\l - e 2
p-2s).
Hence th e C-par t i s equal t o
p(s, f)=n(i - ap-s'2ri{\ - pP-'2)-\i - axP-i2)-\i - n xP-s/2ri
p\C
rj(i - ^p-*)(i - ftoa+ep-T'a - ft-s)-
23. E F F E C T I V E BOUND S F O R T H E CLAS S N U M B E R 53 5
After factorin g 1 — a 2p~s an d 1 — fi 2p~s w e arrange thi s produc t a s
(23.23)
^,/)=n( i+ ^" fl )" i ( i -^" fl )
p\C
H ( l + ap-*/2)(l + Pp~s^2)(l - a xp-l2)-\\ - P xp-
s/2 l
y.
P\c
2
Moreover, w e ge t M(s,f) = L(2s,e 2)~lL(s, Sym /) wher e
2
L(s, Sym /) = [ T ( l - a V 8 ) _ 1 ( l - a / J p ' T ^ l - J ^ P -8 -1
)
P
is the symmetri c squar e L-functio n attache d t o / ; se e Section 5.1 2 . Thi s L-functio n
appears a s a facto r i n th e Rankin-Selber g L-functio n
oo
(23.24) L ( s , / ® / ) = ]TA 2(n)n-s.
l
Indeed, b y (23.1 2 ) fo r m = n w e deduce tha t
2
(23.25) L(a, f ® /) = L(s, e)M(s, f) = J ^ LL ( 8 , Sym / ).
The //-functio n L(s,Sy m / ) admit s meromorphi c continuatio n t o th e whol e com -
plex s-plan e an d ha s a functiona l equation . A s prove d b y Shimur a [Sh2] , i t i s
holomorphic excep t possibl y fo r simpl e pole s a t s = 0 an d 8 = 1 .
The cases for whic h L(s , Sym2 /) ha s a pole at s = 1 are completely understoo d
(the dihedral forms ) an d ver y rare. W e shall assum e tha t ou r functio n L(s , Sym2 / )
has n o pol e a t s = 1 . B y (5.1 01 ) o r Theore m 5.44 , w e hav e
2
(23.26) L ( l , S y m /)^0.
Besides th e abov e arithmeti c propertie s o f th e zet a function s w e nee d crud e
estimates o n th e lin e R e (s) — ~. B y (23.23 ) w e ge t
(23.27) | P ( S ,/)|^c(|)«/>.
For the L-functio n o f the character I/J = e\ w e use the convexity boun d (se e Sectio n
5.9)
(23.28) L ( 5 , ^ ) < H | 2 3 | i .
Hence w e arriv e a t
4
(23.29) Z(s)<^ \s\ \D\*h
where th e implie d constan t depend s o n th e cus p for m / .
Having gathere d th e arithmeti c an d analyti c propertie s o f th e relevan t zet a
functions w e ar e no w read y t o evaluat e th e serie s Si . Movin g i n (23.1 9 ) t o th e lin e
Res = —1 / 4 w e obtai n
(23.30) S i =S 0 + O(h)
536 23 . E F F E C T I V E B O U N D S F O R T H E CLAS S N U M B E R
where So i s th e residu e a t s = 0, o r simpl y
d9
s 2
(23.31) S 0 = -—Q T (s + \)Z(2s + 1), a t s = 0.
REMARK. Th e error term i n (23.30 ) could be improved by factor | D | - 1 / / 1 6 usin g
Burgess' bound i n place of (23.28 ) bu t w e do not nee d anythin g better tha n (23.30) .
The dependenc e o f So o n th e characte r ip — e\ need s t o b e exposed . T o thi s
end w e se t
(23.32) R(s) = Q sT2{s + §)M(2s + 1, f)P(2s + 1, /)
and differentiat e L(2s + l,ip)R(s) b y Leibni z rul e gettin g
u 9
(23.33) S 0= Yl 2 ( u)L{u)(l^)Riv){0).
u-\-v=g
Next, whe n computin g th e derivative s R^(0) w e think o f Q s a s a dominant facto r
in R(s). Recallin g tha t al l prime s i n C are bounde d b y Q , w e deriv e b y repeate d
application o f Leibni z rul e an d th e produc t representatio n (23.23 ) tha t
(23.34) fl (*)(0) = J R(o)(logQr(l + o ( i ^ ) )
where
(23.35) t(C )= ^p-hogp.
P\c
This approximatio n woul d b e useles s i f the erro r ter m wa s no t small , bu t thank s
to (23.1 7 ) w e ge t b y (22.81 ) tha t t(C) = t(D) + <3(log2ft). W e hav e als o t(D) < C
u(\D\) < lo g 2ft, therefor e t(C) < log2ft .
Finally b y (23.30) , (23.34 ) an d (23.36 ) w e obtai n
(23.36) s 1 =;?(o) Y, 2 "(«)L(u)(i^)aogi^ir(i+o(^^))+o(/
u-\-v=g
log 2ft
l ).
23.5. A n asymptoti c formul a fo r A ^ ( | ) .
Adding (23.1 8 ) t o (23.37 ) w e obtai n
2 log 2ft
(23.37) S = R(0) J2 "(^) i(M) (l^)(log| J D|r(l + o ( ^ g f ) ) + 0 ( ^ c ( l ) )
u+v=g
where R(0) = r 2 ( f ) M ( l , / ) P ( l , / ) . Recal l tha t
(23.38) M (1, / ) = L(2, e2 ) " 1^ ! , Sym 2 / )
23. E F F E C T I V E B O U N D S F O R T H E CLAS S N U M B E R 537
and P ( l , /) i s given by (23.23) . Hav e in mind tha t P ( l , / ) depend s on D bu t rathe r
mildly, precisel y
(23.39) J . ( l , / ) . n ( l+ ^)-,(l-^)(l + ^ )
\ y/i> *{ VP J ^ VP >
where th e produc t i s over prime s p \ D and p ^ B with \{p) — 1- Insertin g (23.37 )
to (23.7 ) w e obtain th e desire d expressio n fo r A ^ ( | ) i n term s o f derivative s o f th e
L-function fo r th e characte r ip = ex a t th e poin t 3 = 1 .
Prom now on we assume that e is a real character, th e symmetric squar e Sym 2 /
has rea l coefficient s (thoug h / ma y hav e bot h rea l an d imaginar y coefficients ) an d
P ( l , / ) i s also real. The n th e approximat e formul a (23.37 ) fo r S and S is the same .
Recall th e formul a (23.7) . Sinc e w e d o no t wis h th e mai n term s cancele d w e mus t
match th e parit y o f g with the sig n of the functiona l equatio n (23.6 ) fo r th e produc t
L-function L(s) = L(s,f)L(s,f x), i.e. , w e requir e
(23.40) w = w(f)w(fx) = (-If.
Now insertin g (23.37 ) int o (23.7 ) w e arriv e a t
(23.41)
Q-U<»>(!) = 2 r 2 ( f ) M ( i , / ) P ( i , / )
52 2«(^)L(«) (l^)(l0gW(l + 0 ( ^ ) ) + 0 ( ^ c ( l ) ) .
u-\-v=g ' '
We shal l refin e thi s rudimentar y formul a i n specia l cases .
Suppose e = 1 , so ty—XD- W e can use the estimates fo r L ^ ( l , XD) establishe d
in Sectio n 22. 6 unde r th e mino r assumptio n h(D)\og\D\ ^ \f\D~\ (se e (22.1 22) )
reducing (23.41 ) to
(23.42)
Q-*A^(i) = 2^r 2(|)M(l,/)P(l,/)
L'(l, XD)(\og \D\y~l (l + o ( j j ^ )) + 0(huc(l)).
Also we could replac e 1 /(1 , XD) b y ^-P(D) usin g (22.1 1 7) . Thoug h w e made s o fa r
rather mino r restrictions for the class number (exactl y such that th e segment (23.1 7 )
is not void) , our asymptoti c formul a (23.42 ) i s interesting onl y if h(D) < C (log \D\) 9,
or els e th e erro r ter m exceed s th e mai n term . Havin g assume d th e clas s numbe r
is tha t small , notic e tha t ther e ar e a t most 2g prime s p < exp(y/\og \D\) whic h
split i n K = Q(VD) (se e (22.95)) . Choosin g B = (log |Z>|)4^+20 (whic h i s withi n
(23.17)) w e find tha t i/ c (l) x u D(l) < v{Df b y (23.1 5) , the n 1 /(1 , XD) ~ v(p)
by (22.1 1 9) , an d
,2,43, P ( u )p\D, n H r V ^ p\D
v
+ihn(i+^f)
by (23.39) .
538 23. E F F E C T I V E B O U N D S F O R T HE CLAS S N U M B E R
23.6. A lowe r boun d fo r the class number .
As before x = XD is the character of the imaginary quadrati c field K — Q(\/D).
If the lef t sid e of (23.42) vanishes , we are left wit h a lower bound fo r the error ter m
which contain s th e class numbe r h — h(D). W e put this observatio n as
PROPOSITION 23.1 . Suppose f is a primitive cusp form of weight k ^ 1 , level
N and trivial central character. Suppose the product L-function vanishes at s = \
to order
(23.44) m = ordL(sJ)L(sJ x)^3.
Let g — m — 1 or g = m — 2 be such that (-1 ) 9 = w — w(f)w(f x). Then
1
(23.45) h(D) > 0(£>)(lo g \D\y-
where
(23.4S ) W = n(^n ' + ^ff-
p\D
The implied constant depends only on f and m and it is effectively computable.
The theor y o f ellipti c curve s provide s plent y o f example s o f cus p form s / G
S2(ro(iV)) for whic h th e associated L-functio n LE(S -f \) — L(s,f) shoul d hav e
large order of vanishing at s = \ , precisel y equal to the rank of the group of rational
points o n E, b y the Birch-Swinnerton-Dyer Conjecture . Beside s th e rank w e nee d
to kno w th e root number s w(f), w(f x) t o control th e parity o f g. I n general, fo r a
primitive for m / G Sk(To(N)) o f squarefree leve l N w e have (se e e.g . Propositio n
14.16 or [1 4] )
k
(23.47) w(f) = i /2(N)\(N).
There i s no simple formul a fo r w(f) i f N i s not squarefree. I f the greates t commo n
divisor (N,D 2) i s squarefree, the n th e twisted for m i s primitive, i.e. , fx = / ' ® % ,
has leve l M = [N, D2} an d the root numbe r is
(23.48) w(f x) = x(-Ni)n(N 2)\(N2)w(f)
where N = N XN2 wit h (Ni,D) = 1 an d 7V 2|£>. Thi s simplifie s t o w(f x) =
X(-N)w(f) i f (N,D) = 1 . Exploitin g (23.47 ) an d (23.48 ) w e shall deriv e fro m
Proposition 23. 1 our final result
T H E O R E M 23.2 . There exists an absolute, effectively computable constant c > 0
such that for any fundamental discriminant D < 0 we have
(23.49) h{D) ^c6{D)\og\D\
where
(23.50) 6 | (D) = Jl( i + I ) - 3 ( i + ^ ) - 1 .
p\D
23. E F F E C T I V E B O U N D S F O R T H E CLAS S N U M B E R 53 9
PROOF. Firs t w e handl e th e cas e whe n p = 3 7 split s i n Q(y / D); i n thi s case
(22.72) implie s h(D) log37 ^ log|jD/4 | whic h i s bette r tha n (23.49) . Therefore ,
from no w o n w e assume tha t
(23.51) XD(37 )+ 1 .
We begi n wit h th e ellipti c curv e
(23.52) E 0 : y2 = x 3 + 1 0x 2 - 20 x + 8
which wa s considere d b y Gros s an d Zagier [GZ1 ] ; thi s i s a modula r curv e o f con -
ductor NQ = 3 7 and ran k VQ = 0 . Th e correspondin g primitiv e cus p for m i s
oo
f0(z) = £ X 0(n)nh(nz) e S 2(T0(N0))
l
and th e associate d Hasse-Wei l zet a functio n i s LE 0{S) = L(s — | , /o ) wher e
oo
L{$Jo) = ^ A 0 ( n ) n _ s .
l
This satisfie s th e appropriat e functiona l equatio n wit h sig n w(fo) = 1 . Fo r th e
proof o f Theore m 23. 2 w e tak e th e twiste d curv e
(23.53) E : -139y2 = x 3 + lOrr 2 - 20 x + 8 .
This i s a modula r curv e o f conducto rT V = 3 7 • 1392 an d ran k r = 3 . Th e corre -
sponding primitiv e cus p for m i s
oo
f(z) = J2 Kn)nh(nz) e S 2(T0(N))
i
with A(n ) = Ao(n)x~i39(n) , i.e. , f = f 0 0 x-1 39 , an d th e associate d Hasse-Wei l
zeta functio n i s LE(S) = L(s — |, / ) wher e
00
L(s,/)=]TA(n)n- s .
1
The sig n o f th e functiona l equatio n fo r / i s
w(/) = X - i 3 9 ( - l M / o ) = X-i39(-l ) = - 1
according t o (23.48) . Henc e L ( ^ , / ) = 0 , bu t Gros s an d Zagie r showe d tha t als o
L'{\if) — 0, therefor e L(s,f) vanishe s a t s = \ t o th e orde r a t leas t three . (I n
Appendix 23.A , w e sketc h th e proo f o f th e resul t o f Gros s an d Zagier) .
Now we examine the twisted for m f®XD £ S 2(YQ{ND2)) an d it s correspondin g
primitive for m f XD G S2{T0(M)) o f leve l M\ND 2.
C A S E 1 3 9 | D. The n M = ND 2, f XD=f®XD = fo® X-1 39D, an d b y virtu e o f
(23.48)
M / X D ) = X-1 39D(-A r i)M^2)A 0 (^ 2 )
where N XN2 = 3 7 wit h (N UD) = 1 and N 2\D. I f N x = 3 7 an d N 2 = 1 , w e ge t
wtfxo) = X-i39D(-37 ) = X-i39(-37 ) = -n;(/o ) = - 1 . If ^1 = 1 and N 2 = 37 , we
get w(f XD) = - X - I 3 9 D ( - 1 ) A 0 ( 3 7 ) = -A 0 (37) = - w ( / 0 ) = - 1 b y virtue o f (23.47) .
540 23. E F F E C T I V E B O U N D S F O R T H E CLAS S N U M B E R
CASE 1 39|D . W e writ e D — — 139C wher e C i s a positiv e fundamenta l discrimi -
nant coprim e wit h 1 39 . Accordingl y th e field characte r factor s int o \D — X-139XC
and / <g ) XD = / o ® X-139 ® Xc- Her e th e for m / o 0 X-1 3 9 * s n° t primitiv e becaus e
the trivial characte r X-1 3 9 simply kill s the place p — 139 in its L-function , however ,
it i s induced b y th e primitiv e for m /o - Therefor e th e primitiv e for m whic h induce s
/ ® XD i s f XD = / o ® Xc, i t ha s th e leve l M = [37 , C2 ] = 37C 2 /(37, C) an d th e
root numbe r
w(fXD) = xc(-N 1 )fi(N2)X0(N2)
by virtu e o f (23.48) , where N XN2 = 3 7 with (N UC) = 1 and N 2\C. I f N x = 3 7 and
N2 = 1 , we get w(f XD) = Xc(—37 ) = X-i39£>( —37) = —1 , as i n th e forme r case . I f
N± = 1 and N 2 = 37 , we get w(f XD) = -A 0 (37) = - 1.
Now we conclude tha t i n all cases w(f) — w(f XD) = —1 , whence the L-functio n
L(s) = L(s, f)L(s, f x) ha s roo t numbe r w — w(f)w(f XD) = 1 . Sinc e L(s) vanishe s
at s = ^ t o order m ^ 3 (actually m ^ 4 ) we can appl y Propositio n 23. 1 with g = 2
proving (23.49) . •
EXERCISE. Deriv e fro m (23.49 ) th e followin g effectiv e boun d
(23.54) h(D) >exp(-(loglog|D|)i)(log|£>|) .
[Hint: Us e genu s theor y i f D ha s man y smal l prim e factors. ]
23.7. Concludin g notes .
Our formul a (23.41 ) i s not restricte d t o the L-function s associate d wit h ellipti c
curves, an d on e hope s t o us e i t mor e efficientl y b y employin g othe r L-function s
which vanis h t o hig h order a t th e centra l point . A n interestin g propositio n i s to tr y
the L-function o f a cusp form with non-trivial centra l character, th e point bein g tha t
the leadin g ter m i n (23.41 ) ha s th e orde r o f magnitud e L(l,exr>)(lo g \D\) 9 whic h
for e ^ 1 is better tha n th e bound L'(l , xz))(log \D\) g~l deduce d fro m (23.42 ) whe n
e = l.
There ar e severa l example s o f cuspida l L-function s othe r tha n th e Hasse-Wei l
zeta function s o f ellipti c curve s whic h vanis h a t th e centra l poin t s — \ t o th e
order a t leas t tw o (bu t stil l with th e trivia l centra l character , unfortunately ) foun d
by F . Rodrigue z Villega s [RV] . Thes e ar e th e L-function s fo r Heck e character s o f
an imaginar y quadrati c field whic h ar e describe d i n Chapte r 3 by formula s (3.92) -
(3.95). Suppos e £ is such a character o f the field Q(y/—q) wit h q > 3 , q = 3(mo d 4 )
and o f weigh t £ ^ 1 ,£ = l(mo d 2) , i.e. , £ i s a characte r whic h o n principa l ideal s
takes
«(->»-(T)(R)'
for a = \{m + riy/^q). Ther e ar e exactl y h(—q) character s o f thi s type , eac h o f
which yield s a primitiv e cus p for m
/«(*) = ^2tta)(Na)h(zNa) G S e+
1 (T0(q))
a
(see (3.89)) , an d th e associate d L-functio n
a
23. E F F E C T I V E B O U N D S F O R T H E CLAS S N U M B E R 541
satisfies th e following equatio n (self-dual )
(g)-r(.+ ^,0_.( g)'--r(,-. + <) i(1 -,o
(see Theore m 3.8 ) with th e root numbe r w = (—1 )^~( 2 ). I n the Main Theore m
of Rodriguez-Villega s [RV ] the centra l valu e L ( | , £ ) i s expresse d a s a squar e of
a linea r combinatio n o f certai n thet a functions , showin g tha t L ( | , £ ) ^ 0 . I t is
also prove d tha t L ( | , £ ) i s positive i f £ = 1 (because i t is true o n average ove r all
characters £ and these character s ar e Galois conjugate s o f each other) . Fo r £ = 3
the non-vanishin g o f L ( | , £ ) i s guaranteed onl y i f the class numbe r h(—q) is not
a multipl e o f 3 (thi s conditio n i s required t o show ther e i s only on e Galois orbit) .
Consistently, fo r q = 5 9 we have h(— 59) = 3 and L ( | , £) = 0 . Sinc e th e roo t
number is w = —(-^) = 1 we get three distinc t function s L(s,£) eac h o f whic h
vanishes at s = \ t o order a t least 2 , and the corresponding cus p forms hav e weigh t
23.A Appendix : Th e Gross-Zagier L-functio n vanishe s t o orde r 3 .
We will sketch the proof b y Gross and Zagier tha t th e Hasse-Weil zet a functio n
of the curve (23.53 )
-139y 2 = x 3 + 1 0x 2 - 20 x + 8
vanishes t o order ^ 3 at the central critica l point . Precisely , w e explain th e proof
of:
THEOREM 23.3 . For the above L-function, we have ! / ( / , \) = 0.
Since th e sign o f the function equatio n fo r L(f,s) i s —1 , thi s implie s tha t th e
order o f vanishin g i s a t leas t 3 a s desired . Th e proof i s base d o n a formul a o f
Gross and Zagier for the special valu e of the derivative of the L-function o f E (ove r
a suitabl e auxiliar y imaginar y quadrati c field) i n terms o f the height o f a specia l
point. T o state this , w e recall som e notation .
Let fcbea numbe r field an d M^ th e set of (normalized ) absolut e value s o f &; ,
denoted | • \v fo r v G M^; those includ e archimedea n a s well a s non-archimedea n
absolute value s an d we have the product formul a
TT \x\ v = 1 for any x G k*.
veMk
For an y x = [xo : x\ : x^\ i n the projective plan e P 2(fc), th e absolute logarithmi c
height o f x is
(23.55) h(x) =^ lo g Y[ max{|x 0 | v ,|xi| v ,|x 2 | v }
I- * ^ veM k
(this is well-defined becaus e of the product formula) . I f E c P 2 is an elliptic over /c,
then th e function x H- » h(x) i s almost a quadratic for m (wit h respec t t o the group
structure). Denotin g
hE(x)= h m,
n—>+oo 2
Tate an d Neron showe d tha t HE is a quadrati c for m wit h HE(X) = h(x) + O(l),
called th e canonical heigh t o n E. W e only nee d th e obvious fac t tha t 1 IE(0) — 0 ,
which follow s becaus e h(oo) = 1 ; see [Sil] for more details .
542 23. E F F E C T I V E B O U N D S F O R T H E CLAS S N U M B E R
Let E/Q b e an ellipti c curv e wit h conducto r N. I t i s modula r an d a (non-
obvious) consequenc e i s tha t ther e exist s non-constan t map s TT : XQ(N) — > E.
Among these , ther e i s a unique on e such tha t 7r(oo ) = 0 and 7r*(dz) = cf(z)dz fo r
some constan t c > 0, where dz i s the translation-invariant differentia l for m o n E
and / i s the primitive weight 2 cusp form associate d t o E. Th e map TT is then calle d
the modula r parameterizatio n o f E an d in fac t i t i s induced b y the holomorphi c
function H -> C defined by
f
TT(Z) = 2nic / f(w)dw.
Let K/Q b e any imaginar y quadrati c field with discriminan t D < 0 such that ever y
prime p dividin gT V splits i n K (i n particular, i s not ramified) . Not e tha t ther e
are infinitel y man y suc h fields becaus e th e conditions ar e XK{P) = 1 for p | iV,
which ar e congruence condition s fo r D. Ther e exis t 2 ^ ^ ) ideal s n c O suc h tha t
O/n ~ Z/iV Z (wher e O C K i s the ring o f integers) , thos e bein g i n one-to-on e
correspondence wit h solution s ft (mo d 27V ) to the congruence f3 2 = D (mo d 4iV) .
To each ideal class a of K i s associated the corresponding comple x point z f l G l
(see Sectio n 22.1 ) . On e easily show s tha t ther e i s an SL(2, Z)-equivalent poin t z a
such tha t
_ - E + V ^P
Za
~ 2A
with N | A an d B = / ? (mo d 2N). Thi s poin t i s uniqu e modul o T 0(N). Th e
Heegner poin t o n E associate d t o K an d n is defined t o be
y* = X>(*a) e £(C),
a
where the sum runs ove r al l ideal classe s o f K.
THEOREM 23. 4 (TH E GROSS-ZAGIE R FORMULA) . Let E/Q be an elliptic curve
with associated cusp form f, K an imaginary quadratic field as above with discrim-
inant D, x the Kronecker character associated to K. Let
LK(f,s) = L(f,s)L(f® X,s).
Then L K{f,\) = 0 and
where \\f\\ is the Petersson norm of f G S2{N), u G {1,2,3} is half the number of
units of K, deg(-7r ) ^ 1 is the degree of the map TT : XQ(N) —> E.
REMARKS. (1 ) In the applicatio n below , th e modularit y o f E an d th e existenc e
of TT wil l be obvious.
(2) Wha t i s implicit (an d already quit e deep ) i s that yx i s an algebraic poin t
in f£(C) , s o that it s canonical heigh t i s defined. Thi s come s fro m th e modular in-
terpretation o f X${N)\ point s r G ro(7V)\HI ar e in one-to-one correspondenc e wit h
pairs (E,H), wher e E is an elliptic curve over C and H c E(C) is a cyclic subgrou p
of orde r N. Thi s bijectio n i s induced b y the map r f- » (C/(Z 0 rZ) , ( N - 1 ) ) , and
the pai r correspondin g t o z a i s (C/a, q _ 1 a / a ) , wher e N K/qq = N. Thi s curv e has
complex multiplication , henc e finitely man y Galoi s conjugates , whic h implie s tha t
za i s defined ove r O.
23. E F F E C T I V E B O U N D S F O R T H E CLAS S N U M B E R 54 3
More precisely, on e shows that yx G E{K). Th e z a, wher e a runs ove r the idea l
classes i n K, ar e i n fac t a singl e Galoi s orbi t unde r th e Galoi s grou p o f K.
SKETCH O F PROO F O F THEORE M 23.3 . Th e principl e i s tha t i f E/Q i s a n
elliptic curve , wit h associate d cus p for m / , suc h tha t th e roo t numbe r o f E i s + 1,
and i f D < 0 i s a fundamenta l discriminan t o f a n imaginar y quadrati c fiel d K/Q
such that th e twisted curve E^/Q ha s root number — 1, then one has L(f<g>Xi \) ~ 0 -
If i n additio n th e Heegne r poin t yx G E(C) turn s ou t t o hav e h(yx) = 0 , the n
£'*(/, §) = £(/,§)£'(/®x,|) = o.
If L(/ , \) 7 ^ 0 (whic h ca n b e rigorousl y checke d numerically) , the n i t follow s tha t
£'(/®X,5) = 0 .
We no w appl y thi s followin g [GZ2 ] t o th e curv e
E : y 2 = x 3 + 1 0x 2 - 20 x + 8
with th e imaginar y quadrati c fiel d K = Q(\/—1 39) , s o tha t £?£ > i s indee d th e
curve (23.53) . Thi s curve s satisfie s th e assumption s o f Theorem 23. 4 since w e hav e
X-i39(-37) = - 1 . We choos e th e idea l n = ( 1 + u> ) (where u = ( 1 + >/-1 39)/2) .
The discriminan t o f £ i s A = 2 1 2 -37 and th e j-invariant i s j = 2 ^ - = 409 ™00.
Thus th e give n Weierstras s equatio n i s no t minimal . A correspondin g minima l
model i s the curv e
E' : y 2 + y = x 3 + x 2 - 3x + 1 ,
the coordinat e chang e bringin g E' t o E bein g (x,y) \-> (4 x — 2, Sy + 4) . Th e curv e
E^ ha s discriminan t 37 , an d sinc e i t i s squarefree , th e conducto r o f E i s 37 . W e
denote b y / th e weight 2 primitive cus p for m o f level 37 corresponding t o E. Writ e
E(C) = C / ( Z 0 rZ ) fo r som e r G H , a s give n b y th e theor y o f ellipti c functions .
Since th e conducto r N = 3 7 i s squarefree , th e sig n o f th e functiona l equatio n
is give n b y SE = a#/(37 ) whic h give s SE = 1 , an d numerica l checkin g give s
£,(/, I ) = L ( £ , 1 ) « 0.725681 061 9361 5278.. . ^ 0
as required. I n fact, on e can check that E(Q) c ^ Z/3Z i s generated b y (rr , y) = (2,4) .
The idea l clas s grou p o f K i s o f orde r three , generate d b y a = [5 , 1 + v 2 ~ 1 3 9 ] .
The thre e Heegne r point s fo r th e chose n n ar e give n b y
„ _ 1 5 1 + x/ :r 1 39 _ _ - 7 1 + \/ z T39 . _ 3 + y /z: 1 39
Za Z
~ 37 0' ~a~ 37 0' Z
°~ 7 4'
Denoting b y TT th e modula r parameterizatio n X Q ( 3 7 ) —> E, w e nee d t o prov e tha t
7r(za) -f 7r(z- a) + TT(ZO) = 0 € E(C), o r equivalently , tha t
(23.57) ir(z a) + 7r(5_ a) + 7r(z a) G Z 0 rZ .
Notice tha t thi s i s a proble m state d i n analyti c terms . T o prov e it , on e use s th e
following lemm a fro m th e theor y o f ellipti c functions :
LEMMA 23.5 . Let A C C be a lattice, and let f ^ 0 be a A-periodic meromor-
phic function onC. Let si,... , Sd be a set of representatives of A-equivalence classes
of poles and zeros of f, with multiplicity. Then we have si + • • • + Sd G A.
This i s th e complex-analyti c analogu e o f par t o f Propositio n 1 1 .33 , whic h w e
proved ther e fo r ellipti c curve s ove r finite fields. I t i s i n fac t onl y a specia l cas e o f
the Abel-Jacob i Theore m (se e [Rey ] fo r instance) .
54 4 23. E F F E C T I V E B O U N D S F O R T H E CLAS S N U M B E R
P R O O F . Choos e a fundamenta l parallelogra m P C C fo r C/ A an d represen -
tatives fo r th e Si suc h tha t Si i s i n th e interio r o f P fo r al l i. The n conside r th e
integral
2*iJ9pZf(z)dZ Y l
by th e residu e theorem . I f 7 an d 7 + A o are paralle l side s o f P , wit h A o G A, the n
(taking int o accoun t orientation ) th e substitutio n z H * Z — zo o n th e secon d on e
yields b y A-periodicit y
™ y7 * -mdz+*n / 7+Ao
z
W)dz=2^ y 7 W)dz e Ao Z
since th e integra l i s th e windin g numbe r aroun d 0 o f th e close d loo p t \-t f(^{t)).
Hence th e resul t follows . •
We wil l exhibi t a n ellipti c functio n o n C/( Z 0 rZ ) wit h a pol e o f orde r 3 at 0
and simpl e zero s at th e thre e points n(zb) 1 thereb y provin g (23.57 ) b y Lemma 23.5.
Start wit h th e meromorphi c functio n u o n Xo(37 ) give n b y
u(z) = J * £ -
where7 7 is th e Dedekin d et a functio n give n b y
V(z)=e[^)l[(l-e(kz))
(see (22.66)) .
Letting q = e(z) , w e hav e a t infinit y th e expansio n
so u ha s a pol e o f orde r 3 a t 00 . Sinc e rj doesn't vanis h excep t a t infinity , thi s i s
the onl y pol e o f u. Moreover , becaus e7 7 is o f leve l 1 , th e substitutio n z »- > —1 /z
shows tha t u ha s a zer o of orde r 3 at th e cus p 0 , an d n o othe r zeros . No w conside r
v(z) = u(z) — U(ZQ). I t ha s stil l a pole of order 3 at infinity , an d a zero at ZQ. Thi s
is a simpl e zero , an d v a s tw o othe r simpl e zero s a t z a an d Z- a. Indeed , becaus e
there ar e a s many zero s as poles, it suffice s t o prove v(z± a) = 0 , or u(z± a) = U(ZQ),
to sho w this .
Notice tha t u(z) 1 2 = A(^)A(372:) -1 wher e A i s th e Ramanuja n function . Fo r
a poin t z G X Q ( 3 7 ) correspondin g t o a pai r (E,H) o f a n ellipti c curv e an d a
cyclic subgrou p o f orde rT V = 37 , a s i n th e remar k above , on e ha s A(z) = A(E)
and A(372: ) = A(E/H). Fo r a Heegne r poin t i b , thi s pai r i s (C/b,n _ 1 b/b). I t
follows tha t U(ZQ) 1 2 = u(z± a)1 2. The n on e ca n chec k (eve n numerically ) tha t
U(ZQ) = u(z± a) or , mor e precisely , prov e tha t U(ZQ) = 1 + w G K, wher e 3 7 =
N(l + CJ) , an d us e th e fac t fro m Comple x Multiplicatio n theory , tha t i f a i s th e
generator o f th e Galoi s grou p o f th e Hilber t Clas s Fiel d o f K correspondin g t o a ,
then u(z± a) — u(zo) a±a = U{ZQ).
To conclude , w e need t o pus h th e functio n v t o E(C). T o d o this , pu t
W(z)= n "t o
n(z')=z
23. E F F E C T I V E B O U N D S F O R T H E CLAS S N U M B E R 545
for z £ E(C). I t i s easy t o check tha t thi s i s a well-defined meromorphi c functio n
on i£(C) , an d it has a pole of order 3 at 7r(oo) = 0 and simple zero s a t 7r(z b). Thi s
finishes th e proof . •
R E M A R K . Usin g program s suc h a s PARI/GP, on e can nowadays find explici t
equations an d formulas fo r all objects i n the above constructions . Som e ar e stated
in [MSD] , Sec . 5 . We thank C . Delaunay fo r computing th e Heegner points . I t is
easier t o deal wit h th e curve
F : y 2 + y = x3 + x2 - 23 x - 5 0
instead o f E. B y [MSD] , w e have E ~ F/fis, wher e fis i s the group o f 3-root s
of unit y (on e ha s F[3] ~ // 3 x Z/3Z as Galois-module), s o F i s isogenous t o E, i n
particular, E an d F (an d thei r twists ) hav e identica l L-functions .
The curv e Xo(37 ) i s the nonsingular mode l o f the plan e curv e wit h equation s
X 0 (37) : y 2 = -x6 - 9x 4 - Ux + 37,
and th e modular parameterizatio n o f F i s the ma p
TT(X, y) = (\(37x- 2 - 5) , |(37yx" 3 - 4)) .
The Hilber t Clas s Fiel d H o f K = Q(V~1 39 ) i s th e splittin g field ove r K o f
P = X 3 +_4X 2 + 6X + 1 . Denot e b y 6La the real roo t o f P, by 0o th e root i n H
and 6 a = 6Q. The n 7r(zb ) £ F(H) i s given by
7r(zb) = (Ai(0 b ) + A 2(0h)V=139,B1 (0h) + B 2 (fl b )>/=l39)
with
1_
Ax = -— 2 (1 1 52X 2 + 6398X + ^ | § 9)
412
1
4 2 = 1 Q n ^ l 2 ( 3 2 5 7 9 X 2 + 53474 X + ±2|8i )
ioy • 4i
2
B x = J_(_39658X + 225512X + 84810)
413
B2 = - * (2662280X2 + 6561700X + 1270577).
l o c / • 41
One ca n then chec k "b y hand" tha t 7r(z a) + 7r(z_ a) + 7r(5o ) = 0 in F(C). (Thi s
means tha t th e three point s 7r(zb ) a r e on a line i n P2 ). Note , however , tha t TT(ZO)
is of infinite orde r (i n F(H)) becaus e th e class numbe r o f Q(y/—139) i s larger tha n
the degre e ( = 2) of the modula r parameterizatio n n (thi s is a theorem o f Nakazato
[Nak]).
This page intentionally left blank
http://dx.doi.org/10.1090/coll/053/25
CHAPTER 24
THE CRITICA L ZERO S O F TH E
RIEMANN ZET A FUNCTIO N
The zero s p = \ + ry o f £(s) o n th e critica l lin e ar e calle d th e critica l zeros .
Let N 0(T) denot e th e numbe r o f critica l zero s wit h 0 < 7 ^ T . Recal l tha t N(T)
denotes th e numbe r o f al l zero s p = /3 + 27 wit h 0 < /3 < 1 and 0 < 7 < T, an d
that w e hav e (se e Theore m 5.24 )
(24.1) N(T) = Ll o g l _ + O(iogT) .
The Rieman n Hypothesi s assert s tha t iVo(T ) = N(T). B y th e densit y theore m
(Theorem 1 0.1 ) w e kno w tha t almos t al l zero s ar e nea r th e critica l line . Rieman n
showed by hand computation s tha t ther e ar e zeros exactly on the line Re s = \ , th e
first critica l zer o being p\ = \ + i^\ wit h 7 1 = 14.1 3 . .. . Using moder n computer s
it ha s bee n checke d tha t million s o f th e firs t zero s o f £(s ) ar e o n th e lin e R e s = \
and the y al l ar e simpl e (se e e.g. [LRW]) .
24.1. A lower boun d fo r N 0(T).
In 1 91 4 , G . H . Hard y prove d tha t £(s ) ha s infinitel y man y zero s o n th e lin e
Re s = | , and seve n years later h e showed jointly wit h J . E . Littlewoo d [HL2 ] tha t
(24.2) 7 V o ( T ) > T
for al l larg e T . I n this sectio n w e giv e a proof o f this result . The n i n the nex t
section w e improve thi s boun d t o th e followin g estimat e o f A . Selberg ,
(24.3) 7 V 0(r)>TlogT,
which say s tha t a positive proportio n o f zero s lie s o n th e critica l line .
We se t
where g(s) = 7r~" S//2r(s/2). B y th e functiona l equatio n g(s)((s) = g(l — s)((l — s)
it follow s tha t f(u) is real an d even . W e conside r th e tw o integral s
rt+A
(24.5) I(t) = j f(u)du,
pt+A
J = du
(24.6) ^ J \fM\
where A is a fixed positive number . Clearl y \I(t)\ ^ J{t). I f
(24.7) |/(t) | < J{t),
547
548 24. T H E C R I T I C A L ZERO S O F T H E R I E M A N N ZET A F U N C T I O N
then f(u) mus t chang e sig n i n the interval (t,t + A), henc e f(u) mus t hav e a zero
in (t,t + A) , s o has C (| + ^ u) becaus e g(s) doe s no t vanish anywhere . Therefor e
our proble m reduce s t o showing tha t (24.7 ) occur s quit e often . T o this en d we ar e
going to prove a lower boun d fo r I(t) an d an upper boun d fo r J(t) o n average ove r
the segmen t [T,2T] .
LEMMA 24.1 . Let A ^ 1 . There exists a function K(t), which also depends on
A, such that
(24.8) J(t) ^A-K(t),
2
(24.9) / \K(t)\ dt<^T,
ifT^A2, where the implied constant is absolute.
LEMMA 24.2 . Let A ^ 1 . We have
2
(24.10) / \I(t)\ dt<^AT
ifT^A6, where the implied constant is absolute.
By Lemmas 24. 1 and 24.2 the estimate of Hardy-Littlewood ca n be deduced as
follows. Le t T b e the subset o f [T,2T] wher e \I(t)\ = J(£), thu s
/ \I(t)\dt= f J(t)dt.
IT JT
We get by Cauchy's inequalit y an d by Lemma 24. 2
/ \I(t)\dt^ f |J(t)|d*<A*T .
On th e other hand , w e get by Lemma 24. 1 and Cauchy's inequalit y
/ J(t)dt > A\T\ - / K(t)dt = A\T\ + 0 ( | T | * T * ) .
Combining th e abov e bound s w e deduce tha t th e measur e o f T satisfie s \T\ <C
A~2 T wher e th e implied constan t i s absolute. Fo r A sufficientl y larg e thi s give s
\T\ ^ \T. I n other words , the set S = [T, 2T)\Ton whic h (24.7 ) hold s has measur e
\S\ ^ \T. Clearl y S contain s a sequence {£i,.. . , tn} o f A-spaced point s o f length
R > A _1 |<S| ^ T/2A . Fo r each t r, 1 < r < R, ther e i s a sig n chang e o f f(u) i n
tr < u < tr + A, hence ther e i s a critical zer o p r = \ + i^ r wit h t r < 7 r < t r + A.
This prove s tha t N 0(T) > T / A which i s (24.2).
It remain s t o prove th e two lemmas .
P R O O F O F LEMM A 24.1 . W e hav e
pA PA
J(t) = / |CC i + it + iu)\du ^ / C ( | + ^ + iu)du
JO ' J O
i rA i
^A - / (C( | + it + iw) - l)d u = A - #(*) ,
'Jo '
24. TH E CRITICA L ZERO S O F TH E RIEMAN N ZET A FUNCTIO N 549
say. T o estimate K(t) w e us e th e approximatio n
(24.11) <(*) = J2 n-' + 0(T-*]
l^n^T
which i s valid fo r s = \ + it wit h T < t < 3T (se e (8.3)) . Henc e
iA
1 - n~
K(t) = \ J2 *-*" " logn + 0(AT"2) ,
Kn^T
and b y Theore m 9. 1 w e conclude tha t
r2T
2 2 2
/ |lf(*)| d t < T J2 n-\logn)- + A <z:T.
Kn^T
D
P R O O F O F LEMMA 24.2 . Usin g th e convexit y boun d ((s) < C \s\* w e arrang e
the integra l o f (24.1 0 ) a s follow s
r2T r 2T,r A
2
/ = / \I(t)\ dt= / / f(t + u)du\
JT JT UO '
pA pA p2T
= f(t + u 1 )f(t + u 2)dtdu
1 du2
Jo Jo JT
rA pA p2T
= f(t)f(t + u 12-u )dtdu
1 du2^0(A3T2)
Jo Jo JT
A n2T
/ 3
( A - M ) / f(t)f(t + u)dtdu + 0(A Ti).
-A JT
Now w e approximat e f(t)f(t + u) b y a Dirichle t polynomial . Firs t b y Stirling' s
formula
T(a + it) = (27r) i (it)*"* ( t / ef e - 5 ' ( l + O ^ " 1 ) )
(-°m>
for < J > 0 , t > 0 we find tha t
g{\ + tf)£(± + i t + m ) /2TT \"
|fl(| + *t)s(5+ ** + *«)! V *
Then b y (24.1 1 ) an d th e convexit y boun d £(s ) < C |s|* w e ge t
i ,m^n. /27rm 2 \ 2 2
/(«)/(* + «) = E E ( ™ ) - * ( 7 m +0(A T-i).
l<m,n<T
Hence
4
/ = J2Y1 c(m,n)(mn)-i+0(A T^)
where
/my* (2irrri 2\ T
dtdu
-!/;0"«(T>^) dt
550 24 . T H E C R I T I C A L ZERO S O F T H E R I E M A N N ZET A F U N C T I O N
with \(x) = x ~ 2 ( s i n : r ) 2 . I f ra ^ n , w e integrat e b y p a r t s gettin g
m( sin | A l o g ( 7 r m 2 / T ) ^ ^ f sin \ A l o g ( 2 7 r m 2 / T ) \ '
c(ra, n) lo g — <
log(7rm /T)2
+ log(27rm 2 /T) /
/•2T
/sin|Alog(27rm2/t)
+ \ log(27rm 2
/t)
27rm2 -2
log 7rm
< mi in
n ^I LA
+ log
If m = n , i t i s easie r t o integrat e ove r t firs t an d the n ove r u gettin g
2 - 2 ^ fnm 2\ 2
c(m,m) = 2 T / (A-H) -. du
J-L_A 2 — zu \ 1
27rra2'-1
< A T mi n ^ 1 , log 7rm
T + log
Using thes e estimate s on e derive s t h a t
^ ^ c(m,n)(mn) * < A T
which complete s th e proo f o f (24.1 0) .
•
2 4 . 2 . A p o s i t i v e p r o p o r t i o n o f critica l z e r o s .
In 1 942 , A . Selber g modifie d th e argument s o f Hard y an d Littlewoo d showin g
t h a t a positiv e proportio n o f zero s o f £(s ) lie s o n th e lin e R e s = | . H e di d no t giv e
the percentag e number . A ver y stron g resul t i s du e t o B . Conre y [Co2] , namel y
that
(24.12) N0(T) > |iV(T )
for sufficientl y larg e T. Conrey' s approac h i s base d o n Levinson' s m e t h o d (se e
[Lev]) whic h i s quit e differen t fro m th e on e use d b y Selberg .
Selberg's refinemen t o f t h e Hardy-Littlewoo d argument s appear s i n mollifica -
tion o f £(s ) b y a Dirichle t polynomial , it s rol e bein g t o diminis h t h e contributio n
of larg e value s o f ((s). Thi s ide a wa s first exercise d b y Boh r an d L a n d a u [BL] ,
however, les s effectivel y t h a n i n t h e hand s o f A . Selber g [S8] . Selber g consider s
(24.13) m \ga+iu)\
C(h+iu)\<p(l+iu)
where (p(s) i s a Dirichle t polynomia l o f typ e
S
(24.14) ^)=£^V
and h(x) i s a real , continuou s functio n o n [0,1 ] suc h t h a t
(24.15) h(x) = l + 0(x )
(24.16) h(x) < 1 -x
24. T H E C R I T I C A L ZERO S O F T H E R I E M A N N ZET A F U N C T I O N 551
while th e coefficient s 7 ^ hav e arithmetica l natur e (the y ar e als o real an d bounded) .
Clearly f(u) i s rea l an d even . Th e reaso n tha t \ip(\ + iu)\ 2 i s applie d rathe r tha n
<p(| + iu) i s t o ensur e tha t an y sig n chang e o f f(u) a t u = 7 come s fro m th e sig n
change o f £ ( | +iu) becaus e \<p(\ +iu)\ 2 ^ 0 . Th e mollifie r \ip(\ +iu)\ 2 give s som e
extra zero s t o /(it) , bu t the y ar e no t counte d b y th e sig n changes !
A reasonabl e gues s o f th e coefficient s 7 ^ come s b y tryin g ip(s) t o approximat e
£(s)~2. Indee d Selber g choose s 7 ^ t o b e exactl y th e coefficient s i n th e Dirichle t
series representatio n o f th e Eule r produc t
(24.17) cw-^nC 1 -£)*=£-***"'•
Vd
We hav e fo r s > 1 ,
V ~ p*J ^ V ~ ~ ps) ^ \ ~ p*J '
whence |7^ | ^ |7^ | ^ 1 , where 7 ^ ar e th e coefficient s o f
-
(24.18) cw^n^-ir^E™* '-
Pd
As i n th e Hardy-Littlewoo d proo f w e shall compar e th e integral s
pt+A
(24.19) I(t ) - J f(u)du,
pt+A
(24.20) J ( t ) = / \f{u)\du
in th e rang e 0 ^ t ^ T . Du e t o th e mollification , w e shal l b e abl e t o wor k wit h
smaller A ,
1
(24.21) A x (logT)"
where th e implie d constan t i s absolute . W e shal l prov e thre e estimate s
LEMMA 24.3 . Let ( l o g T ) - 1 < A ^ 1 and D = T e with 0 < 6 < ^ . Then we
have
(24.22) / |/(t)|<t t
Jo
T
(24.23) / [T|\f(t)\
/(t)f<ft<T ,
Jo
/o
I
2
(24.24) f |/(t)| d t < A T ( l o g T )1- ,
where the implied constant depends only on 0.
Prom thes e estimate s w e derive Selberg' s boun d (24.3 ) a s follows. Le t £ b e th e
subset o f [0,T ] i n whic h \I(t)\ < J{t). W e obtai n
A= J J(t)dt> f(J(t)-\I(t)\)dt= [ (J(t)-\I(t)\)dt = B-C
J£ J£ JO
552 24 . TH E CRITICA L ZERO S O F TH E RIEMAN N ZET A FUNCTIO N
say. B y th e Cauchy-Schwar z inequalit y an d b y (24.23 ) w e deriv e
A2^\£\f J 2
(t)dt=\£\ f (f \f(t + v)\dvYdt
pT pA /*T+ A
< A\£\ / / \f(t + v)\ 2dvdt ^ 2A 2 |£| / \f(t)\ 2
dt < A 2
\£\T.
Jo Jo Jo
On th e othe r hand , w e have b y (24.22 )
pT pt+A pT
B= / \f(u)\dudt > A / \f(u)\du > AT ,
Jo Jt JA
and b y (24.24 )
\I(t)\ 2dt < A T ^ l o g T ) " 1 .
C2 ^T f
Jo
Combining the above estimates we obtain A T < C A2T(logT)~2+A|£ 12T2, whenc e
\£\ ^ > T , provide d A l o g T i s sufficientl y large . Th e se t £ contain s a sequenc e o f
points {ti, ... , tR\ whic h ar e A-space d o f lengt h R ^ A _ 1 | £ | > T l o g T . Fo r eac h
tr ther e i s a sig n chang e o f f{u) i n (£ r ,£ r 4 - A), s o ther e i s a zer o o f £ ( | + iu) i n
this interva l becaus e \<p{\ 4 - iu)\2 i s non-negative. Henc e w e conclud e (24.3) .
It remain s t o prov e th e estimate s fo r th e thre e integrals . Th e firs t on e (24.22 )
is easy. W e hav e
2
[ \f(t)\dt>\[ {(±+it)tp (±+it)dt
Jo ] IT/2
JT/2
By th e approximatio n (24.1 1 ) an d th e trivia l boun d <p(s) < C D* fo r R e (5) = ^ w e
get
C{s)^2{s)=YJ^n-s^O{DT-^)
where a\ = 1 and \a n\ ^ r 3(n) fo r n ^ N = £> 2T. Assumin g D < T i ( l o g T ) - 1 w e
obtain
I/ r
/ 22
((s)<p2(s)dt>^-2 £
2 ^
|a
Vlog
n |n-i(logn1
)- +o(^)>|
:
if T i s sufficiently large , whic h prove s (24.22) .
The othe r tw o estimate s (24.23 ) an d (24.24 ) ar e muc h harde r t o establis h be -
cause we have to appeal to the nature o f the mollifie r <p 2(s). We split th e argument s
into severa l group s an d evaluat e a fe w integral s o f th e zet a functio n whic h ar e o f
independent interest .
T H E O R E M 24.4 . Let si = \ + ivi,s2 = \+iv 2 with \v\\ < 1 , \v2\ ^ 1 . Let a, b
be positive integers with (a , b) = 1 . We have
(24.25) J C(s 1 +it)as2+it)a^+H^-itdt = TP v(^\+0((ab)iTl(logT)
6
)^
where v = v 2 — v 1and P V(X) is defined by
(24.26) P t,(X)=C(l-w) + C( l + w ) T - — •
1 + IV
24. TH E CRITICA L ZERO S O F TH E RIEMAN N ZET A FUNCTIO N 55 3
The implied constant in (24-25) is absolute.
Notice tha t b y th e Lauren t expansio n o f £( 1 + iv) w e ge t
(24.27) P V(X) = : + 27 - 1 + 0(\v\)
iv
which i s quit e goo d i f \v | i s small. I n particular , w e hav e
(24.28) P 0{X) = logX + 2-y-l.
COROLLARY 24.5 . If (a , 6) = 1 , we have
(24.29) [ U\ + it)?QUdt = -^ (lo g JL + 2 7 - l ) + O iabTl (IogT)« )
w/iere £/* e implied constant is absolute.
We begin the proof of Theorem 24. 4 by evaluating the following partia l integral s
2 fll it
(24.30) S = / C(si + **)C(* 2 + i * ) a " " 6"Sa+it*
for T < T i < T 2 ^ 2T . Changin g th e variabl e t -> • t - v 1 w e ge t
2
5 = {ab)~h iv f C(| + it)C( | + i v + i t ) ( - ) * * * + £ ,
where E 1 is the non-overlappin g par t o f th e integral , an d i t satisfie s
(24.31) £ < ( a & ) - * r i
by applyin g th e convexit y boun d £(s ) < C |s|* o n th e lin e R e s = ^ .
Next w e approximat e £(s ) b y it s partia l su m
(24.32) Cx(s )= 5 > " ' .
Recall tha t fo r R e 5 = a ^ | an d |s | ^ nX w e hav e
(24.33 ) C W = Cx(« ) - ~ + 0(X-") .
1— 5
In particular , fo r 5 with R e 5 = \ an d T < C |s| < TTT we hav e
(24.34) C(« ) = Cr(a ) + 0 ( r - i ) .
Hence
5 = (o6)"H i t ; / 2
C T (| + « ) C T ( | ~iv- H) (ly'dt + E + E^
where E^ account s th e contributio n o f th e erro r ter m i n (24.34) , thu s i t satisfie s
(24.35) E o o < ( a 6 ) - i ( T l o g r ) i
by virtu e o f th e followin g estimat e (obtaine d fro m (7.52 ) b y Cauchy' s inequality )
3T
|C(!+it)|d*<T(logT)i.
Jo
55 4 24. TH E CRITICA L ZERO S O F TH E RIEMAN N ZET A FUNCTIO N
Inserting th e Dirichle t polynomial s (24.32 ) w e ge t
(24.36) S= f 2 ^^(am)-i'i\bn)-i+iv^itdt + E^E 00.
First w e pul l ou t th e term s o n th e diagona l am = bn. Sinc e a, b are co-prim e
it follow s tha t vn — bl an d n — at wit h t ^ L = Tc~ l wher e c = max(a , b). Henc e
these term s contribut e
1 +iv
s0 = (T 12-T )(ab)- {L(i-iv).
Introducing th e approximatio n (24.33 ) w e arriv e a t
(24.37) S 0 = (T 2 - H X a & r W c a - iv) + —) + O(l) .
V iv J
Now we consider th e contributio n t o the integra l (24.36 ) o f the term s am ^ 6n,
say <S* . B y integratio n w e ge t
(24.38) 5 * = 5 ( T 2)-5(Ti),
where
(24.39) S(t) = i ]T ^ ( a m ) - i - ^ ( 6 n ) " 4 + " + i t (lo g °^) ~ \
am^bn
We shal l sho w tha t th e mai n contributio n t o S(t) come s fro m th e term s nea r th e
diagonal, i.e. , fro m m,n with 1 — £ < ^ < 1 + £ fo r som e smal l e to b e chose n
later. Pu t
1
(24.40) S e(t)=i £ £ {amr ^\bnr^+it{\og1
a
^y .
0<\am—bn\<£bn
For th e remainin g term s w e hav e | log ^ | > |, and sinc e lo g x i s monotonic , w e
can appl y partia l summatio n t o estimat e th e contributio n o f thes e term s b y
e-\ab)-X2 (\ 2
ogT) \U\ + it)Cy(k ~ iv
' **) l
for som e x,y wit h 1 < x,y < T. No w b y Weyl' s subconvexit y boun d Cx(s) < ^
|s|i(log3|^|) 2 , whic h i s valid fo r s with R e s = \ an d x < 4|s|2 (se e (8.22)) , i t
follows tha t
1 1
(24.41) S(t)=S e(t) + 0(e- (ab)- *T*(\ogTf).
For th e term s i n S £(t) w e write am — bn + h wit h O < \h\ < ebn, an d us e th e
approximations
(£)"-(' + £)""--"""^(sk)
getting
1
Se(t)= ^ ] T ih" (bn)lve-ith/bn + E1
am—bn=h
0<\h\<ebn
24. TH E CRITICA L ZERO S O F TH E RIEMAN N ZET A FUNCTIO N 55 5
where E\ account s fo r th e contributio n o f th e abov e erro r terms . W e hav e
\am—bn\<ebn
< ( 1 + st) ^(enba- 1 + l ) ( 6 n ) - x < C (1 + e r ) ( e a_ 1r + b'1 l o g T ) .
n
Clearly w e ca n interchang e a and b (see th e firs t inequality) , s o combinin g bot h
estimates w e deriv e a symmetric boun d
2
(24.42) E x <C (ab)-^(sT)
provided eT ^ l o g T , whic h conditio n w e hencefort h assume .
Now w e writ e S e(t) a s
ih
S£(t)= J2 'X J2 {bn) iv
e-ith'bn
1 ^E
+
H~<h<H ' Ni<n<N 2
h^O n= — hb(mod a)
where
H+ = min{ebT, e(l + e^aT}
1
H~ = - min{£6T , s(l-e)~ aT}
1
Ni = |/i|/e6 , 7V 2 = min{T , (a T - / i ^ " }.
For t h e inne r su m w e appl y t h e followin g formul a
£e (/H) = -/ e{f(x))dx + 0{\)
Nl<n<N2 a j N
i
n=a(mod a)
which hold s tru e fo r an y smoot h functio n / suc h t h a t \f f\ ^ (2a)~ l an d / " ^=- 0 in
the interva l [iVi , JV2] with N\ < N2 (thi s follow s fro m Propositio n 8.7) . Assumin g
2
(24.43) 2e abT ^ 1
we obtai n
1 fN 2
iv
J 3 {bn) e'ith'bn = - (bx) iv
e-ith'hxdx + O(l).
n= — hb(mod a)
Here w e replac e N 2 b y T m i n { l , a / 6 } makin g a n erro r 0(\h\/ab). T h e n w e chang e
the variabl e x int o xt/2-Kb gettin g
1 + i v £t 2n h
1/ + \ r / \ \ /IAi l
Ni<_n<N2 x 7
^t/znai
n— — hb(mod a)
where d = min{a, 6} . Insertin g thi s int o S £(t) we ge t
2 l
5 e ( t ) = Xr (-pj r x- "^( J ] (2irih)- e{-hx))dx + E 1 +E2
a
\ KJ Jt/2-KdT 0<\h\<et/2nx
H~<h<H+
556 24. TH E CRITICA L ZERO S O F TH E RIEMAN N ZET A FUNCTIO N
where
(24.44) E 2 < Yl ( ^ + I) < (ofc)"^ r + loS T -
0</i<2edT
The inne r su m ove r h in the abov e integral i s a partial su m in the Fourie r expansio n
for —i[>{x) = [x] — x + | . Applyin g th e approximatio n
1
-^(x) = ^ (27ri/i)- e(/ix) + 0 ( ( 1 + ||x|| # r 1 )
-//i<^i</f2
MO
where H = min{ifi , i/2 } (compar e thi s wit h (4.1 8) ) w e ge t
S
f / t \ livv /.0
r c0 2 iv
e(t) = - — / x- ~ il>(x)dx + E 1 +E2 + E3
tt& V27T / J t/2ndT
J hi
where
2
£ 3 <-r/( l + e * ^) x dx= — [\\x +-r )—
ab K J ab K J
Jt/2irdT % £ Jt/27rdT ^ X
eab
We simplif y thi s boun d t o
(24.45) £ 3 < e " 1 ^ ) " ^ lQ gT-
Finally w e use th e formul a
which hold s fo r 0 < y < 1 and R e w > —1 , gettin g
m . I f f )"flL± M - < <5£ + iffl!^ + Bl + * + «
ao \2TTJ I +IV ab iv ab 1 +z v
Now addin g u p th e relevan t formula s w e obtai n
T2P
S = {ab) — l-\-iv
\^)-TlP\*tb) +R
where P V{X) i s defined b y (24.26 ) an d R i s the total error term. B y (24.31 ) , (24.35) ,
(24.37), (24.42)-(24.45 ) w e ge t
J 2 « (afc)-*(T * + e - 1 : H +£ 2 T 2 )(logT) 6 + (logT) 2
provided eT ^ log T an d 2e 2abT < 1 . W e choos e e = T~ 3 gettin g
6
JR«(a6)-5Tf(logT)
provided 2a 6 ^ TB. W e als o hav e th e trivia l boun d
2
<S<(a&)-5 / | C ( i + i < ) | cte«(a6)-iTlogT.
./o
24. T H E C R I T I C AL ZERO S O F T HE R I E M A NN ZET A F U N C T I O N 55 7
Combining bot h result s w e get rid of the condition 2ab < T Q claimin g a weake r
result
S = (ab) — l-\-iv WAW„(^)
' \27rabJ
+ 0((ab) 2 Tl (log T)6 ).
This yield s (24.25 ) b y addin g th e results for dyadic point s Tj.
R E M A R K S . Th e erro r term s i n (24.25) an d (24.29) coul d b e improved substan -
tially. Fo r example, i f a = b = 1 , our Corollary 24. 5 yield s
2
(24.47) J \C(l +lt)\ dt = T(log^-+2j-l^E(T)
with E(T) < C T§ log6 T, whil e i t is known tha t E(T) < T&+ £ . I t is conjecture d
that (24.47 ) hold s wit h E(T) < Ti+ £ (se e [Iv]).
Theorem 24. 4 can be easily generalize d t o integrals o f type
GV(T) = J C(l+ tf)C(| -it- iv) (l) ltg{t)dt
where g(t) is a smooth functio n o n [0,T]. Fo r g(t) = 1 we get by (24.25)
if (a , 6) = 1 , \v\ ^ 1 and T ^ 2 , the implied constan t bein g absolute . Henc e we
derive by partial integratio n tha t i n general
biv f T / / t \ iv\ «
Gv(T) = -j=j « , ( i ) ( c ( i - i „ ) + < ( i + it,)( —) )d * + 0(a6GT5 (log T) 6 )
where
c-tofOI + f 1 /(01 ^ .
For #(£ ) = (2TT/£)™/ 2 thi s yield s
COROLLARY 24.6 . Let (a, 6) = 1 , |v| ^ 1 and T ^ 2 . TAer a we Aave
/* T / a \ i t
/2TT\ Q
(24.48) I C (|+ii)C(i_rt-iv)(-) ( _ ) d t
W f^+uO / T w + C^) / Tx* | + 0 ( a 6 T l ( )7 )
v
^fab \ 2 + iv \27ra 2J 2 - iv \27rb 2
JJ \ B JJ
where the implied constant is absolute.
Now w e procee d t o the proof o f (24.23). Le t a^ be the coefficients of
2
^
558 24 . T H E C R I T I C A L ZERO S O F T H E R I E M A N N ZET A F U N C T I O N
where /3d = 7d/i(logd/logD) , thu s OLI < C r(£). W e hav e b y Corollar y 24. 5
fT\f(t)\2dt= f T
\t(l+lt)y\\+it)\Ht
Jo Jo
2
= TT*aa b(ab)-^ [ \a±+it)\ (a/bfdt
J
ab °
8
= AT(log ^- + 2 7 - l ) - BT + 0(D 6T9 (logT) )
where
(24.49) A =
J2J2J2 "adotbdiabd)- 1 ,
d (a,b)=l
1
(24.50) B = ^2J2^2a adabd(abd)- \ogak
d (a,6 ) =l
Therefore i t suffice s t o prov e tha t
(24.51) A < ( l o g I } ) - \
(24.52) B < 1 .
First w e treat A i n considerable detai l an d the n w e modify th e argument s t o reduc e
the cas e o f B t o tha t o f A.
We begi n b y diagonalizin g th e quadrati c for m A a s follow s
d6 a
= E d ~l (E / w1) ( E a ^a~1 )2 = E *(<*M2,
d S\d a d
where
a
Ad= Yl «1
a_ =
X I Pd 1 Pd2(did2)~1
a=0(mod d ) a=0(mo d d)
2
for d ^ D . Se t d x = 5 xk an d 5 2 = < M with 5i5 2 | d°°, d | 5i<J 2, (JM, d) = 1 getting
M2ld°°
d|M2
where
Ad(S)= J2 ^kk~\
(fc,d)=i
From no w o n w e tak e h{x) = 1 — x , s o tha t
These ar e the origina l coefficient s o f Selberg, howeve r on e could choos e h{x) almos t
arbitrarily subjec t t o th e condition s (24.1 5 ) an d (24.1 6) . I n thi s particula r cas e we
24. TH E CRITICA L ZERO S O F TH E RIEMAN N ZET A FUNCTIO N 559
have a simpl e Melli n integra l representatio n
+
(24.54) log x = ^- [ xss 2 ds.
(e)
Now
Ad(S)logD= £ 76kk -Hog+^-=l6^-. [ Z(s + l)(D/5) s
s-2ds
m
{k,d)=i ^
where Z(s) i s th e correspondin g zet a functio n
z{s)= E i kk-2 = J{{i-p-^=Us)k{s)-^
(fe,d) = l p\d
with
CdW = II(1-P"T 1-
P\d
We hav e ( d{s + 1 ) < Cd(l ) = d/<p(d) , CO 5 + 1 ) > H " 1
an d (D/(5) s < 1 on th e lin e
R e s = £ with e — 1/ log D. W e als o hav e
2
/ \s\-^ ds^e-^2 = (logD)-2.
J(e)
Hence w e deriv e A d(S) < C |7a|(d/<p(d)logD)3. Thi s yield s A d < \(d)d/ip(d) log D,
where
A
(^) = X ! Z l i 7 ^ 7 ^ l ( ^ 1 ^ 2 ) ~ 1 < X ^ r_1
S Ttfi^ a = —T^v -
M 2 |d°° Hd° ° M 2=r ^ '
d\5\52 d\r
because \^s\ < 7 5 an d 7 • 7 = 1 . Therefor e w e hav e
2
(24.55) A d4Zd(<p(d))- (log D)- 1 .
Finally insertin g thi s t o th e diagona l for m o f A w e argu e a s follows :
1
A(\ogDf« £ dV3(d) = E ^ I K 2 2
-1 )"3
d^D d^D p| d^
1
« E^Ift +*"*)< E ^ E - ^
d ^ D 2 p| d d^D 2
™| d
_1
^c(|) E ^ «iog^
d^D2
which complete s th e proo f o f (24.51 ) .
For th e proo f o f (24.52 ) w e write logaf r = logafrd 2 — 21 og d an d w e spli t B ac -
cordingly, sa y B = B' — 2B". B y the argument s whic h le d u s to th e diagonahzatio n
of A w e deriv e simila r expression s fo r B' an d B". Thu s w e hav e
B
' = Yl ^( d ) Yl 5 Z ^aOLbiflb)- 1
log ab = 2 J2 P( d A
) dBd
d a,6=0(mo dd)d
560 24 . T H E C R I T I C A L ZERO S O F T H E R I E M A N N ZET A F U N C T I O N
where
a = 0 ( m o d d) q
by th e formula L — A • 1 . Applyin g (24.55 ) w e derive
Bd « ( Y.(d,q)A(q)q-1)d^-2(d)(\ogD)-1 « d^ 2
(d),
q^D*
This boun d i s larger tha n tha t fo r Ad by factor log/} , henc e the former argument s
yield B' < C 1 as required fo r (24.52) . Moreover , w e have
B =
" £ S S ^d^(a^)-1logd - ^^(d)4|,
d (a,6 ) =l d
where
<5|d S\d Sq\d
= ^A(q)qv(d/q) < 2 V (rf)^A(<;) = 2^(d)logd .
g|d qr| d
Hence B" ^ 4Alog£ > <C l by (24.51 ) . Thi s complete s th e proof o f (24.52).
Finally w e sketch a proof o f (24.24) . W e have
rT, /> A
2
1= \I(t)\ dt= / / f(t + u)dv dt
Jo Jo Uo
2
= / ( A - H ) / f(t)f(t + u)dtdu + 0(D Ti)
J - A JG
and b y the argument (Stirling' s formula ) whic h wa s applied i n the proof o f Lemma
24.2 w e ge t
/(()/(< + «) = ca + i()«l -"- * V ( ! + !<>*>2a - « - <»)(f)* + ° ( f f )
=ss> t .)cu-«-<.)(;r(^ + <)'
a 6
Integrating ove r t b y Corollary 24. 6 we ge t
1
/ f(t)f(t + iz)d * = 2 T ^ ^ J2 aadotbdiabd)-
^° d (o,b)= l
{^(VS"+^(Wfn+o<^s<^-
Now integratin g ove r u we ge t
(24.56) 7 = 2rVJ^^aodaM(a6d)-{$(^#) + * ( ^ )}
d (a,b)= l
+ 0(D 4 Tt (log T)7)
24. T H E C R I T I C A L ZERO S O F T H E R I E M A N N ZET A F U N C T I O N 561
where
rA
v C(i + ^ )
A2 + iu
/ l I I - H1
7/ . I \
iu
= / (A-u)(X -X-iu)p-+ [ (A-\u\)X iu C
( ^u) - - M dl£
J0 v A ;
2 m 7_A V ' " V 2 + iu 2m /
Putting
(24.57) w(z) = /° ° f ^ ) d u
we writ e $ ( X ) = § A - Aw(AlogX ) + 0 ( A / l o g X ). Insertin g thi s fo r X
(T/2ir)i(abd)-1 int o (24.56 ) w e arriv e a t
I = -2ATA' + 0(ATA" + ATA'"(logX)' 1 + £> 4 Ti(logT) 7 )
where
A
' = E E E ^ « M W ^ ( A lo§ ^^).
d (a,b ) =l
d (a,b ) =l
^" = EEEia^awi(aM)_1-
d (a,6 ) =l
The su m A" ca n b e estimate d i n th e sam e wa y a s A,
A" ^ d Y [ ( l + ^)Al«(logD)-^d^l[(l + ^y «(lo gD)-K
d p\d d p\d
The su m A'" look s lik e A except tha t it s term s ar e take n wit h absolut e value .
Consequently, i n th e analyti c argumen t applie d fo r A'" th e Rieman n zet a functio n
occurs a s ({s) 1 /2 rathe r tha n C( 5 ) - 1 / ' 2 resultin g i n a loss o f facto r log D i n th e
bound fo r A!" b y compariso n t o A, i.e. , on e get s A'" < C 1.
For th e su m A' on e derive s th e sam e boun d a s fo r A , namel y A' < C ( l o g D ) - 1 ,
since th e functio n uj(A\ogX) i s bounded an d i t ha s nic e derivatives . Fo r example ,
one ca n us e th e Melli n transfor m o f uj(AlogX) i n X which bring s a sligh t shif t i n
the argumen t o f the involve d Dirichle t series , an d thi s make s n o essential chang e i n
the res t o f th e proof . Anothe r approac h woul d b e t o reduc e A' t o A b y arithmeti c
arguments a s w e reduced B' t o B. T o thi s en d writ e th e powe r serie s
" (-!)*+ ! (2s)2*+ l
^ ( 2 f c + 2) ! 2f c + l '
562 24 . T H E C R I T I C A L ZERO S O F T H E R I E M A N N ZET A F U N C T I O N
For z = A log X = I A log(T/27r ) — A log(a6d) thi s reduce s th e proble m t o estimat -
ing sum s o f typ e
A(k)
= ^SS^adaftdCaMJ-HlogaM)* .
d (a,6 ) =l
Writing
(logabd)k = Y, A*(« )
q\abd
where A ^ i s the vo n Mangold t functio n o f degree k w e arrange A( k' i n term s o f A^
and us e th e boun d (24.55) .
Now collectin g th e abov e estimate s fo r A'\ A", A'" w e ge t
/ < AT(lo g £>)_1 + L> 4 T§(logT) 7 .
Taking D = T^o w e complete th e proo f o f (24.24) .
R E M A R K S . Selberg' s mollification metho d work s well for countin g zero s on an d
near th e critica l line . H e showe d tha t
(24.58) N{\ + 4(5 , T) < T 1 '6 log T
uniformly fo r 5^0. Henc e i t follow s tha t almos t al l zero s o f ((s) li e i n th e regio n
where rj(t) i s an y positiv e functio n whic h increase s t o infinity .
http://dx.doi.org/10.1090/coll/053/26
CHAPTER 2 5
THE SPACIN G O F ZERO S O F
THE RIEMAN N ZETA-FUNCTIO N
25.1. Introduction .
Throughout thi s chapte r w e assume th e validit y o f the Rieman n Hypothesi s fo r
C(s). Thi s allow s u s t o pu t th e critica l zero s p — | + r y i n a n increasin g sequenc e
according t o thei r ordinate s
(25.1) . . . ^ 7- 2 ^ 7- 1 < 0 < 7 1 < 7 2 < • • •
where 7_ n = — 7n . Recal l tha t th e countin g functio n N(T) = \{n | 0 < 7 n ^ T}\
satisfies
(25.2) N(T) = £ lo g ^ + 0(lo g T)
for T ^ 2 (se e Theore m 5.24) . Henc e i t follow s tha t
(25.3) 7 n ~ 27rn(logn) _1 , a s n - > cxo,
so th e number s
(25.4) C n= ^7nlog|7n |
have uni t mea n spacing , i.e. , ( n ~ n a s |n |— > oo . Actuall y (25.2 ) i s muc h mor e
precise tha n (25.4) , stil l i t i s no t satisfactor y fo r understandin g th e subtl e aspect s
of th e distributio n o f 7 n 's, i n particular , i t tell s ver y littl e abou t gap s betwee n
consecutive zeros . Th e Rieman n Hypothesi s alon e doe s no t she d enoug h ligh t o n
the proble m (i t doe s improv e th e erro r ter m i n (25.2 ) onl y b y facto r lo g log T) .
Our firs t goa l woul d b e t o understan d th e statisti c o f the sequenc e o f number s
£ n . On e expect s tha t th e consecutiv e spacing s
(25.5) 6 n = Cn+ i - C n
are no t purel y rando m (i.e . ar e no t poissonian) , a s expecte d fo r th e normalize d
distances betwee n primes , o r fo r th e normalize d distance s betwee n eigenvalue s o f
the Laplac e operato r fo r th e modula r grou p (se e th e final remark s o f Chapte r 1 0
and [Sa4 ] respectively) , bu t rathe r the y follo w th e Gaussia n Unitar y Ensembl e
distribution fro m rando m matri x theory . W e say that a sequence 0 < C i ^ C 2 ^ • • •
follows GU E i f
1 f°°
(25.6) - ] T f(6 n) ~ / f(s)P(s)ds
563
564 25. T H E SPACIN G O F ZERO S O F T H E R I E M A N N Z E T A - F U N C T I O N
for an y nic e functio n / : M +— > C (sa y o f Schwar z class) , wher e P(s) i s th e limi t
density distributio n o f consecutive spacin g o f eigenvalues o f random unitar y matri -
ces (th e limit wit h respec t t o the rank) . Explicitl y thi s distributio n wa s determine d
by Gaudi n an d Meht a [GM] , an d i s given b y P(s) = det( 7 — Q s), wher e Q s i s th e
integral operato r o n L 2([—1 ,1 ]) whos e kerne l i s
sin^(z-y)
(25.7) Qs(x,y)
TT(X - y)
Specifically th e densit y ca n b e expresse d b y th e followin g infinit e produc t
oo
(25.8) PO O = ]7(1 -A,-(a) )
0
where 1 ^ Xo(s) ^ Ai(s ) ^ .. . ar e th e eigenvalue s o f th e integra l operator .
In genera l th e GU E la w a s state d abov e i s very har d t o establish , becaus e har -
monic analysis cannot contro l consecutive points of a sequence, bu t i t can locate th e
points i n smal l domains . Therefor e mor e tractabl e problem s ar e abou t correlatio n
of sets o f points i n question . W e begin b y presentin g th e origina l wor k o f H. Mont -
gomery [Mo4 ] o n th e pai r correlatio n o f zero s o f th e Rieman n zet a function , an d
then w e stat e mor e genera l result s an d conjecture s fo r th e n- level correlatio n (se e
definitions i n Sectio n 25.3) .
25.2. Th e pai r correlatio n o f zeros .
The mai n too l fo r dealin g wit h th e correlatio n problem s i s the explici t formul a
of Rieman n (o r it s variations ) whic h connect s a su m ove r th e zero s o f £(s ) wit h a
sum ove r prim e numbers . Genera l version s ar e prove d i n Chapte r 5 , bu t her e w e
use a formul a i n somewha t customize d form . Le t g G C£°(R) an d le t
iur
(25.9) h(r) = / g(u)e du.
J — oo
Put TR(S) = 7r~ s / 2 r(«s/2), the local factor a t th e infinite plac e for th e Euler produc t
of C(s) . The n
EM7)-»(i)+*(-5) + s£*">{S( i + i r ) + S ( 1- i r ) }*
(25.10)
- £ ^ ( < ?n( l o g n ) + s ( - l o g n ) ) .
Vv
This i s th e cas e o f Theore m 5.1 2 fo r th e Rieman n zet a function .
The strateg y goe s a s follows . Firs t w e localiz e th e 7-terms , sa y 7 nea r t, b y
choosing a suitabl e tes t functio n h(r) = /i(r , £), wher e t i s a paramete r a t ou r
disposal (h(r) ha s t o b e a n entir e function) . O f course, b y th e uncertaint y principl e
of harmoni c analysis , suc h a localizatio n canno t b e exact ; a t bes t on e ca n se e r
near t a t th e distanc e c/ lo g t, wher e c is a positive constant . Computin g a properl y
weighted Z/2-nor m o f suc h 7-su m wit h respec t t o th e paramete r t involve d i n th e
test functio n ^(7 , t) on e pick s u p term s whic h ar e clos e t o th e diagonal , i.e. , on e
gets a su m ove r pair s o f zero s 7,7 ' wit h 7 — 7 ' bein g quit e small . Then , b y a n
approximation argument , o r b y Fourie r inversio n i n anothe r paramete r whic h i s
25. T H E SPACIN G O F ZERO S O F T H E R I E M A N N Z E T A - F U N C T I O N 565
built i n th e tes t function , on e create s a su m whos e term s ar e th e desire d functio n
of th e differenc e 7 — 7'.
The origina l approac h o f Montgomer y wa s somewha t special , an d i t i s stil l
being use d b y man y researchers . Sinc e hi s choic e o f tes t function s i s quit e natura l
we use them first. Montgomer y [Mo4 ] introduce d th e function ( a Fourier transfor m
of th e gap s )
(25.11) F(a,T) = — J- J2J2 Ml-l')T^-^
0< 7 ,7'^T
for an y rea l a an d T ^ 2 , where w(u) i s a suitabl e localizin g function . Specificall y
he take s
(25.12) w(u) = 4 ( 4 + i^2 )\ - l
Note tha t F ( a , T) i s real an d F(a, T ) = F(—a, T ) ^ 0 , becaus e th e Fourie r trans -
form o f w i s positive. Precisely , w(v) = 27re _47r'vi, s o
2 2
(25.13) F(a,r) = - ^ re - M |£ e ^ + Q . o S T ) dv.
/ i o y1 1 _ _
0<7^T
By trivia l estimation , usin g (25.2) , on e derive s
(25.14) F(a,T) < F(0,T ) < logT .
Our goa l i s to giv e a n asymptoti c formul a fo r F ( a , T) a s T— > 0 0 which i s unifor m
in a i n range s a s larg e a s possible .
THEOREM 25. 1 (MONTGOMERY) . For 0 ^ a ^ 1 an d T ^2 we have
(25.15) F ( a , T) = a + T~ 2 a log T + 0 ( a T a ~ 1 + T ~ a log2T a + (logT)- 1 )
where the implied constant is absolute.
REMARKS. Fo r a wit h 2(lo g logT)/ log T ^ a ^ 1 - l/(loglogT ) lo g T w e have
(25.16) T(a , T) - a , a s T - > oc .
The asymptoti c formul a (25.1 5 ) hold s for al l a ^ 0 but i t lose s its meaning i f a ^ 1 ,
because th e erro r ter m 0(aT oc~1 ) exceed s th e leadin g ter m a.
In provin g Theore m 25.1 , Montgomery appeale d t o th e explici t formul a
(25.17) L(x,t) = R(x,t)
where L(x,i) i s a specia l su m ove r th e zero s o f £(s ) localize d nea r t , namel y
ri7
(25.18) L(x,t) = 2Y,j
+ (*-7)2'
and R(x, t) i s the correspondin g su m over primes, includin g th e infinit e plac e terms ;
see Sectio n 5.5 . W e hav e
00
A ( \ it
(25.19) R( X)t) = - Y ^ £ l ( -) minf-,- ) + x~ 1 +lt log(t + 2) + E(x,t)
566 25. T H E SPACIN G O F ZERO S O F T H E R I E M A N N Z E T A - F U N C T I O N
where th e las t ter m satisfie s
(25.20) £ ( M ) < -+ - ^
xt +V
if x ^ 1 and t > 0 , th e implie d constan t bein g absolute . Thi s particula r equatio n
(25.17) ca n b e deduce d fro m th e formul a
(25.21) £ A(„)„ - = -L W = l - Z1 - + X*
n^x p 1
which holds for x > 1 , x ^ p n an d s ^ 1 , p, —2n, or directly by a contour integratio n
of
s
(25.22) - ^ ( s )= ^A(n)rT
^l
and th e functiona l equatio n
(25.23) - £ ( S ) - £ ( 1 -s) = ^(s) + ^ ( 1- S ) = log(| S | + 3 ) + 0(1 ) .
We squar e th e equatio n (25.1 7 ) an d integrat e (thi s i s a wa y o f pickin g smal l
differences 7 — 7')? gettin g
2 2
(25.24) I \L(x,t)\ dt = [ \R{x,t)\ dt.
Jo Jo
The lef t sid e i s equa l t o
[T \L(x,t)\ 2dt = ^ ^ x ^ -^[ T
(l + (t->y) 2)-1 (l + (t-<y') 1
2
)- dt
Jo 7 y Jo
where the summatio n i s over al l the critica l zero s p — \ +27 , p' = \ Jri^' (wit h 7,7 '
positive an d negative) . Usin g (25.2 ) w e reduce th e abov e expressio n a s follows :
4 1
E E xih
~Y) A + (* - ^) 2)'1 (1 + (* - V) 2)_1 dt + 0(log3 T)
OO
(1 + (t - 7) 2 )" 1 (1 + (t - ify'dt + 0(log 3 T)
/
u^7 ,7'^i -°°
i(7 7 )
= 2T T ^^ x - ' tt;(7-7/) + 0(log 3r).
0<7,7'^ T
Putting x = T a wit h a ^ O w e arriv e a t
a
(25.25) / \L(T ,t)\2dt = F(a,T)T\ogT + 0(\og 3
T)
Jo
where th e implie d constan t i s absolute .
25. T H E SPACIN G O F ZERO S O F T H E R I E M A N N Z E T A - F U N C T I O N 56 7
Next w e evaluat e th e othe r sid e o f (25.24) . T o thi s en d w e us e th e followin g
form o f Parseval' s formul a fo r Dirichle t serie s (se e [MVl] , o r refin e th e proo f o f
Theorem 9.1 )
u 2
(25.26) / \Va nn- \ dt = Y\{T + 0{n))\a n\ .
Jo I 1 I !
Write (25.20 ) a s R(x,t) = C(t) + D(t) + E(t), respectively . The n
\R(x,t)\2 = \C(t)\ 2 + \D(t)\ 2 + \E(t)\ 2 + 0(\C(t)D(t)\ + \D(t)E(t)\ + \E(t)C(t)\).
Integrating i n 0 < t ^ T w e deriv e b y th e Cauchy-Schwar z inequalit y
(25.27) / \R(x, t)\ 2dt = C 2 + D 2 + E 2 + 0{CD + DE + EC),
Jo
with C 2 = / \C(t)\ 2
dt an d D 2,E2 define d similarly . B y (25.26 ) w e obtai n
Jo
C--E(r + o(»„^nta»(=,H).
By th e Prim e Numbe r Theore m I/J(X) = x - f 0{x/\og2x) thi s is
2
(25.28) C = Tlogx + 0 ( T + xlogx) .
For £> 2 w e ge t simpl y
(25.29) L > 2 = £ - 2 / log 2
(t + 2)d t = x - 2 T l o g 2 T + 0 ( x - 2 T l o g T ) ,
Jo
and fro m th e erro r ter m (25.20 ) w e ge t
2 2
(25.30) E <^x~ T-{-x.
Inserting (25.28) , (25.29) , (25.30 ) int o (25.27 ) w e arriv e a t
2
(25.31) / \R(x, i) \2dt = T log x + x~ 2T log T + B(x,T)
/o
Jo
where
(25.32) B(x,T) <z:T + xlogx + x-- i l
(log2x)T\ogT.
Comparing (25.31 ) fo r x = T a wit h (25.25 ) w e complet e th e proo f o f (25.1 5) .
Now w e deriv e a fe w consequence s o f Theore m 25.1 .
568 25. TH E SPACIN G O F ZERO S O F TH E RIEMAN N ZETA-FUNCTIO N
THEOREM 25.2 . Let f(x) be a Schwartz function on R such that its Fourier
transform
oo
f(x)e(-xy)dx
/ -OO
is of class C1 with s u p p / C (—1 ,1 ) . Then
£$>(7-Y)/((7-y)^)
0<
(25.34) ™'^T
ii
f(a)\a\da +
}
f(0)}— lo
2?r
g T + O(T)
w/iere £/i e implied constant depends only on f.
PROOF. Fo r an y / o f Schwart z clas s w e hav e
(25.35) 5 ^ X T M 7 - 7 ' ) / ( ( 7 - 7 ' ) - ^ - ) = ^ ( l o g T ) / f(a)F(a,T)da.
Here th e integra l equal s
/ ( / > ) + / V ^ ) ) {« + T -2 a l o g T 4 - 0 ( T - - 1+ T - - l o g 2 r a + (logT)- 1 )}da.
JO
The firs t ter m agree s wit h tha t o n th e right-han d sid e of (25.34). Th e secon d ter m
is
2(logT) / (/(0 ) + 0{a))T-2«da = /(0) + O^logT)- 1
Jo
and th e erro r ter m i s triviall y estimate d b y 0 ( 1 / log T). Thi s complete s th e proo f
of (25.34) . •
In particular , choosin g f(x) = ((sin7rax)/7rax)2 w e ge t
COROLLARY 25.3 . IfO < a < 1 is fixed, then as T -^ oo,
0<7,7'^T \ w / /2o /
From thi s Montgomer y deduce d a lower boun d fo r th e numbe r o f simpl e zero s
ofC(s),
(25.37) JVi(T ) = |{ 0 < 7 ^ T ; p = | + 27 i s simple}| .
(recall w e assum e th e Rieman n Hypothesis) .
COROLLARY 25. 4 (MONTGOMERY) . As T -^ 00, we have
(25.38) j Vl (T)>(?+o(l))^logT,
i.e., at least 2/3 of the zeros are simple.
25. T H E SPACIN G O F ZERO S O F T H E R I E M A N N Z E T A - F U N C T I O N 569
PROOF. Discardin g al l term s i n (25.36 ) excep t fo r 7 = 7 ' w e ge t
E „ ; < (i + i + < ) ( 1 ) )£ l o g r ,
where ra 7 i s the multiplicity of p = \+i')- Thi s holds for any fixed a wit h 0 < a < 1 ,
the bes t boun d bein g | + o(l ) fo r a — > 1 —. Hence w e deriv e tha t
Wi(T)^ J ] ( 2 - m 7 ) m 7 > ( 2 - | + o(l))^log T
completing th e proo f o f (25.38) . •
REMARK. Th e tes t functio n whic h wa s use d i n (25.36 ) i s no t optima l fo r th e
purpose o f estimating N\(T) alon g th e abov e lines , s o the percentag e 2/ 3 o f simpl e
zeros ca n b e slightl y increased .
Another consequenc e o f Theore m 25. 2 (whic h w e d o no t dra w here ) i s tha t
7n+i — In ca n b e smalle r tha n it s averag e 27r/log7 n b y a facto r A < 1 infinitel y
often, i.e. , limin f 5 n ^ A . Montgomer y succeede d i n showin g thi s wit h A = 0.68 .
The bes t publishe d resul t is A = 0.51 7 1 whic h i s du e t o Conrey , Ghos h an d Gone k
[CGG], whil e a ne w refinemen t point s t o A = 0.51 69... . Recentl y i t wa s show n
that i f 5 n ^ \ — £ sufficiently often , the n th e clas s numbe r h(—D) o f the imaginar y
quadratic field Q(y/—D) i s bounde d belo w b y c{e)D^ (log D)~2 fo r som e constan t
c(e) effectivel y computabl e i n term s o f e\ se e [CI2] .
The formul a (25.1 6 ) hold s uniforml y fo r e < a ^ 1 — e. Wit h a littl e mor e
effort on e coul d sho w tha t (25.1 6 ) hold s uniforml y fo r e ^ a ^ 1 . On e ma y wonde r
what i s th e tru e behavio r o f F(a,T) fo r a ^ 1 ? Montgomer y showe d tha t (25.1 6 )
cannot hol d fo r a > 1 . Moreover , h e describe d heuristi c argument s whic h sugges t
that
(25.39) F(a, T) ~ 1 , a s T - > 00
uniformly i n bounde d interval s 1 ^ a ^ A. B y applyin g thi s t o (25.35 ) wit h
appropriate tes t function s / o f Schwartz class , Montgomery wa s led to the followin g
PAIR CORRELATIO N CONJECTURE . For a < f3 put
iV(a,/3;T) = | { m ^ n ; 0 < 7 m,7n^T,^ < lrn - ln < |^}| .
Then, as T—> • 0 0 we have
2.
(25.40) N(a, /? ; T) ~ N{T) j f ( l - ( ^ )) du.
REMARKS. Th e assertions (25.39 ) for a ^ 1 together with (25.1 6 ) for 0 < a ^ 1
are essentiall y equivalen t t o (25.40) . B y (25.39 ) th e asymptoti c formul a (25.36 )
extends t o al l a ^ 1 with th e functio n a~ 1 + a / 3 replace d b y 1 + l / 3 a 2 . Henc e
letting a —> 0 0 i t follow s b y th e forme r argumen t tha t almos t al l zero s o f £(s ) ar e
simple.
570 25. T H E SPACIN G O F ZERO S O F T H E R I E M A N N Z E T A - F U N C T I O N
25.3. Th e n-leve l correlatio n functio n fo r consecutiv e spacing .
The Pai r Correlatio n Conjectur e o f Montgomer y indicate s tha t th e zero s o f
((s) behav e muc h lik e the eigenvalue s o f large comple x Hermitia n rando m matrice s
whose statistica l distributio n i s known a s the Gaussia n Unitar y Ensemble . Furthe r
support fo r thi s behavio r wa s provide d b y Z . Rudnic k an d P . Sarna k [RS ] wh o
extended Montgomery' s result s t o th e n-leve l correlatio n sum s fo r th e normalize d
zeros o f £(s) . I n thi s sectio n w e formulate th e mai n result s o f Rudnic k an d Sarna k
and w e give a fe w comment s abou t thei r arguments .
Recall tha t ( m denot e th e normalize d ordinate s o f th e zero s p m = | + ij m o f
((s) give n by (25.4) , they ar e ordered i n increasing sequenc e . . . < £_ 2 ^ C- i < 0 <
Ci ^ C 2 ^ • • •, an d C n hav e uni t mea n spacing . Th e Pai r Correlatio n Conjectur e
answers th e questio n o f ho w ofte n th e difference s £ m i — £ m2 fal l i n a give n interval .
One may inquir e abou t pair s o f differences ( mi — C™25 Cm 2~~ Cm3 falling int o a give n
rectangle, th e tripl e o f difference s i n a box , an d s o on . T o contro l th e proble m w e
restrict ou r consideration s t o th e firs t M point s £ m , 1 ^ m < M , wher e M i s a
large number . Le t n ^ 2 and B C R n _ 1 b e a bo x o f dimensio n n — 1 . Pu t
Rn(M]B) = — :|{1 ^ m i , . . . , m n ^ M al l distinct ;
( C m i~~ ~ Qm2i Cm 2~ ~ Cm 3 • • • > Qm n-1 ~ Qm n) £ B\\.
For n = 2 and B = (a , /3) this i s simpl y
R2{M-B) = ^ | { 1 ^ rm ^ m 2 < M ; a < £ m i - C™ 2 < P}\.
For M = N(T) thi s i s very clos e t o N(a, /? ; T)/N(T). A s i n (25.41 ) w e expect tha t
there i s a functio n R n(B) suc h tha t
(25.41) R m(M; B) ~ R n(B), a s M - > o o .
Our goa l i s t o determin e R n(B) fo r an y n ^ 2 .
To simplify th e Fourie r analysi s her e we introduce tes t function s / ( # i , . . . , x n)
of th e followin g type :
TF1. / ( # i , . . . , £n ) i s symmetric ,
TF2. / O n + £ , . . . , s n + * ) = / ( * ! , . . . ,x n) fo r an y t e R ,
TF3. / ( # i , . . . , xn )—> > 0 rapidly a s |x |— > 0 on th e plane s x\ + • • • -f - x n = 0 .
REMARKS. Th e conditio n TF 2 say s tha t / ( # i , . . . , x n ) depend s onl y o n th e
successive differences . Thi s togethe r wit h TF 3 implie s tha t / ( # i , . . . ,x n ) decay s
rapidly o n an y hyperplan e x\-\ \-x n = t. Fo r n = 2 such a functio n i s given b y
f{x\,X2) — f(x\ — X2) where f(x) i s a n eve n functio n o n R whic h decay s rapidl y
to zer o a s \x\ —> 00 .
For an y tes t functio n a s abov e w e pu t
(25.42) R n(M;f) = ^ £ • • • £ /(U>-- - >CmJ ,
all distinc t
25. TH E SPACIN G O F ZERO S O F TH E RIEMAN N ZETA-FUNCTIO N 571
and see k a n asymptoti c formul a
(25.43) R n(M; f) ~ R n(f), a s M - + oo.
The multipl e su m Rn(M; f) ca n be smoothed furthe r b y replacing th e range of
summation 1 < m i , . . . , mn ^ M b y a smoot h (rapidl y decreasing ) cut-of f function .
Thus w e also conside r
1 1
(25.44) R n(T; / , ft) = £ \ • • E Kl^T' )... h ^ T " ) / ^ ^ , .. . , 7 m nL )
mi,... ,m n
all distinc t
where h(y) i s the cut-off functio n i n question and
(25.45) L - ^ l o g T .
REMARK. Ther e i s no reason t o use a cut-off functio n h(yi, ... , yn) mor e gen -
eral tha n th e produc t h{y\) ... h(y n) becaus e w e wish th e point s 7 m i , . . . , 7mn t o
be clos e t o each other .
In 1 962 , Dyson [D ] considered a statistical distributio n o f energy level s of some
complex system s i n whic h contex t h e determine d th e n-leve l correlatio n densit y
Wn(x\,... , xn) fo r the GUE model. H e showed tha t
(25.46) W n(xu • • • , xn) = det(K(x 2 - Xj )),
where
(25.47) K(x) = ^ .
7TX
One ca n show th e following propertie s o f W n(x):
(25.48) 0 < W n(x) < 1 ,
(25.49) W n{x) = 0 if and only i f xi = Xj fo r som e i ^ j ,
n
(25.50) W n{x) = 1 if and only i f x G Z an d x^ 7^ Xj for i ^ j .
Let u s compute th e density functio n fo r n = 2 . W e get
/ /i f (0) i f (xi - s 2 ) \ = / s i n 7 r ( x 1 -x2)N 2
W 2 (xi,x 2 ) = de t „ , v „ ,v 1 = - .- -
which agree s wit h tha t i n (25.40) .
T H E O R E M 25. 5 (RUDNICK-SARNAK) . Suppose that the cut-off function h(r) is
given by the Fourier integral (25.9) with some smooth compactly supported function
g(u). Suppose the Fourier transform of the test function f{x),
HO = I f{x)e{S • *)dx
is supported in the polyhedral domain |£i | + • • • + |£ n | ^ 2 . Then, as T—> • o c we
have
(25.51) R n(T;f,h) ~ N(T)( [°° h(r) ndr) [ f(x)W n(x)s(-(x
1 + -^x n))dx
572 25. T H E SPACIN G O F Z E R O S O F T H E R I E M A N N Z E T A - F U N C T I O N
where 5(x) is the Dirac distribution at point 0 .
By makin g an appropriate approximatio n t o the cut-off functio n on e can deriv e
from (25.51 ) th e following
COROLLARY 25.6 . Suppose the test function f(x) has its Fourier transform
/(£) supported in |£i | H + |£ n | ^ 2 . Then we have (25.43) with
(25.52) R n(f)= l f(x)W n(x)6(-(x
1 -h-^xn))dx
where x = (xi , ...x n). This can be written as
(25.53) R n(f) = n / f(x 11,...,xn1
)Wn(x 1
,...,xn)dx ...dxn-
JRn-l
where x n — —x\ — .. . — xn.
REMARKS. I n particular, w e hav e
Mf) = 2J f(x,-x)W 2(x,-x)dx = Jf(x)(l- ( ~ ^ ) )dx
where f(x) = / ( x , 0 ) , whic h agree s wit h th e Montgomery pai r correlatio n result .
25.4. Low-lyin g zero s o f L-functions .
A somewha t differen t proble m fro m th e consecutive spacin g o f zeros o f an L-
function concern s th e zeros nea r th e central point , i.e. , in diminishing distanc e t o
s = | . Suc h a problem , o f course, depend s o n the scaling o f zeros. Conside r a n
L-function o f degree d (stil l satisfyin g GRH ) a s in Chapter 5 , given by
oo
(25.54) L ( / , 5) = ^A/(n)n- s,
l
with conducto r Cf > 1 (see (5.7), (5.8)) . Th e number o f zeros of L ( /, s)
(25.55) Pf = \+ilf
with 1 7/ 1 ^ T satisfie s
dT
(25.56) N (T,f)~— logc fT
as T—> • o o by Theorem 5.8 . Thi s suggests that th e proper scalin g factor fo r the low-
lying zero s o f L(/, s) shoul d b e ^ logc/ . No w th e question is : ho w does N(T, f)
behave fo r small T?
To make the question mor e interesting we take a test functio n 0 in the Schwart z
class on R and define th e low-zeros sum
(25.57) W* ) = I>(7 'llr)'
If
where 7 / run s ove r th e ordinates o f zero s counte d wit h multiplicity . Fo r</> with
rapid deca y thi s su m concentrates o n zero s whic h ar e within 0 ( 1 / log Cf) o f the
point s = \.
25. T H E SPACIN G O F ZERO S O F T H E R I E M A N N Z E T A - F U N C T I O N 573
Because, fo r a singl e L-function , w e ar e effectivel y countin g ver y fe w zeros ,
classical harmoni c analysi s canno t detec t thei r distribution . Therefor e w e mus t
take a sufficientl y larg e famil y o f L-function s int o consideration . Suppos e T i s a n
infinite famil y ordere d b y th e conductor . Pu t
(25.58) T{Q) = {feT;c f^ Q}.
Now w e want t o kno w th e asymptoti c behavio r o f th e averag e densit y
(25.59) AD(F-A,Q) =-±— £ D(f,</>).
If J 7 i s a complet e famil y i n a certai n spectra l sense , on e shoul d expec t tha t
(25.60) AD(T; 0 , Q) ~ / <j>(x)W(T)(x)dx
JR
where W(J r)(x) i s a densit y functio n characterize d b y J-'.
Some investigation s i n thi s directio n wer e undertake n i n th e 1 970' s (se e M .
Jutila [Ju5] , H . L . Montgomer y [Mo4] ) fo r th e famil y o f Dirichle t L-function s
L(s,x)- However , th e recen t studie s b y N . Kat z an d P . Sarna k [KS1 ] , [KS2 ]
expanded fa r beyon d thi s family . B y examinin g variou s familie s T the y reveale d
that th e low-lyin g zero s are alway s governed b y a symmetry (o r monodromy) grou p
G{F) associate d wit h T. Thei r assertion s ar e particularl y convincin g fo r zet a an d
L-functions ove r finite fields, no t onl y becaus e th e Rieman n Hypothesi s i s know n
to b e true , bu t als o du e t o connection s wit h geometr y whic h provide s additiona l
intuition (th e zeros are then eigenvalues of the Frobenius operator, se e Section 1 1 .1 1
and [KS1 ]) .
There ar e als o attractiv e result s fo r familie s o f global L-functions , bu t progres s
is no t ye t a s dee p a s i n th e cas e o f loca l L-functions . I n thi s sectio n w e presen t a
few suc h results .
First w e inspec t th e famil y o f Dirichle t L-function s wit h rea l primitiv e char -
acters Xd- Her e L(s,Xd) ha s th e conducto r Cd = \d\. Sinc e L(s,Xd) hav e eve n
functional equations , w e ma y pu t thei r zero s Pd \ + i^ int o th e sequenc e
(25.61) . . . < 7 !f2) ^ 7^< 0 < 7 (J) < 7 f< • • •
with 7^ ~ — 7^ j fo r £ G Z\{0} (i t i s believed tha t L ( | , Xd) i s never zero) . Le t
(25.62) F{Q) — {d fundamenta l discriminant ; \d\ ^ Q}.
CONJECTURE ( K A T Z - S A R N A K ) . AS Q -> oo, we have
1 V
^ ; | der(Q)t>o JR
where the density function is given by
(25.64) W s p ( ; E ) = i_!^ .
574 25. TH E SPACIN G O F ZERO S O F TH E RIEMAN N ZETA-FUNCTIO N
REMARKS. Kat z an d Sarnak [KS2 ] proved thi s conjecture fo r any test functio n
(j) whos e Fourie r transfor m
(25.65) 0 ( 0 = / <l>(x)e(-£x)dx
is supporte d i n (—2 , 2) (o f course, subjec t t o the validity o f the GRH).
The cas e o f L(s,Xd) * s obtained b y twisting ((s) wit h Xd- On e may generaliz e
this cas e b y startin g wit h an y L(f, s) i n plac e o f £(s ) an d takin g th e famil y o f
L-functions L(f 0 Xd, s) twiste d b y the real character s Xd> Fo r example, choos e a n
elliptic curv e E/Q, an d le t E^ /Q b e th e correspondin g quadrati c twists . The n
the associate d L-function s L(E^ d\s) ar e obtaine d fro m L(E,s) b y twistin g wit h
Xd> The correspondin g conjectur e o f Kat z an d Sarna k predict s tha t th e densit y
function fo r the low-lying zero s o f L(E^ d\ s) i s given by
TTr+/ v . . sin27r x
(25.66) W+ (x) = 1 + — .
Z7TX
Many interesting examples of families o f L-functions ar e offered b y the theory of
automorphic forms , an d a variety o f these ar e studied b y Iwaniec, Lu o and Sarna k
[ILS]. W e are going t o describ e som e o f their results . Le t T = 5Z/2(Z ) an d k ^ 2
be a n eve n integer . Le t Sk(T) b e the linea r spac e o f cusp form s fo r T o f weight k.
Let iffc(r ) b e the Hecke basi s o f primitive form s o f Sfc(r). T o any
oo
we associat e th e Hecke L-functio n
L(f, s) = £ \ f(n)n-s = JJ(1 - \ f(p)p-> + p-
1p
As describe d i n Chapte r 1 4 , the completed functio n
A(/ ) S ) = (27r)-T( S + ^ i ) L ( / , S )
is entir e an d satisfie s th e functiona l equatio n A(/ , s) = £/A(/ , 1 — s) wit h roo t
number give n b y ej — i k — ± 1 (not e Sf depend s onl y o n k but no t on / ). Becaus e
the sig n of the functional equatio n ha s some impact o n the distribution o f low-lying
zeros w e separate th e families o f even an d odd L-functions . Pu t
H
(25.67) M+(tf ) = J2 \ kF)\,
k^K
/c=0(mod 4 )
ff
(25.68) M~(K)= J2 l *(r)l-
k^h
k^K
k=2(mo 4 )
/c=2(mod
The correspondin g densitie s tur n ou t t o be
s
(25.69) W +(x) = 1 + E^i,
Z7TX
25. T H E SPACIN G O F ZERO S O F T H E R I E M A N N Z E T A - F U N C T I O N 57 5
(25.70) W -(x) = i-*^E +So(x).
Zirx
THEOREM 25. 7 (IWANIEC-LUO-SARNAK) . Suppose </> is a smooth function on
R compactly supported in } — 2 , 2[. Then, as K —> oo , we have
(25.71)
77^ E E D{fA)~ U{x)W+{x)dx,
\ * h^ k^Kif feH
fan. fr\ ^^
k(T)
fc=0(mod 4 )
(25.72)
=7jn E E D{fri)~ I <l>{x)W-{x)dx.
M- R
y ) iris
k^K feH tJrr^s
fe=2(mod 4 )
k(T) JR
Let L(Sy m / , s) b e th e symmetri c squar e L- function associate d wit h / . Recal l
from Sectio n 5.1 2 tha t i t i s the L-functio n o f degre e 3 given b y
oo
L(Sym 2 /, s) = ((2s) £ A/(n 2 )n" s
1
(25.73) = FJ( 1 - A/(P2)P-S + Xf( P2)p-2s - p-'T 1
-
P
As show n b y Shimur a [Sh2] , th e complete d functio n
2
(25.74) A(Sym /,s) = ^ 3 ^T{^)T(^^)T(^)L(Sym2 f,s)
is entire an d satisfie s th e functiona l equatio n A(sym 2 /, s) = A(Sym 2 / , 1 — s) (not e
that th e roo t numbe r i s alway s one) . I n thi s cas e w e hav e
THEOREM 25. 8 (IWANIEC-LUO-SARNAK) . Suppose 4> is a smooth function on
E compactly supported in } — | , | [ . Then, as K— > oo ,
(25J5)
MK) ^ feH£ (T) D{Sym2
^ ' fc^tf k
JR
/; 0)
~ / ^^M*)***
/e=0(mod 2 )
w/iere Af(if ) = M + ( X ) + M~(K) and W sp(x) is given by (25.64).
REMARKS. W e emphasiz e tha t th e condition s o n th e tes t functio n 0 i n The -
orem 25. 7 an d Theore m 25. 8 allo w th e suppor t o f th e Fourie r transfor m <j) (se e
(25.65)) t o b e large r tha n [—1 ,1 ] . T o appreciate thi s fac t w e writ e
/ <t>{x)W{x)dx = / 4){y)W(y)dy
by Plancherel' s theorem . Her e th e Fourie r transform s o f th e correspondin g densit y
distributions are :
W+(y) = S0(y) + lv(y),
W-(y)=50(y)-^(y) + h
wsp(y) = s0(y) -\ n{y)
576 25 . T H E SPACIN G O F ZERO S O F T H E R I E M A N N Z E T A - F U N C T I O N
where So(y) i s th e Dira c distributio n an d rj(y) i s th e characteristi c functio n o f th e
segment [—1 ,1 ] . Thes e Fourie r transform s hav e discontinuit y a t y — 1 and y = — 1,
therefore th e result s ar e distinguishabl e onl y i f th e suppor t o f 4> cover s th e point s
y = 1 and y = — 1.
http://dx.doi.org/10.1090/coll/053/27
CHAPTER 2 6
CENTRAL VALUE S O F L-FUNCTION S
26.1. Introduction .
We hav e alread y mentione d man y time s th e Gran d Rieman n Hypothesi s fo r
//-functions an d discussed man y aspect s o f the fascinating zero s of L-functions (se e
Chapters 5 or 2 4 for instance ) i n various aspects .
Recently, muc h o f the analyti c theor y o f L-function s focuse d o n estimate s fo r
the value s o f L-function s a t th e centra l critica l poin t s — |. Thes e specia l value s
appear i n various context s an d vanishing o r non-vanishing ar e the mai n questions .
For example , algebrai c technique s develope d b y Mazu r t o stud y rationa l point s
on certai n modula r curves , wit h application s t o diophantin e equations , requir e fo r
full effec t t o prov e tha t ther e exist s a modula r for m o f a certai n typ e whic h doe s
not vanis h a t th e centra l poin t (se e [Ell ] fo r a recen t example) . Anothe r ver y
different situatio n i s the surprising lin k discovere d b y Iwaniec an d Sarna k betwee n
the proportio n o f non-vanishin g o f variou s familie s o f L-function s an d th e Gaus s
Class Numbe r Proble m (se e Theorem 26. 1 below).
Here i s a shor t panoram a o f selected accomplishment s i n recent years , empha -
sizing th e strongest result s fro m analyti c numbe r theory .
The firs t i s the Iwaniec-Sarnak resul t mentione d abov e [IS2] . Amon g differen t
variants, w e choose th e following :
THEOREM 26.1 . For k = 0 (mo d 4), let Hk be the set of weight k Heche forms
on 5L(2,Z) . For any real primitive character if; modulo q with q ^ k°, we have
(26.1) \{f€Hk | |L(/®V,I)|^^-i^}|^(i- 0 (l))|Ffc|
as k -» +oo and i?—> 0.
They als o sho w tha t
J2 L(/,|)L(/®x,|)~i(x , 1 )1 ^ 1
feHk
for an y rea l primitiv e characte r x modul o D , uniforml y fo r D < k' 0 a s k - » +o o
and # —> 0. Hence , i f the constant \ i n (26.1 ) fo r ip — 1 could b e replaced b y \ + S
for an y 5 > 0, with a n effectiv e implie d constant , i t woul d follo w tha t
L ( 1 x ) > >
' ( ! ^
for an y real primitiv e characte r x modul o D wit h a n absolute an d effective implie d
constant.
577
578 26. C E N T R A L VALUE S O F L - F U N C T I O N S
Note i t i s importan t t o hav e a lowe r boun d a s i n (26.1 ) instea d o f simpl y
•^(/J \) 7 ^ 0> so i t i s conceivabl e tha t al l specia l value s ar e non-zero , ye t to o smal l
to solv e th e Clas s Numbe r Proble m alon g thes e lines . Fro m th e poin t o f vie w o f
analytic numbe r theory , a lowe r boun d a s i n (26.1 ) an d simpl e non-vanishin g ar e
usually indistinguishable .
In par t becaus e o f th e relationshi p t o th e Birc h an d Swinnerton-Dye r Conjec -
ture, th e orde r o f vanishing o f weight 2 cusp forms a t 5 = \ ha s attracte d particula r
attention. Recal l tha t i n on e instanc e i t i s easy t o guarante e tha t th e centra l poin t
is a zero , namel y i f L(/ , s) i s a self-dua l L-functio n an d th e sig n o f th e functiona l
equation i s —1 . I t i s reasonabl e t o assum e tha t th e orde r o f vanishin g wil l b e th e
smallest compatibl e wit h thi s restriction , namel y a "generic " eve n for m wil l hav e
L{f, \) 7 ^ 0 an d a generi c od d for m ! / ( / , \) ^ 0 . Brume r (an d independentl y
Murty) conjecture d thi s an d use d GR H t o giv e som e evidence .
Iwaniec an d Sarna k prove d tha t hal f th e specia l value s o f eve n form s d o no t
vanish (an d ar e no t to o small , a s i n (26.1 )) . Kowalsk i an d Miche l [KM1 ] , an d
independently VanderKa m [vdK ] considere d highe r derivatives . W e quot e tw o
results:
T H E O R E M 26. 2 (Kowalski-Michel) . Let 5 2 (g)* be the set of primitive weight 2
forms of level q. We have
\{f e S 2(q)* I e f = - 1 and L'(f, \) + 0} | > ( ^ - o(l) ) |5 2 (g)*|
for primes q —> +00 .
THEOREM 26. 3 (Kowalski-Michel-VanderKam) . For any fixed k ^ 0 we have
\{f € S 2{qY I e f = (-l) f e and L^ k\f, § ) ^ 0} | > |(p fc - o(l))\S 2(q)*\
for primes q — > -hoO ; where po = 1 /4 , p\ — 1 /l§, P2 — 0.4825; p% = 0.495 ; and
moreover, p k = \ - ^ + O(^) .
Although th e erro r ter m depend s o n /c , it i s possible to combine this result wit h
an averag e boun d
2
Y, (ordL(/,,)) «l
s
fes2(q)* =2
for q prime an d deriv e (se e [KMV3 ] fo r bot h results )
COROLLARY 26.4 . We have
Y oTdL(f,8)^(c + o(l))\S 2(qy\
fes2(qys=2
for primes q —>• +0 0 with c — 1.1891.
This i s particularl y interestin g becaus e 1 .1 89 1 < | , whic h wa s th e valu e ob -
tained b y Brume r o n GRH .
In th e followin g section s w e will giv e considerabl e detail s o f the proo f o f Theo -
rem 26.2 . Thi s result expand s a n earlier on e of B. Duke [Du3 ] where the proportio n
was g/(logg) 4 .
Those non-vanishin g result s hav e nic e interpretatio n i n term s o f arithmeti c
geometry, a s alread y mentione d i n Sectio n 5.1 4 . Indee d associate d t o th e curv e
Xo(q) = To(q)\M i s a n abelia n variety , it s jacobia n variety , denote d Jo(q), whic h
26. CENTRA L VALUE S 579
is o f dimensio n di m Jo (g) = di m S2(q) = \S2(q)*\ (th e latte r onl y fo r q prime) .
The jacobian i s defined ove r Q an d it s grou p o f rational point s i s finitely generate d
by th e Mordell-Wei l theorem . Eichle r an d Shimur a compute d th e Hasse-Wei l zet a
function o f Jo(q) i n term s o f modula r forms : fo r q prime i t take s th e for m
L(J0(q),s)= J ]L (f>8)'
fes2(q)*
If th e Birc h an d Swinnerton-Dye r Conjectur e fo r abelia n varietie s hold s fo r J 0 (g) ,
we hav e
r a n k J 0 ( g ) ( Q ) = or d L(J 0(q),s)= V or d L(/,s) ,
/
s=l 2 —^ s=l/2
so lower bound s fo r th e orde r o f vanishin g giv e lowe r bound s fo r th e ran k o f Jo{q)-
In fact , b y th e genera l for m o f th e Gross-Zagie r formul a (Theore m 23.4 ) fo r a n
arbitrary weigh t 2 cusp form , on e show s tha t i f th e orde r o f vanishin g i s exactl y 1 ,
then / contribute s 1 to th e ran k (b y conjugate s o f Heegne r points) . S o Theore m
26.2 implie s
THEOREM 26.5 . For q prime, q -> +oc , we have
rank J 0(q)(Q) > ( — - o(l) ) di m J 0(q).
On th e Birc h an d Swinnerton-Dye r Conjecture , Corollar y 26. 4 give s a corre -
sponding fairl y goo d uppe r boun d fo r ran k Jo(q)- Ther e i s currently n o comparabl e
unconditional bound .
Recall fro m Propositio n 5.2 1 that , o n GRH , th e maxima l orde r o f vanishing o f
an L-functio n o f degre e d wit h conducto r q a t th e centra l critica l poin t i s abou t
(log g) (log | l o g g ) _ 1 . Assumin g th e Birc h an d Swinnerton-Dye r Conjecture , thi s
translates t o a conjectural uppe r boun d fo r th e maximum ran k o f an abelia n variet y
over Q . On e ca n se e i t i s bes t possibl e b y takin g power s o f a n ellipti c curv e wit h
positive rank . However , mor e convincing i s the cas e of Jo(q) (whic h i s conjecturall y
"almost" irreducible) . Fo r q prim e on e get s b y GR H an d th e Petersso n formul a
the slightl y bette r estimat e (5.94) . Th e conducto r i s N q = g dimJ o(g) 5 henc e b y
dim Jo(q) ~ #/1 2 w e hav e di m Jo(q) ~ (logN q)(\og\ogNq)'1 . S o Theore m 26. 5
shows tha t th e uppe r boun d o n GR H canno t b e improve d (excep t fo r th e implie d
constant).
Of cours e on e ca n als o stud y classica l Dirichle t L- functions. Miche l an d Van -
derKam [MvdK ] studie d th e analogu e o f Theorem 26. 3 for th e famil y o f primitiv e
characters modul o q. A different challeng e i s to conside r onl y rea l character s mod -
ulo q ^ Q. Ther e Soundararaja n [Sou ] ha s prove d
THEOREM 26.6 . Let Q be the set of odd squarefree integers. For q E Q, let
n n
X8q( ) — (&q/ ) be the even quadratic character modulo Sq. Then
\{q<Q\ q£QandL(lx8 q)^0}\^Q-o(l))\{q^Q \ q € Q}\
as Q —> • + 0 0 .
Notice the coincidence of the proportio n 7/ 8 wit h Theorem 26. 2 (wher e a facto r
I reall y come s fro m consideratio n o f od d forms) . I n fact , Soundararajan' s proo f
580 26. CENTRA L VALUE S O F L-FUNCTION S
uses a mollifie r M{\) a s i n Theore m 26. 2 (se e Sectio n 26. 2 an d following) , an d
remarkably fo r a mollifier o f length Q A , the proportion obtaine d i s 1 — (2 A + 1)~3
exactly a s in Theorem 26.9 ! Th e Katz-Sarnak philosoph y o f monodromy group s of
a famil y o f L-functions ca n nicely explai n thi s phenomenon .
This concep t o f monodrom y grou p o f families , althoug h lackin g a forma l def -
inition yet , ha s alread y bee n use d t o explai n ver y strikin g results . Fo r example ,
Conrey an d Soundararaja n [CoSo ] hav e bee n abl e t o prov e fo r th e first tim e tha t
there ar e infinitel y man y L-function s o f rea l character s wit h n o rea l zero s i n th e
critical strip (no t only at s — \). Th e success of their approac h (whic h depends on a
constant turnin g ou t t o be positive) wa s "predicted " b y the conjectured symmetr y
of th e family o f real characters .
THEOREM 26.7 . Let Q be as in Theorem 26.6. We have
\{q^Q\ L(x-8 9, s) ± 0 for s G [0, 1]}| ^ Q - o(l))\{q ^ Q \ q G Q}\
as Q -» -foe.
We conclud e wit h a stil l challengin g problem : prov e that , give n a modula r
form / (sa y of weight 2 and level q), there exist s a positive proportio n o f quadrati c
characters \ suc h tha t L(f 0 x ? |) 7 ^ 0- Thi s i s a conjectur e o f Goldfel d tha t stil l
resists proof . Th e best know n resul t i s due to Ono [Ono] , usin g algebrai c method s
(congruences an d Fourier coefficient s o f half-integral weigh t modula r forms) , losin g
a facto r (logQ) 6 fo r som e S G (0,1). Analytically , usin g Heath-Brown' s larg e siev e
inequality fo r rea l character s (Theore m 7.20) , Perell i an d Pomykal a [PP ] have th e
best result , wit h a facto r Q~ e lost .
REMARK. Othe r specia l point s o n th e critica l lin e ar e thos e Sj = \ + itj as -
sociated t o a n eigenvalu e o f th e Laplac e operato r o f a Maas s cus p for m <pj fo r a
congruence subgroup . Th e deformatio n theor y o f Phillip s an d Sarna k show s tha t
the vanishing of the special value of the Rankin-Selberg L- function L(f<g>(pj, \+itj)
(where / i s a weigh t 4 holomorphic cus p form ) ha s an interpretatio n fo r th e ques -
tion o f th e existenc e o f cus p form s i n non-arithmeti c groups . W . Lu o [Luo ] has
shown tha t fo r fixed / G S^p)*, wher e p i s prime, a positiv e proportio n o f specia l
values L(f <g > ipj, \ - h itj) ar e non-zero.
26.2. Principl e o f th e proo f o f Theore m 26.2 .
Since q is prime and k — 2 < 1 2 , we can apply the Petersson formula t o the basis
5*2(q)* (properl y normalized ) o f primitive form s o f £2(3), a s describe d i n Corollar y
14.23 and afterwards . Le t / G S2(q)*• W e denote it s Fourier expansio n
f(z) = X^M^vW^)
and recal l its multiplicative propertie s (1 4.50) , (1 4.51 ) . T o prove Theore m 26.2 , w e
first deriv e th e weighted analogue :
THEOREM 26.8 . We have
feS2(q)* od d
L/(/,l/2)^0
26. CENTRA L VALUE S 581
for primes q —> +00 .
Recall fro m (1 4.60 ) tha t th e h i n superscrip t indicate s tha t th e normalizin g
factor (47r) _ 1 ||/|| - 2 i s inserted .
The metho d o f proo f i s base d o n compariso n o f th e firs t an d secon d moment s
of specia l value s Z/(/ , 1 /2) . Sinc e "larg e values " o f ! / ( / , 1/2) hav e a larg e effec t o n
the secon d moment , w e us e mollification , a s i n th e proo f o f Selberg' s theore m i n
Chapter 24 . Le t
M
i = E * W-ef)MU)L'U,\)
f£S2(q)*
and
k 2
M2= J2 !(l-e/)|M(/)LU|)|
f£S2(q)*
for som e M(f) G C. Compariso n o f a n uppe r boun d fo r M 2 an d a lowe r boun d fo r
Mi yield s b y Cauchy' s inequalit y a lowe r boun d fo r th e quantit y w e stud y
Ml
(26.2 ) E* 1> M
2
*/ = -!
£'(/,§)*o
In orde r t o achiev e th e bes t possibl e estimate , w e will fin d asymptotic s fo r M i
and M 2 . Not e tha t i f th e mollifie r i s ignore d (tak e M(f) = 1 ) , a facto r log< ? is los t
in th e fina l estimate .
To ge t sum s amenabl e t o Petersson' s formula , w e loo k fo r mollifier s a s linea r
forms i n A/(m) :
A
(26.3) M(/ ) = J2 % /M
for som e M = q A wit h 0 < A < \ an d rea l coefficient s x m supporte d o n squarefre e
numbers < M , suc h tha t
(26.4) x^ <^r(m) (logqf
for som e absolut e implie d constant .
Theorem 26. 8 follow s b y lettin g A —> • \ i n th e followin g
THEOREM 26.9 . For 0 < A < 1 /2 , we have
1 > 1
^ 2\ (2 A + 1)0
fes2(qy od d
L'(/,l/2)=#)
for all primes q large enough in terms of A.
582 26. CENTRA L VALUE S O F L-FUNCTION S
26.3. Formula s fo r th e firs t an d th e secon d moment .
The moment s M i an d M 2 ca n bot h b e reduce d t o combinations o f averages o f
! / ( / , \) o r Z/(/, \) 2 twiste d b y Heck e eigenvalues . Fo r m ^ 1 , le t
(26.5) D(m)= Yl k W-e f)Xf(m)L\f^),
fes2(q)*
(26.6) H(m)= Y? |(l-^/)A / (m)L / (/,i) 2 .
fes2(q)*
We hav e
(26.7) M l =£^kD(m)
and b y (26.3 )
A
M2= E " L'u,\f E 7 = = /^)A/^)
Sf = — 1 rai,ra2 ^
(26.8) = V j V ^bm^brn , R( y
b mi,m2
We will obtain asymptoti c formula s fo r D(m) an d H(m) valid if ra is small enoug h
with respec t t o q.
We first us e contour integratio n an d th e functiona l equatio n t o expres s L'(f, | )
and ! / ( / , | ) 2 a s sums o f rapidly convergen t series . Thi s coul d b e derived fro m
(5.12) b y differentiation , bu t w e redo th e computation s anyway .
Let N ^ 2 and G be a fixed polynomia l suc h tha t G(—s) — G(s), G(—N) =
. . . = G ( - l) = 0 an d G(0 ) = 1 . I n particular, G'(0 ) = G< 3>(0) = 0. Conside r
/=
i/A^+^f-
(2)
We ca n shif t th e contou r o f integration t o the lin e R e (s) = —2, picking u p a
double pol e a t s = 0, henc e
ds
I=resA(f,s+^ + ^- J A(f,s+l)G(s
(-2)
By th e functiona l equatio n (Theore m 1 4.1 7) , wit h sig n—7 ? = sj, th e integral on
Re (s) = — 2 is equal to £//, so
(l-£/)/=resA(/)S + i ) ^ .
Computing th e residu e b y writin g th e Taylo r expansion s aroun d 0 , we obtai n
l2
2{l-ef)I = {l-e s)(^) ' L'{f,\).
26. C E N T R A L VALUE S 583
On the othe r hand , w e can comput e / by expanding th e L- function i n an absolutel y
convergent Dirichle t serie s o n th e lin e R e (s) = 2 , which give s
'-(£)"*£
V2W
U ^WV £)
^
where
3ds
(3/2)
Hence, comparin g bot h expressions , w e hav e
(26.9) ( i _ e / ) L U ! ) = 2 (l- e/ )g^v(^).
We estimat e V easil y b y shiftin g th e contou r t o th e left , o r right : w e hav e
(26.10) V(y) = -logy- 7 + 0(y),
J
(26.11) V(y)<^y~ forallj^l .
Similarly, w e conside r th e integra l
J
'hSK{,-'+^G{4
(2)
and procee d t o evaluat e i t i n th e sam e fashio n a s fo r I. Shiftin g th e contou r t o
Re (s) — —2 and applyin g th e functiona l equatio n fo r th e squar e o f th e L-functio n
A(/,S)2 = A ( / , 1 - S ) 2
(where th e sig n i s ei = 1), we ge t
2J=resA(/,s+i)2^.
5 = 0 S°
Further, fro m th e multipHcativit y o f th e Heck e eigenvalue s (se e (1 4.50) ) w e
derive th e Dirichle t serie s expansio n
L(/,s)2 = C 9(25)^r(n)A/(n)n-s,
so th e ter m b y ter m integratio n yield s
27r
SI ^ v
«J
where
2
(26.12) W(y) = ± J ( q(l + 2s)T(s) G(s)y-s^.
(1/2)
Hence
n>l
584 26 . C E N T R A L VALUE S O F L - F U N C T I O N S
For od d /, w e have L ( / , \) = A(/ , \) — 0 so the above residu e equal s A'(/ , \) 2 =
2%Lf(f, \)2 fro m th e Taylor expansion s of A(/, s + \) an d G(s). Therefor e fo r odd
forms w e have
2
(26.13) ! U i ) = 2 j : f r W p ) .
The functio nV K satisfies th e bound
(26.14) y l
WU)(y) < log 3(?/ + y" 1 ), fo r all i^j^0,
(i)
(26.15) 2/W (y) < C y_J ', fo r all i ^ 0 , j^ 1 .
(the implie d constan t dependin g o n z and j) an d there exist s a polynomia l P ,
independent o f g, of degree a t most 2 , such tha t
(26.16) W(y) = -^(logy) 3 + P(logy) + O ^ - ^ l o g y ) 2 + y).
This las t fac t follow s b y writing
G W r w ^ i + i!;)
by shiftin g th e contour , an d computing th e residue.
Using (26.5 ) an d (26.9) w e get
where
A , (I,m) = ^ ( l - ^ ) A / ( 0 A / ( m ) .
/
We no w appeal t o (1 4.65 ) o f Corollar y 1 4.2 6 which show s tha t A ' i s a stron g
approximation t o the Kronecker delt a
A%rn) = S(lM + 0^^(^)\lo gq)\).
One coul d us e also (1 4.64) , keepin g th e Kloosterman sums , bu t this wil l no t b e
necessary fo r the application t o M\ an d Theorem 26.2 . Henc e w e have b y (26.10)
and (26.1 1 )
3 81 4
D(m) = 4=VC-^) +0(g- / m / (logg)2)
with a n absolute implie d constant . B y (26.7), (26.1 0 ) an d (26.4) w e get
PROPOSITION 26.1 0 . We have for m^l,
(26.17) D{m) = - i= (log ^-) + 0(<T3/8 m 1/4 (log q) 2 + mq'^ 2
)
where q = y /g/(27re7), and if M = qA, then
s 7
(26.18) M 1 = ] T ^(logl)+0( q- (logq) ).
26. CENTRA L VALUE S 585
where 6 = | ( \ - A) .
In th e following , whe n w e write a n erro r ter m o f the for m 0(q~ 8)1 i t i s implie d
that 5 > 0, 6 depends onl y o n A , an d th e valu e o f S may chang e fro m lin e t o line .
We no w procee d t o get a n expressio n fo r M 2 a s a quadrati c for m i n th e xm.
The ide a i s th e sam e a s above , bu t w e canno t simpl y estimat e th e contributio n o f
the su m o f Kloosterma n sum s i n th e Petersso n formul a b y Weil' s bound : w e ope n
the Kloosterma n sum s an d transfor m furthe r t o deriv e th e require d expression .
Let m ^ 1 be fixed, m ^ q 2A < q. W e ar e no w read y t o comput e M 2. B y
(26.6) an d (26.1 3 ) w e hav e
H(m) = £ T -§W(^) ^ ( 1 -e f)Xf{m)Xf(n)
r(n) /47r. 22,n\ ,
E - WT T — )A'(m,n) . Q
Our previou s approximatio n t o A'(ra , n) b y 5(m, n) i s not sufficientl y strong . Pro m
Corollary 1 4.2 6 w e have th e mor e precis e formul a
A , (m,n) = ( 5 ( m , n ) - - ^ ] T -S(mq,n]r)J 1 ^J'^>j
v (r,g ) =l
1
+ 0 (r3((m,n))^)i(logg))
Hence b y (26.1 4) , (26.1 5 )
r(m) /47r 2mN
(26.19) H( ? n) = ^ H / ( ^ ! ! ! ) + X ( m . ) + o((^)i(log^)
9
where X(m) i s a su m o f serie s o f Kloosterma n sum s
1
(26.20) X{m) = - ^ ^ T r- Xr(m),
^ (r, 9)=i
(26.21) X r (m) = - £ ^ S ( m 5 , n ; r ) j1 ( ^ ^ ) W ( ^ ) f ( „
For technica l reason s w e have inserte d i n th e summatio n th e facto r £(n ) wher e £ is
a fixed functio n £ : R+ — » [0,1 ] , whic h i s C°° an d satisfie s
£(z) = 0 , 0 ^ x ^ \, £(x) = 1 , x ^ 1.
Obviously thi s extr a facto r doesn' t affec t th e su m (al l positiv e integer s ar e a t leas t
1!), bu t i t wil l b e usefu l t o gai n convergenc e i n som e serie s appearin g later .
We first estimat e th e contributio n o f thos e term s wit h r > R i n (26.20) , wher e
R > 0 will b e chose n late r (R = q 2). Usin g J\(x) < C x, Weil' s boun d fo r Klooster -
man sums , (26.1 4 ) an d (26.1 5) , w e ge t
2T T
(r,g) = l
586 26 . C E N T R A L VALUE S O F L - F U N C T I O N S
We denot e no w X f(m) th e remainin g par t o f th e su m i n X(ra) , namel y
r1
_
(26.22) X'(ra ) = ^F Yl ^(m).
(r,g) = l
Let r ^ R. Th e expressio n X r(m) ca n b e compute d usin g th e summatio n
formula (4.56 ) fo r sum s o f Kloosterma n sums . Thi s yield s
2 C°° fx
XJm) = — 5 ( m , 0 ; r ) / (lo g ^— + i)t{x)dx
r Jo r
+ — 2jr(/i)S(/i<7-m,0;r ) / * o ( — Jt(x)d x
- - 5 ^ T ( / i ) S ( / i g + m,0;r ) / ° ° lfo( ft *)*(s)ris
ft^i
where
2
47r jmx\ (4TT X\ £(x)
(26.23) t(l )= J l ( _ ^ _ ) W ( _ ) .
r V q J \ q / y/x
Hence
(26.24) X'(m ) = ^ £ ^S(m,0;r)L(r )
(r,<jr) = l
+ ^FE \Y,T(h)S(hq-m,0;r)y(h)
(r,<j) = l
- ^ E ^E^)^+™>^ w + o ( ^)
(r,g) = l
with
(26.25) L(r ) = y o (lo g ^- ^ (- y j T ) W (—)-/= l
(26.26) y(/i ) - J Yo(—y-)t(x)dx,
(26.27) k{h)= / Xo(^^)t(^)dx ,
the erro r ter m comin g fro m removin g th e functio n £(x) fro m th e firs t ter m usin g
t(x) < (logqfx- 1 /2 fo r 0 < x < 1 , and |S(ra,0;r) | ^ r .
26. CENTRA L VALUE S 58 7
Let X"(m) b e th e firs t ter m i n X'(m). W e hav e thu s b y changin g th e integra -
variable x into ij^r ,
X"(m) = -2 £ \s{m^T) j ^ {\og^-+ 1 )J1 {2V^)W{r2x)~,
(r,q) = l
=
in J Z m(l + 25)Cg(l + 25 ),- 1r( 5 ) 2 G( 5 )L( 5 )^,
(1/2)
by (26.1 2) , wit h
Z*(s) = ] T 5 ( m , 0 ; r ) r " s ,
(r, 9 ) = l
1 8 /2
L(s) = -2 / (lo g ^+ 7) J 1 (2y/mx)x'- - dx.
Jo 2 7 r
Extending th e su m ove r r to al l r ^ 1 , we ge t
d|ra
if R e (s) — 2 by th e formul a (3.2 ) fo r th e Ramanuja n sum . Moreover , fo r al l s with
\ < Re (s) < 1 , we hav e
(26.28) L(s) = m s " 1 / 2 r ( - 5 ) r ( 5 ) - 1 flo g — + 27 + ^( 1 + s) + ^(1 - 5) )
\m /
where ^ = T'/ T an d Q = q/An 2. Indee d formul a 1 4.561 .1 4 o f [GR] give s (fo r
- 2 <Re(s ) < - | )
r°° r r(i(l + f)
y j!(x)^ x = 2s ^ 2
/° ° (log*) Mx^dx = 2 ' B i ± || (lo g 2 + ^ ( l + '-) + i t f (l - 0)
and a simple computatio n give s (26.28) . Consequentl y w e hav e
x {m) =
" 1
hi I (- 2)^-2sM5-1 r(5)2G(.)L(^ + o ( ^ i o g g )
(1/2)
since (o n R e (5) = -|)
C,(l + 2S )r( S ) 2 G( S )L( S ) « \r(s)r(-s)s- 1 \(logq+ |^( 1 + S) | + Ml - a )|).
The functio n F(s) bein g integrate d ca n b e writte n
9.
F(s) - m - 1 / 2 5 r / s ( m ) - 1 G ( 5 ) r ( 5 ) r ( - s ) flo g " + 27 + ^( 1 + 5) + ^(1 - s))
m
where
V.(m)= J2 (£)*
ab=m
588 26. C E N T R A L VALUE S O F L - F U N C T I O N S
(Fourier coefficient s o f the non-holomorphic Eisenstei n serie s fo r 5X(2,Z)). Thus ,
F(s) i s an odd function o f s. Moreover , i t is holomorphic i n the strip |Re(s) | < 1,
except fo r a tripl e pol e a t s = 0 , an d decrease s exponentiall y i n vertica l strips .
Shifting th e contour t o Re (s) — —\ and then changin g s into — s, w e conclude tha t
r(ra )
X"(m) = f r e s F ( 8 ) + O p p i o g g )
Expanding aroun d s = 0 we have
1
s- r(s)r(-s) = -s"3 + (72 - r"(i)) s-1 + o(s),
2
G(s ) = l + iG"(0)s + O(s 3 ),
2 7 + ^ (1 + s) + V(l - s) = V>"(0)s 2 + 0 ( s 4 ) ,
2 3
% ( m ) = -r(m ) + | T ( m ) s + 0 ( s )
where
(26.29) T(m)= £ (log^)* .
Combining those , w e obtain therefor e
(26.30) X"( TO ) = -^(log^) +a^(log^)+o(^log,),
4y/m V m) y/m \ m/ \ R /
where a = | ( 7 2 - r " ( l) - §G"(0 ) - V"(0)).
Coming to the other tw o terms i n (26.24), we claim tha t fo r q < m an d R ^ q 2
x
(26.31) " 7 E* 4 E r(h)S(hq - m, 0; r)y(ft) « m '\~^\
(26.32) ±- Y^ ^ Er(ft)S(fe<7 +™> 0; r)k(h) « (m 1 /^" 1 + <T1/2)<f
for an y £ > 0, the implied constan t dependin g onl y o n e.
We choose R = g2. B y (26.20) , (26.22) , (26.24) , (26.30) , (26.31 ) , (26.32 )
together, w e deduce
(26.33) X M = - ^ ( l o g 5 )+ a I ^ ( I o g 4 ) + o ( ( ^ + 4 = y ) .
4>/m \ mJ y/m V m /V V q yfqJ J
Inserting thi s i n (26.19) an d appealing t o (26.1 6) , we obtain
PROPOSITION 26.1 1 . The twisted mean-value (26.6) is given by
(26.34)
1 r(m) / Q \ 3 \T(m) ( Q\ r(m) / Q\ 1 / 4 _ 1 / 4 +££
H{m) = —-^jJ-[log--) - - — )=+ lo g — ) + — =• Pi lo g — )+0{ml/*q i / 4 +
12 y/m V m / 4 y/m \ mJ y/m \ mJ
where Q = q/Air 2 and P^X) = P(X) + aX for 1 ^ m ^ q2A with A < 1 /2 . The
implied constant depends only on e and A.
26. C E N T R A L VALUE S 589
R E M A R K . I n particular , fo r m = 1, (26.1 7 ) an d (26.34 ) giv e asymptoti c for -
mulas fo r th e firs t an d secon d moment s o f the specia l value s o f the derivative s
£'(/>!):
k
(26.35) J2 \{1 -ef)L'U,\) = \\ogq-1\og2ir + 0{q-ll2),
/eS2(g)*
(26.36)
J2k |( 1 - ef)LU \f = ±(log ^ )3 + A (log J L) 4 - 0(g-^«).
Finally, th e expression fo r the second momen t o f special value s a s quadrati c
form i n the mollifier coefficient s follow s fro m (26.8 ) an d (26.34).
THEOREM 26.1 2 . Assume M = qA with A < \. Then
s
M2 = —M 2i - -M 22 + M2 3 + 0(q- )
where M21 , M22 and M23 ar e quadratic forms in the variables x m given by
3
(26.37) M 12 = j: \ £ ^ ^ ^ (lo g - $ - ) ,
1
1
6 771,77 2
T
(26.38) M 22 = Vj V (mim2)x^A - Q - ) ,
2 L
^ 0 ^ vfi\Vfi2 V mim 2/
6 77li,77l 2
(26.39) M 23 = £ I £ I^l!^) XbBiStmajPi (l og _ « _ ),
1
1
6 771,77 2
and 6 is a positive number depending only on A. Recall that Q = g/(47r 2).
R E M A R K . I n fact ou r proof i s complete onl y for A < |. Thi s limitatio n come s
from th e error ter m i n (26.1 9) , al l other part s bein g adequat e fo r A < \. Th e full
extension t o this cas e ca n be established b y arguments essentiall y th e same a s i n
the othe r parts , s o we omit th e details.
The proof s o f (26.31) an d (26.32) requir e onl y uppe r bound s fo r the integrals
y(h) an d k(h). Thes e ar e estimated b y standard integratio n b y parts (a s in Section
4.4). Combinin g wit h the bound for Ramanujan sum s \S(hq±m, 0 ; r)| < (hq±m, r)
one derive s th e results (fo r details se e the original pape r [KM1 ]) .
26.4. Optimizin g th e mollifier .
Now (26.1 8 ) expresse s Mi as a linear form , an d Theorem 26.1 2 expresse s M 2
as a quadratic form , i n the coefficients x m. Th e optimal choic e of xm t o maximiz e
M1/M2 (se e (26.2)) i s not easy to express explicitl y becaus e th e quadratic for m is
not diagonal . Ou r strategy wil l be to decompose M 2 as a sum of quadratic forms ,
one o f which, sa y M' 2, is easily diagonalized . W e then choos e x m t o maximize
M1/M2, an d simpl y evaluat e th e othe r quadrati c form s fo r this chose n vector .
One coul d justif y afterward s tha t thi s i s asymptotically bes t possibl e eve n fo r the
full quadrati c form . I t turns ou t that bot h M 2\ an d M2 2 in Theorem 26.1 2 hav e
contributions o f the same orde r o f magnitude, wherea s M23 is smaller.
590 26. C E N T R A L VALUE S O F L - F U N C T I O N S
We separat e rai an d 777, 2 hi (26.37 ) b y mean s o f th e formul a
r r
r(m1m2) = ^ ^ W 7 W)
a|(mi,7n2)
getting (afte r relaxin g th e resultin g coprimalit y conditio n b y mean s o f th e Mobiu s
function)
r l r ^ aV ^ r(mi)r(m 2) A < 2V
6a mi,rri2
Expanding th e logarithm , M21 is a linea r combinatio n o f th e quadrati c form s
n(t,«,«,«;) = (iogQ) » 5>(%fc",H) y*
0,(™)
u
where
r m
(*) V ^ ( )n \ i
m
,<*,. \ Y. «£W.
ab=k
and t, u, v an d it ; ar e non-negativ e integer s suc h tha t t-\-u + v + w = 3. I n fac t
M23 is also such a linea r combinatio n wit h parameter s suc h tha t t + u + v + w ^ 2 ,
however.
For th e chose n vecto r (x m) i t wil l b e clea r tha t
n ( t , t i , ^ ) < < n ( o , « , ^ ) ^ —
logg
by direc t estimation , s o w e ca n restric t ou r attentio n t o U(u^v,w) = 1 1 (0 , u, v,w).
Accordingly we write i/(k) fo r uo(k). Not e that fo r k ^ M , w e have v(k) = cp(k)k~ 2.
The contributio n t o M21 of thos e II(n , v, w) i s the quadrati c for m
m 2 1 = n(3,0,0 ) - 6n(2,1 ,0 ) + 61 1 (1 ,1 ,1 ) + 61 1 (1 ,2,0 ) - 61 1 (0,1 ,2 ) - 21 1 (0,0,3 )
(using th e obviou s symmetr y IL(u, v, w) = n(u , w,v)). W e selec t th e on e quadrati c
form I I = 1 1 (3,0,0 ) a s reference an d w e choose (x m ) whic h maximize s M f /II. The n
we evaluate th e othe r quadrati c form s II(u , v, w) an d M2 2 fo r tha t vector .
By definition , I I i s alread y diagonalize d
n = (iog£) 3l>(%fc-
k
The x m ar e recovere d b y th e formul a
(26.40) x m = £ ^T-Vk,
k
where g = [i* fi i s the Dirichle t convolutio n invers e o f r. Henc e th e linea r for m M i
can b e writte n a s
M1 = £ —log-^E^ *
26. C E N T R A L VALUE S 591
where
*(*) = £ £*(«)(logf).
ab=k
Since
we hav e J2j(k)k~s = C(s + l)- 2((log<z)C(* + 1 ) + C'(* + 1))
fc>i
logg r/ ._lv
( r ) ' ( * + i)
<(* + !)
which show s tha t
(26.41) j(k) = ^-{logqk).
By Cauchy' s inequalit y w e have
with equalit y i f y& = ^ 4 fo r k ^ M . Henc e th e best choic e o f mollifier i s
^\ 0 ,i f k > M.
In particula r y^ , an d s o als o # m (se e (26.40)) , i s supporte d o n squarefre e num -
bers an d th e growt h conditio n (26.4 ) i s immediatel y verified . Sinc e j(k) i s abou t
(log k)/k an d v i s abou t A: -1 , it i s quit e clea r fro m lookin g a t th e variou s expres -
sions involve d tha t w e will get a positive (harmonic ) proportio n i f M — q A fo r any
A > 0 , with bot h M x2 and M 2 bein g o f size (logg) 6 .
The result s o f evaluating M\ an d M 2 i, M22 are as follow s
PROPOSITION 26.1 3 . Let M = q A and xm be as above. We have
Mx = A (^ + I + \)(logq) 3 + 0((\ogq) 2),
M21 = (JA 6 + f A 5 + f A 4 + f A 3 + 2A2 + \A)(\ogqf + 0((\ogqf),
M23<(logg)5(loglogg)4.
PROOF. Fo r Mi, by definition o f yk an d (26.41 ) , w e have
«, = „og0)-n = i;f^ = i:^f(^«2
and th e result follow s b y partial summatio n fro m
KJ
k<K ^
592 26. C E N T R A L VALUE S O F L - F U N C T I O N S
For M21 , we hav e
)
U(u,v,w) = (logQr^(k)y^ y^)
k
and w e expres s y£ i n term s o f yk usin g th e highe r vo n Mangold t functio n A ^ =
// • (log) 2. Precisel y w e hav e
T
(26.42) y?= Y, -f^)yke-
i^M/k
See Sectio n 1 . 4 ((1 .43 ) an d after ) fo r th e propertie s o f A * that w e need .
The sum in (26.42) is thus a sum over squarefree I having at mos t i prime factor s
(i ^ 3) . T o evaluate we can find the associate d Dirichle t serie s or more elementaril y
separate th e su m int o th e part s wit h a fixed numbe r o f prim e factors . Thos e ar e
multiple sum s ove r primes , an d the y ar e o f Merten s typ e sinc e T{£)£~ 1 = 2H~ X
for suc h £ with LJ(£) = j prim e factors , henc e eas y t o evaluat e asymptotically . W e
quote th e resul t (se e [KM1 ] , 2.5.2 , fo r details) : on e finds
2/il) = Q ^ (lo g f y(log<f +
W k ) + 0(^(lo gqy(loglogq))
for i = 1 , 2 , 3 wit h c\ — — 1, C 2 = \ an d C 3 = 0 . The n th e computatio n o f th e
various n(u , v, it;) i s a simpl e matte r o f summatio n b y parts . Fo r instanc e w e hav e
11(0,1,2) = - J £ ^!(i og ^)3(iog93M2fc)(log^MA;)
+ 0((log< ?)5(loglog<?)3)
=/ (log- ) ( l o g . f M V K l o g . f M z ) — + 0((logc ? ) 5 (lo g log< ? ) 3 )
JY\ X / X
AogM
3 5
=/ y {3logqM-y)(2\ogqM-y)dy + 0((\ogq) (log\ogq)3)
Jo
= ( ~ A 6 - ^A 5
- ^(logg) 6
+ 0((log< Z ) 5 (loglog (? ) 3 )
(recall log g = lo g y/q + O(l)) . Whe n everythin g i s computed , th e valu e o f M21 i s
obtained a s claimed . D
There remain s t o trea t th e quadrati c for m M22 defined b y (26.38) . Recal l tha t
T(n) i s define d b y (26.29) . I t follow s tha t
(2,43, • « . E ^mm)H^ym d\(mi,m,2)
26. CENTRA L VALUE S 59 3
We us e this t o separat e m i a n d 7712, which yield s t h e expressio n
z
^-^o^^ a ^ ^ mim 2V o r 77117712/
ba 77li,77l 2
0 ^m T r(mi)T(m 2) / Q \
mi,m2
- 4 V ] l / i ( f e ) V " —X 1km Xkm2'
^ *-^ mim 2
This i s rewritte n a s
(26.44) M 22 = 2(n(l,0,0)-n(0,l,0)-n(0,0,l))-4^i/ 1 (%fc^
where
Z
_ (o ) _ V ^ T ( m )
k — zk ~ 2_ > m XkT
m
T
(l) V ^ ( m ) /i \
and
ft(a,6,c) = (lo g Q ) a X ; K % i 6 ) ^ •fe
it
As befor e w e express t h e new variables i n term s o f yk b y
(26.45) Zk = 2j2 ±^y ke,
l^M/k
X)
(26.46) 4 = ]T -AJ_±i Zfc ,+ £ " ^ ^ V
£^M/k i^M/k
To se e this, notic e t h a t b y (26.40 ) w e hav e
*-E(E^-)>
rn _ , . .to-
T(m)
The Dirichle t generatin g serie s fo r t h e coefficien t o f £ is L(s + 1 ) wher e
i(5) = c(S )- 2 E T ( n ) n " s -
71
It i s easy t o se e t h a t
^2T(n)n-s = 4<C" - 2(CC') ' = 2(CC" - (C) 2),
594 26 . C E N T R A L VALUE S O F L - F U N C T I O N S
so L(s) = 2(C /C~1 )/, whic h give s (26.45) . The n w e derive (26.46 ) a s follow s
m £b=m £ m^M/£
^ r(l)A(l )^ T(£)A(£)
£^M/k £^M/k
by (26.43) .
By (26.45 ) an d (26.46 ) w e can evaluate Zk and z\. ' muc h a s before wit h y^\
and w e get
*--5^(*T)'™,""*>+0<4(1°")')
»i2)+0(-|y(l089)2(l«6l»g9)).
From thi s w e easily evaluat e th e quadratic form s tha t occu r i n (26.44 )
11(1,0,0) = - ( l o g Q ) 5 > ( % f c 2 / l 2 ) + 0((log<?) 5) = - n ( l , 2 , 0 ) + 0((log<z) 5),
k
n(o, i, o) = - £ " W ^ S f + o((iogq)5) = -n(o, 1,2) + o((iog<?)5),
k
6(0,0,1) « (lo g qf.
Moreover, b y direct estimatio n
^2^i{k)ykzk < (logg) 5 (loglogg) 3 .
k
Now (26.44 ) yields :
PROPOSITION 26.1 4 . Let m = qA. For x m as above, we have
M22 = ( - ^ A 6 - ^ A 5
- ^A 4
- i A 3 ) ( l o g g ) 6 + 0((logg) 5 (loglo g ( ? ) 3 ).
that Prom
fo r ATheorem
<\ 26.1 2 , Proposition 26.1 3 and Proposition 26.1 4 , we can comput e
M L = I f i1
_ , 0 /(loglogg) 4 \
\
3
M 2 2 V (2 A + i ) ; ^ v iog g ;
(where th e very simpl e fractio n appear s quit e miraculousl y b y partial fractio n ex-
pansion). Inequalit y (26.2 ) the n implie s Theore m 26.9 , and therefor e Theore m
26.8.
26. C E N T R A L VALUE S 595
EXERCISE. Sho w b y simila r method s tha t
fes2(qy
L(/,|)/0
for prime s q —>• +00 .
26.5. P r o o f o f t h e o r e m 26.2 .
Passing fro m Theore m 26. 8 (non-vanishin g i n "harmonic " average ) t o Theore m
26.2 (non-vanishin g i n "natural " average ) ca n b e don e i n differen t ways . W e us e
Shimura's formul a
2
(26.47) 4^||/|| = ^ | p L ( S y m 2 / , 1 ) + 0((\ogqf) (fo r q prime )
relating the Petersson nor m t o the specia l value of the symmetric squar e L- function
of / a t s = 1 . Th e latte r i s given b y
L(Sym 2 /,*) = £ Pf(n)n-° = C q(2s) ]T X 2
f(n )n-'
for R e (s) > 1 ; se e Sectio n 5.1 2 . A t s = 1 th e serie s barel y fail s t o converg e
absolutely, bu t o n average i t ca n be represented b y a very short sum . Pro m [KM2] ,
we quote a general useful resul t base d o n the large sieve type estimate for symmetri c
square //-function s (se e Theore m 7.28) .
PROPOSITION 26.1 5 . Let aj, for f e 5 2 (g)* ; be complex numbers such that
(26.48) a f < q l~5 for some S > 0,
(26.49) Y^ \&\f <: (logq) A for some A >0,
f
Then we have
with x = q K, where K is any positive number and 7 is a positive number which
depends only on 5 and K. The implied constant depends only on K, 5 and A.
We us e a mollifie r M ( / ) a s i n (26.3 ) o f lengt h M = q A wit h A < | , definin g
N,= }2 M(/)L'(/,l/2) ,
2
N2= Yl \M(f)L\fA/2)\
e / =-i
and appl y Propositio n 26.1 5 wit h a f = M(f)U(f,\) an d a = | M ( / ) L ' ( / , \)\ 2
ly/4 2
respectively. Th e convexit y boun d ! / ( / , \) < C g (logg) (us e (26.9 ) t o prov e i t
again) yield s easil y (26.48 ) an d (26.49) .
59 6 26. C E N T R A L VALUE S O F L - F U N C T I O N S
It follow s tha t
* = ^ff1 E * E ^f(*mf)i>v, h)+ ofa1 -4),
1 £
' / = ~1 ^ 2<C*
(*,<?) = !
2
C(2) ^ ^ d*
(*,<?) = !
K
with x — q wher e K > 0 can b e chose n arbitrarily .
Inserting th e formula s (26.9 ) an d (26.1 3) , an d usin g result s o f Sectio n 26.3 ,
we ar e le d t o a linea r for m expressio n fo r TV i an d a quadrati c for m fo r N2 whic h
are ver y simila r t o thos e o f Theore m 26.1 2 . I t i s importan t t o b e abl e t o tak e K
arbitrarily smal l t o retai n a n asymptoti c formula . Th e sam e principl e a s in Sectio n
26.4 applie s t o fin d th e mollifie r an d evaluat e N\ an d A^ , bu t ther e ar e man y
technical complication s du e t o lac k o f complet e multiplicativit y o f th e coefficient s
involved.
REMARK. I f on e i s willing t o accep t a slightl y weake r resul t
(26.50) | { / e S 2(q)* \e f = -l an d L'(f, \) + 0} | » \S 2(q)*\
for q prime, with n o explicit constan t (th e analogu e o f Selberg's theore m o n critica l
zeros of C( 5))? some shortcuts ar e available. First , ther e i s no reason t o optimize th e
mollifier a s in Section 26.4, and the computations ther e can be simplified quit e a bit .
Then instea d o f appealing t o Propositio n 26.1 5 and performin g th e computatio n o f
Ni an d N2 on e ca n argu e b y mean s o f Cauchy' s inequalit y
1 = 2 l
E £ 4d|/|| ^ V2^i67r 2||/||4J V E )'
fes2(qy od d fes 2(qy od d/ fes 2(q)* od d
L'(/,l/2)^0 L'(/,l/2)^ 0 L'(/,l/2)# 0
Then th e asymptoti c formul a
<26 5i) c
- £»~ «2
as q — » -foo fo r som e constan t c > 0 togethe r wit h Theore m 26. 8 impl y (26.50) .
Formula (26.51 ) i s a specia l cas e o f a resul t o f E . Roye r [Roy ] abou t moment s o f
L(Sym 2 /, 1 ) ove r / G 52(g)*. Not e tha t b y (26.47) , thi s amount s t o th e negativ e
second momen t o f L(Sym 2 /, 1 ) . Th e proo f o f (26.51 ) use s a densit y theore m fo r
zeros o f familie s o f L-function s t o restric t th e summatio n t o / satisfyin g a "quasi -
Riemann Hypothesis" .
REMARK. Beside s bein g use d t o buil d mollifiers , a s w e hav e done , th e twiste d
mean-values o f typ e (26.5) , (26.6 ) fo r familie s o f L-function s hav e a powerfu l us e
for buildin g a n "amplifier" , i.e . a su m o f th e typ e
Ak = y ZA(f)\L(f,±)\k
where A(f) > 0 , sa y
A(f) = \Y,c e\f(e)
26. C E N T R A L VALUE S 597
but thi s tim e A(f) is meant t o b e larg e fo r f = fo e T, some fixed form, s o that b y
positivity on e deduce s fro m a n uppe r boun d fo r th e mean-valu e Ak a n individua l
upper boun d
/k
(26.52) \L(f 0,±)\^Al A(f0)-\
In th e bes t circumstances , th e boun d fo r Ak ca n b e achieve d i f L is a small powe r
of |.F| , and matche s tha t fro m Lindelo f Hypothesi s an d orthogonalit y
^«m 1 + £ (£n 2 ) 1 / 2
i
so that (26.52 ) is a gain if the mean-value of Q is comparable in size to the peak value
A(fo). Thin k o f the cas e of primitive Dirichle t character s o f conductor q where on e
can tak e Q = x(fy ( s e e Propositio n 1 4.2 2 for a suitable choic e fo r classica l modula r
forms).
This amplificatio n metho d ha s bee n th e mos t successfu l fo r th e proo f o f sub -
convexity bound s fo r L- functions o f degre e ^ 2 , particularl y i n conducto r aspec t
where othe r technique s (Wey l shifts , reducin g th e averagin g family ) ar e inoperant .
The implementatio n is , however , quit e delicat e i n al l case s (requirin g off-diagona l
analysis) s o we refer t o paper s lik e [DFI2] , [DFI3] , [KMV2 ] an d th e surve y [M2 ]
for mor e informatio n o n thi s importan t subject .
This page intentionally left blank
Bibliography
[ASl] A . Adolphso n an d S . Sperber , On twisted exponential sums, Math . Ann . 29 0 (1 991 ) ,
713-726.
[AS2] A . Adolphso n an d S . Sperber , Character sums in finite fields, Compositi o Math . 5 2
(1984), 325-354 .
[AS3] A . Adolphso n an d S . Sperber , Newton polyhedra and the total degree of the L-function
associated to an exponential sum, Invent , math . 8 8 (1 987) , 555-569 .
[Ahl] L . Ahlfors , Complex Analysis, McGra w Hill , 1 978 .
[AM] M . Atiya h an d I.G . MacDonald , Introduction to commutative algebra, Addison-Wesley ,
1969.
[AL] A.O. L Atki n an d J . Lehner , Heche operators on To(m), Math . Ann . 1 8 5 (1 970) , 1 34 -
160.
[At] F . V . Atkinson , The mean value of the zeta-function on the critical line, Proc . Londo n
Math. Soc . (2 ) 4 7 (1 941 ) , 1 74-200 .
[B] A . Baker , Transcendental Number Theory, Cambridg e Univ . Press , 1 990 .
[BH] R . Bake r an d G . Harman , The difference between consecutive primes, Proc . Londo n
Math. Soc . 7 2 (1 996) , 261 -280 .
[Ba] W . Banks , Twisted symmetric-square L-functions and the nonexistence of Siegel zeros
on GL(3) , Duk e Math . J . 8 7 (1 997) , 343-353 .
[Bar] M . B . Barban , The "large sieve" method and its application to number theory, Uspeh i
Mat. Nau k 2 1 (1 966) , 51 -1 02 ; Englis h transl . i n Russia n Math . Survey s 2 1 (1 966) ,
49-103.
[BG] J . Bernstei n an d S . Gelbart , An introduction to the Langlands program, Birkhauser ,
2003.
[BL] H . Boh r an d E . Landau , Sur les zeros de la fonction f(s ) de Riemann, Compt e Rendu s
de l'Acad . de s Science s (Paris ) 1 5 8 (1 91 4) , 1 06-1 1 0 .
[Bol] E . Bombieri , Maggiorazione del resto nel "Primzahlsatz" col metodo di Erdos-Selberg.,
1st. Lombard o Accad . Sci . Lett . Rend . A 9 6 (1 962) , 343-350 .
[Bo2] E . Bombieri , Le grand crible dans la theorie analytique des nombres, S.M.F , 1 974 .
[Bo3] E . Bombieri , Counting points on curves over finite fields (d'apres S. A. Stepanov),
Lecture Note s i n Math . 38 3 (1 974) , 234-241 .
[Bo4] E . Bombieri , On exponential sums in finite fields, II, Invent , math . 4 7 (1 978) , 29-39 .
[Bo5] E . Bombieri , On the large sieve, Mathematik a 1 2 (1 965) , 201 -225 .
[BD] E . Bombier i an d H . Davenport , Some inequalities involving trigonometrical polynomi-
als, Ann . Scuol a Norm . Sup . Pis a (3 ) 2 3 (1 969) , 223-241 .
[BFI] E . Bombieri , J . Priedlande r an d H . Iwaniec , Primes in arithmetic progressions to large
moduli, Act a Math . 1 5 6 (1 986) , 203-251 .
[BI] E . Bombier i an d H . Iwaniec , Some mean-value theorems for exponential sums, Ann .
Scuola Norm . Sup . Pis a CI . Sci . 1 3 (1 986) , 473-486 .
[Bou] J . Bourgain , Remarks on Montgomery's conjectures on Dirichlet sums, Lectur e Note s
in Math . 1 46 9 (1 991 ) , 1 53-1 65 .
[BDCT] C . Breuil , B . Conrad , F . Diamon d an d R . Taylor , On the modularity of elliptic curves
overQ: wild 3-adic exercises, J . Amer . Math . Soc . 1 4 (2001 ) , 843-939 .
[BC] W . E . Brigg s an d S . Chowla , On discriminants of binary quadratic forms with a single
class in each genus, Canad . J . Math . 6 (1 954) , 463-470 .
[Bru] R . W . Bruggeman , Fourier coefficients of cusp forms, Invent , math . 4 5 (1 978) , 1 -1 8 .
[Brl] V . Brun , Uber das Goldbachsche Gesetz und die Anzahl der Primzahlpaare, Archi v fo r
Math.og Naturvid . B 3 4 (1 91 5) , no . 8 .
599
600 B I B L I O G R A P H Y
[Br2] V . Brun , Le crible d'Eratosthene et le theoreme de Goldbach, C . R . Acad . Sci . Pari s
168 (1 91 9) , 544-546 .
[Bu] D . Bump , Automorphic forms and representations, Cambridg e Univ . Press , 1 996 .
[BDHI] D . Bump , W . Duke , J . Hoffstei n an d H . Iwaniec , An estimate for the Hecke eigenvalues
of Maass forms, Internat . Math . Res . Notice s 4 (1 992) , 75-81 .
[Burl] D . A . Burgess , On character sums and L-series, I, Proc . Londo n Math . Soc . (3 ) 1 2
(1962), 1 93-206 .
[Bur2] D . A . Burgess , On character sums and L-series, II, Proc . Londo n Math . Soc . (3 ) 1 3
(1963), 524-536 .
[BH] C . J . Bushnel l an d G . Henniart , An upper bound on conductors for pairs, J . Numbe r
Theory 6 5 (1 997) , 1 83-1 96 .
[Byk] V . A . Bykovsky , Spectral expansion of certain automorphic functions and its number-
theoretical applications, Proc . Steklo v Inst . (LOMI ) 1 3 4 (1 984) , 1 5-33 ; Englis h transl .
in J . Sovie t Math . 3 6 (1 987) , 8-21 .
[Car] F . Carlson , Uber die Nullstellen der Dirichletschen Reihen und der Riemannschen
(-Funktion, Arkiv . fu r Mat . Astr . oc h Fysi k 1 5 (1 920) .
[CF] J.W. S Cassel s an d A . Frolich , Algebraic Number Theory, Academi c Press , 1 990 .
[Chi] Chamiz o an d H . Iwaniec , On the sphere problem, Rev . Mat . Iberoamerican a 1 1 (1 995) ,
417-429.
[Ch] J . R . Chen , On the representation of a larger even integer as the sum of a prime and
the product of at most two primes, Sci . Sinic a 1 6 (1 973) , 1 57-1 76 .
[CS] S . Chowl a an d A . Selberg , On Epstein's zeta-function, J . Rein e Angew . Math . 22 7
(1967), 86-1 1 0 .
[Col] B . Conrey , Zeros of derivatives of Riemann's ^-function on the critical line, J . Numbe r
Theory 1 6 (1 983) , 49-74 .
[Co2] B . Conrey , More than two fifths of the zeros of the Riemann zeta function are on the
critical line, J . Rein e Angew . Math . 39 9 (1 989) , 1 -26 .
[CGG] B . Conrey , A . Ghos h an d S . M . Gonek , A note on gaps between zeros of the zeta
function, Bull . Londo n Math . Soc . 1 6 (1 984) , 421 -424 .
[CI1] B . Conre y an d H . Iwaniec , The cubic moment of central values of automorphic In-
junctions, Ann . o f Math . (2 ) 1 5 1 (2000) , 1 1 75-1 21 6 .
[CI2] B . Conre y an d H . Iwaniec , Spacing of zeros of Hecke L-functions and the class number
problem, Act a Arithmetic a 1 0 3 (2002) , 259-31 2 .
[CoSo] B . Conrey an d K . Soundararajan , Real zeros of quadratic Dirichlet L-functions, Invent .
math. 1 5 0 (2002) , 1 -44 .
[Corl] J.G . va n de r Corput , Zahlentheoretische Abschdtzungen, Math . Ann . 8 4 (1 921 ) , 53-79 .
[Cor2] J.G . va n de r Corput , Verscharfung der Abschdtzungen beim Teilerproblem, Math . Ann .
87 (1 922) , 39-65 .
[Cor3] J . G . va n de r Corput , Sur Vhypothese de Goldbach pour presque tous les nombres
premiers, Act a Arithmetic a 2 (1 937) , 266-290 .
[Cox] D . Cox , Primes of the form x 2 + ny 2, Wiley , 1 989 .
[Cra] H . Cramer , On the order of magnitude of the difference between consecutive prime
numbers, Prac e Mat.-Fiz . 4 5 (1 937) , 51 -74 .
[Dal] H . Davenport , On some infinite series involving arithmetical functions. II, Quart . J .
Math. Oxf . 8 (1 937) , 31 3-320 .
[Da2] H . Davenport , Analytic methods for Diophantine equations and Diophantine inequali-
ties, An n Arbo r Publishers , 1 963 .
[DH1] H . Davenpor t an d H . Halberstam , The values of a trigonometrical polynomial at well
spaced points, Mathematik a 1 3 (1 966) , 91 -96 .
[DH2] H . Davenpor t an d H . Halberstam , Primes in arithmetic progressions, Michiga n Math .
J. 1 3 (1 966) , 485-489 .
[DHe] H . Davenpor t an d Heilbronn , On the class-number of binary cubic forms, I, II, J .
London Math . Soc . 2 6 (1 951 ) , 1 83-1 92 , 1 92-1 98 .
[Del] P . Deligne , La conjecture de Weil, I, Inst . Haute s Etude s Sci . Publ . Math . 4 3 (1 972) ,
206-226.
[De2] P . Deligne , La conjecture de Weil, II, Inst . Haute s Etude s Sci . Publ . Math . 5 2 (1 980) ,
137-252.
BIBLIOGRAPHY 601
[De3] P . Deligne , Cohomologie etale, SGA 4-\, Lectur e Notes . Math . 569 , Springe r Verlag ,
1977.
[DeSe] P . Delign e an d J-P . Serre , Formes modulaires de poids 1 , Ann . Sci . Ecol e Norm . Sup .
(4) 7 (1 974) , 507-530 .
[DI] J.M . Deshouiller s an d H . Iwaniec , Kloosterman sums and Fourier coefficients of cusp
forms, Invent , math . 7 0 (1 982/83) , 21 9-288 .
[DS] H . Diamon d an d J . Steinig , An elementary proof of the prime number theorem with a
remainder term, Invent , math . 1 1 (1 970) , 1 99-258 .
[Dir] P.G . Dirichlet , Demonstration d'une propriete analogue a la loi de reciprocity qui existe
entre deux nombres premiers quelconques, J . Rein e Angew . Math . 9 (1 832) , 379—389 .
[Dul] W . Duke , The dimension of the space of cusp forms of weight one, Internat . Math .
Res. Notice s (1 995) , 99-1 09 .
[Du2] W . Duke , Hyperbolic distribution problems and half-integral weight Maass forms, In -
vent. math . 9 2 (1 988) , 73-90 .
[Du3] W . Duke , The critical order of vanishing of automorphic L-functions with large level,
Invent, math . 1 1 9 (1 995) , 1 65-1 74 .
[Du4] W . Duke , Some problems in multidimensional analytic number theory, Act a Arith -
metica 5 2 (1 989) , 203-228 .
[Du5] W . Duke , Elliptic curves with no exceptional primes, C . R . Acad . Sci . Pari s Ser . I
Math. 32 5 (1 997) , 81 3-81 8 .
[DFIl] W . Duke , J . Friedlande r an d H . Iwaniec , A quadratic divisor problem, Invent , math .
115 (1 994) , 209-21 7 .
[DFI2] W . Duke , J . Friedlande r an d H . Iwaniec , Bounds for automorphic L-functions, II,
Invent, math . 1 1 5 (1 994) , 21 9-239 .
[DFI3] W . Duke , J . Friedlande r an d H . Iwaniec , The subconvexity problem for Artin In-
junctions, Invent , math . 1 4 9 (2002) , 489-577 .
[DFI4] W . Duke , J . Friedlande r an d H . Iwaniec , Bilinear forms with Kloosterman fractions,
Invent, math . 1 2 8 (1 997) , 23-43 .
[DFI5] W . Duke , J . Friedlander , H . Iwaniec, Equidistribution of roots of a quadratic congruence
to prime moduli, Ann . o f Math . (2 ) 1 4 1 (1 995) , 423-441 .
[Dull] W . Duk e an d H . Iwaniec , Bilinear forms in the Fourier coefficients of half-integral
weight cusp forms and sums over primes, Math . Ann . 28 6 (1 990) , 783-802 .
[DuI2] W . Duk e an d H . Iwaniec , Estimates for coefficients of L-functions, I, Automorphi c
forms an d analyti c numbe r theor y (Montreal , P Q , 1 989) , CRM , 1 989 , pp . 43-47 .
[DuI3] W . Duk e an d H . Iwaniec , Estimates for coefficients of L-functions, II, Proceeding s
of th e Amai n Conferenc e o n Analyti c Numbe r Theory , Univ . Salerno , Salerno , 1 992 ,
pp. 71 -82 .
[DuI4] W . Duk e an d H . Iwaniec , Estimates for coefficients of L-functions, IV, Amer . J . Math .
116 (1 994) , 207-21 7 .
[DK] W . Duk e an d E . Kowalski , A problem of Linnik for elliptic curves and mean-value
estimates for automorphic representations, Invent , math . 1 3 9 (2000) , 1 -39 .
[D] F . J . Dyson , Statistical theory of the energy levels of complex systems, I, J . Mathemat -
ical Phys . 3 (1 962) , 1 40-1 56 .
[Ell] J . Ellenberg , Galois representations attached to Q- curves and the generalized Fermat
equation A A + B 2 = C p, Amer . J . Math , (t o appear) .
[El] P.D.T.A . Elliott , On inequalities of large sieve type, Act a Arithmetic a 1 8 (1 971 ) , 4 0 5 -
422.
[EK] P . Erdo s an d M . Kac , The Gaussian law of errors in the theory of additive number
theoretic functions, Amer . J . Math . 6 2 (1 940) , 738-742 .
[Est] T . Estermann , On Goldbach's Problem: Proof that Almost All Even Positive Integers
are Sums of Two Primes, Proc . Londo n Math . Soc . (2 ) 4 4 (1 938) , 307-31 4 .
[Fon] J.M . Fontaine , II n'y a pas de varietes abeliennes sur Z , Invent , math . 8 1 (1 985) ,
515-538.
[FK] E . Fouvr y an d N . Katz , A general stratification theorem for exponential sums, and
applications, J . Rein e Angew . Math . 54 0 (2001 ) , 1 1 5-1 66 .
[FI] E . Fouvr y an d H . Iwaniec , Gaussian Primes, Acta . Arithmetic a 7 9 (1 997) , 249-287 .
[FoM] E . Fouvr y an d P . Michel , Sur le changement de signe des sommes de Kloosterman
(preprint).
602 BIBLIOGRAPH Y
[Fri] J . Priedlander , Primes in arithmetic progressions and related topics, Analyti c Numbe r
Theory an d Diophantin e problems , Birkhaiiser , 1 987 , pp . 1 25-1 34 .
[FG] J . Friedlande r an d A . Granville , Limitations to the equi-distribution of primes, III,
Compositio Math . 8 1 (1 992) , 1 9-32 .
[FI1] J . Friedlande r an d H . Iwaniec , The polynomial X 2 + Y 4 captures its primes, Ann . o f
Math. (2) 1 4 8 (1 998) , 945-1 040 .
[FI2] J . Friedlande r an d H . Iwaniec , Summation formulae for coefficients of L-functions (t o
appear).
[FI3] J . Friedlande r an d H . Iwaniec , A Note on Character Sums, Contemporar y Math . 1 6 6
(1994), 295-299 .
[FI4] J . Friedlande r an d H . Iwaniec , Incomplete Kloosterman sums and a divisor problem,
Ann. o f Math . 1 2 1 (1 985) , 31 9-350 .
[Gal] P.X . Gallagher , A large sieve density estimate near a — 1 , Invent , math . 1 1 (1 970) ,
329-339.
[Ga2] P.X . Gallagher , Primes in progressions to prime-power modulus, Invent , math . 1 6
(1972), 1 91 -201 .
[Ga3] P . X . Gallagher , BombierVs mean value theorem, Mathematik a 1 5 (1 968) , 1 -6 .
[Ga4] P . X . Gallagher , The large sieve and probabilistic Galois theory, Proc . Sympos . Pur e
Math., Vol . XXIV , Amer . Math . S o c , 1 973 , pp . 91 -1 01 .
[GM] M . L. Gaudin an d M . Mehta , On the density of eigenvalues of a random matrix, Nuclea r
Phys. 1 8 (1 960) , 420-427 .
[Ge] F . Gert h III , Extension of conjectures of Cohen and Lenstra, Expositione s Math . 5
(1987), 1 81 -1 84 .
[GeJ] S . Gelbar t an d H . Jacquet , A relation between automorphic representations of GL(2)
and GL(3), Ann . Sci . Ecol e Norm . Sup . (4 ) 1 1 (1 978) , 471 -542 .
[GJ] R . Godemen t an d H . Jacquet , Zeta functions of simple algebras, Lectur e Note s Math .
260, Springe r Verlag , 1 972 .
[Gol] D . Goldfeld , A simple proof of SiegeVs theorem, Proc . Nat . Acad . Sci . U.S.A. 7 1 (1 974) ,
1055.
[Go2] D . Goldfeld , The class number of quadratic fields and the conjectures of Birch and
Swinnerton-Dyer, Ann . Scuol a Norm . Sup . Pis a CI . Sci . (4 ) 3 (1 976) , 624-663 .
[GS] D . Goldfel d an d P . Sarnak , Sums of Kloosterman sums, Invent , math . 7 1 (1 983) , 243 -
250.
[GHL] D . Goldfeld , J . Hoffstei n an d D . Lieman , Annal s o f Math . (2 ) 1 4 0 (1 994) , 1 61 -1 81 .
[GHB] D . Goldsto n an d D . R . Heath-Brown , A note on the differences between consecutive
primes, Math . Ann . 26 6 (1 984) , 31 7-320 .
[GR] I.S . Gradhstey n an d I.M . Rizhik , Table of integrals, series and products, 6t h Edition ,
Academic Press , 2000 .
[Gl] S . Graham , An asymptotic estimate related to Selberg's sieve, J . Numbe r Theor y 1 0
(1978), 83-94 .
[G2] S . Graham , On Linnik's constant, Act a Arithmetic a 3 9 (1 981 ) , 1 63-1 79 .
[GK] S. W Graha m an d G . Kolesnik , van der Corput's method of exponential sums, Cam -
bridge Univ . Press , 1 991 .
[GRi] S . W . Graha m an d C . J . Ringrose , Lower bounds for least quadratic nonresidues, An -
alytic numbe r theor y (Allerto n Park , IL , 1 989) , Birkhauser , 1 990 , pp . 269-309 .
[Gra] A . Granville , Unexpected irregularities in the distribution of prime numbers, Pro -
ceedings o f th e Internationa l Congres s o f Mathematicians , Vol . 1 , 2 (Zurich , 1 994) ,
Birkhauser, Basel , 1 995 , pp . 388-399 .
[Gr] G . Greaves , Sieves in number theory, Ergebniss e de r Mathemati k un d ihre r Grenzge -
biete (3) , vol . 43 , Springe r Verlag , 2001 .
[GZ1] B . Gros s an d D . Zagier , Heegner points and derivatives of L-series, Invent , math . 8 4
(1986), 225-320 .
[GZ2] B . Gros s an d D . Zagier , Points de Heegner et derivees de fonctions L, C . R . Acad . Sci .
Paris Ser . I Math . 29 7 (1 983) , 85-87 .
[HaTu] G . Halas z an d P.Turan , On the distribution of roots of Riemann zeta and allied func-
tions, I, J . Numbe r Theor y 1 (1 969) , 1 21 -1 37 .
[Hal] H . Halberstam , On the distribution of additive number-theoretic functions, II, III, J .
London Math . Soc . 3 1 (1 956) , 1 -1 4 , 1 4-27 .
BIBLIOGRAPHY 60 3
[HaRi] H . Halbersta m an d H . E . Richert , Sieve methods, Academi c Press , 1 974 .
[HaLa] G . H . Hard y an d E . Landau , The lattice points of a circle, Proc . Roya l Soc . A 1 0 5
(1924), 244-258 .
[HL1] G . H . Hard y an d J . E . Littlewood , Some Problems of 'Partitio Numerorum. 7 III. On
the Expression of a Number as a Sum of Primes., Act a Math . 4 4 (1 922) , 1 -70 .
[HL2] G . H . Hard y an d J . E . Littlewood , The zeros of Riemann 's zeta function on the critical
line, Math . Z . 1 0 (1 921 ) , 283-31 7 .
[HR] G.H . Hard y an d S . Ramanujan , Asymptotic Formulae in Combinatory Analysis, Proc .
London Math . Soc . 1 7 (1 91 8) , 75-1 1 5 .
[HW] G.H . Hard y an d E.M . Wright , An introduction to the theory of numbers, 5t h edition ,
Oxford Universit y Press , 1 979 .
[HT] M . Harri s an d R . Taylor , The geometry and cohomology of some simple Shimura vari-
eties, Princeto n Univ . Press , 2002 .
[Ha] R . Hartshorne , Algebraic Geometry, Grad . Text s i n Math . 52 , Springe r Verlag , 1 977 .
[HB1] D.R . Heath-Brown , A mean value estimate for real character sums, Act a Arithmetic a
72 (1 995) , 235-275 .
[HB2] D.R . Heath-Brown , An estimate for Heilbronn's exponential sum, Analyti c numbe r
theory, Vol . 2 (Allerto n Park , IL , 1 995) , Birkhauser , 1 995 , pp . 451 -463 .
[HB3] D . R . Heath-Brown , Prime numbers in short intervals and a generalized Vaughan
identity, Canad . J . Math . 3 4 (1 982) , 1 365-1 377 .
[HB4] D . R . Heath-Brown , Zero-free regions for Dirichlet L-functions, and the least prime in
an arithmetic progression, Proc . Londo n Math . Soc . (3 ) 6 4 (1 992) , 265-338 .
[HB5] D . R . Heath-Brown , Lattice points in the sphere, Numbe r theor y i n progress , Vol . 2 ,
de Gruyter , Berlin , 1 999 , pp . 883-892 .
[HB6] D . R . Heath-Brown , Cubic forms in ten variables, Proc . Londo n Math . Soc . (3 ) 4 7
(1983), 225-257 .
[HBP] D.R . Heath-Brow n an d S.J . Patterson , The distribution of Kummer sums at prime
arguments, J . Rein e Angew . Math . 31 0 (1 979) , 1 1 1 -1 30 .
[Heel] E . Hecke , Uber eine neue Art von Zetafunktionen, Math . Zeit . 6 (1 920) , 1 1 -51 .
[Hee] K . Heegner , Diophantische Analysis und Modulfunktionen, Math . Z . 5 6 (1 952) , 227 -
253.
[H] D . Hilbert , Beweis fur die Darstellbarkeit der ganzen Zahlen durch eine feste Anzahl
n-ter Potenzen (Waringsches Problem), Math . Annale n 6 7 (1 909) , 281 -305 .
[Hil] A . Hildebrand , An asymptotic formula for the variance of an additive function, Math .
Z. 1 8 3 (1 983) , 1 45-1 70 .
[Hi2] A . Hildebrand , On the constant in the Poly a-Vinogradov inequality, Canad . Math .
Bull. 3 1 (1 988) , 347-352 .
[Hof] J . Hoffstein , On the Siegel-Tatuzawa theorem, Act a Arithmetic a 3 8 (1 980/81 ) , 1 67 -
174.
[HL] J . Hoffstei n an d P . Lockhart , Coefficients of Maass forms and the Siegel zero, Annal s
of Math . 1 4 0 (1 994) , 1 61 -1 81 .
[Ho] G . Hoheisel , Primzahlprobleme in der Analysis, S.-B . Preuss . Akad . Wiss . Phys.-Math .
Kl. (1 930) , 580-588 .
[HI] Loo-Ken g Hua , Introduction to number theory, Springer , 1 982 .
[H2] Lo o Ken g Hua , On Waring's problem, Quart . J . Math . Oxfor d 9 (1 938) , 1 99-202 .
[Hub] H . Huber , Zur analytischen Theorie hyperbolischen Raumformen und Bewegungsgrup-
pen, Math . Ann . 1 3 8 (1 959) , 1 -26 .
[Hul] M . N . Huxley , The large sieve inequality for algebraic number fields, Mathematik a 1 5
(1968), 1 78-1 87 .
[Hu2j M . N . Huxley , Large values of Dirichlet polynomials, A.cta Arithmetic a 2 4 (1 973) ,
329-346.
[Hu3] M . N . Huxley , On the differences between consecutive primes, Invent , math . 1 5 (1 972) ,
164-170.
[Hu4] M . N . Huxley , Area, lattice points, and exponential sums, Th e Clarendo n Press , 1 996 .
[HuW] M . N . Huxle y an d N . Watt , Exponential sums and the Riemann zeta function, Proc .
London Math . Soc . 5 7 (1 988) , 1 -24 .
[Ing] A . E . Ingham , On the estimation of N(a,T), Quart . J . Math . 1 1 (1 940) , 291 -292 .
604 BIBLIOGRAPH Y
[IR] K . Irelan d an d M . Rosen , A Classical Introduction to Modern Number Theory, 2n d
Edition, Grad . Text s i n Math . 84 , Springer-Verlag , 1 990 .
[Iv] A . Ivic , The Riemann zeta-function, Theory and applications, Dove r Publications ,
2003.
[II] Iwaniec , Character sums and small eigenvalues for TQ (p), Glasgo w Math . J . 2 7 (1 985) ,
99-116.
[12] H . Iwaniec , On zeros of DirichleVs L series, Invent , math . 2 3 (1 974) , 97-1 04 .
[13] H . Iwaniec , Spectral theory of automorphic functions and recent developments in ana-
lytic number theory, Proceeding s o f th e IC M Berkele y 1 986 , Amer . Math . S o c , 1 987 ,
pp. 444-456 .
[14] H . Iwaniec , Topics in Classical Automorphic Forms, A.M.S , 1 997 .
[15] H . Iwaniec , Introduction to the spectral theory of automorphic forms, 2n d edition ,
A.M.S an d R.M.I , 2002 .
[16] H . Iwaniec , Fourier coefficients of modular forms of half-integral weight, Invent , math .
8 7 (1 987) , 385-401 .
[17] H . Iwaniec , The spectral growth of automorphic L-functions, J . Rein e Angew . Math .
4 2 8 (1 992) , 1 39-1 59 .
[18] H . Iwaniec , The half-dimensional sieve, Act a Arithmetic a 2 9 (1 976) , 69-95 .
[19] H . Iwaniec , A new form of the error term in the linear sieve, Act a Arithmetic a 3 7
(1980), 307-320 .
[110] H . Iwaniec , Almost primes represented by quadratic polynomials, Invent , math . 4 7
(1978), 1 71 -1 88 .
[III] H . Iwaniec , Small eigenvalues of Laplacian for T o (AT), Act a Arithmetic a 5 6 (1 990) ,
65-82.
[112] H . Iwaniec , Prime geodesic theorem, Journa l Rein e Angew . Math . 3 4 9 (1 984) , 1 36-1 59 .
[113] H . Iwaniec , Nonholomorphic modular forms and their applications, Modula r form s
(Durham, 1 983) , Horwood , 1 984 , pp . 1 57-1 96 .
[114] H . Iwaniec , The lowest eigenvalue for congruence groups, Topic s i n geometry , Birk -
hauser, 1 996 , pp . 203-21 2 .
[ILS] H . Iwaniec , W . Lu o an d P . Sarnak , Low-lying zeros of families of L-functions, Inst .
Hautes Etude s Sci . Publ . Math . 9 1 (2001 ) , 55-1 31 .
[IM] H . Iwanie c an d P . Michel , The second moment of the symmetric square L-functions,
Ann. Acad . Sci . Fenn . Math . 2 6 (2001 ) , 465-482 .
[ISl] H . Iwanie c an d P . Sarnak , Perspectives on the analytic theory of L-functions, GAF A
Special V o l u m e G A F A 2 0 0 0 (2000) , 705-741 .
[IS2] H . Iwanie c an d P . Sarnak , The non-vanishing of central values of automorphic L-
functions and Landau-Siegel zeros, Israe l J . Math . 1 2 0 (2000) , 1 55-1 77 .
[JS] H . Jacque t an d J . Shalika , On Euler products and the classification of automorphic
representations, I and II, Amer . J . Math . 1 0 3 (1 981 ) , 499-588 , 777-81 5 .
[JPS] H . Jacquet , I . Piatetskii-Shapir o an d J . Shalika , Rankin-Selberg convolutions, Amer . J .
Math. 1 0 5 (1 983) , 367-464 .
[Jul] M . Jutila , Lectures on a method in the theory of exponential sums, Springe r Verlag ,
1987.
[Ju2] M . Jutila , On character sums and class numbers, J . Numbe r Theor y 5 (1 973) , 203-21 4 .
[Ju3] M . Jutila , On large values of Dirichlet polynomials, Topic s i n numbe r theor y (Proc .
Colloq., Debrecen , 1 974) , North-Holland , 1 976 , pp . 1 29-1 40 .
[Ju4] M . Jutila , Zero-density estimates for L-functions, Act a Arithmetic a 3 2 (1 977) , 55-62 .
[Ju5] M . Jutila , Statistical Deuring-Heilbronn phenomenon, Act a Arithmetic a 3 7 (1 980) ,
221-231.
[Kl] N . Katz , Twisted L-functions and m,onodromy, Princeto n Univ . Press , 2002 .
[K2] N . Katz , Exponential sums over finite fields and differential equations over the complex
numbers: some interactions, Bull . Amer . Math . Soc . (N.S. ) 2 3 (1 990) , 269-309 .
[K3] N . Katz , Gauss sums, Kloosterman sums and monodromy groups, Princeto n Univ .
Press, 1 988 .
[K4] N . Katz , Sommes exponentielles, S.M.F , 1 980 .
[K5] N . Katz , Sums of Betti numbers in arbitrary characteristic, Finit e Field s Appl . 7
(2001), 29-44 .
BIBLIOGRAPHY 605
[KS1] N . Kat z an d P . Sarnak , Random matrices, Frobenius eigenvalues, and monodromy,
A.M.S, 1 999 .
[KS2] N . Kat z an d P . Sarnak , Zeroes of zeta functions and symmetry, Bull . Amer . Math .
Soc. 3 6 (1 999) , 1 -26 .
[Kh] A . Y . Khinchine , Three pearls of number theory, Dove r Publications , 1 998 .
[KSa] H . Ki m an d P . Sarnak , Refined estimates towards the Ramanujan and Selberg conjec-
tures, J . Amer . Math . Soc . 1 6 (2003) , 1 75-1 81 .
[KSh] H . Ki m an d F . Shahidi , Cuspidality of symmetric powers with applications, Duk e Math .
J. 1 1 2 (2002) , 1 77-1 97 .
[Klo] H.D . Kloosterman , On the representation of numbers in the form ax 2 + by 2 + cz 3 -\-dt 2,
Acta Math . 4 9 (1 926) , 407-464 .
[Kol] E . Kowalski , Analytic problems for elliptic curves (2001 ) (preprint) .
[Ko2] E . Kowalski , On the "reducibility" of arctangents of integers, Amer . Math . Monthl y
111 (2004) , 351 -354 .
[KM1] E . Kowalsk i an d P . Michel , A lower bound for the rank of Jo(q)j Act a Arithmetic a 9 4
(2000), 303-343 .
[KM2] E . Kowalsk i an d P . Michel , The analytic rank of Jo(q) and zeros of automorphic L-
function, Duk e Math . J . 1 0 0 (1 999) , 503-542 .
[KMVl] E . Kowalski , P . Miche l an d J . VanderKam , Mollification of the fourth moment of
automorphic L-functions and arithmetic applications, Invent , math . 1 4 2 (2000) , 9 5 -
151.
[KMV2] E . Kowalski , P . Miche l an d J . VanderKam , Rankin-Selberg L-functions in the level
aspect, Duk e Math . J 1 1 4 (2002) , 1 23-1 91 .
[KMV3] E . Kowalski , P . Miche l an d J . VanderKam , Non-vanishing of high derivatives of au-
tomorphic L-functions at the center of the critical strip, J . fu r di e Rein e un d Angew .
Math. 52 6 (2000) , 1 -34 .
[Kor] N . M . Korobov , Estimates of trigonometric sums and their applications, Uspeh i Mat .
Nauk. 1 3 (1 958) , 1 85-1 92 .
[Kubl] J . Kubilius , Probability methods in number theory, Usp . Mat . Nauk . 6 8 (1 956) , 31 -66 .
[Kub2] J . Kubilius , Sharpening of the estimate of the second central moment for additive
arithmetical functions, Litovsk . Mat . Sb . 2 5 (1 985) , 1 04-1 1 0 .
[Kuz] N . V . Kuznetsov , The Petersson conjecture for cusp forms of weight zero and the
Lmnik conjecture, Mat . Sb . (N.S. ) I l l (1 980) , 334-383 ; Math . USSR-S b 3 9 (1 981 ) ,
299-342.
[LO] J . Lagaria s an d A . Odlyzko , Effective versions of the Chebotarev Density Theorem,
Algebraic numbe r fields : L-function s an d Galoi s propertie s (Proc . Sympos. , Univ .
Durham, Durham , 1 975) , Academi c Press , 1 977 , pp . 409-464 .
[Lai] E . Landau , Uber die Einteilung der positiven ganzen Zahlen in vier Klassen nach der
Mindestzahl der zu ihrer additiven Zusammensetzung erforderlichen Quadrate, Arch .
der Math . u . Phys . (3 ) 1 3 (1 908) , 305-31 2 .
[La2] E . Landau , Bemerkungen zum Heilbronnschen Satz, Act a Arithmetic a 1 (1 936) , 1 —1 8 .
[La3] E . Landau , Uber die Nullstellen der Dirichletschen Reihen und der Riemannschen
C-Funktion, Arkiv . fu r Mat . Astr . oc h Fysi k 1 6 (1 921 ) .
[La] S . Lang , Algebraic Number Theory, 2n d edition , Grad . Text s i n Math . 1 1 0 , Springer -
Verlag, 1 994 .
[LT] S . Lan g an d H . Trotter , Frobenius distribution in GLi extensions, Lectur e Note s i n
Math. 504 , Springe r Verlag , 1 976 .
[Lau] G . Laumon , Exponential sums and l-adic cohomology: a survey, Israe l J . Math . 1 2 0
(2000), 225-257 .
[Lav] A . F. Lavrik , Approximate functional equations of Dirichlet functions, Izv . Akad . Nau k
SSSR Ser . Mat . 3 2 (1 968) , 1 34-1 85 .
[Leb] N . N . Lebedev , Special functions and their applications, Dove r Publications , 1 972 .
[Lev] N . Levinson , More than one-third of the zeros of the Riemann zetafunction are on
a = 1 /2 , Adv . Math . 1 3 (1 974) , 383-436 .
[L] W . Li , L-series of Rankin type and their functional equations, Math . Ann . 24 4 (1 979) ,
135-166.
[Lil] Yu . V . Linnik , The large sieve, Dokl . Akad . Nau k SSS R 3 0 (1 941 ) , 292-294 . (i n
Russian)
606 BIBLIOGRAPH Y
[Li2] Yu . V . Linnik , The dispersion method in binary additive problems, AMS , 1 963 .
[Li3] Y u V . Linnik , Additive problems and eigenvalues of the modular operators, Proc . In -
t e r n a l Congr . Mathematician s (Stockholm , 1 962) , pp . 270-284 .
[Li4] Yu . V . Linnik , On the least prime in an arithmetic progression, I. The basic theorem,
Rec. Math . [Mat . Sbornik ] N.S . 1 5(57 ) (1 944) , 1 39-1 78 .
[Li5] Yu . V . Linnik , On the least prime in an arithmetic progression. II. The Deuring-
Heilbronn phenomenon, Rec . Math . [Mat . Sbornik ] N.S . 1 5(57 ) (1 944) , 347-368 .
[Li6] Yu . V . Linnik , On Dirichlet's L-series and prime-number sums, Rec . Math . [Mat .
Sbornik] N.S . 1 5(57 ) (1 944) , 3-1 2 .
[Li7] Yu . V . Linnik , New versions and new uses of the dispersion methods in binary additive
problems, Dokl . Akad . Nau k SSS R 1 3 7 (1 961 ) , 1 299-1 302 .
[LR] J . H . va n Lin t an d H . -E . Richert , On primes in arithmetic progressions. Act a Arith -
metica 1 1 (1 965) , 209-21 6 .
[Lit] J . E . Littlewood , On the zeros of the Riemann Zeta-function, Cambridg e Phil . Soc .
Proc. 2 2 (1 924) , 295-31 8 .
[LRW] J . va n d e Lune , H.J. J t e Riele , D.T . Winter , On the zeros of the Riemann zeta function
in the critical strip, IV, Math . Comp . 1 7 4 (1 986) , 667-681 .
[Luo] W . Luo , Nonvanishing of L-values and the Weyl law, Ann . o f Math . (2 ) 1 5 4 (2001 ) ,
477-502.
[LRS] W . Luo , Z . Rudnic k an d P . Sarnak , On Selberg's eigenvalue conjecture, Geom . Funct .
Anal. 5 (1 995) , 387-401 .
[Ma] H . Maass , Uber eine neue Art von nichtanalytischen automorphen Funktionen und
die Bestimmung Dirichletscher Reihen durch Funktionalgleichungen. Math . Ann . 1 2 1
(1949), 1 41 -1 83 .
[Mai] H . Maier , Primes in short intervals, Michiga n Math . J . 3 2 (1 985) , 221 -225 .
[Mar] G . Margulis , Discrete Subgroups of Semisimple Lie Groups, Ergebniss e de r Math , un d
ihrer Grenzgebiet e 68 , Springe r Verlag , 1 991 .
[MSD] B . Mazu r an d P . Swinnerton-Dyer , Arithmetic of Weil curves, Invent , math . 2 5 (1 974) ,
1-61.
[Mes] J-F . Mestre , Formules explicites et minorations de conducteurs de varietes algebriques,
Compositio Math . 5 8 (1 986) , 209-232 .
[Ml] P . Michel , The subconvexity problem for Rankin-Selberg L-functions and equidistribu-
tion of Heegner points, Annals , o f Math , (t o appear) .
[M2] P . Michel , Analytic number theory and families of automorphic L-functions (t o ap -
pear).
[M3] P . Michel , Autour de la conjecture de Sato-Tate pour les sommes de Kloosterman, I,
Invent. Math . 1 2 1 (1 995) , 61 -78 .
[MvdK] P . Miche l an d J . VanderKam , Non-vanishing of high derivatives of Dirichlet L-func-
tions at the central point, J . Numbe r Theor y 8 1 (2000) , 1 30-1 48 .
[Mi] T . Miyake , Modular forms, Springe r Verlag , 1 989 .
[MW] C . Moegli n an d J-L . Waldspurger , Poles des fonctions L de paires pour GL(N), app.
to Le spectre residuel de GL(N), Ann . Sci . EN S (4em e serie ) 2 2 (1 989) , 605-674 .
[Mol] G . Molteni , Upper and lower bounds at s = 1 for certain Dirichlet series with Euler
product, Duk e Math . J . I l l (2002) , 1 33-1 58 .
[Mol] H.L . Montgomery , The analytic principle of the large sieve, Bull . Amer . Math . Soc .
84 (1 978) , 547-567 .
[Mo2] H.L . Montgomery , Topics in multiplicative number theory, Lectur e Note s i n Math . 227 ,
Springer Verlag , 1 971 .
[Mo3] H . Montgomery , Zeros of L-functions, Invent , math . 8 (1 969) , 346-354 .
[Mo4] H . L . Montgomery , The pair correlation of zeros of the zeta function, Analyti c numbe r
theory (Proc . Sympos . Pur e Math. , Vol . XXIV) , Amer . Math . S o c , 1 972 , pp . 1 81 -1 93 .
[MV1] H . L . Montgomer y an d R . C. , Vaughan , The large sieve, Mathematiki a 2 0 (1 973) ,
119-134.
[MV2] H.L . Montgomer y an d R.C . Vaughan , Hilbert's inequality, J . Londo n Math.Soc . (2 ) 8
(1974), 73-82 .
[MV3] H.L . Montgomer y an d R.C . Vaughan , The exceptional set in Goldbach's problem, Act a
Arithmetica 2 7 (1 975) , 353-370 .
BIBLIOGRAPHY 60 7
[Mori] C . Moreno . Prime number theorems for the coefficients of modular forms, Bull . Amer .
Math. Soc . 7 8 (1 972) , 796-798 .
[Mor2] C . Moreno , Algebraic curves over finite fields, Cambridg e Univ . Press , 1 991 .
[Mor3] C . Moreno , Analytic proof of the strong multiplicity one theorem, Amer . J . Math . 1 0 7
(1985), 1 63-206 .
[Motl] Y . Motohashi , Spectral theory of the Riemann zeta-function, Camdridg e Univ . Press ,
1997.
[Mot2] Y . Motohashi , An induction principle for the generalization of Bombieri's prime num-
ber theorem, Proc . Japa n Acad . 5 2 (1 976) , 273-275 .
[Nak] H . Nakazato , Heegner points on modular elliptic curves, Proc . Japa n Acad . Ser . A
Math. Sci . 7 2 (1 996) , 223-225 .
[Ne] J . Nekovar , On the parity of ranks of Selmer groups, II, C . R . Acad . Sci . Pari s Ser . I
Math. 33 2 (2001 ) , 99-1 04 .
[Od] A . M . Odlyzko , Some analytic estimates of class numbers and discriminants, Invent .
math. 2 9 (1 975) , 275-286 .
[Oej J . Oesterle , Nombres de classes des corps quadratiques imaginaires, Asterisqu e 1 21 —
122 (1 985) , 309-323 .
[Ono] K . Ono , Nonvanishing of quadratic twists of modular L-functions and applications to
elliptic curves, J . Rein e Angew . Math . 53 3 (2001 ) , 81 -97 .
[PP] A . Perell i an d J . Pomykala , Averages over twisted elliptic L-functions, Act a Arith -
metica 8 0 (1 997) , 1 49-1 63 .
[PSa] Y . Petridi s an d P . Sarnak , Quantum unique ergodicity for SL2(0)\M.2> and estimates
for L-functions. Journa l o f Evolutio n Equation s 1 (2001 ) , 277-290 .
[Ph] E . Phillips , The zeta-function of Riemann; further developments of van der CorpuVs
method, Quart . J . Math . 4 (1 933) , 209-225 .
[PS] R . Phillip s an d P . Sarnak , On cusp forms for co-finite subgroups of PSL(2,R), Invent .
math. 8 0 (1 985) , 339-364 .
[Pi] N . Pitt , On shifted convolution of C s(s) with automorphic L-functions, Duk e Math . J .
77 (1 995) , 383-406 .
[Poi] G . Poitou , Sur les petits discriminants, Seminair e Delange-Pisot-Poitou , 1 8 e anne e
(1976-77).
[Pol] G . Polya , Uber die Verteilung der quadratischen Reste und Nichtreste, Nachr . Konigl .
Gesell. Wissensch . Gottingen , Math.-phys . Klass e (1 91 8) , 21 -29 .
[Pos] A . G . Postnikov , On Dirichlet L-series with the character modulus equal to the power
of a prime number, J . India n Math . Soc . (N.S. ) 2 0 (1 956) , 21 7-226 .
[PR] A . G . Postniko v an d N . P . Romanov , A simplification of A. Selberg's elementary proof
of the asymptotic law of distribution of prime numbers, Uspeh i Mat . Nau k (N.S. ) 1 0
(1955), 75-87 .
[R] H . Rademacher , On the Partition Function p(n), Proc . Londo n Math . Soc . 4 3 (1 937) ,
241-254.
[Ra] D . Ramakrishnan , Modularity of the Rankin-Selberg L-series, and multiplicity one for
SL(2), Annal s o f Math . (2) 1 5 2 (2000) , 45-1 1 1 .
[Ral] R . A . Rankin , Van der CorpuVs method and the theory of exponent pairs, Quart . J .
Math. (2 ) 6 (1 955) , 1 47-1 53 .
[Ra2] R . A . Rankin , The difference between consecutive prime numbers, V, Proc . Edinburg h
Math. Soc . (2 ) 1 3 (1 962/1 963) , 331 -332 .
[Ra3] R . A . Rankin , Contributions to the theory of Ramanujan's r function and similar
arithmetical functions, II, Proc . Camb . Phil . Soc . 3 5 (1 939) , 351 -372 .
[Re] A . Renyi , On the representation of an even number as the sum of a single prime and
single almost-prime number, Izvestiy a Akad . Nau k SSSR . Ser . Mat . 1 2 (1 948) , 57-78 .
[Rey] E . Reyssat , Quelques aspects des surfaces de Riemann, Progres s i n Math . 77 , Birk -
haiiser, 1 989 .
[Rie] B . Riemann , Uber die Anzahl der Primzahlen unter einer gegebenen Grosse, Monats -
ber. Berlin . Akad . (1 859) , 671 -680 .
[RV] F . Rodriguez-Villegas , Square root formulas for central values of Hecke L-series, II,
Duke Math . J . 7 2 (1 993) , 431 -440 .
[Rot] K.F . Roth , On the large sieves of Linnik and Renyi, Mathematik a 1 2 (1 965) , 1 -9 .
608 BIBLIOGRAPH Y
[Roy] E . Royer , Statistique de la variable aleatoire L(Sym 2 / , 1 ) , Math . Ann . 32 1 (2001 ) ,
667-687.
[Ru] W . Rudin , Real and complex analysis, McGraw-Hill , 1 987 .
[RS] Z . Rudnic k an d P . Sarnak , Zeros of principal L-functions and random matrix theory,
Duke Math . J . 8 1 (1 996) , 269-322 .
[Sal] H . Salie , Uber die Kloostermanschen Summen S(u,v,q), Math . Z . 3 4 (1 931 ) , 91 -1 09 .
[Sal] P . Sarnak , Estimates for Rankin-Selberg L-Functions and Quantum Unique Ergodicity,
J. Funct . Anal . 1 8 4 (2001 ) , 41 9-453 .
[Sa2] P . Sarnak , Class Numbers of indefinite binary quadratic forms, Journa l o f Numbe r
Theory 1 5 (1 982) , 229-247 .
[Sa3] P . Sarnak , Some applications of modular forms, Cambridg e Univ . Press , 1 990 .
[Sa4] P . Sarnak , Arithmetic quantum chaos, Bar-Ila n Univ. , 1 995 .
[Sch] W . Schmidt , Equations over finite fields. An elementary approach, Lectur e Note s i n
Math. 536 , Springe r Verlag , 1 976 .
[ST] D . B . Sear s an d E.C . Titchmarsh , Some eigenfunction formulae, Quart . J . Math . Ox -
ford 1 (1 950) , 1 65-1 75 .
[SI] A . Selberg , The general sieve method and its place in prime number theory, Proc . ICM ,
vol. 1 , Cambridge , MA. , 1 950 , pp . 286-292 .
[S2] A . Selberg , On the estimation of Fourier coefficients of modular forms, Proc . Sympos .
Pure Math. , Vol . VIII , Amer . Math . S o c , 1 965 , pp. 1 -1 5 .
[S3] A . Selberg , Lectures on sieves, Collecte d Paper s Vol . II , Springe r Verlag , 1 991 , pp. 6 6 -
247.
[S4] A . Selberg , On the zeros of Riemann's zeta-function, Skr . Norsk e Vid . Akad . Osl o I
(1942).
[S5] A . Selberg , Bemerkungen uber eine Dirichletsche Reihe, die mit der Theorie der Mod-
ulformen nahe verbunden ist, Arch . Math . Naturvid . 4 3 (1 940) , 47-50 .
[S6] A . Selberg , Harmonic analysis and discontinuous groups in weakly symmetric Rie-
mannian spaces with applications to Dirichlet series, J . India n Math . Soc . (N.S. ) 2 0
(1956), 47-87 .
[S7] A . Selberg , On the estimation of Fourier coefficients of modular forms, Proc . Symp .
Pure Math . 8 , 1 965 , pp . 1 -8 .
[S8] A . Selberg , On the zeros of the zeta function of Riemann, De r Kong . Norsk e Vidensk .
Selsk. Forhand . 1 5 (1 942) , 59-62 .
[Sel] J-P . Serre , Cours d'arithmetique, 2n d edition , P.U.F , 1 977 .
[Se2] J-P . Serre , Modular forms of weight one and Galois representations, Algebrai c numbe r
fields: L-function s an d Galoi s propertie s (Proc . Sympos. , Univ . Durham , Durham ,
1975), Academi c Press , 1 977 , pp . 1 93-268 .
[Se3] J-P . Serre , Congruences et formes modulaires (d'apres H. P. F. Swinnerton-Dyer),
Seminaire Bourbaki , 24 e anne e (1 971 /1 972) , Exp . No . 41 6 , Lectur e Note s i n Math .
317, Springe r Verlag , 1 973 , pp . 31 9-338 .
[Se4] J-P . Serre , Corps locaux, Hermann , 1 968 .
[Se5] J-P . Serre , Representations lineaires des corps finis, Hermann , 1 971 .
[Se6] J-P . Serre , Quelques applications du theoreme de densite de Chebotarev, Publ . Math .
IHES 5 4 (1 981 ) , 1 23-201 .
[Se7] J-P . Serre , Minorations de discriminants, Oeuvres , Vol . Ill , Springer-Verlag , 1 986 ,
pp. 240-243 .
[Se8] J-P . Serre , Letter to J.M. Deshouillers.
[Shah] F . Shahidi , Symmetric power L-functions for GL(2), Ellipti c curve s an d relate d topics ,
CRM Proc . Lectur e Note s 4 , Amer . Math . S o c , 1 994 , pp . 1 59-1 82 .
[Sha] D . Shanks , Class number, a theory of factorization, and genera, 1 96 9 Numbe r Theor y
Institute (Proc . Sympos . Pur e Math. , Vol . XX) , Amer . Math . S o c , 1 971 , pp. 41 5-440 .
[Shi] G . Shimura , On modular forms of half-integral weight, Annal s o f Math . 9 7 (1 973) ,
440-481.
[Sh2] G . Shimura , On the holomorphy of certain Dirichlet series, Proc . Londo n Math . Soc .
(3) 3 1 (1 975) , 79-98 .
[Sh3] G . Shimura , Introduction to the arithmetic theory of automorphic functions, Princeto n
Univ. Press , 1 971 .
BIBLIOGRAPHY 609
[Siel] C . L . Siegel , Uber die Classenzahl quadratischer Zahlkorper, Act a Arithmetic a 1 (1 936) ,
83-86.
[Sie2] C.L . Siegel , On the theory of indefinite quadratic forms, Ann . o f Math . 4 5 (1 944) ,
577-622.
[Sie3] C . L . Siegel , Lectures on quadratic forms, Tat a Institute , 1 967 .
[Sil] J . Silverman , The arithmetic of elliptic curves, Grad . Text s i n Math . 1 06 , Springer -
Verlag, 1 986 .
[Sou] K . Soundararajan , Nonvanishing of quadratic Dirichlet L-functions at s = ^ , Ann . o f
Math. (2 ) 1 5 2 (2000) , 447-488 .
[Stl] H . Stark , A complete determination of the complex quadratic fields with class-number
one, Michiga n Math . J . 1 4 (1 967) , 1 -27 .
[St2] H . Stark , A transcendence theorem for class-number problems. II, Annal s o f Math . (2)
96 (1 972) , 1 74-209 .
[St3] H . Stark , Some effective cases of the Brauer-Siegel theorem, Invent , math . 2 3 (1 974) ,
135-152.
[Ste] S . A . Stepanov , The number of points of a hyperelliptic curve over a finite prime field,
Izv. Akad . Nau k SSS R Ser . Mat . 3 3 (1 969) , 1 1 71 -1 1 81 .
[Tal] J . Tate , Fourier analysis in number fields and Heche's zeta functions, Algebrai c Num -
ber Theory , Academi c Press , 1 990 , pp . 305-347 .
[Tat] J . Tate , Number theoretic preliminaries, Proceeding s o f Symposi a i n Pur e Math . 33 ,
vol 2 , A.M.S , 1 979 , pp . 3-26 .
[TW] R . Taylo r an d A . Wiles , Ring-theoretic properties of certain Hecke algebras, Ann . o f
Math. (2 ) 1 4 1 (1 995) , 553-572 .
[Tchu] N . G . Tchudakov , Sur le probleme de Goldbach, C . R . (Dokl. ) Acad . Sci . URSS , n . Ser .
17 (1 937) , 335-338 .
[Th] A . Thue , Uber Anndherungswerte algebraischer Zahlen, J . Rein e Angew . Math . 1 3 5
(1909), 284-305 .
[Tl] E.C . Titchmarsh , The theory of functions, 2n d Edition , Oxfor d Univ . Press , 1 939 .
[T2] E . C . Titchmarsh , The theory of the Riemann zeta-function, 2n d edition , Oxfor d Univ .
Press, 1 986 .
[T3] E . C . Titchmarsh , Eigenfunction expansions associated with second-order differential
equations, Clarendo n Press , 1 962 .
[Tot] A . Toth , Roots of quadratic congruences, Internat . Math . Res . Notice s (2000) , 71 9-739 .
[Tul] P . Turan , Uber einige Verallgemeinerungen eines Satzes von Hardy und Ramanujan,
J. Lond . Math . Soc . 1 1 (1 936) , 1 25-1 33 .
[Tu2] P . Turan , Uber die Primzahlen der arithmetischen Progression, Act a Litt . Sci . Szege d
8 (1 937) , 226-235 .
[vdK] J . VanderKam , The rank of quotients of J 0(N), Duk e Math . J . 9 7 (1 999) , 545-577 .
[VdP] M . va n de r Put , Grothendieck's conjecture for the Risch equation y' = ay + b, Indag .
Mathem. N.S . 1 2 (2001 ) , 1 1 3-1 24 .
[Va] R . C . Vaughan , Sommes trigonometriques sur les nombres premiers, C . R . Acad . Sci .
Paris Ser . A- B 28 5 (1 977) , A981 -A983 .
[VI] I . M . Vinogradov , A new estimate for £( 1 + it), Izv . Akad . Nau k SSSR , Ser . Mat . 2 2
(1958), 1 61 -1 64 .
[V2] I . M . Vinogradov , On Weyl's sums, Mat . Sborni k 4 2 (1 935) , 521 -530 .
[V3] I . M . Vinogradov , A new method of estimation of trigonometrical sums, Mat . Sborni k
(1) 4 3 (1 936) , 1 75-1 88 .
[V4] I . M . Vinogradov , Perm . Univ . Fiz.-Mat . ob.-v o Z h 1 (1 91 8) , 1 8-24 .
[V5] I . M.Vinogradov , Some theorems concerning the theory of primes, Math . Sb . 2 4 4
(1937), 1 79-1 95 .
[V6] I . M . Vinogradov , Representation of an odd number as a sum of three primes, Dokl .
Akad. Nau k SSS R 1 5 (1 937) , 291 -294 .
[Vi] A . I . Vinogradov , The density hypothesis for Dirichet L-series, Izv . Akad . Nau k SSS R
Ser. Mat . 2 9 (1 965) , 903-934 .
[Vor] G . Voronoi , Sur une fonction transcendante et ses applications a la sommation de
quelques series, Ann . Sci . Ecol e Norm . Sup . (3) 2 1 (1 904) , 207-267 , 459-533. .
[W] L . Washington, Introduction to cyclotomic fields, Grad . Text s i n Math . 83 , 2nd edition ,
Springer Verlag , 1 997 .
610 BIBLIOGRAPH Y
[Wa] T . Watson , Rankin triple products and quantum chaos, PhD thesis, Princeton Univer-
sity, 2001 .
[Wei] A . Weil , On some exponential sums, Proc . Nat . Acad . Sci . U . S . A . 3 4 (1 948) , 204-207 .
[We2] A . Weil , Uber die Bestimmung Dirichletscher Reihen durch Funktionalgleichungen,
Math. Ann . 1 6 8 (1 967) , 1 49-1 56 .
[Wl] H . Weyl , Uber die Gleichverteilung von Zahlen mod. Eins, Math . Ann . 7 7 (1 91 6) ,
313-352.
[W2] H . Weyl , Zur Abschatzung von C( l + ti), Math . Zeit . 1 0 (1 921 ) , 88-1 01 .
[W] A . Wiles , Modular elliptic curves and Fermat's last theorem, Ann . o f Math . (2 ) 1 4 1
(1995), 443-551 .
[Wil] E . Wirsing , Das asymptotische Verhalten von Summen uber multiplikative Funktionen,
II, Act a Math . Acad . Sci . Hungar . 1 8 (1 967) , 41 1 -467 .
[Wi2] E . Wirsing , Elementare Beweise des Primzahlsatzes mit Restglied, II, J . Rein e Angew .
Math. 2 1 4 / 2 1 5 (1 964) , 1 -1 8 .
[Wi3] E . Wirsing , Growth and differences of additive arithmetic functions, Topic s i n classica l
number theory , Vol . I, I I (Budapest , 1 981 ) , Colloq . Math . Soc . Jano s Bolyai , 34 , North -
Holland, Amsterdam , 1 984 , pp . 1 651 -1 661 .
Index
A-process, 204 , 21 1 , 21 5 Bilinear form , 1 69 , 1 74 , 1 86 , 21 8 , 326 , 340 ,
^-process, 204 , 21 1 , 21 6 343, 346 , 350 , 42 0
L-functions o f varieties , 9 6 Binary quadrati c forms , 498 , 50 3
A 2 -sieve, 1 60 , 43 0 Birch an d Swinnerton-Dye r Conjecture , 1 48 ,
^-adic cohomology , 26 9 367, 520 , 538 , 578 , 57 9
£-adic cohomology group s with compac t sup - Brun-Titchmarsh inequality , 1 67 , 42 0
port, 30 4 Buchstab identity , 34 9
£-adic sheaf , 31 0 Burgess' bound , 53 6
j-invariant, 36 4
Canonical height , 54 1
Abel-Jacobi Theorem , 299 , 54 3 Cauchy's inequality , 1 62 , 1 69 , 22 0
Abelian variety , 1 45 , 1 4 6 Central character , 54 0
Absolute logarithmi c height , 54 1 Central critica l point , 57 7
Absolute values , 54 1 Central value , 51 4 , 52 9
Additive binar y problem , 46 8 Character sum , 74 , 1 01 , 365, 422 , 42 9
Additive character , 44 , 1 75 , 1 81 , 271 , 467 , Characters, 43 , 48 7
475 Chebotarev Densit y Theorem , 1 43 , 48 9
Additive function , 9 , 2 8 Circle method , 1 71 , 315, 44 3
Additive reduction , 36 5 Class group , 58 , 51 0
Adjoint square , 1 37 , 37 3 Class grou p character , 59 , 1 31 , 1 34 , 361 ,
Algebraic curves , 1 4 5 369, 51 6
Algebraic surface , 31 0 Class number , 58 , 1 25 , 368 , 402 , 504 , 523 ,
Almost primes , 1 5 9 537, 56 9
Ambiguous classes , 50 9 Class Numbe r Formula , 38 , 1 24 , 402 , 51 3 ,
Ambiguous form , 50 7 516
Amplification method , 379 , 59 7 Class numbe r on e problem , 1 24 , 57 7
Amplifier, 59 6 Closed point , 29 7
Analytic L-function , 1 41 , 1 43 , 1 4 9 Cohen-Lenstra heuristics , 51 0
Analytic conductor , 9 5 Combinatorial sieve , 1 6 4
Approximate functiona l equation , 97 , 25 7 Companion sums , 273 , 27 8
Arithmetic etal e fundamenta l group , 30 0 Complete L-function , 9 4
Artin L-functions , 96 , 1 26 , 1 41 , 356 Complete exponentia l sum , 45 8
Artin-Shreier covering , 30 2 Complete family , 57 3
Automorphic L-functions , 84 , 93 , 37 5 Complete sums , 269 , 31 8
Automorphic form , 61 , 93, 1 3 1 Completing technique , 31 8
Auxiliary polynomial , 28 2 Complex multiplication , 367 , 36 9
Averaging technique , 357 , 38 7 Conductor, 45 , 60, 93, 94, 1 09 , 1 30 , 1 49 , 1 90 ,
247, 365 , 539 , 572 , 579 , 59 7
Bad reduction , 36 4 Conductor exponent , 36 5
Bernoulli numbers , 6 7 Congruence subgroup , 35 4
Bernoulli polynomial , 66 , 48 4 Conjugacy classes , 39 4
Bessel function , 72 , 73 , 90, 91 , 245, 258, 358, Constant ter m i n th e Fourie r expansion , 39 0
386, 408 , 41 1 , 454, 51 2 Continuous spectrum , 7 5
Betti numbers , 305 , 30 7 Converse theorems , 37 7
612 INDEX
Convexity bound , 1 01 , 1 1 9 , 1 37 , 244 , 535 , Euler Pentagona l Number s Theorem , 449 ,
549, 553 , 59 5 456
Critical line , 1 1 3 , 547 , 56 2 Euler product , 1 1 , 41 , 45 , 60 , 94 , 1 1 3 , 1 46 ,
Critical strip , 96 , 1 01 , 1 05 , 1 1 3 , 1 4 5 297, 366 , 371 , 493, 521 , 529 , 551 , 56 4
Critical zero , 547 , 56 3 Euler-Maclaurin formula , 66 , 68 , 76 , 206 ,
Cubic Gaus s sum , 49 1 483, 48 5
Cusp, 355 , 38 7 Exceptional character , 37 , 43 4
Cusp form , 61 , 65 , 83 , 1 36 , 1 86 , 356 , 368 , Exceptional eigenvalues , 390 , 399 , 41 0 , 49 7
479, 57 4 Exceptional zero , 93, 111 , 112, 1 21 , 140, 390,
428, 43 4
Dedekind et a function , 450 , 51 6 , 54 4 Exceptional zer o repulsion , 42 8
Dedekind multiplie r system , 45 0 Exclusion-inclusion, 1 45 , 1 56 , 1 71 , 308 . 33 9
Dedekind sum , 450 , 456 , 51 6 Explicit formula , 1 08 , 1 1 8 , 1 27 , 265 , 337 ,
Dedekind zet a function , 1 25 , 51 3 410, 440 , 56 4
Degree, 9 4 Exponent pair , 21 4
Deligne bound , 291 , 357, 379 , 493 , 53 2 Exponent Pai r Hypothesis , 86 , 21 4
Density conjecture , 232 , 249 , 26 5 Exponential integrals , 206 , 20 8
Density theorem , 420 , 547 , 59 6 Exponential sums , 1 97 , 225 , 443 , 45 6
Determinant equation , 38 3
Deuring-Heilbronn phenomenon , 42 8 Family o f L-functions , 96 , 1 75 , 573 , 58 0
Difference betwee n consecutiv e primes , 26 6 Family o f exponentia l sums , 31 2
Differencing process , 201 , 204, 21 1 , 212, 220 , Farey sequence , 451 , 458, 469 , 47 7
Ford circle , 45 2
330, 46 6
Four square s theorem , 46 8
Diophantine approximation , 28 2
Fourier coefficients , 1 7 4
Dirichlet L-functions , 45 , 83 , 84 , 1 82 , 267 ,
Fourier coefficient s o f cus p forms , 31 9 , 40 4
324, 329 , 368 , 388 , 479 , 51 3 , 534 , 573 ,
Fourier coefficient s o f modular forms , 83 , 291,
579
479
Dirichlet approximatio n theorem , 1 99 , 45 7
Fourier expansio n a t infinity , 1 3 2
Dirichlet character , 45 , 1 7 9
Fourier integrals , 20 4
Dirichlet convolution , 1 2 , 59 0
Fricke involution , 83 , 366 , 36 8
Dirichlet diviso r problem , 21 , 79 , 1 98 , 21 5
Frobenius automorphism , 270 , 30 2
Dirichlet polynomial , 1 94 , 229, 253, 429, 549,
Frobenius conjugac y class , 30 2
550, 55 4
Function field, 292 , 30 1
Dirichlet series , 1 1
Functional equation , 81 , 84 , 94 , 244 , 258 ,
Discrete subgroup , 35 4
362, 366 , 368 , 375 , 377 , 51 2 , 530 , 539 ,
Discrete valuation , 292 , 30 1
547, 566 , 58 2
Discriminant, 36 3
Functional equatio n o f Eisenstei n series , 38 9
Dispersion method , 1 7 1
Functor, 30 1
Divisor function , 1 3 , 74 , 86 , 33 4
Fundamental discriminant , 52 , 124, 508, 538,
Divisor o n a curve , 29 3
540, 54 3
Dual sum , 1 70 , 46 8
Fundamental domain , 58 , 354, 396 , 499, 504 ,
Duality principle , 1 70 , 1 74 , 1 84 , 1 89 , 23 4 521
Fundamental Lemma , 1 53 , 1 59 , 43 7
Effective divisor , 293 , 29 8
Fundamental unit , 38 , 402 , 51 6
Eichler-Shimura theory , 36 7
Eisenstein series , 80 , 357, 360, 369 , 388 , 411, Gamma factor , 9 4
479, 491 , 51 1 , 58 8 Gauss circl e problem , 20 , 73 , 1 98 , 21 5
Elliptic curve , 1 45 , 1 90 , 269 , 31 3 , 363 , 403 , Gauss sum , 1 8 , 47 , 49 , 60 , 79 , 84 , 1 1 9 , 1 79 ,
520, 529 , 57 4 192, 1 99 , 239 , 274 , 321 , 347 , 456 , 474 ,
Elliptic differentia l operator , 38 6 478, 488 , 491 , 49 5
Elliptic functions , 54 3 Gauss-Bonnet formula , 35 5
Elliptic motion , 39 5 Gaussian prime , 53 , 1 30 , 29 0
Epstein zet a function , 51 3 Gaussian Unitar y Ensemble , 563 , 57 0
Equidistribution, 1 30 , 1 37 , 1 98 , 31 3 , 352 , Genus character , 362 , 51 6
487 Genus theory , 480 , 506 , 54 0
Etale covering , 30 0 Geometric Frobenius , 302 , 30 4
Euler characteristic , 30 7 Geometric fundamenta l group , 30 4
Euler Idonea l Numbe r Problem , 52 0 Goldbach problem , 1 71 , 33 9
INDEX 613
Good reduction , 36 4 Jacobian variety , 1 47 , 57 8
Grand Densit y Conjecture , 25 0
Grand Rieman n Hypothesis , 1 01 , 1 1 3 , 1 36 , Kloosterman fractions , 1 85 , 48 1
175, 1 82 , 1 94 , 249 , 324 , 41 9 , 57 7 Kloosterman sum , 1 8 , 78, 1 85 , 187, 278, 308,
GRH, 1 1 3 313, 322 , 382 , 404 , 469 , 476 , 481 , 492 ,
Gross-Zagier curve , 36 8 584
Gross-Zagier formula , 57 9 Kloosterman sum s zeta-function , 41 2
Group la w o n ellipti c curves , 29 1 Kloosterman zet a function , 27 8
Kloosterman-Salie sum , 281 , 35 8
Haar measure , 48 7 Kronecker Limi t Formula , 51 6 , 52 4
Hadamard an d d e l a Vallee Poussin , 41 , 101, Kronecker symbol , 52 , 57 , 1 24 , 506 , 508 , 53 0
306 Kuznetsov formula , 65 , 1 86 , 38 0
Hankel transform , 71 , 99
Harish-Chandra/Selberg transform , 393 , 397, Langlands functoriality , 96 , 369 , 49 3
501 Langlands program , 1 3 2
Harmonic polynomial , 36 2 Laplace operator , 89 , 1 31 , 1 32 , 1 86 , 362 ,
Harmonics, 9 , 1 74 , 239 , 31 9 , 356 , 360 , 38 3 404, 563 , 58 0
Hasse bound , 1 93 , 26 9 Large sieve , 1 64 , 1 67 , 1 74 , 1 94 , 239 , 420, 59 5
Hasse derivative , 28 2 Large siev e inequality , 1 25 , 423 , 58 0
Hasse-Weil zet a function , 99 , 1 45 , 1 46 , 272 , Lattice points , 1 9 7
366, 369 , 539 , 541 , 57 9 Least quadrati c non-residue , 1 8 2
Hecke L-function , 60 , 86 , 366 , 368 , 374 , 379 , Lefschetz trac e formula , 305 , 31 0
511, 57 4 Legendre formula , 1 55 , 34 1
Hecke basis , 57 4 Legendre symbol , 27 1
Hecke character , 56 , 59 , 1 29 , 1 41 , 1 42 , 302 , Length o f close d geodesies , 41 0
362 Length spectrum , 40 1
Hecke congruenc e groups , 35 4 Levinson's method , 55 0
Hecke eigenvalues , 1 34 , 1 92 , 51 3 , 58 2 Lindelof Hypothesis , 1 01 , 116, 1 86 , 1 94 , 214,
Hecke form , 37 2 235, 243 , 256 , 59 7
Hecke operator , 80 , 1 86 , 36 7 Linear equivalenc e classes , 29 8
Heegner point , 494 , 542-544 , 57 9 Linear forms , 497 , 49 9
Heilbronn sums , 30 0 Linear fractiona l transformation , 353 , 50 4
Higher vo n Mangold t function , 1 6 , 59 2 Linearly equivalen t divisors , 29 3
Hilbert Clas s Field , 54 4 Liouville function , 1 4
Hilbert inequality , 1 7 5 Lisse ^-adi c sheaf , 30 1
Hilbert's Theore m 90 , 29 6 Local Langland s conjecture , 1 3 8
Hybrid larg e sieve , 1 8 3 Local roo t number , 1 3 6
Hyperbola method , 22 , 31 , 37, 42 0 Local roots , 9 4
Hyperbolic conjugac y classes , 40 1 Local zet a function , 36 5
Hyperbolic geodesies , 38 4 Log-free zero-densit y estimate , 42 8
Hyperbolic laplacian , 38 5 Low-lying zero , 57 4
Hyperbolic measure , 354 , 51 4
Mobius function , 32 , 36 , 42 , 345 , 421 , 430 ,
Hyperbolic motion , 39 5
Hyperbolic plane , 38 4 443, 446 , 59 0
Hyperelliptic curve , 281 , 28 7 Mobius inversion , 1 3 , 44 , 46 , 1 5 4
Mobius Randomnes s Law , 338 , 44 4
Idoneal number , 50 8 Maass form , 1 31 , 38 7
Imaginary quadrati c field , 1 7 , 56 , 361 , 503 , Major arc , 46 7
508, 54 2 Matrix coefficients , 48 7
Incomplete characte r sum , 31 7 Mellin transform , 61 , 90, 25 7
Incomplete Eisenstei n series , 387 , 38 9 Mertens formula , 3 4
Incomplete gamm a function , 51 3 Minimal model , 36 4
Incomplete Kloosterma n sum , 47 1 Minor arc , 457 , 462 , 464 , 466 , 46 7
Invariant integra l operator , 392 , 39 3 Mixed o f weight s < w, 30 5
Isoperimetric inequality , 39 8 Modular curves , 57 7
Modular form , 1 45 , 35 6
Jacobi inversio n formula , 47 3 Modular for m o f weigh t one , 35 6
Jacobi sum , 49 , 31 1 , 49 1 Modular group , 50 3
Jacobi symbol , 52 , 47 3 Modular interpretation , 54 2
614 INDEX
Modular parameterization , 542 , 543 , 54 5 Principal divisors , 29 3
Modularity conjecture , 1 91 , 366 Principal genus , 50 6
Mollification, 550 , 562 , 58 1 Pure o f weigh t w, 30 5
Mollifier, 25 1
Monodromy group , 31 3 , 57 3 Quadratic character , 1 8 4
Mordell-Weil theorem , 57 9 Quadratic Reciprocit y Law , 5 1
Multiple Kloosterma n sum , 308 , 49 2 Quantum uniqu e ergodicit y conjecture , 49 4
Multiplicative characters , 34 , 271 , 308
Radius o f convergence , 28 9
Multiplicative function , 1 54 , 1 65 , 339 , 52 1
Ramanujan A function , 360 , 367 , 493 , 51 6
Multiplicative inverse , 1 9 , 5 5
Ramanujan r function , 37 2
Multiplicative reduction , 36 5
Ramanujan sum , 1 8 , 44 , 48 , 1 79 , 280 , 323 ,
Multiplicity on e principle , 37 3
325, 461 , 472 , 478 , 481 , 483 , 488 , 587 ,
Near-orthogonality, 1 70 , 1 9 2 589
Nebentypus, 1 3 1 Ramanujan-Petersson conjecture , 95 , 100, 101,
Negative curvature , 38 4 115, 1 31 , 1 37 , 1 46 , 378 , 391 , 48 0
Newton polyhedron , 308 , 30 9 Random matri x theory , 56 3
Norm map , 27 0 Rankin's trick , 341 , 349, 52 5
Numerus idoneus , 50 8 Rankin-Selberg L-function , 97 , 1 1 8 , 1 89 , 375,
378, 535 , 58 0
Order o f vanishin g o f a n L-function , 1 1 7 Rankin-Selberg convolution , 1 4 , 97, 1 06 , 1 1 0
Orthogonality o f characters, 35 , 46, 1 21 , 31 1 , Real character , 46 , 4 9
318, 425 , 482 , 49 2 Real primitiv e character , 38 , 46, 57 , 84 , 1 22 ,
Orthogonality relations , 44 , 45 , 1 79 , 271 , 573, 57 7
511 Real quadrati c field, 38 , 51 6
Reflection method , 243 , 25 7
Polya-Vinogradov inequality , 325 , 32 6 Regular discriminants , 51 0
Pair Correlatio n Conjecture , 266 , 56 9 Regulator, 1 2 5
Parabolic motion , 39 5 Residual spectrum , 38 8
Peter-Weyl theorem , 48 7 Resolvent o f th e laplacian , 390 , 40 5
Petersson formula , 1 36 , 1 87 , 1 88 , 380 , 404 , Riemann Hypothesis , 305 , 306, 309, 320, 337,
411, 579 , 58 0 427, 51 4 , 51 8 , 524 , 547 , 563 , 56 8
Petersson inne r product , 357 , 37 1 Riemann Hypothesi s fo r curve s ove r finite
Petersson norm , 1 38 , 542 , 59 5 fields, 1 46 , 329 , 46 3
Poincare duality , 30 5 Riemann Hypothesi s fo r Dirichle t L-functions ,
Poincare metric , 353 , 38 4 443
Poincare series , 357 , 404 , 50 0 Riemann Hypothesi s fo r ellipti c curve s ove r
Poincare uppe r half-plane , 353 , 38 4 finite fields, 36 6
Point-pair invariant , 39 1 Riemann zet a function , 1 2 , 1 1 9 , 1 97 , 204 ,
Poisson distribution , 26 6 216, 388 , 56 1
Poisson summatio n formula , 61 , 69 , 75 , 99 , Riemann-Roch Theorem , 294 , 296 , 29 8
204, 31 9 , 359 , 391 , 398, 404 , 47 3 Root number , 94 , 1 42 , 1 45 , 367 , 377 , 538 ,
Polya-Vinogradov inequality , 523 , 52 4 574
Positivity, 41 , 1 05 , 1 57 , 1 80 , 37 7 Roots o f a n exponentia l sum , 27 4
Primary element , 5 4 Roots o f quadrati c congruences , 5 5
Prime Numbe r Theorem , 25 , 31 , 1 1 0 , 264 ,
401, 41 9 , 427 , 446 , 448 , 489 , 527 , 56 7 Salie sum , 272 , 323 , 476, 477 , 49 5
Primes i n arithmeti c progression s t o larg e Sato-Tate Conjecture , 1 37 , 289 , 324 , 49 3
moduli, 26 6 Sato-Tate measure , 49 2
Primes i n shor t intervals , 264 , 26 6 Scaling matrix , 355 , 38 7
Primes splittin g completely , 52 1 Scattering matrix , 388 , 39 7
Primitive character , 46 , 48 , 70 , 79 , 85 , 1 1 9 , Selberg Eigenvalu e Conjecture , 96 , 390 , 41 5
179, 244 , 422 , 491 , 508 , 59 7 Selberg sieve , 1 6 0
Primitive conjugac y classes , 396 , 40 2 Selberg trac e formula , 65 , 39 1
Primitive cus p form , 99 , 1 34 , 370 , 373 , 374 , Self-dual L-function , 95 , 1 06 , 57 8
513, 52 9 Semistable, 36 5
Primitive Heck e character , 6 0 Separation o f variables , 326 , 35 0
Primitive ideal , 57 , 50 8 Series o f Kloosterma n sums , 38 0
Primitive root , 27 1 Shifted primes , 30 , 1 5 3
INDEX 615
Short characte r sums , 289 , 32 6 Zero-density theorem , 249 , 26 4
Siegel mas s formula , 48 1 Zero-detecting polynomials , 25 2
Siegel's bound , 1 24 , 479 , 51 4 Zero-free region , 1 05 , 1 1 0 , 1 28 , 1 35 , 1 38 ,
Siegel-Walfisz Theorem , 41 9 , 427 , 48 9 249, 264 , 336 , 347 , 428 , 42 9
Sieve dimension , 1 5 7 Zeta functio n o f a n exponentia l sum , 27 3
Sieve methods , 25 , 1 53 , 1 71 , 265 , 340 , 349 ,
430, 49 3
Sieve problem , 2 7
Sieve weights , 33 8
Sieving level , 34 9
Sifted sum , 1 5 3
Sifting range , 1 6 1
Sign o f th e functiona l equation , 95 , 53 7
Singular integral , 46 0
Singular series , 460 , 466 , 47 8
Smoothing, 40 , 73 , 74 , 1 1 1 , 1 69 , 239 , 49 7
Special value , 1 94 , 577 , 59 5
Spectral decomposition , 403 , 406, 50 1
Spectral theory , 1 9 , 34 4
Spectral theor y o f Kloosterma n sums , 40 3
Square roo t cancellation , 306 , 31 2 , 31 4
Strict diviso r function , 34 2
Strong multiplicit y on e principle , 1 39 , 37 5
Subconvexity bound , 1 01 , 204, 329, 494, 554,
597
Subdivision method , 23 6
Sums o f Kloosterma n sums , 41 0 , 58 6
Symmetric square , 1 4 , 1 37 , 1 89 , 532 , 535 ,
575, 59 5
Tate twists , 30 1
Tate-Shafarevitch group , 36 7
Tempered diviso r function , 7 5
Theta function , 1 8 , 61 , 85 , 361 , 456 , 473 ,
479, 51 3 , 54 1
Theta multiplier , 1 8 6
Trace map , 27 0
Trivial sheaf , 30 6
Trivial zeros , 9 6
Twisted modula r form , 1 3 3
Uncertainty principle , 56 4
Unramified, 94 , 1 1 8
Upper-bound sieve , 43 0
Von Mangold t function , 1 5 , 31 , 42
Voronoi formula , 39 8
Waring problem , 45 6
Weierstrass equation , 363 , 54 3
Weil bound , 79 , 1 30 , 269, 288, 381, 403, 413,
472, 58 5
Weil conjectures , 37 8
Well-factorable function , 42 0
Well-spaced, 1 75 , 21 8 , 23 2
Weyl Law , 1 86 , 39 1
Weyl shift , 21 6 , 31 4
Weyl sum , 1 98 , 201 , 33 5
This page intentionally left blank
Titles i n Thi s Serie s
53 H e n r y k Iwanie c an d E m m a n u e l Kowalski , Analyti c numbe r theory , 200 4
52 D u s a McDuf f an d D i e t m a r Salamon , J-holomorphi c curve s an d symplecti c topology ,
2004
51 A l e x a n d e r Beilinso n an d Vladimi r Drinfeld , Chira l algebras , 200 4
50 E . B . D y n k i n , Diffusions , superdiffusion s an d partia l differentia l equations , 200 2
49 Vladimi r V . C h e p y z h o v an d Mar k I . Vishik , Attractor s fo r equation s o f
mathematical physics , 200 2
48 Yoa v B e n y a m i n i an d J o r a m Lindenstrauss , Geometri c nonlinea r functiona l analysis ,
Volume 1 , 200 0
47 Yur i I . M a n i n , Frobeniu s manifolds , quantu m cohomology , an d modul i spaces , 1 99 9
46 J . Bourgain , Globa l solution s o f nonlinea r Schrodinge r equations , 1 99 9
45 Nichola s M . K a t z an d P e t e r Sarnak , Rando m matrices , Frobeniu s eigenvalues , an d
monodromy, 1 99 9
44 M a x - A l b e r t K n u s , A l e x a n d e r Merkurjev , an d Marku s R o s t , Th e boo k o f
involutions, 1 99 8
43 Lui s A . Caffarell i an d Xavie r Cabre , Full y nonlinea r ellipti c equations , 1 99 5
42 V i c t o r Guillemi n an d S h l o m o Sternberg , Variation s o n a them e b y Kepler , 1 99 0
41 Alfre d Tarsk i an d S t e v e n Givant , A formalizatio n o f se t theor y withou t variables , 1 98 7
40 R . H . Bing , Th e geometri c topolog y o f 3-manifolds , 1 98 3
39 N . Jacobson , Structur e an d representation s o f Jorda n algebras , 1 96 8
38 O . O r e , Theor y o f graphs , 1 96 2
37 N . J a c o b s o n , Structur e o f rings , 1 95 6
36 W . H . Gottschal k an d G . A . H e d l u n d , Topologica l dynamics , 1 95 5
35 A . C . SchaefFe r an d D . C . Spencer , Coefficien t region s fo r Schlich t functions , 1 95 0
34 J . L . Walsh , Th e locatio n o f critica l point s o f analyti c an d harmoni c functions , 1 95 0
33 J . F . R i t t , Differentia l algebra , 1 95 0
32 R . L . W i l d e r , Topolog y o f manifolds , 1 94 9
31 E . Hill e an d R . S . Phillips , Functiona l analysi s an d semigroups , 1 95 7
30 T . R a d o , Lengt h an d area , 1 94 8
29 A . Weil , Foundation s o f algebrai c geometry , 1 94 6
28 G . T . W h y b u r n , Analyti c topology , 1 94 2
27 S . Lefschetz , Algebrai c topology , 1 94 2
26 N . Levinson , Ga p an d densit y theorems , 1 94 0
25 Garret t BirkhofF , Lattic e theory , 1 94 0
24 A . A . A l b e r t , Structur e o f algebras , 1 93 9
23 G . Szego , Orthogona l polynomials , 1 93 9
22 C . N . M o o r e , Summabl e serie s an d convergenc e factors , 1 93 8
21 J . M . T h o m a s , Differentia l systems , 1 93 7
20 J . L . Walsh , Interpolatio n an d approximatio n b y rationa l function s i n th e comple x
domain, 1 93 5
19 R . E . A . C . P a l e y an d N . W i e n e r , Fourie r transform s i n th e comple x domain , 1 93 4
18 M . Morse , Th e calculu s o f variation s i n th e large , 1 93 4
17 J . M . W e d d e r b u r n , Lecture s o n matrices , 1 93 4
16 G . A . Bliss , Algebrai c functions , 1 93 3
15 M . H . S t o n e , Linea r transformation s i n Hilber t spac e an d thei r application s t o analysis ,
1932
14 J . F . R i t t , Differentia l equation s fro m th e algebrai c standpoint , 1 93 2
TITLES I N THI S SERIE S
13 R . L . M o o r e , Foundation s o f poin t se t theory . 1 93 2
12 S . Lefschetz , Topology , 1 93 0
11 D . Jackson , Th e theor y o f approximation , 1 93 0
10 A . B . Coble , Algebrai c geometr y an d thet a functions , 1 92 9
9 G . D . Birkhoff , Dynamica l systems , 1 92 7
8 L . P . Eisenhart , Non-Riemannia n geometry , 1 92 7
7 E . T . Bell , Algebrai c arithmetic , 1 92 7
6 G . C . Evans , Th e logarithmi c potential , discontinuou s Dirichle t an d Neuman n problems ,
1927
5.1 G . C . Evans , Functional s an d thei r applications ; selecte d topics , includin g integra l
equations, 1 91 8
5.2 O . Veblen , Analysi s situs , 1 92 2
4 L . E . Dickson , O n invariant s an d th e theor y o f number s
W . F . O s g o o d , Topic s i n th e theor y o f function s o f severa l comple x variables , 1 91 4
3.1 G . A . Bliss , Fundamenta l existenc e theorems , 1 91 3
3.2 E . Kasner , Differential-geometri c aspect s o f dynamics , 1 91 3
2 E . H . M o o r e , Introductio n t o a for m o f genera l analysi s
M. M a s o n , Selecte d topic s i n th e theor y o f boundar y valu e problem s o f differentia l
equations
E. J . Wilczynski , Projectiv e differentia l geometry , 1 91 0
1 H . S . W h i t e , Linea r system s o f curve s o n algebrai c surface s
F . S . W o o d s , Form s o n noneuclidea n spac e
E. B . Va n Vleck , Selecte d topic s i n th e theor y o f divergen t serie s an d o f continue d
fractions, 1 90 5