240 Graduate Texts in Mathematics Editorial Board S.Axler K.A.Ribet Graduate Texts in Mathematics 1 TAKEUTI/ZARING.Introduction to 34 SPITZER.Principles ofRandom Walk. Axiomatic Set Theory.2nd ed. 2nd ed. 2 OXTOBY.Measure and Category.2nd ed. 35 ALEXANDER/WERMER.Several Complex 3 SCHAEFER.Topological Vector Spaces. Variables and Banach Algebras.3rd ed. 2nd ed. 36 KELLEY/NAMIOKAet al.Linear 4 HILTON/STAMMBACH.A Course in Topological Spaces. Homological Algebra.2nd ed. 37 MONK.Mathematical Logic. 5 MACLANE.Categories for the Working 38 GRAUERT/FRITZSCHE.Several Complex Mathematician.2nd ed. Variables. 6 HUGHES/PIPER.Projective Planes. 39 ARVESON.An Invitation to C*-Algebras. 7 J.-P.SERRE.A Course in Arithmetic. 40 KEMENY/SNELL/KNAPP.Denumerable 8 TAKEUTI/ZARING.Axiomatic Set Theory. Markov Chains.2nd ed. 9 HUMPHREYS.Introduction to Lie 41 APOSTOL.Modular Functions and Algebras and Representation Theory. Dirichlet Series in Number Theory. 10 COHEN.A Course in Simple Homotopy 2nd ed. Theory. 42 J.-P.SERRE.Linear Representations of 11 CONWAY.Functions ofOne Complex Finite Groups. Variable I.2nd ed. 43 GILLMAN/JERISON.Rings of 12 BEALS.Advanced Mathematical Analysis. Continuous Functions. 13 ANDERSON/FULLER.Rings and 44 KENDIG.Elementary Algebraic Categories ofModules.2nd ed. Geometry. 14 GOLUBITSKY/GUILLEMIN.Stable 45 LOÈVE.Probability Theory I.4th ed. Mappings and Their Singularities. 46 LOÈVE.Probability Theory II.4th ed. 15 BERBERIAN.Lectures in Functional 47 MOISE.Geometric Topology in Analysis and Operator Theory. Dimensions 2 and 3. 16 WINTER.The Structure ofFields. 48 SACHS/WU.General Relativity for 17 ROSENBLATT.Random Processes.2nd ed. Mathematicians. 18 HALMOS.Measure Theory. 49 GRUENBERG/WEIR.Linear Geometry. 19 HALMOS.A Hilbert Space Problem 2nd ed. Book.2nd ed. 50 EDWARDS.Fermat's Last Theorem. 20 HUSEMOLLER.Fibre Bundles.3rd ed. 51 KLINGENBERG.A Course in Differential 21 HUMPHREYS.Linear Algebraic Groups. Geometry. 22 BARNES/MACK.An Algebraic 52 HARTSHORNE.Algebraic Geometry. Introduction to Mathematical Logic. 53 MANIN.A Course in Mathematical Logic. 23 GREUB.Linear Algebra.4th ed. 54 GRAVER/WATKINS.Combinatorics with 24 HOLMES.Geometric Functional Emphasis on the Theory ofGraphs. Analysis and Its Applications. 55 BROWN/PEARCY.Introduction to 25 HEWITT/STROMBERG.Real and Abstract Operator Theory I:Elements of Analysis. Functional Analysis. 26 MANES.Algebraic Theories. 56 MASSEY.Algebraic Topology:An 27 KELLEY.General Topology. Introduction. 28 ZARISKI/SAMUEL.Commutative 57 CROWELL/FOX.Introduction to Knot Algebra.Vol.I. Theory. 29 ZARISKI/SAMUEL.Commutative 58 KOBLITZ.p-adic Numbers,p-adic Algebra.Vol.II. Analysis,and Zeta-Functions.2nd ed. 30 JACOBSON.Lectures in Abstract Algebra 59 LANG.Cyclotomic Fields. I.Basic Concepts. 60 ARNOLD.Mathematical Methods in 31 JACOBSON.Lectures in Abstract Algebra Classical Mechanics.2nd ed. II.Linear Algebra. 61 WHITEHEAD.Elements ofHomotopy 32 JACOBSON.Lectures in Abstract Algebra Theory. III.Theory ofFields and Galois 62 KARGAPOLOV/MERIZJAKOV. Theory. Fundamentals ofthe Theory ofGroups. 33 HIRSCH.Differential Topology. 63 BOLLOBAS.Graph Theory. (continued after index) Henri Cohen Number Theory Volume II: Analytic and Modern Tools Henri Cohen Université Bordeaux I Institut de Mathématiques de Bordeaux 351,cours de la Libération 33405,Talence cedex France [email protected] Editorial Board S.Axler K.A.Ribet Mathematics Department Mathematics Department San Francisco State University University of California at Berkeley San Francisco,CA 94132 Berkeley,CA 94720-3840 USA USA [email protected] [email protected] Mathematics Subject Classification (2000):11-xx 11-01 11Dxx 11Rxx 11Sxx Library ofCongress Control Number:2007925737 ISBN-13:978-0-387-49893-5 eISBN-13:978-0-387-49894-2 Printed on acid-free paper. © 2007 Springer Science+Business Media,LLC All rights reserved.This work may not be translated or copied in whole or in part without the written permission ofthe publisher (Springer Science+Business Media,LLC,233 Spring Street, New York,NY 10013,USA),except for brief excerpts in connection with reviews or scholarly analysis. Use in connection with any form of information storage and retrieval, electronic adaptation,computer software,or by similar or dissimilar methodology now known or hereafter developed is forbidden. The use in this publication oftrade names,trademarks,service marks,and similar terms,even if they are not identified as such,is not to be taken as an expression of opinion as to whether or not they are subject to proprietary rights. 9 8 7 6 5 4 3 2 1 springer.com Preface This book deals with several aspects of what is now called “explicit number theory,” not including the essential algorithmic aspects, which are for the most part covered by two other books of the author [Coh0] and [Coh1]. The central (although not unique) theme is the solution of Diophantine equa- tions, i.e., equations or systems of polynomial equations that must be solved in integers, rational numbers, or more generally in algebraic numbers. This theme is in particular the central motivation for the modern theory of arith- metic algebraic geometry. We will consider it through three of its most basic aspects. The first is thelocal aspect: the invention of p-adic numbers and their generalizationsbyK.Henselwasamajorbreakthrough,enablinginparticular the simultaneous treatment of congruences modulo prime powers. But more importantly, one can do analysis in p-adic fields, and this goes much further than the simple definition of p-adic numbers. The local study of equations is usually not very difficult. We start by looking at solutions in finite fields, where important theorems such as the Weil bounds and Deligne’s theorem on the Weil conjectures come into play. We then lift these solutions to local solutions using Hensel lifting. The second aspect is the global aspect: the use of number fields, and in particular of class groups and unit groups. Although local considerations can give a considerable amount of information on Diophantine problems, the “local-to-global” principles are unfortunately rather rare, and we will see many examples of failure. Concerning the global aspect, we will first require as a prerequisite of the reader that he or she be familiar with the standard basic theory of number fields, up to and including the finiteness of theclassgroupandDirichlet’sstructuretheoremfortheunitgroup.Thiscan be found in many textbooks such as [Sam] and [Marc]. Second, and this is lessstandard,wewillalwaysassumethatwehaveatourdisposalacomputer algebrasystem(CAS)thatisabletocomputeringsofintegers,classandunit groups,generatorsofprincipalideals,andrelatedobjects.SuchCASarenow verycommon,forinstanceKash,magma,andPari/GP,tocitethemostuseful in algebraic number theory. vi Preface ThethirdaspectisthetheoryofzetaandL-functions.Thiscanbeconsid- ered a unifying theme3 for the whole subject, and it embodies in a beautiful waythelocalandglobalaspectsofDiophantineproblems.Indeed,thesefunc- tionsaredefinedthroughthelocalaspectsoftheproblems,buttheiranalytic behaviorisintimatelylinkedtotheglobalaspects.Afirstexampleisgivenby the Dedekind zeta function of a number field, which is defined only through thesplittingbehavioroftheprimes,butwhoseleadingtermats=0contains atthesametimeexplicitinformationontheunitrank,theclassnumber,the regulator, and the number of roots of unity of the number field. A second very important example, which is one of the most beautiful and important conjecturesinthewholeofnumbertheory(andperhapsofthewholeofmath- ematics), the Birch and Swinnerton-Dyer conjecture, says that the behavior at s=1 of the L-function of an elliptic curve defined over Q contains at the same time explicit information on the rank of the group of rational points on the curve, on the regulator, and on the order of the torsion group of the group of rational points, in complete analogy with the case of the Dedekind zeta function. In addition to the purely analytical problems, the theory of L-functions contains be(cid:2)autiful results (and conjectures) on special values, of which Euler’s formula 1/n2 =π2/6 is a special case. n(cid:2)1 Thisbookcanbeconsideredashavingfourmainparts.Thefirstpartgives the tools necessary for Diophantine problems: equations over finite fields, number fields, and finally local fields such as p-adic fields (Chapters 1, 2, 3, 4, and part of Chapter 5). The emphasis will be mainly on the theory of p-adic fields (Chapter 4), since the reader probably has less familiarity with these. Note that we will consider function fields only in Chapter 7, as a tool forprovingHasse’stheoremonellipticcurves.Animportanttoolthatwewill introduceattheendofChapter3isthetheoryoftheStickelbergeridealover cyclotomic fields, together with the important applications to the Eisenstein reciprocitylaw,andtheDavenport–Hasserelations.ThroughEisensteinreci- procity this theory will enable us to prove Wieferich’s criterion for the first case of Fermat’s last theorem (FLT), and it will also be an essential tool in the proof of Catalan’s conjecture given in Chapter 16. The second part is a study of certain basic Diophantine equations or systems of equations (Chapters 5, 6, 7, and 8). It should be stressed that even though a number of general techniques are available, each Diophantine equation posesanewproblem,anditisdifficult toknowinadvance whether it will be easy to solve. Even without mentioning families of Diophantine equationssuchasFLT,thecongruentnumberproblem,orCatalan’sequation, allofwhichwillbestatedbelow,provingforinstancethataspecificequation suchas x3+y5 =z7 with x,y coprime integers hasnosolution withxyz (cid:2)=0 seems presently out of reach, although it has been proved (based on a deep theoremofFaltings)thatthereareonlyfinitelymanysolutions;see[Dar-Gra] 3 Expression due to Don Zagier. Preface vii andChapter14.NotealsothatithasbeenshownbyYu.Matiyasevich(after aconsiderableamountofworkbyotherauthors)inanswertoHilbert’stenth problem that there cannot exist a general algorithm for solving Diophantine equations. The third part (Chapters 9, 10, and 11) deals with the detailed study of analytic objects linked to algebraic number theory: Bernoulli polynomi- als and numbers, the gamma function, and zeta and L-functions of Dirichlet characters, which are the simplest types of L-functions. In Chapter 11 we alsostudyp-adicanaloguesofthegamma,zeta,andL-functions,whichhave cometoplayanimportantroleinnumbertheory,andinparticulartheGross– Koblitz formula for Morita’s p-adic gamma function. In particular, we will seethatthisformulaleadstoremarkablysimpleproofsofStickelberger’scon- gruenceandtheHasse–Davenportproductrelation.MoregeneralL-functions suchasHeckeL-functionsforGro¨ssencharacters,ArtinL-functionsforGalois representations,or L-functions attached tomodularforms,elliptic curves,or higher-dimensional objects are mentioned in several places, but a systematic exposition of their properties would be beyond the scope of this book. Much more sophisticated techniques have been brought to bear on the subject of Diophantine equations, and it is impossible to be exhaustive. Be- cause the author is not an expert in most of these techniques, they are not studiedinthefirstthreepartsofthebook.However,consideringtheirimpor- tance, I have asked a number of much more knowledgeable people to write a few chapters on these techniques, and I have written two myself, and this forms the fourth and last part of the book (Chapters 12 to 16). These chap- ters have a different flavor from the rest of the book: they are in general not self-contained,areofahighermathematicalsophisticationthantherest,and usually have no exercises. Chapter 12, written by Yann Bugeaud, Guillaume Hanrot,andMauriceMignotte,dealswiththeapplicationsofBaker’sexplicit results on linear forms in logarithms of algebraic numbers, which permit the solution of a large class of Diophantine equations such as Thue equations and norm form equations, and includes some recent spectacular successes. Paradoxically, the similar problems on elliptic curves are considerably less technical, and are studied in detail in Section 8.7. Chapter 13, written by SylvainDuquesne,dealswiththesearchforrationalpointsoncurvesofgenus greaterthanorequalto2,restrictingforsimplicitytothecaseofhyperelliptic curvesofgenus2(thecaseofgenus0—inotherwords,ofquadraticforms—is treated in Chapters 5 and 6, and the case of genus 1, essentially of elliptic curves, is treated in Chapters 7 and 8). Chapter 14, written by the author, dealswiththeso-calledsuper-Fermatequationxp+yq =zr,onwhichseveral methods have been used, including ordinary algebraic number theory, classi- calinvarianttheory,rationalpointsonhighergenuscurves,andRibet–Wiles typemethods.Theonlyproofsthatareincludedarethosecomingfromalge- braicnumbertheory.Chapter15,writtenbySamirSiksek,dealswiththeuse of Galois representations, and in particular of Ribet’s level-lowering theorem viii Preface and Wiles’s and Taylor–Wiles’s theorem proving the modularity conjecture. Themainapplicationistoequationsof“abc”type,inotherwords,equations of the form a+b+c = 0 with a, b, and c highly composite, the “easiest” applicationofthismethodbeingtheproofofFLT.Theauthorofthischapter hastriedtohideallthesophisticatedmathematicsandtopresentthemethod as a black box that can be used without completely understanding the un- derlying theory. Finally, Chapter 16, also written by the author, gives the complete proof of Catalan’s conjecture by P. Miha˘ilescu. It is entirely based onnotesofYu.Bilu,R.Schoof,andespeciallyofJ.Bo´echatandM.Mischler, andtheonlyreasonthatitisnotself-containedisthatitwillbenecessaryto assume the validity of an important theorem of F. Thaine on the annihilator of the plus part of the class group of cyclotomic fields. Warnings Since mathematical conventions and notation are not the same from one mathematical culture to the next, I have decided to use systematically un- ambiguous terminology, and when the notations clash, the French notation. Here are the most important: – Wewill systematically say that a is strictly greater thanb, or greater than orequaltob(orbisstrictlylessthana,orlessthanorequaltoa),although the English terminology a is greater than b means in fact one of the two (I don’t remember which one, and that is one of the main reasons I refuse to use it) and the French terminology means the other. Similarly, positive and negative are ambiguous (does it include the number 0)? Even though the expression “x is nonnegative” is slightly ambiguous, it is useful, and I will allow myself to use it, with the meaning x(cid:2)0. – Although we will almost never deal with noncommutative fields (which is a contradiction in terms since in principle the word field implies commu- tativity), we will usually not use the word field alone. Either we will write explicitlycommutative(ornoncommutative)field,orwewilldealwithspe- cific classes of fields, such as finite fields, p-adic fields, local fields, number fields, etc., for which commutativity is clear. Note that the “proper” way in English-language texts to talk about noncommutative fields is to call themeither skew fieldsor division algebras. Inany case this will notbe an issue since the only appearances of skew fields will be in Chapter 2, where wewillprovethatfinitedivisionalgebrasarecommutative,andinChapter 7 about endomorphism rings of elliptic curves over finite fields. – The GCD (respectively the LCM) of two integers can be denoted by (a,b) (respectively by [a,b]), but to avoid ambiguities, I will systematically use the explicit notation gcd(a,b) (respectively lcm(a,b)), and similarly when more than two integers are involved. Preface ix – An open interval with endpoints a and b is denoted by (a,b) in the En- glish literature, and by ]a,b[ in the French literature. I will use the French notation, and similarly for half-open intervals (a,b] and [a,b), which I will denote by ]a,b] and [a,b[. Although it is impossible to change such a well- entrenched notation, I urge my English-speaking readers to realize the dreadful ambiguity of the notation (a,b), which can mean either the or- dered pair (a,b), the GCD of a and b, the inner product of a and b, or the open interval. – The trigonometric functions sec(x) and csc(x) do not exist in France, so I will not use them. The functions tan(x), cot(x), cosh(x), sinh(x), and tanh(x)aredenotedrespectivelybytg(x),cotg(x),ch(x),sh(x),andth(x) inFrance,butforoncetobowtothemajorityIwillusetheEnglishnames. – (cid:3)(s)and(cid:4)(s)denotetherealandimaginarypartsofthecomplexnumber s, the typography coming from the standard TEX macros. Notation In addition to the standard notation of number theory we will use the fol- lowing notation. – We will often use the practical self-explanatory notation Z>0, Z(cid:2)0, Z<0, Z(cid:3)0,andgeneralizationsthereof,whichavoidusingexcessiveverbiage.On the other hand, I prefer not to use the notation N (for Z(cid:2)0, or is it Z>0?). – If a and b are nonzero integers, we write gcd(a,b∞) for the limit of the ultimately co(cid:3)nstant sequence gcd(a,bn) asn → ∞. We have of course gcd(a,b∞)= p|gcd(a,b)pvp(a), and a/gcd(a,b∞) is the largest divisor of a coprime to b. – If n is a nonzero integer and d | n, we writed(cid:7)n if gcd(d,n/d) = 1. Note thatthisisnot thesamethingasthecondition d2 (cid:2)n,exceptifdisprime. – If x∈R, we denote by(cid:9)x(cid:10) the largest integer less than or equal to x (the floor ofx),by(cid:11)x(cid:12)thesmallestintegergreaterthanorequaltox(theceiling ofx,whichisequalto(cid:9)x(cid:10)+1ifandonlyifx∈/ Z),andby(cid:9)x(cid:12)thenearest integer to x (or one of the two if x ∈ 1/2+Z), so that (cid:9)x(cid:12) = (cid:9)x+1/2(cid:10). We also set {x}=x−(cid:9)x(cid:10), thefractional part of x. Note that for instance (cid:9)−1.4(cid:10)=−2, and not −1 as almost all computer languages would lead us to believe. – For any α belonging to a field K of characteristic zero and any k ∈ Z(cid:2)0 we set (cid:4) (cid:5) α α(α−1)···(α−k+1) = . k k! (cid:6) (cid:7) In p(cid:6)ar(cid:7)ticular, if α∈Z(cid:2)0 we have αk =0 if k >α,(cid:6)an(cid:7)d in this case we will set α = 0 also when k < 0. On the other hand, α is undetermined for k k k <0 ifα /∈Z(cid:2)0.