ebook img

How to Print Floating-Point Numbers Accurately PDF

18 Pages·2004·2 MB·English
by  
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview How to Print Floating-Point Numbers Accurately

RETROSPECTIVE: How to Print Floating-Point Numbers Accurately Guy L. Steele Jr. Jon L White SunMicrosystemsLaboratories LispWizard 1NetworkDrive,UBUR02-311 2001TheForwardMarchofTechnologyDrive Burlington,MA 01803 USA Oceanview,Kansas 99999 USA [email protected] [email protected] 1. History had coded some variant of Taranto’s algorithm [31]—“I found it OurPLDIpaper[28]wasalmost20yearsinthemaking. inKnuth!”[18,ex.4.4–3]. Somealsodismissedexamplesofdis- How should the result of dividing 1.0 by 10.0 be printed? crepancies with hand-waving generalizations like those found in In 1970, one usually got “0.0999999” or “0.099999994”; Knuth (“::: floating point arithmetic is inherently approximate” whynot“0.1”? (Theproblemofbinary-to-decimalconversionis [18, section4.2.1])andlaterinSteele(“Ingeneral, computations amazinglytricky,evenforfixed-pointfractions,letalonefloating- with floating-point numbers are only approximate.” [29, section point numbers [31][22][20][12].) Someprogramming systems of 12.1]. But Whitesaw no reason why theprinting problem could thedayrequiredaprespecifiedlimitforthenumberofdecimaldig- notbesolvedexactly,andhesetouttofindaway. itstobeprintedforabinaryfloating-pointnumber.If,inourexam- Onesuggestedplanforobtainingtheshortestoutputthatcorrectly ple,thelimitweresettoninedecimalplaces,thentheprintedresult preserves the floating-point value was simply to produce many might well be0.099999994; but ifthe limitwereset to seven moredecimaldigitsthannecessary; thenonewould“roundback- decimal places, then poorly implemented printers would produce wards”indecimal,convertingthedecimalstringbackintobinary 0.0999999 and those that took the trouble to perform decimal ateachstep,untiltheshorteststringofdigitswasfoundthatwould roundingwouldproduceeither0.1000000or0.1. convert backintotheoriginalnumber. However, thisapproach is Whatisthecorrectnumberofdigitstoproduceiftheuserdoesn’t quiteinefficient.Itfailstotakeadvantageofinformationthatisin- specify? Ifasystemprintstoomanydigits,theexcessdigitsmay crementallyavailableduringthestepsofdigitgenerationandthat be“garbage,”reflectingmoreinformationthanthenumberactually adequately characterizes thedifference betweenwhat has already contains;ifasystemprintstoofewdigits,theresultwillbewrong (incrementally)beenoutputtedandtheoriginalnumber. inastrongersense: convertingthedecimalrepresentationbackto The first attempt to take advantage of such information was by binarymaynotrecovertheoriginalbinaryvalue. White,laterin1971.By1972,whenSteelejoinedWhiteatMITto Itreallybuggedoneofus(White)thatsystemsofthedaynotonly workonMacLisp,therewasarudimentaryversionofthealgorithm producedincorrectdecimalrepresentationsofbinarynumbers,but intheMacLispPRINTfunction, betterthanpreviousprintingal- provided no guaranteed bound on how wrong they might be. In gorithms but not yet correct in all cases. White then decided to early 1971, he began to analyze the machine-language algorithm trytrackingtheerrorpropagationbywatchingwhathappenedtoa usedintheimplementationofMacLisp[23][27]toconvertPDP-10 value equal to 12 ulp, subjected tothesame arithmeticoperations floating-pointnumbersintodecimalnotation. Healsoinvestigated thatwerebeingperformedonthefractionbeingprinted. Thisline the behavior of other language systems, both those that printed a ofinvestigationledtotheuseofthevariableMinouralgorithms. variablenumberofdigitsandthosethatprintedafixednumberof (Wecalledthisvalue“M”becauseWhitethoughtofitasdefining digitsbutfailedtoperformdecimalroundingcorrectly. a “mask,” the complement of the interval [M;1−M], that filters Using the “bignum” (high-precision integer) facilities that had (“masks”) outfractionalvaluesforwhichenough digitshadbeen recently beenadded toMacLispfor other purposes, Whiteinves- generated and passes through only fractions for which more dig- tigatedtheexactrepresentationsofcertainfloatingpointnumbers itsarerequired, namely thoselyinginthegap[M;1−M]. AsM thatseemedtoprintoutwithtoomanydecimaldigits. Hediscov- grows, themaskgrowsandtheintervalatitscentershrinks, until ered that virtually all of the systems he examined suffered from eventuallyM exceeds 12,thegapshrinksawaytonothing, andno oneormoreofthefollowingproblemsonmanyfloating-pointval- remainingfractioncanpassthroughthemask.) ues: (a) printing many more decimal digits than were necessary, When Steele left MIT in 1980 to go to Carnegie-Mellon Uni- (b)printingdecimalvaluesthatdifferedfromthebinaryvaluesby versity, MacLisp was using the algorithm Dragon2, withdouble- many units in the last place (ulps), and (c) exhibiting substantial precision floating-point prescaling of values that were large or “drift” when a value isprinted, then read back in, the new value small enough to require “E” notation. By about this time Steele printedandagainreadbackin,andsoon. had writtenaveryearlydraftof thePLDIpaper. (Hewroteitin White found that the programmers or authors of these systems TEX, so it must have been after Fall 1978, when he first ported invariablyclaimedthattheirsystemsmustbeproducing“correct” TEXfromStanfordtoMIT,anditmusthavebeenpriortothe1981 results because (1) they produced lots of decimal digits, making publicationofthesecondeditionof“KnuthVolume2”[19],which no claim of producing “shortest” representations, and/or (2) they mentionsthisunpublisheddraftintheanswertoexercise4.4–3.) Steele continued to polish the algorithm, off and on, during the early 1980s. His principal contributions were (1) to replace floating-pointprescalingwithimplicitrationalprescaling,thusin- 20 Years of the ACM/SIGPLAN Conference on Programming Language DesignandImplementation(1979-1999): ASelection,2003. troducingthescalefactorStothealgorithm(andincidentallycom- Copyright2003ACM1-58113-623-4...$5.00. ACM SIGPLAN 372 Best of PLDI 1979-1999 mitting to the use of bignum arithmetic in the printing algorithm ::: theresult [of abinary-to-decimal conversion] isexpressed itself);(2)tousetwomaskquantitiesM+ andM− ratherthanjust usingtheminimumnumberofdigits::: one; (3)toseparatethedigit-generation processfromtheformat- Thefifthrevisionof thereport [17] hassimilarwording but cites tingprocess,organizingthemascoroutines;(4)toprovideaproof [2]ratherthan[28]. ofcorrectnessforthedigit-generationprocess;and(5)tocodethe Steele had a hand in establishing conversion accuracy require- whole thing up (in Pascal), including many different formatters. ments in The Java Language Specification [11, pp. 506–507], (ThePL/I-style“picture”formatterwasomittedfromthePLDIpa- which requires that all binary-to-decimal and decimal-to-binary perforlackofspace,alas.)Steelefinishedallthisby1985. conversions becorrectlyrounded and thatbinary-to-decimal con- Duringthe1980s,Whiteinvestigatedthequestionofwhetherone versionsgenerateasfewdigitsaspossible. coulduselimited-precisionarithmeticafterallratherthanbignums. Thedescriptionofthe“basislibrary”forStandardMLsays[25]: Hehadearlierprovedbyexhaustivetestingthatjust7extrabitssuf- toDecimalshouldproduceonlyasmanydigitsasareneces- ficeforcorrectlyprinting36-bitPDP-10floating-pointnumbers,if saryforfromDecimaltoconvert back tothesame number, powersoftenusedforprescalingareprecomputedusingbignums i.e., for any Normal or SubNormal real value r, we have: androundedjustonce.Butcanonederive,withoutexhaustivetest- fromDecimal(toDecimalr)=r ::: Algorithmsforac- ing,thenecessaryamountofextraprecisionsolelyasafunctionof curatelyandefficientlyconvertingbetweenbinaryanddecimal theprecisionandexponentrangeofafloating-pointformat? This realrepresentationsarereadilyavailable,e.g.,seethetechnical problemisstillopen,andappearstobeveryhard. reportby[9]. It weighed on Steele’s mind that Knuth had shown confidence inhimbyciting—inabook!—apaperthatdidn’treallyexistyet. Soonafterthat,thelanguageLimboadoptedaccuratebasecon- EventuallySteelesummonedtheefforttocompletethepaperand versionasoneofitsimprovementsoverC[13,p.271]: submitittothe1990PLDI,justtoridhimselfofthenaggingbur- The most important numerical development at the language den. Imagine our surprise when we discovered that Will Clinger level recently has been accurate binary/decimal conversion hadwrittenapaperonthedecimal-to-binaryproblemandsubmit- [3][9][28]. Thus printing a real using %g and reading it on a ted it to the same conference! (By the way, the third edition of differentmachineguaranteesrecoveringidenticalbits. “KnuthVolume2”[21]citesourPLDIpaper,soalliswellnow.) AsforHaskell,acommentthatappearsinthecodeforthefunc- Sinceourpaperwaspublished, thereseemtohavebeenexactly tionfloatToDigitsintheHaskell98library[16, p.14] says twofollow-uppapers[9][2]ofanyconsequence.Bothprovidedim- thatthecodeisbasedonBurgerandDybvig’swork[2]. portantpracticalimprovementsonourwork;werecommendthem Borneoisanumericalprogramminglanguage[4,pp.7,9]: tointerestedreadersandespeciallytoimplementors. Althoughpublishedalgorithmsexistforbothcorrectlyrounded input [3] and output [28], conversion problems persist. Cor- 2. Influence onProgrammingLanguages rectlyroundedalgorithmsarealsoacceptablyfastforcommon Sinceourpaperwaspublished,ithashadaclearinfluenceonthe cases[2],[9]. WhileworkingontheBEEFtestsfortranscen- specificationandimplementationofmanyprogramminglanguages. dentalfunctions,itwasdiscoveredthattheTurboC1.0compiler We had coded an early version of this algorithm as part of the didnotconvert11.0exactlyintoafloatingpointnumberequal runtime system for MacLisp while we were at MIT in the late to11!::: Javarequirescorrectlyroundeddecimaltobinaryand 1970s; this version was not completely accurate because it first binarytodecimalconversion::: BorneomaintainsJava’sbase usedfloating-pointoperationstoscalethenumbertobeprintedby conversionrequirements::: a power of 10 if the number was very large or small, but it was SteeleservedasProjectEditorforthefirsteditionoftheECMA completelyaccuratefornumbersnotrequiringsuchprescaling. standard for the web browser scripting language ECMAScript The MacLisp successor Lisp Machine Lisp, also developed at (more popularly known as JavaScript), which recommends, but MIT,wasinfluencedbyourwork[32,p.281]: doesnotrequire,accurateconversions[5,p.36]: Thenumberofdigitsprintedisthe“correct”number;noinfor- NOTE:Thefollowingobservationsmaybeusefulasguidelines mationpresent intheflonumislost, andnoextratrailingdig- forimplementations,butarenotpartofthenormativerequire- itsareprintedthatdonotrepresentinformationintheflonum. mentsofthisStandard: Ifxisanynumber valueotherthan0, Feeding the [printed representation] of a flonum back to the thenToNumber(ToString(x))isexactlythesamenumbervalue readerisalwayssupposedtoproduceanequalflonum. asx.::: ImplementorsofECMAScriptmayfindusefulthepa- The documentation for the commercial version of this language, per and code written by David M. Gay for binary-to-decimal Symbolics’Zetalisp,hassimilarwording[30,p.15]. conversionoffloating-pointnumbers[9]. After publication of our paper, the idea began to spread, to the languageId,anotherMITproject,in1991[1,p.15]: A recent extension of the GNU Fortran compiler makes use of accurateconversiontechniques[26,p.319]: Utilitiesfor accurate reading and printing of double precision floating point numbers have been implemented. The floating ::: supportroutinesforperformingconversionbetweencharac- pointreader[3]hasbeenimplementedinCommonLispandId ter strings and intervals ::: weredeveloped based on routines 90,thefloatingpointprinter[28]inCommonLisp. forfloating-pointinput[9]andfloating-pointoutput[2]. andtoModula-3[14,p.15]: Finally,DavidM.Gayhimselfhasbeenworkingonanalgebraic Theideaofconvertingtodecimalbyretainingjustasmanydig- modelinglanguagecalledAMPL[10,p.3]: itsasarenecessarytoconvertbacktobinaryexactlywaspop- AMPL and its solver interface library use correctly rounded ularizedbyGuyL.SteeleJr. andJonLWhite[28]. DavidM. binary$decimalconversions,whichisnowpossibleonallma- Gaypointedouttheimportance::: ofdemandingthatthecon- chines where AMPL has run other than old Cray machines. versiontobinaryhandlemid-pointcasesbyaknownrule[9]. Details are described in Gay [9]. Part of the reason for men- andtothefourthrevisionofthereportonScheme,whichcites[28] tioningbinary$decimal conversions here istopoint out are- and[3]andalsoremarks[24,p.24]: cent extension to Gay [9] that carries out correctly rounded ACM SIGPLAN 373 Best of PLDI 1979-1999 conversionsforotherarithmeticswithpropertiessimilartobi- eds.,Beautyisourbusiness:abirthdaysalutetoEdsgerW. naryIEEEarithmetic. Thisincludescorrectdirectedroundings Dijkstra.Springer-Verlag(Berlin,1990),141–148. and rounding of a decimal string to a floating-point interval [13] Grosse,Eric.RealInferno.InBoisvert,RonaldF.,editor,The of width at most one unit in the last place, both of which are QualityofNumericalSoftware:Assessmentand obviously useful for rigorous interval computations. There is Enhancement: Proc.IFIPTC2/WG2.5WorkingConf., nopaperyetaboutthiswork, butthesourcefilesareavailable Oxford,UnitedKingdom,8–12July1996.ChapmanHallon asftp://netlib.bell-labs.com/netlib/fp/gdtoa.tgzwhichincludesa behalfofIFIP(London,1997),270–279. READMEfilefordocumentation. [14] Horning,Jim,Kalsow,Bill,McJones,Paul,andNelson, The IEEE 754 standard for floating-point arithmetic [15] has a Greg.SomeUsefulModula-3Interfaces.Memo113.Digital conversion accuracy requirement, butaliberalone. Nowthat the EquipmentCorporationSystemsResearchCenter(PaloAlto, practicality of our approach has been established, we have high CA,December1993). hopes that anupcoming revision of thatstandard willadopt are- [15] IEEEStandardforBinaryFloating-PointArithmetic, quirementoffullconversionaccuracy. ANSI/IEEEStd754-1985edition.IEEE(NewYork,1985). [16] StandardlibrariesfortheHaskell98programminglanguage. 3. Why“Dragon”? http://www.haskell.org/definition/haskell98-library.pdf, February1999. A very few readers have wondered why the principal algorithms [17] Kelsey,Richard,Rees,Jonathan,Clinger,William,etal.The inourworkarenamed“Dragon2”and“Dragon3”and“Dragon4”; revised5reportonthealgorithmiclanguageScheme.ACM this was an obscure joke (entirely Steele’s fault) alluding to the SIGPLANNotices33,9(September1998),26–76. “dragon curves” discovered and analyzed by John E. Heighway, [18] Knuth,DonaldE.SeminumericalAlgorithms. BruceA.Banks,andWilliamG.HarterandreportedonbyMartin Addison-Wesley(Reading,MA,1969). Gardner [7][8]. Theinitiallettersinthemultiworddescriptionof a “Dragon” algorithm forma sequence of letters‘F’ and ‘P’ that [19] Knuth,DonaldE.SeminumericalAlgorithms(Second representsthesequenceof(valley)Foldsand(mountain)Peaksin Edition).Addison-Wesley(Reading,MA,1981). apieceofpaperthathasbeenfoldedtoformadragoncurve. [20] Knuth,DonaldE.Asimpleprogramwhoseproofisn’t.In Feijen,W.H.J.,vanGasteren,A.J.M.,Gries,D.,andMisra, REFERENCES J.,eds.,Beautyisourbusiness:abirthdaysalutetoEdsger W.Dijkstra.Springer-Verlag(Berlin,1990),233–242. [1] Boughton,G.A.,editor.ComputationStructuresGroup [21] Knuth,DonaldE.SeminumericalAlgorithms(ThirdEdition). ProgressReport1990–91.CSGMemo337.MITLaboratory Addison-Wesley(Reading,MA,1998). forComputerScience(Cambridge,MA,June1991). [22] Matula,DavidW.In-and-outconversions.Communications [2] Burger,RobertG.,andDybvig,R.Kent.Printing oftheACM11,1(January1968),47–50. floating-pointnumbersquicklyandaccurately.InProc.ACM [23] Moon,DavidA.MacLISPReferenceManual.MITProject SIGPLAN’96Conf.Prog.Lang.DesignandImplementation. MAC(Cambridge,MA,April1974). ACM(Philadelphia,PA,June1996),108–116. [24] Rees,Jonathan,Clinger,William,etal.Therevised4report [3] Clinger,WilliamD.Howtoreadfloatingpointnumbers onthealgorithmiclanguageScheme.ACMSIGPLANLisp accurately.InProc.ACMSIGPLAN’90Conf.Prog.Lang. Pointers4,3(July–September1991),1–55. DesignandImplementation.ACM(WhitePlains,NY,June [25] Reppy,JohnH.,etal.http://cm.bell-labs.com/cm/cs/ 1990),92–101.ACMSIGPLANNotices25,6(June1990). what/smlnj/doc/basis/pages/real.html,October1997.Tobe [4] Darcy,JosephD.Borneo1.0.2:AddingIEEE754floating publishedas[6]. pointsupporttoJava.U.California,Berkeley,May1998. [26] Schulte,M.J.,Zelov,V.A.,Akkas,A.,andBurley,J.C.The [5] ECMAScriptLanguageSpecification,thirdedition.ECMA interval-enhancedGNUFortrancompiler.InCsendes,Tibor, (December1999).StandardECMA-262. ed.,DevelopmentsinReliableComputing.Kluwer [6] Gansner,EmdenR.,andReppy,JohnH.TheStandardML (Dordrecht,Netherlands,1999),311–322. BasisManual.CambridgeUniversityPress(NewYork, [27] Steele,Jr.,GuyL.,andGabriel,RichardP.Theevolutionof 2003).Notyetpublished—availablefromOctober2003. Lisp,pages233–330.InHistoryofProgramming [7] Gardner,Martin.Mathematicalgames.ScientificAmerican Languages.ACMPress(NewYork,1996),233–330. 216,3(March1967),124–125;216,4(April1967), [28] Steele,Jr.,GuyL.,andWhite,JonL.Howtoprint 118–120;217,1(July1967),115. floating-pointnumbersaccurately.InProc.ACMSIGPLAN [8] Gardner,Martin.MathematicalMagicShow.Vintage(New ’90Conf.Prog.Lang.DesignandImplementation.ACM York,1978),207–209,215–220. (WhitePlains,NY,June1990),112–126.ACMSIGPLAN [9] Gay,DavidM.CorrectlyRoundedBinary-Decimaland Notices25,6(June1990). Decimal-BinaryConversions.NumericalAnalysis [29] Steele,GuyL.,Jr.,Fahlman,ScottE.,Gabriel,RichardP., Manuscript90-10.AT&TBellLaboratories(MurrayHill, Moon,DavidA.,andWeinreb,DanielL.CommonLisp:The NJ,November1990). Language.DigitalPress(Burlington,MA,1984). [10] Gay,DavidM.Symbolic-AlgebraicComputationsina [30] ReferenceGuidetoSymbolics-Lisp.Symbolics,Inc. ModelingLanguageforMathematicalProgramming. (Cambridge,MA,March1985). TechnicalReport00-3-02.ComputingSciencesResearch [31] Taranto,Donald.Binaryconversion,withfixeddecimal Center,BellLaboratories(MurrayHill,NJ,July2000). precision,ofadecimalfraction.CommunicationsoftheACM [11] Gosling,James,Joy,Bill,andSteele,Guy.TheJavaLan- 2,7(July1959),27. guageSpecification.Addison-Wesley(Reading,MA,1996). [32] Weinreb,Daniel,andMoon,David.LISPMachineManual, [12] Gries,David.Binarytodecimal,onemoretime.InFeijen, ThirdEdition.MITArtificialIntelligenceLaboratory W.H.J.,vanGasteren,A.J.M.,Gries,D.,andMisra,J., (Cambridge,MA,March1981). ACM SIGPLAN 374 Best of PLDI 1979-1999 ACM SIGPLAN 375 Best of PLDI 1979-1999 ACM SIGPLAN 376 Best of PLDI 1979-1999 ACM SIGPLAN 377 Best of PLDI 1979-1999 ACM SIGPLAN 378 Best of PLDI 1979-1999 ACM SIGPLAN 379 Best of PLDI 1979-1999 ACM SIGPLAN 380 Best of PLDI 1979-1999 ACM SIGPLAN 381 Best of PLDI 1979-1999

Description:
amazingly tricky, even for fixed-point fractions, let alone floating- point numbers tigated the exact representations of certain floating point numbers.
See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.