ebook img

Fast quantum algorithm for numerical gradient estimation PDF

0.13 MB·English
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Fast quantum algorithm for numerical gradient estimation

Fast Quantum Algorithm for Numerical Gradient Estimation Stephen P. Jordan MIT Physics, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139 (Dated: February 1, 2008) Givenablackboxforf,asmoothrealscalarfunctionofdrealvariables,onewantstoestimate∇f at a given point x=(x1,x2,...,xd) with n bits of precision. On a classical computer this requires a minimum of d+1 blackbox queries, whereas on a quantum computer it requires only one query regardless ofd. Thenumberof bitsofprecision towhichf mustbeevaluatedmatchestheclassical requirement in the limit of large n. PACSnumbers: 03.67.Lx 5 In the context of many numerical calculations, black- estimationalgorithmtoperformthefunctionevaluations 0 boxquerycomplexityisanaturalmeasureofalgorithmic displaced by l/2 from x along each dimension so that 0 efficiency. For example, in optimization problems, the theregionbei±ngsamplediscenteredatx. Inthiscase2d 2 function evaluations are frequently the most time con- functionevaluationsarerequiredinsteadofd+1. ∂f/∂x 1 n suming part of the computation, and an efficient opti- willthenbegivenby(f(x +l/2,...) f(x l/2,...))/l, 1 1 a mization algorithm is therefore one which uses as few and similarly for the other partial de−rivativ−es. Inserting J function evaluations as possible [1]. the Taylor expansion for f about x into this expression 2 Here we investigate the query complexity of numeri- showsthatthequadratictermswillcancel,leavinganer- cally estimating the gradientof a blackbox function at a roroforderl2 andhigher. Onemustchoosel sufficiently 2 v given point. We find that gradients can be estimated on small that these terms are negligible. 6 a quantum computer using a single blackbox query. The Nowweconsiderthe quantumcase. Itsufficestoshow 4 algorithmwhichachievesthiscanbeviewedasageneral- how to perform a quantum gradient estimation at x = 1 izationoftheBernstein-Vazirani[2]algorithm,whichhas 0, since the gradient at other points can be obtained 5 beendescribedinothercontexts[3,4,5,6]. Theblackbox by trivially redefining f. To estimate the gradient at 0 in this algorithm has always previously been described the origin, start with d input registers of n qubits each, 4 0 asevaluatingafunctionovertheintegersratherthanap- plus a single output register of no qubits, all initialized / proximating a continuous function with finite precision. to zero. Perform the Hadamard transform on the input h Gradient finding is the first known practical variant of registers, write the value 1 into the output register and p this algorithm. In [3], the question as to whether the al- then perform an inverse Fourier transform on it. This - t gorithmcouldbe adaptedfor any task of practicalinter- yields the superposition n est was presented as an open problem, which this paper a u resolves. q The blackbox that we consider takes as its input d 1 N−1N−1...N−1 δ ... δ No−1ei2πa/No a iv: qbuinbairtys wsthriicnhgss,hoeuacldhboef sleetngttohzenr,o.alTonhge bwliatchkbnooxawnrciitlelas NdNo δX1=0δX2=0 δXd=0| 1i | di Xa=0 | i X itsoutputintotheancillabitsusingadditionmodulo2no pwhere N 2n and No 2no. In vector notation, ≡ ≡ r andpreservestheinputbits. Thisisastandardtechnique a for making any function reversible, which it must be for 1 a quantum computer to implement it. = δ ei2πa/No a NdN | i | i The purpose of the blackbox is to evaluate some func- o δ a X X tion f : Rd → R with no bits of precision on a finite Next, use thpe blackbox to compute f and add it mod- domain. It does so in fixed-point notation. That is, the uloN intotheoutputregister. Theoutputregisterisan inputsandoutputstothefunctionf,whicharerealnum- o eigenstate of addition modulo N . The eigenvalue corre- berswithinafiniterange,areapproximatedbytheinputs o sponding to addition of x is ei2πx/No. Thus by writing and outputs to the blackbox, which are integers ranging into the output register via modular addition, we obtain from 0 to 2n, and 0 to 2no, respectively, via appropriate a phase proportional to f. This technique is sometimes scaling and offset. called phase kickback. The resulting state is Fornumericalgradientestimationtowork,intheclas- sical or quantum case, f and its first partial deriva- tives must be continuous in the vicinity of the point 1 ei2πmNlf(Nl (δ−N2)) δ ei2πa/No a C(xl1a,sxsi2c,a.l.ly.,,txod)esattimwahtiech tfheatgxraidniedntdiims teonsbioenesv,aolnueatceadn. NdNo Xδ | iXa | i evaluatef atxandatd∇additionalpoints,eachdisplaced whpereNisthed-dimensionalvector(N,N,N,...),and from f along one of the d dimensions. l is the size of the region over which f is approximately Inpractice,itmaybedesirableintheclassicalgradient linear. l andNareusedtoconvertfromthe components 2 of δ, which are nonnegative integers represented by bit thatboundsforthevaluesofthecomponentsareknown, strings, to rationals evenly spaced over a small region which is a requirement for any algorithm using fixed- centered at the origin. Similarly, the blackbox output is point arithmetic. relatedtothevalueoff bya a NNof mod N . m In general the number of bits of precision necessary → ⊕⌈ ml ⌋ o is the size of the interval which bounds the components to represent a set of values is equal to log (r/δ), where 2 of f. Thisensuresproperscalingofthefinalresultinto r is the range of values, and δ is the smallest difference ∇ afixedpointrepresentation,thatis, asanintegerfrom0 in values one wishes to distinguish. Thus for classical to 2n. gradientestimationwithnbitsofprecision,oneneedsto For sufficiently small l, evaluate f to 1 ei2mπNl (f(0)+Nl (δ−N2)·∇f) δ ei2πa/No a . log max(f)−min(f) (1) ≈ NdNo Xδ | iXa | i 2" m2nl # Wpriting out the vector components, and ignoring bits of precision. global phase, the input registers are now approximately An important property of the quantum Fourier trans- in the state form is that it can correctly distinguish between expo- nentiallymanydiscreteplanewavestateswithhighprob- ability without requiring the phases to be exponentially = 1 ei2mπ δ1∂∂xf1+δ2∂∂xf2+...+δd∂∂xfd precise [7]. It is not hard to show that if each phase √Nd (cid:16) (cid:17)× is accurate to within θ then the inner product between δ1X...δd the ideal state and the actual state is at least cosθ, and δ δ ... δ . | 1i| 2i | di thereforethealgorithmwillstillsucceedwithprobability at least cos2θ. This is a product state: As shown earlier, the phase acquired by “kickback” is equalto 2πNf,andtherefore,forthephasetobeaccurate ml = 1 ei2mπδ1∂∂xf1 δ1 ... ei2mπδd∂∂xfd δd . tTohuwsi,threincal±liθn,gfthamtuNst=be2ne,valuated to within ±2mπNl θ. √Nd | i! | i! Xδ1 Xδd Fourier transform each of the registers, obtaining maxf minf n =log − . (2) o 2 ml θ " 2n 2π # N ∂f N ∂f N ∂f ... . As anexample, if θ =π/8,then the algorithmwillbe- m∂x m∂x m∂x (cid:12) 1(cid:29)(cid:12) 2(cid:29) (cid:12) d(cid:29) have exactly as in the idealized case with approximately (cid:12) (cid:12) (cid:12) Then sim(cid:12)ply measu(cid:12)re in the co(cid:12)mputational basis to 85% probability, and No will exceed the classically re- (cid:12) (cid:12) (cid:12) quired precision by four bits, for a given value l. l also obtain the components of f with n bits of precision. ∇ differs between the quantum and classical cases, as will Because f will in general not be perfectly linear, even be discussed later. Thus n differs from the classically overa small region,there also willbe nonzeroamplitude o required precision only by an additive constant which to measure other values close to the exact gradient, as depends on θ and l. Because the classical and quantum will be discussed later. precision requirements are both proportional to n, this Normally, the quantum Fourier transform is thought difference becomes negligible in the limit of large n. of as mapping the discrete planewave states to the com- Theonlyapproximationmadeinthedescriptionofthe putational basis states: quantum gradient estimation algorithm was expanding f to first order. Therefore the lowest order error term N will be due to the quadratic part of f. The behavior of 1 e2πijk/N j k the algorithm in the presence of such a quadratic term √N | i→| i j=0 providesanidea ofits robustness. Furthermore,in order X to minimize the number of bits of precision to which f where 0 < k < N. However, negative k is also easily mustbeevaluated,lshouldbechosenaslargeaspossible dealt with, since subject to the constraint that f be locally linear. The analysis of the quadratic term provides a more precise description of this constraint. N 1 e 2πijk/N N k . The series of quantum Fourier transforms on differ- − | | √N →| −| |i ent registers can be thought of as a single d-dimensional j=0 X quantum Fourier transform. Including the quadratic Thus negative components of f pose no difficulties term, the state which this Fourier transform is acting ∇ for the quantum gradient estimation algorithm provided on has amplitudes 3 The expectation values on a hypercube of uniform probability are x x = 1 δ , thus 1 l h i ji 12 ij f(δ)= exp i2π δ f + δTHδ , Nd/2 ·∇ 2mN (cid:20) (cid:18) (cid:19)(cid:21) 1 N2l2 ∂2f 2 where H is the Hessian matrix of f. After the Fourier σi2 = 12 A2ij = 12m2 ∂x ∂x . transform, the amplitudes should peak around the cor- j j (cid:18) i j(cid:19) X X rect value of f. Here we are interested in the width ∇ This quadratic dependence on N is just as expected of the peak, which should not be affected by f, so for ∇ since, at the end of the computation, the register that simplicity it will be set to 0. The Fourier transform will we aremeasuringis intendedto contain N ∂f. Therefore yield amplitudes of [8] m∂x the uncertainty in ∂f/∂x is approximately i 1 l 1 f˜(k)= Nd exp i2π 2mNδTHδ− Nk·δ . l ∂2f 2 δ (cid:20) (cid:18) (cid:19)(cid:21) (3) X 2√3vu j (cid:18)∂xi∂xj(cid:19) Ignoring global phase and doing a change of variables uX t (u=δ/N), independent of N. In the classical algorithm which uses 2d function evaluations, the cubic term introduces 1/2 ... 1/2 exp i2π NluTHu k u ddu atundeerroofrtohfirσd∼pa2rl24tiDal3dwehrievraetiDve3siosfthfe. tIyfptihceal2[n9d] mpaargtniai-l ≈ 2m − · Z−1/2 Z−1/2 (cid:20) (cid:18) (cid:19)(cid:21) derivatives of f have a magnitude of approximately D2 then the typical uncertainty in the quantum case will be Thisintegralcanbeapproximatedusingthemethodof stationaryphase. φ= Nl HT +H u kbutHessians σ lD2√d. To obtain a given uncertainty σ, ∇ 2m − ∼ 2√3 aresymmetric,so φ= NlHu k.Thus(againignoring ∇ m (cid:0) − (cid:1) global phase), 2 6σ classical D3 l  q f˜(k) Det(1NmlH) if ∃ u∈C s.t. NmlHu−k=0 ∼ D2√2√3σd quantum ≈ q0 otherwise Recalling Eq. (1)and (2), the number of bits of pre- cisionto whichf must be evaluateddepends logarithmi- where C is the region 1/2 < u < 1/2 i. So ac- cally on l. However, in the limit of large n, the number i − ∀ cordingto the stationaryphase approximation,the peak of bits will match the classical requirement. is simply a region of uniform amplitude, with zero am- The level of accuracy of the stationary phase approxi- plitude elsewhere. Geometrically, this region is what is mationcanbeassessedbycomparisontonumericalsolu- obtained by applying the linear transformation NlH to tionsofexamplecases. Inonedimension,Eq. (3)reduces the d-dimensional unit hypercube. m toσ2 = α2N2 whereα= l ∂2f. Figures2and1display Since we have set f = 0, the variance of N ∂f will the close a3greement betw2emen∂xn2umerical results and the ∇ m∂xi be analytical solution obtained using stationary phase. A two dimensional example provides a nontrivial test of the stationary phase method’s prediction of the peak 1 Nl σ2 = k2ddk where A= H shape. If the Hessian is such that i DetA i m ZD and D is the region of nonzero amplitude. Doing a N 1 1 change of variables with A as the Jacobian, H =0.1 m 1 1 (cid:20) − (cid:21) then,accordingtothestationaryphaseapproximation, 1 σi2 = DetAZC(Ak′)2iDetA ddk′ t4h5e proetaaktisohno.ulTdhbiseisaisnqureaaresoonfabsildeeaglernegemthen√1t02wliwthiththae ◦ where C is again the unit hypercube centered at the numerical result, as shown in figure 3. origin. In components, Because this algorithm requires only one blackbox query, one might expect that it could be run recursively to efficiently obtain higher derivatives. In this case, an- 2 otherinstance ofthe samealgorithmservesasthe black- σi2 =  Aijkj′ ddk′. box. However,thealgorithmitselfdiffersfromtheblack- ZC j boxinthatthe blackboxhasscalaroutput whichit adds X   4 σ2 3.5 3 40 40 2.5 20 20 2 1.5 0 0 1 -20 -20 0.5 -40 -40 N 25 50 75 100 125 150 -40 -20 0 20 40 -40 -20 0 20 40 FIG. 1: Comparison between error estimates obtained in FIG. 3: On the left, the probability, as numerically cal- thestationaryphaseapproximation(solidline)andnumerical results (points) for the one dimensional case. Here the 2nd culated, is shown. The areas of highest probability appear darkest. On the right, the region of nonzero probability, as derivative remains constant (α = 0.02), and the number of calculated in the stationary phase approximation, is shaded bits to which the gradient is being evaluated is varied. in black. σ2 the evaluation of an nth partial derivative, just as in the classical case. 20 The problem of global phase when recursing quantum 15 algorithmsaswellasthedifficultiesinherentinrecursing approximate or probabilistic algorithms are not specific 10 to gradient finding but are instead fairly general. 5 Efficient gradient estimation may be useful, for ex- ample, in some optimizationand rootfinding algorithms. α Furthermore, upon discretization, the problem of mini- 0.02 0.04 0.06 0.08 0.1 mizingafunctionalisconvertedintotheproblemofmin- imizing a function of many variables, which might ben- FIG.2: Analyticalerrorestimates(solidline)areagaintested against numerical results (thepoints) in the onedimensional efit from gradient descent techniques. A speedup in the case,varyingthe2nd derivativeinsteadofthenumberofbits. minimization of functionals may in turn enable more ef- Here N =80. ficient solution of partial differential equations via the Euler-Lagrangeequation. The analysis of the advantage which this technique can provide in quantum numerical modulo No to the existing value in the output register, algorithms remains open for further research. and it does not incur any input-dependent global phase. Anadditivescalaroutputcanbeobtainedbyminormod- The author thanks P. Shor, E. Farhi, L. Grover, J. ification to this algorithm, but the most straightforward Traub, and M. Rudner for useful discussions, and MIT’s techniquesforeliminatingtheglobalphaserequireanad- Presidential Graduate Fellowship program for financial ditionalblackboxquery,thusnecessitating2n queriesfor support. [1] W.H.Press, S.A.Teukolsky,W.T.Vetterling,andB.P. [6] C. H. Bennett, G. Brassard, P. Høyer, and U. Vazirani, Flannery,Numerical Recipes in C (Cambridge University SIAMJournal of Computing 26, 1510 (1997). Press, 1992), 2nd ed. [7] M. A. Nielsen and I. L. Chuang, Quantum Computation [2] E. Bernstein and U. Vazirani, Proceedings of the 25th and Quantum Information (Cambridge University Press, ACM Symposium on the Theory of Computing pp. 11– 2000), (See exercise 5.6). 20 (1993). [8] δ herereally representsδ−N/2. [3] M. Mosca, Ph.D. thesis, University of Oxford (1999). [9] Alternatively,wecan defineD3 and D2 as thelargest 2nd [4] P. Høyer, Physical Review A 59, 3280 (1999). and 3rd partial derivatives of f to obtain a worst case [5] R. Cleve, A. Ekert, C. Macchiavello, and M. Mosca, Pro- requirement on l. ceedings of the Royal Society of London, Series A 454, 339 (1998).

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.