Table Of Content

Stochastic Mechanics Applications of Random Media Mathematics Signal Processing and Image Synthesis Stochastic Modelling and Applied Probability Mathematical Economics and Finance Stochastic Optimization 35 Stochastic Control Stochastic Models in Life Sciences Edited by B. Rozovskii M. Yor Advisory Board D. Dawson D. Geman G. Grimmett I. Karatzas F. Kelly Y. Le Jan B. Øksendal G. Papanicolaou E. Pardoux This page intentionally left blank Harold J. Kushner G. George Yin Stochastic Approximation and Recursive Algorithms and Applications Second Edition With 31 Figures HaroldJ.Kushner G.GeorgeYin DivisionofAppliedMathematics DepartmentofMathematics BrownUniversity WayneStateUniversity Providence,RI02912,USA Detroit,MI48202,USA Harold_Kushner@Brown.edu gyin@math.wayne.edu ManagingEditors B.Rozovskii CenterforAppliedMathematicalSciences DenneyResearchBuilding308 UniversityofSouthernCalifornia 1042WestThirty-sixthPlace LosAngeles,CA90089,USA rozovski@math.usc.edu M.Yor LaboratoiredeProbabilite´setMode`lesAleátoires Universite´ deParisVI 175,rueduChevaleret 75013Paris,France Coverillustration:CoverpatternbycourtesyofRickDurrett,CornellUniversity,Ithaca,NewYork. MathematicsSubjectClassification(2000):62L20,93E10,93E25,93E35,65C05,93-02,90C15 LibraryofCongressCataloging-in-PublicationData Kushner,HaroldJ.(HaroldJoseph),1933– Stochasticapproximationandrecursivealgorithmsandapplications/HaroldJ.Kushner, G.GeorgeYin. p.cm.—(Applicationsofmathematics;35) Rev.ed.of:Stochasticapproximationalgorithmsandapplications,c1997. ISBN0-387-00894-2(acid-freepaper) 1.Stochasticapproximation. 2.Recursivestochasticalgorithms. 3.Recursivealgorithms. I.Kushner,HaroldJ.(HaroldJoseph),1933–.Stochasticapproximationalgorithmsand applications. II.Yin,George,1954– III.Title. IV.Series. QA274.2.K882003 519.2—dc21 2003045459 ISBN0-387-00894-2 Printedonacid-freepaper. 2003,1997Springer-VerlagNewYork,Inc. All rights reserved. This work may not be translated or copied in whole or in part without the writtenpermissionofthepublisher(Springer-VerlagNewYork,Inc.,175FifthAvenue,NewYork, NY10010,USA),exceptforbriefexcerptsinconnectionwithreviewsorscholarlyanalysis.Use inconnection withany formof informationstorageand retrieval,electronic adaptation,computer software,orbysimilarordissimilarmethodologynowknownorhereafterdevelopedisforbidden. The use in this publication of trade names, trademarks, service marks, and similar terms, even if theyarenotidentifiedassuch,isnottobetakenasanexpressionofopinionastowhetherornot theyaresubjecttoproprietaryrights. PrintedintheUnitedStatesofAmerica. 9 8 7 6 5 4 3 2 1 SPIN10922088 Typesetting:Pagescreatedbytheauthorsin 2.09usingSpringer’ssvsing.stymacro. www.springer-ny.com Springer-Verlag NewYork Berlin Heidelberg AmemberofBertelsmannSpringerScience+BusinessMediaGmbH To Our Parents, Harriet and Hyman Kushner and Wanzhen Zhu and Yixin Yin This page intentionally left blank Preface and Introduction ThebasicstochasticapproximationalgorithmsintroducedbyRobbinsand MonroandbyKieferandWolfowitzintheearly1950shavebeenthesubject of an enormous literature, both theoretical and applied. This is due to the large number of applications and the interesting theoretical issues in the analysisof“dynamicallydefined”stochasticprocesses.Thebasicparadigm isastochasticdifferenceequationsuchasθn+1 =θn+(cid:2)nYn,whereθn takes its values in some Euclidean space, Y is a random variable, and the “step n size” (cid:2) >0 is small and might go to zero as n→∞. In its simplest form, n θ is a parameter of a system, and the random vector Y is a function of n “noise-corrupted”observationstakenonthesystemwhentheparameteris set to θ . One recursively adjusts the parameter so that some goal is met n asymptotically.Thisbookisconcernedwiththequalitativeandasymptotic properties of such recursive algorithms in the diverse forms in which they arise in applications. There are analogous continuous time algorithms, but the conditions and proofs are generally very close to those for the discrete time case. The original work was motivated by the problem of finding a root of a continuous function g¯(θ), where the function is not known but the ex- perimenter is able to take “noisy” measurements at any desired value of θ. Recursive methods for root finding are common in classical numerical analysis, and it is reasonable to expect that appropriate stochastic analogs would also perform well. In one classical example, θ is the level of dosage of a drug, and the functiong¯(θ),assumedtobeincreasingwithθ,istheprobabilityofsuccess at dosage level θ. The level at which g¯(θ) takes a given value v is sought. viii Preface and Introduction Theprobabilityofsuccessisknownonlybyexperimentatwhatevervalues ofθareselectedbytheexperimenter,withtheexperimentaloutcomebeing either success or failure. Thus, the problem cannot be solved analytically. One possible approach is to take a sufficient number of observations at some fixed value of θ, so that a good estimate of the function value is available, and then to move on. Since most such observations will be taken atparametervaluesthatarenotclosetotheoptimum,mucheffortmightbe wastedincomparisonwiththestochasticapproximationalgorithmθn+1 = θ +(cid:2) [v−observation at θ ], where the parameter value moves (on the n n n average)inthecorrectdirectionaftereachobservation.Inanotherexample, we wish to minimize a real-valued continuously differentiable function f(·) of θ. Here, θ is the nth estimate of the minimum, and Y is a noisy n n estimate of the negative of the derivative of f(·) at θ , perhaps obtained n by a Monte Carlo procedure. The algorithms are frequently constrained in that the iterates θ are projected back to some set H if they ever leave n it. The mathematical paradigms have posed substantial challenges in the asymptotic analysis of recursively defined stochastic processes. A major insight of Robbins and Monro was that, if the step sizes in the parameter updates are allowed to go to zero in an appropriate way as n→∞,thenthereisanimplicitaveragingthateliminatestheeffectsofthe noiseinthelongrun.Anexcellentsurveyofdevelopmentsuptoaboutthe mid 1960s can be found in the book by Wasan [250]. More recent material can be found in [16, 48, 57, 67, 135, 225]. The book [192] deals with many of the issues involved in stochastic optimization in general. In recent years, algorithms of the stochastic approximation type have foundapplicationsinnewanddiverseareas,andnewtechniqueshavebeen developed for proofs of convergence and rate of convergence. The actual and potential applications in signal processing and communications have exploded.Indeed,whetherornottheyarecalledstochasticapproximations, such algorithms occur frequently in practical systems for the purposes of noise or interference cancellation, the optimization of “post processing” or “equalization” filters in time varying communication channels, adaptive antenna systems, adaptive power control in wireless communications, and many related applications. In these applications, the step size is often a small constant (cid:2) = (cid:2), or it might be random. The underlying processes n are often nonstationary and the optimal value of θ can change with time. Thenonekeeps(cid:2) strictlyawayfromzeroinordertoallow“tracking.”Such n trackingapplicationsleadtonewproblemsintheasymptoticanalysis(e.g., when(cid:2) areadjustedadaptively);onewishestoestimatethetrackingerrors n and their dependence on the structure of the algorithm. New challenges have arisen in applications to adaptive control. There has been a resurgence of interest in general “learning” algorithms, motivated by the training problem in artificial neural networks [7, 51, 97], the on-line learning of optimal strategies in very high-dimensional Markov decision processes [113, 174, 221, 252] with unknown transition probabilities, Preface and Introduction ix in learning automata [155], recursive games [11], convergence in sequential decision problems in economics [175], and related areas. The actual recur- siveformsofthealgorithmsinmanysuchapplicationsareofthestochastic approximation type. Owing to the types of simulation methods used, the “noise” might be “pseudorandom” [184], rather than random. Methodssuchasinfinitesimalperturbationanalysis[101]fortheestima- tion of the pathwise derivatives of complex discrete event systems enlarge thepossibilitiesfortherecursiveon-lineoptimizationofmanysystemsthat ariseincommunicationsormanufacturing.Theappropriatealgorithmsare often of the stochastic approximation type and the criterion to be mini- mizedisoftentheaveragecostperunittimeovertheinfinitetimeinterval. Iterate and observation averaging methods [6, 149, 216, 195, 267, 268, 273], which yield nearly optimal algorithms under broad conditions, have been developed. The iterate averaging effectively adds an additional time scaletothealgorithm.Decentralizedorasynchronousalgorithmsintroduce new difficulties for analysis. Consider, for example, a problem where com- putationissplitamongseveralprocessors,operatingandtransmittingdata tooneanotherasynchronously.Suchalgorithmsareonlybeginningtocome intoprominence,duetoboththedevelopmentsofdecentralizedprocessing and applications where each of several locations might control or adjust “local variables,” but where the criterion of concern is global. Despitetheirsuccesses,theclassicalmethodsarenotadequateformany of the algorithms that arise in such applications. Some of the reasons concern the greater flexibility desired for the step sizes, more complicated dependence properties of the noise and iterate processes, the types of constraints that might occur, ergodic cost functions, possibly additional time scales, nonstationarity and issues of tracking for time-varying systems, data-flow problems in the decentralized algorithm, iterate-averaging algorithms, desired stronger rate of convergence results, and so forth. Much modern analysis of the algorithms uses the so-called ODE (ordi- nary differential equation) method introduced by Ljung [164] and exten- sively developed by Kushner and coworkers [123, 135, 142] to cover quite general noise processes and constraints by the use of weak ergodic or averaging conditions. The main idea is to show that, asymptotically, the noise effects average out so that the asymptotic behavior is determined effectively by that of a “mean” ODE. The usefulness of the technique stems from the fact that the ODE is obtained by a “local analysis,” where the dynamical term of the ODE at parameter value θ is obtained by averaging the Y as though the parameter were fixed at θ. Constraints, complicated n statedependentnoiseprocesses,discontinuities,andmanyotherdifficulties canbehandled.Dependingontheapplication,theODEmightbereplaced by a constrained (projected) ODE or a differential inclusion. Owing to its versatility and naturalness, the ODE method has become a fundamental technique in the current toolbox, and its full power will be apparent from the results in this book.

Stochastic Approximation Algorithms and Applications PDF

497 Pages·1997·2.076 MB·English

by Harold J. Kushner, G. George Yin (auth.)

#Mathematics #Computational Mathematics

Checking for file health...

Download

Upgrade Premium

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Download Stochastic Approximation Algorithms and Applications PDF Free - Full Version

by Harold J. Kushner, G. George Yin (auth.)| 1997| 497 pages| 2.076| English

Download Stochastic Approximation Algorithms and Applications by Harold J. Kushner, G. George Yin (auth.) in PDF format completely FREE. No registration required, no payment needed. Get instant access to this valuable resource on PDFdrive.to!

Free Download PDF

About Stochastic Approximation Algorithms and Applications

No description available for this book.

Detailed Information

Author:	Harold J. Kushner, G. George Yin (auth.)
Publication Year:	1997
ISBN:	387008942
Pages:	497
Language:	English
File Size:	2.076
Format:	PDF
Price:	FREE

Download Free PDF

Safe & Secure Download - No registration required

Why Choose PDFdrive for Your Free Stochastic Approximation Algorithms and Applications Download?

100% Free: No hidden fees or subscriptions required for one book every day.
No Registration: Immediate access is available without creating accounts for one book every day.
Safe and Secure: Clean downloads without malware or viruses
Multiple Formats: PDF, MOBI, Mpub,... optimized for all devices
Educational Resource: Supporting knowledge sharing and learning

Frequently Asked Questions

Is it really free to download Stochastic Approximation Algorithms and Applications PDF?

Yes, on https://PDFdrive.to you can download Stochastic Approximation Algorithms and Applications by Harold J. Kushner, G. George Yin (auth.) completely free. We don't require any payment, subscription, or registration to access this PDF file. For 3 books every day.

How can I read Stochastic Approximation Algorithms and Applications on my mobile device?

After downloading Stochastic Approximation Algorithms and Applications PDF, you can open it with any PDF reader app on your phone or tablet. We recommend using Adobe Acrobat Reader, Apple Books, or Google Play Books for the best reading experience.

Is this the full version of Stochastic Approximation Algorithms and Applications?

Yes, this is the complete PDF version of Stochastic Approximation Algorithms and Applications by Harold J. Kushner, G. George Yin (auth.). You will be able to read the entire content as in the printed version without missing any pages.

Is it legal to download Stochastic Approximation Algorithms and Applications PDF for free?

https://PDFdrive.to provides links to free educational resources available online. We do not store any files on our servers. Please be aware of copyright laws in your country before downloading.

The materials shared are intended for research, educational, and personal use in accordance with fair use principles.