ebook img

Local Regression and Likelihood PDF

305 Pages·1999·1.367 MB·English
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Local Regression and Likelihood

Local Regression and Likelihood Clive Loader Springer Preface This book, and the associated software, have grown out of the author’s workinthefieldoflocalregressionoverthepastseveralyears.Thebookis designed to be useful for both theoretical work and in applications. Most chapterscontaindistinctsectionsintroducingmethodology,computingand practice, and theoretical results. The methodological and practice sections shouldbeaccessibletoreaderswithasoundbackgroundinstatisticalmeth- ods and in particular regression, for example at the level of Draper and Smith (1981). The theoretical sections require a greater understanding of calculus, matrix algebra and real analysis, generally at the level found in advanced undergraduate courses. Applications are given from a wide vari- ety of fields, ranging from actuarial science to sports. Theextent,andrelevance,ofearlyworkinsmoothingisnotwidelyappre- ciated,evenwithintheresearchcommunity.Chapter1attemptstoredress the problem. Many ideas that are central to modern work on smoothing: localpolynomials,thebias-variancetrade-off,equivalentkernels,likelihood models and optimality results can be found in literature dating to the late nineteenth and early twentieth centuries. The core methodology of this book appears in Chapters 2 through 5. These chapters introduce the local regression method in univariate and multivariatesettings,andextensionstolocallikelihoodanddensityestima- tion. Basic theoretical results and diagnostic tools such as cross validation are introduced along the way. Examples illustrate the implementation of the methods using the locfit software. The remaining chapters discuss a variety of applications and advanced topics: classification, survival data, bandwidth selection issues, computa- vi tionandasymptotictheory.Largely,thesechaptersareindependentofeach other, so the reader can pick those of most interest. Most chapters include a short set of exercises. These include theoretical results;detailsofproofs;extensionsofthemethodology;somedataanalysis examplesandafewresearchproblems.Buttherealtestforthemethodsis whether they provide useful answers in applications. The best exercise for every chapter is to find datasets of interest, and try the methods out! The literature on mathematical aspects of smoothing is extensive, and coverage is necessarily selective. I attempt to present results that are of most direct practical relevance. For example, theoretical motivation for standard error approximations and confidence bands is important; the reader should eventually want to know precisely what the error estimates represent, rather than simply asuming software reports the right answers (this applies to any model and software; not just local regression and loc- fit!). On the other hand, asymptotic methods for boundary correction re- ceive no coverage, since local regression provides a simpler, more intuitive and more general approach to achieve the same result. Alongwiththetheory,wealsoattempttointroduceunderstandingofthe results, along with their relevance. Examples of this include the discussion of non-identifiability of derivatives (Section 6.1) and the problem of bias estimation for confidence bands and bandwidth selectors (Chapters 9 and 10). Software Local fitting should provide a practical tool to help analyse data. This re- quires software, and an integral part of this book is locfit. This can be run either as a library within R, S and S-Plus, or as a stand-alone appli- cation. Versions of the software for both Windows and UNIX systems can be downloaded from the locfit web page, http://cm.bell-labs.com/stat/project/locfit/ InstallationinstructionsforcurrentversionsoflocfitandS-Plusarepro- videdintheappendices;updatesforfutureversionsofS-Pluswillbeposted on the web pages. The examples in this book use locfit in S (or S-Plus), which will be of use to many readers given the widespread availability of S within the statistics community. For readers without access to S, the recommended alternative is to use locfit with the R language, which is freely available and has a syntax very similar to S. There is also a stand-alone version, c-locfit, with its own interface and data management facilities. The in- terfaceallowsaccesstoalmostallthefacilitiesof locfit’sSinterface,and a few additional features. An on-line example facility allows the user to obtain c-locfit code for most of the examples in this book. vii ItshouldalsobenotedthisbookisnotanintroductiontoS.Thereader using locfit with S should already be familiar with S fundamentals, such as reading and manipulating data and initializing graphics devices. Books such as Krause and Olson (1997), Spector (1994) and Venables and Ripley (1997) cover this material, and much more. Acknowledgements Acknowledgements are many. Foremost, Bill Cleveland introduced me to the field of local fitting, and his influence will be seen in numerous places. Vladimir Katkovnik is thanked for helpful ideas and suggestions, and for providing a copy of his 1985 book. locfit has been distributed, in various forms, over the internet for sev- eral years, and feedback from numerous users has resulted in significant improvements. Kurt Hornik, David James, Brian Ripley, Dan Serachitopol and others have ported locfit to various operating systems and versions of R and S-Plus. This book was used as the basis for a graduate course at Rutgers Uni- versity in Spring 1998, and I thank Yehuda Vardi for the opportunity to teach the course, as well as the students for not complaining too loudly about the drafts inflicted upon them. Of course, writing this book and software required a flawlessly working computersystem,andmysystemadministratorDaisyNguyenrecievesthe highest marks in this respect! Manyofmyprogrammingsourcesalsodeservemention.Horspool(1986) has been my usual reference for C programming. John Chambers provided S, and patiently handled my bug reports (which usually turned out as locfit bugs; not S!). Curtin University is an excellent online source for X programming (http://www.cs.curtin.edu.au/units/). This page intentionally left blank Contents 1 The Origins of Local Regression 1 1.1 The Problem of Graduation . . . . . . . . . . . . . . . . . . 1 1.1.1 Graduation Using Summation Formulae . . . . . . . 2 1.1.2 The Bias-Variance Trade-Off . . . . . . . . . . . . . 7 1.2 Local Polynomial Fitting . . . . . . . . . . . . . . . . . . . 7 1.2.1 Optimal Weights . . . . . . . . . . . . . . . . . . . . 8 1.3 Smoothing of Time Series . . . . . . . . . . . . . . . . . . . 10 1.4 Modern Local Regression . . . . . . . . . . . . . . . . . . . 11 1.5 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12 2 Local Regression Methods 15 2.1 The Local Regression Estimate . . . . . . . . . . . . . . . . 15 2.1.1 Interpreting the Local Regression Estimate . . . . . 18 2.1.2 Multivariate Local Regression . . . . . . . . . . . . . 19 2.2 The Components of Local Regression . . . . . . . . . . . . . 20 2.2.1 Bandwidth . . . . . . . . . . . . . . . . . . . . . . . 20 2.2.2 Local Polynomial Degree . . . . . . . . . . . . . . . 22 2.2.3 The Weight Function. . . . . . . . . . . . . . . . . . 23 2.2.4 The Fitting Criterion . . . . . . . . . . . . . . . . . 24 2.3 Diagnostics and Goodness of Fit . . . . . . . . . . . . . . . 24 2.3.1 Residuals . . . . . . . . . . . . . . . . . . . . . . . . 25 2.3.2 Influence, Variance and Degrees of Freedom . . . . . 27 2.3.3 Confidence Intervals . . . . . . . . . . . . . . . . . . 29 2.4 Model Comparison and Selection . . . . . . . . . . . . . . . 30

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.