ebook img

SUPPLEMENT TO ‘BAYESIAN LOGICAL DATA ANALYSIS FOR THE PHYSICAL SCIENCES’ PDF

142 Pages·2018·4.949 MB·English
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview SUPPLEMENT TO ‘BAYESIAN LOGICAL DATA ANALYSIS FOR THE PHYSICAL SCIENCES’

SUPPLEMENT TO ‘BAYESIAN LOGICAL DATA ANALYSIS FOR THE PHYSICAL SCIENCES’ P.C.Gregory DepartmentofPhysicsandAstronomy,UniversityofBritishColumbia 23Jan.2018 Copyright P.C.Gregory2015,2016,2017,2018 i SupplementalChapter1dealswithmyFusionMarkovchainMonteCarlo(FM- CMC)algorithm,anextensionoftheworkintroducedinthelatterpartofChapter 12 of the textbook. FMCMC is a special version of the Metropolis algorithm that incorporates parallel tempering, genetic crossover operations, and an automatic simulatedannealing.Eachofthesefeaturesfacilitatethedetectionofaglobalmin- imuminchi-squaredinahighlymulti-modalenvironment.Bycombiningallthree, the algorithm greatly increases the probability of realizing this goal. It is a pow- erful general purpose tool for Bayesian model fitting which I have been using to great success in the arena of extra-solar planets. The FMCMC is controlled by a unique adaptive control system that automates the tuning of the MCMC proposal distributionsforefficientexplorationofthemodelparameterspaceevenwhenthe parametersarehighlycorrelated. Chapter1alsoincludesimportantnewworkonBayesianmodelcomparisonand introduces a new marginal likelihood estimator, called Nested Restricted Monte Carlo(NRMC),usedinthecalculationofBayes’factors.Itsperformanceiscom- pared with two other marginal likelihood estimators that depend on the posterior MCMCsamples.TherearethreesupportingappendicestoChapter1. Supplemental Chapter 2 provides an introduction to hierarchical or multilevel Bayes. It examines how to handle hidden variables and missing data. There are numerous examples of the use of fusion MCMC in Bayesian regression and re- gression involving selection effects, providing an important enhancement to the coverageofthetextbook. Thesesupplementalchaptersareintendedtomaintainandstrengthenthebook’s appeal as a text for graduate-level courses in the physical sciences. Although the main textbook provides some examples using Mathematica commands, the sup- plemental chapters are standalone and do not make use of any specific computer language. Contents 1 FusionMarkovchainMonteCarlo page1 1.1 Introduction 1 1.2 Exoplanets 2 1.3 Bayesianprimer 3 1.3.1 Twocommoninferenceproblems 4 1.3.2 Marginalization:animportantBayesiantool 7 1.4 FusionMCMC 10 1.4.1 Paralleltempering 11 1.4.2 FusionMCMCadaptivecontrolsystem 13 1.4.3 Automaticsimulatedannealing 16 1.4.4 Highlycorrelatedparameters 17 1.5 Exoplanetapplications 20 1.5.1 Exoplanetpriors 22 1.5.2 HD208487example 24 1.5.3 Aliases 31 1.5.4 Whichisthealias? 34 1.5.5 Gliese581example 37 1.6 ModelComparison 41 1.6.1 Paralleltemperingestimator 43 1.6.2 Marginallikelihoodratioestimator 47 1.6.3 NestedRestrictedMonteCarlo 49 1.6.4 Comparisonofmarginallikelihoodmethods 53 1.6.5 Bayesianfalsealarmprobability 55 1.7 ImpactofstellaractivityonRV 58 1.8 Conclusions 58 References 60 2 Hiddenvariables,missingdataandmultilevel(hierarchical)Bayes 65 Contents iii 2.1 Introduction 65 2.2 Fittingastraightlinewitherrorsinbothcoordinates 66 2.2.1 Correlatedxydataerrors 70 2.3 Linearregressionwitherrorsinbothcoordinates 70 2.3.1 Example1 74 2.3.2 Example2 81 2.4 Effectofmeasurementerror oncorrelationandregression 82 2.5 Gaussianmixturemodel 86 2.5.1 Example 88 2.6 Regressionwithmultiple independentvariables 89 2.6.1 Example:twoindependentvariables 91 2.7 Selectioneffects 92 2.7.1 Selectiononmeasuredindependentvariables 96 2.7.2 Selectiononmeasureddependentvariable 96 2.7.3 Example1:gradualcutoffinmeasuredy 97 2.7.4 Example2:gradualcutoffinmeasuredy 102 2.8 Regressionwithnondetections 104 2.9 Summary 105 References 107 AppendixA FMCMCcontrolsystemdetails 109 A.1 Controlsystemstage0 110 A.2 Controlsystemstage1 111 A.3 Controlsystemstage2 114 A.4 FurtherCSstudies 119 A.5 Choosingtemperingvalues 120 A.5.1 Procedureforselectingtemperingvalues 121 AppendixB Exoplanetpriors 125 AppendixC AccuracyofMathematicamodelRadialVelocities 131 Index 135 1 Fusion Markov chain Monte Carlo: A Powerful Tool for Bayesian Data Analysis 1.1 Introduction The purpose of this chapter is to introduce a new tool for Bayesian data analysis called fusion Markov chain Monte Carlo (FMCMC), a new general purpose tool fornonlinearmodelfittingandregressionanalysis.Itistheoutgrowthofanearlier attempt to achieve an automated MCMC algorithm discussed in Section 12.8 of mybook“BayesianLogicalDataAnalysisforthePhysicalSciences,”Cambridge UniversityPress(2005,2010)[24]. FMCMC is a special version of the Metropolis MCMC algorithm that incorpo- ratesparalleltempering,geneticcrossoveroperations,andanautomaticsimulated annealing. Each of these features facilitate the detection of a global minimum in chi-squared in a highly multi-modal environment. By combining all three, the al- gorithmgreatlyincreasestheprobabilityofrealizingthisgoal. The FMCMC is controlled by a unique adaptive control system that automates the tuning of the MCMC proposal distributions for efficient exploration of the modelparameterspaceevenwhentheparametersarehighlycorrelated.Thiscon- trolled statistical fusion approach has the potential to integrate other relevant sta- tistical tools as desired. The FMCMC algorithm is implemented in Mathematica usingparallizedcodetotakeadvantageofmultiplecorecomputers. Thepowerofthisapproachwillbeillustratedbyconsideringaparticularmodel fittingproblemfromtheexcitingareaofextra-solarplanet(exoplanets)research.In a subsequent chapter we will examine other applications in multilevel (hierarchi- cal)Bayesiananalysis.Thenextsectionofthischapterprovidessomebackground informationonexoplanetdiscoveriestomotivatetheBayesiananalysis.Section1.3 provides a brief self contained introduction to Bayesian logical data analysis ap- plied to the Kepler exoplanet problem leading to the MCMC challenge. For more detailsonBayesianlogicaldataanalysispleaseseemybook[24]. The different components of Fusion MCMC are described in Section 1.4. In 2 FusionMarkovchainMonteCarlo 800 r à a e y (cid:144) s et 600 n a pl w e 400 n f o r e b 200 m à à u à N à à à à 0à à à à à à à à à à à à à à à à à à à à 1990 1995 2000 2005 2010 2015 Year Figure1.1 ThepaceofexoplanetdiscoveriesasofDec.2014. Section 1.5, the performance of the algorithm is illustrated with some exoplanet examples. Section 1.6 deals with the challenges of Bayesian model comparison anddescribesanewmethodforcomputingmarginallikelihoodscalledNestedRe- stricted Monte Carlo (NRMC). Several appendices provide important details on FMCMCcontrolsystem,anddetailsbehindthechoicesofpriors. 1.2 Exoplanets A remarkable array of new ground based and space based astronomical tools are providing astronomers access to other solar systems. Close to 2000 planets have been discovered to date, starting from the pioneering work of [9, 66, 46, 45]. Fig.1.1illustratesthepaceofdiscovery1 uptoOct.2014. A wide range of techniques including radial velocity, transiting, gravitational micro-lensing,timing,anddirectimaging,havecontributedexoplanetdiscoveries. Becauseatypicalstarisapproximatelyabilliontimesbrighterthanaplanet,only asmallfractionoftheplanetshavebeendetectedbydirectimaging.Themajority of the planets have been detected through transits or the reflex motion of the star caused by the gravitational tug of unseen planets, using precision radial velocity (RV) measurements. There are currently 1832 planets in 1145 planetary systems (asof17October2014).469planetsareknowntobeinmultipleplanetsystems,the largestofwhichhassevenplanets[44].ManyadditionalcandidatesfromNASA’s 1 DatafromtheExtrasolarplanetEncyclopedia,Exoplanet.eu.[57] 1.3 Bayesianprimer 3 Keplertransitdetectionmissionareawaitingconfirmation.Onerecentanalysisof the Kepler data [49] estimates that the occurrence rate of Earth-size planets (radii between1-2timesthatoftheEarth)inthehabitablezone(whereliquidwatercould exist)ofsunlikestarsis22±8%. These successes on the part of the observers has spurred a significant effort to improve the statistical tools for analyzing data in this field (e.g., [42, 41, 11, 24, 25, 19, 20, 21, 70, 12, 13, 17, 1, 5, 60, 35]. Much of this work has highlighted a Bayesian MCMC approach as a way to better understand parameter uncertainties anddegeneraciesandtocomputemodelprobabilities.MCMCalgorithmsprovidea powerfulmeansforefficientlycomputingtherequiredBayesianintegralsinmany dimensions (e.g., an 8 planet model has 42 unknown parameters). More on this below. 1.3 Bayesianprimer WhatisBayesianprobabilitytheory(BPT)? BPT = extended logic. Deductive logic is based on axiomatic knowledge. In science we never know any theory of nature is absolutely true because our reasoning is based on incomplete information. Our conclusions are at best probabilities. Any extension of logic to deal with situations of incomplete information (realm of inductive logic) requires atheoryofprobability. Anewperceptionofprobabilityhasariseninrecognitionthatthemathematical rules of probability are not merely rules for manipulating random variables. They arenowrecognizedasvalidprinciplesoflogicforconductinginferenceaboutany hypothesis of interest. This view of, “Probability Theory as Logic”, was champi- oned in the late 20th century by E. T. Jaynes in his book2, “Probability Theory: TheLogicofScience,”CambridgeUniversityPress2003. It is also commonly referred to as Bayesian Probability Theory in recognition of theworkofthe18thcenturyEnglishclergymanandMathematicianThomasBayes. Logic is concerned with the truth of propositions. A proposition asserts that somethingistrue.Belowaresomeexamplesofpropositions. • A ≡“TheoryXiscorrect.” • A¯ ≡“TheoryXisnotcorrect.” • D ≡“Themeasuredredshiftofthegalaxyis0.15±0.02.” • B≡“Thestarhas5planets.” 2 Thebookwaspublished5yearafterJaynes’deaththroughtheeffortsofaformergraduatestudentDr.G.L. Bretthorst. 4 FusionMarkovchainMonteCarlo • A ≡“Theorbitalfrequencyisbetween f and f +df.” Inthelastexample, f isacontinuousparametergivingrisetoacontinuoushypoth- esisspace.Negationofapropositionisindicatedbyabarovertop,i.e, A¯. Thepropositionsthatwewanttoreasonaboutinsciencearecommonlyreferred toashypothesesormodels.Wewillneedtoconsidercompoundpropositionslike A,Bwhichassertsthatpropositions Aand Barebothtrueconditionalonthetruth ofanotherpropositionC.Thisiswritten A,B|C. RulesformanipulatingprobabilitiesandBayes’theorem Thereareonlytworulesformanipulatingprobabilities. Sumrule : p(A|C)+ p(A¯|C) = 1 (1.1) Productrule : p(A,B|C) = p(A|C)× p(B|A,C) (1.2) = p(B|C)× p(A|B,C) Bayes’theoremisobtainedbyrearrangingthetworighthandsidesoftheproduct rule. p(A|C)× p(B|A,C) Bayes′ theorem : p(A|B,C) = (1.3) p(B|C) Figure 1.2 shows Bayes’ theorem in its more usual form for data analysis pur- poses. In a well posed Bayesian problem the prior information, I, specifies the hypothesisspaceofcurrentinterest(rangeofmodelsactivelyunderconsideration) andtheprocedureforcomputingthelikelihood.ThestartingpointisalwaysBayes’ theorem.Inanygivenproblemtheexpressionsforthepriorandlikelihoodmaybe quite complex and require repeated applications of the sum and product rule to obtainanexpressionthatcanbesolved.ThiswillbemoreapparentinChapter?? dealingwithhiddenandmissingvariables. Asatheoryofextendedlogic,BPTcanbeusedtofindoptimalanswerstowell posedscientificquestionsforagivenstateofknowledge,incontrasttoanumerical recipeapproach. 1.3.1 Twocommoninferenceproblems 1. Modelcomparison(discretehypothesisspace):Whichoneof2ormoremod- els(hypotheses)ismostprobablegivenourcurrentstateofknowledge?Exam- ples: • Hypothesisormodel M assertsthatthestarhasnoplanets. 0

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.