ebook img

The evolution and distribution of species body size PDF

0.89 MB·English
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview The evolution and distribution of species body size

The evolution and distribution of species body size∗ Aaron Clauset1 and Douglas H. Erwin1,2 1Santa Fe Institute, 1399 Hyde Park Road, Santa Fe NM, 87501, USA 2Departement of Paleobiology, MRC-121, National Museum of Natural History, P. O. Box 37012, Washington DC, 20013-7012, USA Thedistributionofspeciesbodysizewithintaxonomicgroupsexhibitsaheavyright-tailextending overmanyordersofmagnitude,wheremostspeciesaresignificantlylargerthanthesmallestspecies. We provide a simple model of cladogenetic diffusion over evolutionary time that omits explicit mechanisms for inter-specific competition and other microevolutionary processes yet fully explains the shape of this distribution. We estimate the model’s parameters from fossil data and find that it robustly reproduces the distribution of 4002 mammal species from the late Quaternary. The observed fit suggests that the asymmetric distribution arises from a fundamental tradeoff between 9 the short-term selective advantages (Cope’s rule) and long-term selective risks of increased species 0 body size, in thepresence of a taxon-specificlower limit on body size. 0 2 n Mosttaxonomicgroupsshowacommondistributionof We developed a generalized diffusion model of species a species body size [1, 2, 3], with a single prominent mode body size evolution, in which the size distribution is the J relativelynearbutnotatthesmallestspeciessize[4]and product of three macroevolutionary processes (Fig. 1). 2 asmoothbutheavyright-tail(oftendescribedasaright- We combine these processes, each of which has been in- skew on a log-size scale) extending for several orders of dependently studied [1, 2, 17, 20], in a single quantita- ] E magnitude (e.g., Fig. 1). This distribution is naturally tive framework, estimate its parameters from fossil data P related to a wide variety of other species characteris- onextinctterrestrialmammalsfrombeforethelateQua- . tics with which body size correlates, including habitat, ternary [19, 21], and test whether this model, or simpler o life history, life span [5], metabolism [6] and extinction variants, can reproduce the sizes of the 4002 known ex- i b risk [7]. A greater understanding of the underlying con- tantandextinctterrestrialmammalspeciesfromthelate - q straints on, and long-term trends in, body size evolu- Quaternary (Recent species) [18]. [ tion may provide information for conservation efforts [8] This model assumes that (1) species size varies over and insight about interactions between ecological and evolutionary time as a cladogenetic multiplicative diffu- 1 v macroevolutionary processes [9]. sionprocess[1,17]: thesizeofadescendantspeciesxD is 1 theproductofastochasticgrowthfactorλanditsances- 5 Studies of body size distributions have suggested that 2 the prominent mode may be indicative of a taxon- 0 specific energetically optimal body size [10, 11], which . 1 is supported by microevolutionary studies of insular 0 species[12]. However,evidenceforCope’srule[1,13,14] Diffusion and short−term 9 –theobservationthatspeciestendtobelargerthantheir selective advantages 0 ancestors – and the fact that most species are not close : v totheirgroup’spredictedoptimalsize(amongotherrea- i sons [15]), suggest that this theory may be flawed. Al- y X sit ternatively,speciesbodysizesmaydiffuseoverevolution- n e r ary time. If so, Cope’s rule alone could cause size dis- D Long−term risk a tributions to exhibit heavy right-tails [1], although size- of extinction dependentspeciationorextinctionrates[2,9,16]orsize- Lower boundary neutraldiffusionnearataxon-specificlowerlimitonbody effects size[17]couldalsoproduceasimilarshape. Furthermore, different mechanisms may drive body size evolution on 100 101 102 103 104 105 106 107 108 Body size (g) spatial and temporal scales [3], and the importance of inter-specific competition to the macroevolutionary dy- namics of species body size is not known. FIG. 1: Smoothed species body size distribution of 4002 Re- cent terrestrial mammals (datafrom [18]), showing thethree macroevolutionary processes that shape the relative abun- dances of different sizes. The left-tail of the distribution is ∗This manuscript is a pre-print version that has not undergone created by diffusion in the vicinity of a taxon-specific lower final editing. Please refer to the complete version of record, Sci- limit near2g,whilethelongright-tailisproduced bythein- ence321,399–401(2008),athttp://www.sciencemag.org/. This teractionofdiffusionoverevolutionarytime(includingtrends manuscriptmaynotbereproducedorusedinanymannerthatdoes likeCope’srule)andthelong-termriskofextinctionfromin- notfallwithinthefairuseprovisionsoftheCopyrightActwithout creased body size. theprior,writtenpermissionofAAAS. 2 A Time Fossil evidence suggests that this limit has existed since at least the Cretaceous-Tertiary boundary [19, 21, 23]. Further, a limit in this vicinity is supported by both ex- perimental [22] and theoretical work [6] on mammalian metabolism. x A Away from this limit, mammalian body size evolu- tion is governed mainly by diffusion with a bias (Cope’s x D rule) [14, 24], while its evolution near the lower limit is likely constrained by the need for relatively specialized morphologicalstructures[1]. We expectthis latter effect to appear in fossil data as a systematic intensification of Cope’s rule for very small-bodied species, i.e., increased hlogλi as xA → xmin. From ancestor-descendant size B λ data for 1106 extinct North American terrestrial mam- size, 10 mals[19],weestimatedandcomparedthreemodelsofthe y d distributionF(λ)asafunctionofancestorsize,including o b e in 2 themodelsuggestedbyAlroy[14]whichpredictsamod- ng 1 erately bi-modal distribution in body sizes. Of these, a e ch0.5 a piecewise model (Fig. 2B), with no effective optimal ativ body size, has the best empirical support (model selec- plic tion via likelihood ratio test and Bayesian information Multi0.1 Ancestor−descendant data criterion; see Appendix B2). This model includes both Estimated model (mean, var) a strengthening of Cope’s rule for small-bodied species 100 101 10A2nces1to0r3 body1 0s4ize (g1)05 106 107 (x . 32g) and a small but uniformly positive bias for larger species, resulting in an average body-size growth of 4.1±1.0% between ancestors and their descendants FIG. 2: (A) A schematic illustrating a simple cladogenetic (hlogλi=0.04±0.01). diffusionmodel(seetext)ofspeciesbodysizeevolution,where This result supports the existence of short-term selec- thesizeofadescendantspeciesxD isrelatedtoitsancestor’s tive advantagesfor increasedspecies body size,e.g., bet- size xA by a multiplicative factor λ. (B) Empirical data on tertoleranceofresourcefluctuations,betterthermoregu- 1106changesinNorthAmericanmammalianbodysize(data lation,andbetterpredatoravoidance[5],butalsoimplies from [19]), as a function of ancestor size, overlaid with the estimatedmodelofwithin-lineagechanges,wheretheaverage a more nuanced view: small-bodied species exhibit even log-change hlogλi varies piecewise as a function of body size greaterselectiveadvantagesfromincreasedsize,e.g.,be- (see AppendixB2). cause of greater morphologicalflexibility. Empirical estimates of extinction rates (or equiva- lently, speciation rates) as functions of body size are tor’ssizexA,i.e.,xD =λxA. Foreachspeciationevent,a uncertain [25], due to the bias and incompleteness of newλisdrawnfromthedistributionF(λ),whichmodels the fossil record. We partly control for this uncertainty the total influence on species size changes fromall direc- by utilizing a simplistic model of extinction risk pe(x), tions. Abiastowardlargersizes(Cope’srule)appearsas largelyestimated fromthe data,where extinctionoccurs apositiveaveragelog-changetosizehlogλi>0,andmay independently with a probability calculated only from dependontheancestor’ssize. (2) Speciesbodysizeisre- the species’ size. We specified a basal extinction rate β strictedbyataxon-specificlowerlimitx [6,22],which by assumingthatthe number ofRecentterrestrialmam- min wemodelbyrequiringthatF(λ<xmin/xA)=0,i.e.,the mal species is close to a putative carrying capacity. We largest possible decrease in size for a particular specia- thenletextinctionriskperunittimeincreaselogarithmi- tion event is λ=xmin/xA. In our computer simulations, callywithbodysize[26](seeAppendix A2). Thismodel time proceeds in discrete steps. At each step, exactly leaves only the rate ρ by which risk increases with size one new species is produced, which is the descendant of asafreeparameter,whichwaschosenbyminimizing the a randomly selected species. Finally, (3) every species statistical distance between the simulated and empirical independently becomes extinct with probability pe(x), distributions (see Appendix A3). which increases monotonically with size. A schematic Inserting these three processes, as estimated above, of the model is shown in Fig. 2A (for technical details into our computer model, we found that the model see Appendix A1). accurately predicted the distribution of Recent terres- To make this model appropriately realistic, we esti- trial mammal sizes over its seven orders of magnitude mated the form of each process from fossil data. The (Fig.3A),andwasparticularlyaccurateforsmall-bodied lower limit on mammalian body size is near 2g, close to species (x < 80g). Our sensitivity analysis further indi- the size of both the Etruscan shrew (Suncus etruscus) catedthatthispredictionwashighlyrobusttovariations and the bumblebee bat (Craseonycteris thonglongyai). inmostoftheestimatedparameters,buthighlysensitive 3 10−1 10−1 10−1 A Empirical data B C Diffusion model n10−2 n10−2 n10−2 o o o orti orti orti p p p o o o Pr Pr Pr Model assumptions: Model assumptions: Model assumptions: 10−3 ooLower limit 10−3 ooLower limit 10−3 ooLower limit oPositive bias (Cope’s rule) oPositive bias (Cope’s rule) oPositive bias (Cope’s rule) oSize−dependent extinction oSize−dependent extinction oIncreased bias at small sizes 10 0 101 102 103 104 105 106 107 100 101 102 103 104 105 106 107 100 101 102 103 104 105 106 107 Body size (g) Body size (g) Body size (g) FIG. 3: Simulated distributions of species body size (central tendency ± 95% confidence intervals from 1000 repetitions; all model parameters estimated as described in the text) and the empirical distribution of Recent terrestrial mammals. (A) The model described in the text. (B) The same model as A but with a bias hlogλi that is independent of size. (C) The same model as B but with an extinction risk that is independent of size. (For details and additional results, see AppendixD.) to the location of the lower-limiton body size. The esti- rule, hlogλi>0) shifts the mode toward slightly larger mated value of x ≈2g, however,is the most strongly sizesandslightlyincreasestheheavinessoftheright-tail. min supported of all model parameters. Thus, even large Under different conditions, these processes produce revisions to the other parameter estimates are unlikely markedly different body size distributions. For instance, to change our general conclusions (see Appendix C2). a long left-tail extending toward small-bodied species Also, although a range of ρ values produced size distri- would indicate that the risk of extinction decreases with butions that were statistically close to the empirical dis- larger size ρ < 0. Similarly, a more symmetric distribu- tribution, the model predicts a particular extinction risk tion would indicate both that extinction rates are rela- curve (Fig. S4) that could be tested with appropriate tively size-independent ρ ≈ 0 and that changes to body empirical data. size convey few selective advantages hlogλi ≈ 0. Al- To further discriminate among alternative explana- though a suitable body size distribution is not currently tions for the species size distribution, we tested sim- available for dinosaurs (but see [27]), evidence suggests pler diffusion models, each with parameters estimated that it may be more symmetric than for mammals. The from fossil data (see Appendix D), including (1) unbi- right-skewed distribution’s ubiquity, such as for insects ased diffusion with a lower boundary, (2) Cope’s rule and birds [1, 2], suggests that such circumstances are with size-dependent extinction, (3) Cope’s rule alone, rare,andthatthemammaliandistributionrepresentsthe (4) size-dependent extinction alone, and (5) a version of norm. the full model that omits the increased bias for small- Thismodelomitsexplicitmechanismsformanycanon- bodied species (x . 32g). We found that these mod- icalecologicalandmicroevolutionaryprocesses,including els all predicted size distributions that differed, some- the impact of inter-specific competition, geography, pre- times dramatically so, from the empirical distribution dation, population dynamics, and size variationbetween (Figs. 3B, 3C, S9 and S10). Additionally, we found that speciation events (anagenetic evolution), which suggests a positive bias hlogλi>0 for large-bodied species is not that their contributions to the systematic or large-scale necessaryif the extinctionrisk increasesless quickly (see character of species body size distributions can be com- Appendix C2). These results support the inclusion of a pactlysummarizedbythevaluesofcertainmodelparam- fundamentallowerlimit,thediffusionofspeciessize,and eters,e.g.,thestrengthofCope’srule hlogλiortheman- anincreasingriskofextinctionwithsize,aswellasanin- ner in which extinction risk increases with body size ρ. creased bias toward larger sizes for small-bodied species Some aspects of the body size distribution, however,are (x.32g). not explained by this model, such as the slight over- Thus, the shape of a body size distribution can be in- abundance of terrestrial mammal species around 300kg terpretedinthecontextofthesethreemacroevolutionary and the slight under-abundance around 1kg (Fig. 3A). processes. Anintermediatelocationforthedistribution’s Whether such deviations can be attributed to phyloge- mode (40g for terrestrialmammals) is mainly caused by neticallycorrelatedspeciationandextinctioneventsisan diffusioninthevicinityofthephysiologicallowerlimiton open question. A more thorough examination of these body size – which prevents the smallest species from be- macroevolutionary processes may explain their particu- ingthemostabundant. Aheavyright-tailisthencaused lar form and origin, and answer why body size is weakly primarily by diffusion in the presence of extinction risks correlated with increased extinction rates (or, decrease that increase weakly with size ρ>0. For mammals, the of speciation rates) weakly with body size, why physio- within-lineage tendency toward increased size (Cope’s logical lower limits on body size exist and are conserved 4 withinataxonomicgroups,andwhysomegroupsexhibit tions. We thank J. Alroy, A. Boyer and F. Smith for macroevolutionary trends but others do not. kindly sharing data. Supported in part by the Santa Fe Institute and the Computer Science Department at the University of New Mexico. Acknowledgments AC is grateful to A. Boyer, J. Dunne, J. Ladau, B. Olding, C. Shalizi, and J. Wilkins for helpful conversa- [1] S.M. Stanley,Evolution 27, 1 (1973). [30] D.H.Erwin,AnnualReviewofEarthandPlanetary Sci- [2] J. Kozl owski, A. T. Gawelczyk, Functional Ecology 16, ences 34, 569 (2006). 419 (2002). [31] W. H. Press, S. A. Teukolsky, W. T. Vetterling, B. P. [3] C. R. Allen, et al.,Ecology Letters 9, 630 (2006). Flannery, Numerical Recipes in C: The Art of Scientific [4] K.P. Dial, J. M. Marzluff, Ecology 69, 1620 (1988). Computing (Cambridge University Press, Cambridge, [5] J.H.Brown,Macroecology (UniversityofChicagoPress, UK, 1992). Chicago, 1995). [32] B. MacFadden, Paleobiology 12, 355 (1986). [6] G.B.West,W.H.Woodruff,J.H.Brown,Proceedingsof [33] D. Jablonski, Nature 385, 250 (1997). the National Academy of Science, USA99, 2473 (2002). [34] B. Maurer, Evolutionary Ecology 12, 925 (1998). [7] M. Cardillo, et al.,Science 309, 1239 (2005). [35] F. Bokma, Evolution 56, 2499 (2002). [8] D.O.Fisher,I.P.F.Owens,TrendsinEcologyandEvo- [36] P.M.Novack-Gottshall,M.A.Lanier,Proceedings ofthe lution 19, 391 (2004). National Academy of Science, USA 105, 5430 (2008). [9] S. M. Stanley, Proceedings of the National Academy of [37] W.J.Reed,M.Jorgensen,CommunicationsinStatistics: Science, USA 72, 646 (1975). Theory & Methods 33, 1733 (2004). [10] K.P.Sebens,Annual Review of Ecology andSystematics [38] A. Clauset, C. R. Shalizi, M. E. J. Newman, Power- 18, 371 (1987). law distribution in empirical data (2007). E-print, [11] J. H. Brown, P. A. Marquet, M. L. Taper, American arXiv:0706.1062. Naturalist 147, 1092 (1996). [39] B. Efron, R. J. Tibshirani, An Introduction to the Boot- [12] M. Lomolino, American Naturalist 125, 310 (1985). strap (Chapman & Hall, New York,NY,1993). [13] C.Deperet,Thetransformations oftheanimalworld (D. [40] Q. H. Vuong,Econometrica 57, 307 (1989). Appleton and Co., New York,1909). [41] L.Wasserman,AllofNonparametricStatistics (Springer, [14] J. Alroy,Science 280, 731 (1998). New York,NY,2006). [15] J. Kozl owski, Functional Ecology 16, 540 (2002). [42] V. M. Savage, et al.,Functional Ecology 18, 257 (2004). [16] L. VanValen, Evolution 29, 87 (1973). [43] S. C. Wang, Paleobiology 31, 191 (2005). [17] D.W. McShea, Evolution 48, 1747 (1994). [18] F. A.Smith,et al.,Ecology 84, 3403 (2003). MOM Ver- sion 3.6.1. [19] J. Alroy, North American Fossil Mammal Systematics Database(2008). Paleobiology DatabaseOnlineSystem- atics Archive3, http://paleodb.org/. [20] M.L.McKinney,EvolutionaryTrends,K.J.McNamara, ed.(University of Arizona Press, 1990), pp.75–118. [21] M. Fortelius (coordinator), Neogene of the Old World Database of Fossil Mammals (NOW) (2003). Uni- versity of Helsinki, NOW public release 030717, http://www.helsinki.fi/science/now/. [22] O.P. Pearson, Science 108, 44 (1948). [23] F.A.Smith,etal.,AmericanNaturalist 163,672(2002). [24] B.VanValkenburgh,X.Wang,J.Damuth,Science 306, 101 (2004). [25] D.Ludwig, Ecological Applications 6, 1067 (1996). [26] L. H. Liow, et al., Proceedings of the National Academy of Science, USA 105, 6097 (2008). [27] M. T. Carrano, Amniote Paleobiology, M. T. Carrano, T. J. Gaudin, R. W. Blob, J. R. Wible, eds. (University of Chicago Press, 2006), pp.225–268. [28] M. L. Boas, Mathematical Methods in the Physical Sci- ences (John Wiley & Sons, Inc., Hoboken, NJ, 2006), third edn. [29] D.Warton,I.Wright,D.Falster,M.Westoby,Biological Reviews 81, 259 (2006). 5 APPENDICES 1. Model specification Theseappendicesdocumentthetechnicaldetailsofour As described in the main text, our model combines study. three simple mechanisms related to body size evolution. Eachofthese processeshas beenpreviouslysuggestedor • Appendix A fully describes the cladogeneticmodel studied the literature, but are combined here in a coher- used to test our main hypotheses, including the ent, quantitative framework that engages directly with model’s specifications (Appendix A1), the statis- empirical data. We now briefly describe the technical tical estimation of the model parameters from the details of the three processes. mammalian fossil data (Appendix A2), and our 1. The range of possible body sizes for a particular score function for comparing the results of the higher taxon, e.g., terrestrial mammals, obeys a model to empirical data (Appendix A3). lower limit x . A limit like this was suggested min in [1] on the basis that physiological factors, e.g., • AppendixBdescribesourmodelofspeciessizevari- metabolicrequirements,constrainhowsmallapar- ation at speciation events, including a new analy- ticularbodyplancanbecomewithoutfundamental sis of the empirical evidence for Cope’s rule (Ap- innovation. (For convenience, we also assume that pendix B1) and the estimation of the distribution body size obeys an upper limit, but set this limit F(λ) of within-lineage changes to body size (Ap- at an extremely large size, x =1015g.) pendix B2). max 2. As is conventional,simulated time proceeds in dis- • Appendix C presents supplementary results from crete steps, each of which corresponds to a single simulating the model, including snapshots from a event of cladogenesis. Although realistically, each single simulation (Appendix C1), and the results cladogenetic event could produce a variable num- of our analysis of the model’s sensitivity to the es- ber of descendent species, we present results only timated parameters (Appendix C2). for the case where exactly two new species are cre- ated while the ancestor species becomes extinct. • Appendix D presents detailed comparisons of the We note that several apparently reasonable varia- model with simpler alternative diffusion models, tions on this rule, e.g., creating one or more de- severalof which have previously been suggested as scendent species while letting the ancestral species explanations of right-skewedsize distributions. continue,however,appeartoproduceequivalentre- sults. • Appendix E gives a complete Matlab-code imple- At each of these speciation events, each descen- mentation of the model. dent species’ body size xD varies from its ances- tor’sbodysizexAaccordingtoamultiplicativeran- dom walk. That is, the size of a descendent is the APPENDIX A: A CLADOGENETIC DIFFUSION product of its ancestor’s body size and a random MODEL OF BODY SIZE EVOLUTION variableλ,whichrepresentstherelativepercentage change in body size due to allcontributing factors. We then assume that the instantaneous distribu- Complex theoretical questions about the evolution of tion of changes to body size F(λ) for a givenevent body size,such as the ones we consider,are typically ex- has two main characteristics: (1) it is stable over plored with simulations. Such a choice is mainly driven evolutionarytime(i.e.,itisnotafunctionoftimet, by the fact that a mathematical analysis of branching processes is often intractable for all but the most sim- althoughit may be a function of ancestorsizexA), and(2)italwaysrespectstheaforementionedlimits ple questions. On the other hand, poorly executed sim- on body size. This latter requirement implies that ulation studies can be misleading as a result of incor- rect specification, among other reasons. We make a con- for a given ancestor body size xA, the distribution of allowed changes to size F(λ) is bounded on the certed effort to avoidsuch problems by defining a model interval [xmin, xmax]. Fig. S1B illustrates this idea, whose parameters can be estimated directly from fossil xA xA showing how the support of the distribution varies data priorto the late Quaternary,andwhose output can as a function of body size. If hlogλi =6 0, then we be validated against data from the late Quaternary(Re- say that F(λ) is “biased,” with a positive bias cor- cent species). Although these two data sources are not responding to Cope’s rule; if hlogλi = 0, we say logically independent, they are perhaps as close to inde- that F(λ) is “unbiased.” pendent as we might wish for such a macroevolutionary study. Wenotethatwhilewemainlystudythebodysize In the physics literature (see [28]), this boundary distribution of terrestrialmammals here, this framework effect is similar to an “absorbing boundary” con- can easily be adapted to other taxonomic groups, e.g., dition in a diffusion-reaction equation, i.e., we re- birds. quire that the probability density go to zero at the 6 !"" A B 10 Time 2 !"!! 1 0.5 )+,+-.!"!% 0.10 1 2 3 4 5 6 7 xA ()* &' !"!$ x D x min !"!# x ! ! ! /+0 /*1 23,-+4,+5*-+67859*0:78+08)(;.8<+=7>8! 10−1 10−1 C D 10−2 ortion10−2 ortion p p o o Pr Pr 10−3 10−3 Empirical data Diffusion model 100 101 102 103 104 105 106 107 100 101 102 103 104 105 106 107 Body size (g) Body size (g) FIG. S1: Results of modeling the evolution of body sizes for 4002 Recent terrestrial mammal species. (A) A schematic illustrating a simple cladogenetic model (see text) of species body size evolution, on the basis of a multiplicative diffusion process where the size of a descendant species xD is related to its ancestor’s size xA by a multiplicative factor λ. (B) Model of the distribution of within-lineage body size changes F(λ), where lower and upper boundaries on body size are enforced by letting setting F(λ<xmin/x)=0. Thus, as a lineage approaches xmin, the distribution increasingly favors changes in the opposite direction of the limit (inset: average change in log-body size, as a function of ancestral body size, with hlogλi = 0, xmin = 1.8g and xmax = 107g). We incorporate a model of Cope’s rule by letting the mean of this distribution µ(xA) vary as a function of xA, where µ(xA) is estimated from fossil data (see Appendix B). (C) Histogram of Recent mammal body sizesoverlaidbyanexampledistributionproducedbythemodel(inset: correspondingcomplementarycumulativedistribution functions). (D) The central tendency (with 95% confidence intervals) of the simulated distribution of species body sizes and thesmoothed empirical distribution for 4002 Recent mammal species (Gaussian kernel). boundary, s(x) = 0 at x = x . In contrast, mentation ofthese notions concernrelativelymod- min a “reflecting” or “insulating boundary” would re- ern species. As such, relatively little is known quire that the flux across the boundary be zero, about speciation and extinction rates in the fos- ds/dx=0 at x=x . Unfortunately, these same sil record [25, 30]. However, as population size min terms have different meanings in the body size lit- generally decreases with increased body size, the erature (see [17]); thus, we avoidtheir use entirely. increased extinction risk could result from popula- tions of larger sized organisms being closer to in- 3. Species become extinct independently with a viable population sizes. The result for this mech- probability pe that depends only on species anism is that one parameter – the rate at which body size. We considered two functional forms extinctionriskincreaseswithbodysizeρ–remains for how this risk of extinction varies with free in our study. body size: a power-law function of the form log10 pe(x)=ρlog10 x+log10 β, where β is the We note that an equivalent model would allow the baseline extinction rate and ρ is the rate at which speciation rate, or both extinction and speciation, the rate increases with log-body size, and a loga- to vary with body size. The absolute value of rithmic function pe(x)=ρlog10 x+β. the speciation and extinction rates is not impor- Thenotionthatextinctionriskincreaseswithbody tant [2], but rather their ratio is. For a discrete- size ρ > 0 is a conventional one in the body timemodel,size-dependentextinctionratesaresig- size literature [26], although most empirical docu- nificantly easier to work with. 7 107 100 Descendent body size (g)111111000000123456 A Density111000−−−321 B λultiplicative change in body size, 001..01215 C M 100 100 101 102Ance1st0o3r body1 0si4ze (g)105 106 107 0.M1ultiplicativ0e.5 cha1nge in2 body size,1 λ0 100 101 10A2nces1to0r3 body1 0s4ize (g1)05 106 107 FIG.S2: Analysisof1106pairsofmammalspeciesintheNorthAmericanfossilrecord[19]. (A)DescendentbodysizexDversus ancestor body size xA overlaid by the relation xD =xA, representing the null-hypothesis of no bias toward larger or smaller body sizes, i.e., hlogλi =0. The best-fit allometric relation logxD = λ˜logxA for this body size data (by standardized major axisregression[29])producesanestimatedslopeλ˜=1.02±0.1(where±indicatesthe95%confidenceinterval;r2 =0.95). (B) Estimateddensity(Gaussiankernel)ofthedistributionF(λ)ofwithin-lineagechangestospeciesbodysize(solidline;equivalent to distribution of vertical residuals in A), along with the maximum likelihood log-normal distribution (dashed). (C) Change inspeciesbodysize λasafunctionofancestorsize(circles)overlaid withthebestmodeloftheform logλ(xA)=N[µ(xA),σ2] (dashed lines). Under this model, changes in body size at speciation events are systematically biased toward larger sizes (Cope’s rule); the bias is strongest for small bodied species, but still positive [µ(xA) = 0.04] for larger species x & 32g. A likelihoodratiotestindicatesthatthismodelisabetterfittothedatathanamodelwithnobias[µ(xA)=0]forlargerspecies (p= 1.44×10−4; see Appendix B2). We note that this simple model is a more conservative one than a model that includes theheavy tails of the distribution shown in B. Only a few more words are necessary to complete our and 3, the size of the founder species, and the number specificationofthemodel. Ateachtimestep,onespecies, of species to simulate. The methodology for parameter- chosen uniformly at random from the extant set, under- izing Rule 2 is slightly more involved and is described goes cladogensis according to Rule 2. This action pro- subsequently (Appendix B). duces two daughter species, one of which is new and the Rule 1 (boundaries) requires parameters to define a otherofwhichreplacestheancestralspeciesintheextant lower limit on body size. The most direct way to esti- set. Subsequently, each extant species becomes extinct mate these values is to consider fossil [19, 21] and Re- according to Rule 3; extinct species are removed from cent [18] body size data. Each of these sources agrees the extant set. Fig. S1A illustrates this branching pro- that the minimum mammalianbody size is in the neigh- cess schematically. The model is initialized with a single borhoodof x ≈2g [e.g., both the Etruscanshrew (S. min founder species with body size x0, and proceeds for tmax etruscus) and the bumblebee bat (C. thonglongyai) are time steps (the number of steps is also the cumulative inthisrange]. Experimental[22]andtheoreticalwork[6] number of species produced). Fig. S1B illustrates the on metabolism also supports a fundamental limit in this form of Rule 2 that we use (see Appendix B for more vicinity. Theparticularsizeofthefounderspecieshaslit- details), where the largest change in body size is con- tle impact on the simulation results (see Appendix C2), strainedsothat the resultwouldbe to producea daugh- andforconveniencewechooseittobeequaltothemode ter species with size xmin. Fig. S1C shows an example of the Recent distribution, x0 =40g. of the resulting simulated distribution of species body Parameter estimates for Rule 3 (extinction rates) can sizes, where we have used the parameter values given in be partially derived from existing fossil data. We esti- Table S1, and Fig. S1D shows the central tendency of mate the baseline extinction rate β for terrestrial mam- this model. mals in the following way. If the number of Recent ter- restrialmammalsrepresentsaroughlystableequilibrium, then for each cladogenesis event in the simulation there 2. Parameter estimation must be one extinction event, on average. (This equilib- riumassumptionisnotcentraltoourresults,andcanbe To implement this model on a computer, we must relaxedwithoutimpactingthefundamentalnatureofthe choose the form of each mechanism, e.g., F(λ). Where model,solongasthetotalnumberofextantspeciesgrows possible,weestimatedboththeformandthecorrespond- slowlyrelativetotherateofspeciesturnover.) Thus,the ingparametersdirectlyfromfossildata;theonlygenuine baseline extinction rate is simply β = 1/n, where n is freeparameterinthemodelisρ,therateatwhichextinc- the expected number of species at equilibrium. We let tion risk increases with size. In this section, we describe n= 5000, although its precise value is unimportant. By our methodology for estimating parameters for Rules 1 lettingextinctionrateincreasewithbodysize,theactual 8 parameter value source Ref. lower bound xmin 1.8g [18,19, 21] λ founderbody size x0 40g [18] ze, 10 species at equilibrium n 5000 [18] si baseline extinction rate β 1/n – y d rate of extinction increase ρ 0.025 – o e in b 2 myeeaarns isnpeecqiuesililbifreituimme ντ 1.6600(1M)yMy [19[1,92]1] g n 1 logλ-intercept c1 0.33 [19] a e ch0.5 lsoygstxe-minatteirccebpiats cδ2 10..3004 [[1199]] v ati variance σ 0.63 [19] c pli power-law tail α 3.3(1) [19] Multi0.1 Ancestor−descendant data Smoothed data TABLE S1: Cladogenetic simulation parameters, their esti- 100 101 102 103 104 105 106 107 mated values and the data sources from which the estimates Ancestor body size (g) were derived. The parameters can be grouped according to mechanism: the physiological lower limit of the terrestrial mammalianbodysize(xmin);thedistributionF(λ)ofwithin- FIG.S3: ThesamedataasinFig.S2Calongwithasmoothed lineage changes to body size (c1, c2, δ, σ and α), where δ version(exponentialkernel)showingthemean±onestandard denotes the systematic bias away from smaller body sizes deviation. The smoothed trend is quite similar to the piece- (Cope’s rule) and c1 and c2 denote the additional bias for wise linear model that we fitted to the data via maximum small-bodied species; the initial conditions and duration of likelihood (see AppendixB2). thesimulation (x0, τ,ν and n). number of species at equilibrium neq will be somewhat the largest species is 56−58% larger than the basal ex- less than this number. If the true number of terrestrial tinction risk (32−34% for F(λ) with log-normal tails). mammalspeciesissubstantiallygreaterthanourcurrent When spread over six or seven orders of magnitude, this estimateofroughly5000,orifthe assumptionofequilib- causes a slight, positive dependence of extinction risk on rium is incorrect, then the extinction probability curve body size. We note that the form of this curve provides can be rescaled by lowering the baseline extinction rate, a testable prediction of the model. whichdoesnoteffectotheraspectsofthesimulationsuch as the overallshape of the distribution. We estimate the length of the simulation t by esti- 3. Scoring the quality of the model max matingthe totalnumberofmammalianspecies sincethe Cretaceous-Tertiary boundary. We estimate this num- The output of the simulation is a set of species body ber as tmax = τn/ν, where τ is the number of years of sizes. To evaluate the quality of this set relative to the equilibrium, ν is the average duration or lifetime of a empiricaldataonterrestrialmammals,weuseadistance species, and n is the number of species at equilibrium. measure for statistical distributions, the tail-weighted We let τ ≈ 60 My, although its precise value has lit- Kolmogorov-Smirnov(wKS)goodness-of-fitstatistic [31] tle impact on the results of the simulation. Estimates of the average duration of a species, however,vary quite |S(x)−P(x)| wKS=max , (A1) widely depending on the data used. In the Alroy data x P(x)[1−P(x)] set, ν =2.32(8) My (n = 1703; the parenthetical value p denotes the standarderrorinthe lastdigit), while inthe whereS(x)isthecumulativedistributionfunction(CDF) NOW data set, ν =1.52(1) My (n = 14099). We esti- ofthe simulateddataandP(x) isthe CDF ofthe empir- mateνbetheaverageofthese: ν =1.60(1) My,although ical data. This statistic is independent of any particular its exact value is not important (see Appendix C2 and binning scheme and thus gives a relatively general char- Fig. S7). acterization of the dissimilarity of two distributions by Finally,weestimatethevalueofρbynumericallymin- measuringthe maximum absolute deviationbetween the imizing the distributional distance (see Appendix A3) simulated and empirical cumulative distributions. Very between the model and the empirical data for terrestrial smallvalues (wKS <0.3) indicate a strong closeness,for mammals (Fig. S4A). In general, we report results for all values of x. In Fig. S1C, for instance, wKS ≈0.17. the power-law model of extinction risk; the fitted value Somereadersmaybefamiliarwiththemorecommonly of ρ in the logarithmic model is such that the two risk used Kolmogorov-Smirnov(KS) goodness-of-fitstatistic. curvesarealmostidentical(seeFig.S4B),indicatingthat Thetail-weightedversiondiffersbygivingequalweightto the functional form is not important – both models re- allpartsofthedistribution,andparticularlythetails. In sult in a close-to-linear increase in extinction risk with contrast, the traditional KS statistic effectively weights log-size such that the risk of extinction at each step for the area near the median of the distribution the most, 9 andthuscanunderestimatestrongdifferencesinthetails. 1 This causesthe tail-weightedversionto be more difficult tominimize–allpartsofthesimulateddistributionmust 0.9 be close to the empirical one, not just the middles of S K0.8 the distributions. We have tried using both statistics to w score the quality of the model results, and we find that e, 0.7 c n numericallyminimizingthetail-weightedversionchooses a st0.6 values of ρ that produce significantly more convincing di results for larger-bodied species, e.g., x>104g. al 0.5 n o Finally, because the model produces a dynamic equi- uti0.4 liitbsrtiyupmicianlbtheheasvpieocri,easnbdotdoypsrizeevednitsttrriabnustiieonnt,etffoeecvtsalfuroamte distrib0.3 skewing our quality scores,we averagethe wKS statistic 0.2 over regularly spaced intervals in the last 15 My of sim- 0.1 ulated time. When we evaluate the quality of a set of 0 0.01 0.02 0.03 0.04 extinction parameter, ρ parametervalues,we furtheraveragethis value oversev- eral hundred independent trials. x 10−4 3.2 β xρ, Power−law tails β xρ, Log−normal tails 3 β + log xρ, Power−law tails APPENDIX B: CHANGES TO BODY SIZE AND 10 COPE’S RULE p(x)e2.8 β + log10 xρ, Log−normal tails n, o Rule 2representsthe mannerinwhichbody sizesvary ncti 2.6 at speciation events. Phylogenetic body size data for a xti e widerangeofterrestrialmammalswouldbethepreferred ob. 2.4 way to determine the best model of within-lineage body Pr size variation, but such ancestor-descendent data is not 2.2 currently available for a sufficiently large and diverse set of terrestrialmammals. Instead, we use Alroy’s putative ancestor-descendantdata, reconstructedfrom fossil data 10 0 101 102 103 104 105 106 107 108 forNorthAmericanmammals,asaproxy. Thisdatahas Body size (g) been used in several previous studies of within-lineage variation of body size [14, 19], and details of the non- phylogenetic reconstructionprocess for the 1106 pairs of FIG. S4: (A) Estimation results for fitting the free param- terrestrial mammals species are given there. From this eter ρ in the power-law model of extinction risk, in two al- data, we estimate a parametric model for F(λ). ternative cases, one where the distribution F(λ) of within- lineage changes to body size has log-normal tails (blue), and Thenon-phylogeneticnatureofthisdata,however,im- one where the tails decay as a power (red). Similar results pliesthattherearelikelytobeseveralinversionsofances- are obtained under the logarithmic model of extinction risk. torsanddescendants,aswellasseveralincorrectpairings All other parameters take the values given in Table S1. For of ancestors with descendants. Fortunately, the statisti- clarity, we also plot a smoothed trend (exponential kernel) cal nature of our analysis implies that so long as the over the sampled data. Each point is the average goodness- number of putative pairs is relatively large, such errors of-fit hwKSi, for the last 15 My of the simulation, over 50 will not obscure the true average log-change, which is independent trials. (B) The fitted extinction-risk curves for precisely the aspect of this data most important to our models of F(λ)with power-law and non-power-law tails, and study. Further,oursensitivityanalysisindicatesthatthe for models where theextinction risk increases as a logarithm orpowerofsize(seeAppendixA1,Rule3). Thesimilarityof precisedetailsoftheinferredmodel,e.g.,theaverageand thecurvesbetweenthesetwoextinction models showsthat a variance, do not matter much with regard to our overall generally log-linear form is sufficient. conclusions (see Appendix C2), so long as a log-normal model of change is a relatively good model of the data. aslightsystematicpositivebiashlogλi>0,withdescen- dants tending to be slightly larger than their ancestors. 1. Empirical evidence for Cope’s rule In order to specify Rule 2, however, we need to know not only whether there is a positive bias or not, but how Empirical evidence for and against Cope’s rule has strongisthe biasasafunctionofancestorsize. This can been studied in a variety different taxonomic groups[14, be done by directly estimating the shape of F(λ) as a 24, 32, 33, 34, 35, 36]. For terrestrial mammals, the evi- function of ancestor size. Thus, we conduct a new anal- denceisrelativelystrong,withAlroy’sstudy[14]showing ysis of the previously studied ancestor-descendantdata. 10 10−1 505 total species 10−1 2005 total species 10−1 30005 total species 10−1 100005 total species A Empirical data B C D Simulated data Proportion10−2 10−2 10−2 10−2 10−3 10−3 10−3 10−3 10 0 101 102 103 104 105 106 107 100 101 102 103 104 105 106 107 100 101 102 103 104 105 106 107 100 101 102 103 104 105 106 107 Body size (g) Body size (g) Body size (g) Body size (g) FIG.S5: Snapshotsofthesimulatedspeciesbodysizedistribution,relativetotheempiricaldistribution,fromasinglesimulation trial, taken at n = {505,2005,30005,100005} total species (A, B, C and D, respectively). For clarity, the insets show the corresponding complementary cumulative distribution functions. this assumption; however, we note that the data are 0.6 also consistent with a log-normal double Pareto distri- bution [37] – a log-normal distribution with tails that decay as power-laws (or, that decay as exponentials in 0.5 logλ). We test this hypothesis using standard statisti- cal techniques for power-law distributions [38], and find KS0.4 that the tails of the distribution can be assumed to be d e symmetric [negative tail: α = 3.4(2), p = 0.83(3); pos- ht eig0.3 itive tail: α = 3.3(2), p = 0.79(3); both tails together: w α = 3.3(1), p = 0.96(3)]. For completeness, we consider both models of F(λ) in our sensitivity analysis,and find 0.2 relatively small differences between the results (but see Appendix D). 0.1 103 104 105 Total species 2. Our model of changes to body size In this section, we describe a model-selection analysis FIG.S6: Thetimeseries of wKS statistics for thesimulation amongthreealternativemodelsofwithin-lineagechanges in Fig. S5. The bold circles indicate the positions and scores to body size F(λ), all of which are drawn from a log- of the four snapshots. normaldistribution where the averagelog-changeto size µ depends on the ancestor’s size xA. In this way, F(λ) can model both the effect of Cope’s rule on large-bodied Fig. S2A shows descendant body size xD as a func- species and the effects of constrained evolution near the tion of ancestor body size xA, for Alroy’s fossil data on lower limit of body size on real mammalian evolution North American mammals, and illustrates that descen- (above and beyond the form imposed by respecting the dantstendtoberoughlythesamesizeastheirancestors. The best-fit allometric relation [29] logxD = λ˜logxA to lowerlimitinRule2). Thislattereffectwecallthesmall- bodied bias. For these three models, we ask which has these data yields λ˜ = 1.02±0.01 (estimate ±95% con- the best empirical support from the putative ancestor- fidence), indicating a small but systematic tendency for descendent data. descendants to be slightly larger than their ancestors. Fig. S2B shows the distribution of within-lineage 1. Model one is a piece-wise form in which a bias to- changes in body size (equivalent to the vertical residuals wardlargersizesforsmall-bodiedspeciesdecreases tothelinexD =xA inFig.S2A),withincreases(615)be- as a power of body size to a constant value δ for ing only slightly more common than decreases (488; the large-bodied species (Fig. S2C). remaining 3 cases are instances of no-change). Denoting 2. Model two is identical to model one but sets the λasthemultiplicativechangeinbodysizefromancestor large-body bias parameter δ to zero. to descendant, we find that the overall average change is toward larger sizes, with hlogλi=0.047±0.009. This 3. Modelthreeisafunctionµthatfollowsthebest-fit estimate ignores,of course, the possibility that the aver- cubic polynomial (see [14]). age change depends on the ancestor size. The conventional assumption in simulation studies of All models have the form logλ(xA)=N[µ(xA),σ2] – bodysizeevolutionisthatF(λ)followsalog-normaldis- that is, logλ is normally distributed with constant vari- tribution. We find that the data are consistent with ance σ and a mean µ that varies as a function of body

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.