ebook img

On the Height Profile of a Conditioned Galton-Watson Tree PDF

0.2 MB·English
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview On the Height Profile of a Conditioned Galton-Watson Tree

On the Height Profile of a Conditioned Galton-Watson Tree ∗ Go¨tz Kersting 1 University of Frankfurt am Main† 1 0 March 1998 2 n a J 9 Abstract 1 Recently Drmota andGittenberger (1997)proveda conjecture due ] to Aldous (1991) on the height profile of a Galton-Watson tree with R an offspring distribution of finite variance, conditioned on a total size P of n individuals. The conjecture states that in distribution its shape, . h more precisely its scaled height profile coincides asymptotically with t the local time process of a Brownianexcursion of duration 1. We give a m a proof of the result, which extends to the case of an infinite variance offspring distribution. This requires a different strategy, since in the [ infinite variancecasethereis no longera relationshiptothe localtime 1 of Brownian resp. L´evy excursions. v 6 AMS classification numbers: Primary 60J80,Secondary 60F17 5 Key words: Galton-Watson trees, L´evy-excursions,functional limit theorems 6 3 . 1 1 Introduction and main result 0 1 1 In this paper we analyse the shape of a Galton-Watson tree, conditioned to : haveatotalnumberofnindividuals. Morepreciselywestudyitsasymptotic v i height profile, as n . X → ∞ By a tree t we mean a rooted, ordered, finite tree. We consider the r a vertices to represent individuals, such that t can be regarded as the family tree of the progeny of some founding ancestor (the root). It is assumed that among siblings there is an order (of birth), which allows to imbed such trees into the plane. The total number of vertices of t, its size is denoted by s(t). We shall be interested in the way, in which the size of the differentgenerations vary along thetree. Anindividualibelongs tothek’th generation, if the path from i to the root contains exactly k edges. Let z k denote the number of individuals in generation k = 0,1,.... The sequence ∗partially supported bythe German Research Foundation DFG †Fachbereich Mathematik, Postfach 11 19 32, D-60054 Frankfurt/Main 1 1 = z ,z ,z ,... is called the height profile of t (possibly after a suitable 0 1 2 renormalisation). For a Galton-Watson tree T the number of children of the different in- dividuals are assumed to be independent and identically distributed ran- dom variables. p ,x = 0,1,... denotes the probability, that an individual x possesses x children, and Z is the number of individuals in generation k. k We shall consider T, conditioned on a size s(T) = n. Such a conditioned Galton-Watson tree will be abbreviated as CGW(n)-tree. Since it contains n individuals, it is natural to consider its height profile Hn = a−1Z , u 0, u n [nu/an] ≥ rescaled with some sequence (a ) of positive numbers. The aim is to choose n (a ) such, that Hn converges in distribution. Then a gives the magnitude n n of the breadth of the CGW(n)-tree, whereas n/a indicates the order of n h(T) = max k Z > 0 , the height of the CGW(n)-tree. - We assume: k { | } Assumption A 1.) The offspring distribution (p ) has mean 1: x x ∞ xp = 1. x Xx=0 The greatest common divisor of all x with p > 0 is 1. x 2.) There are positive numbers a such that a−1(ξ +...+ξ n) converges n n 1 n− in distribution to a non-degenerate limit law ν, as n . Here ξ ,ξ ,... 1 2 → ∞ denote independent random variables with distribution (p ) . x x Thecase of an offspringdistributionwith a finitemean can betreated in much the same way and is not a genuine generalisation (see Kennedy [25]). Similar the assumption on the g.c.d. may be removed. Criticality and a g.c.d. equal to 1 are assumed just for convenience. The second assumption has been completely analysed. It has several implications, which are discussed in chapter XVII in Feller [16] and in the monograph of Gnedenko and Kolmogorov [18]. In particular (p ) is in the x domain of attraction of a stable law. Its index α belongs to (1,2], since we deal with an offspring distribution of finite mean. This implies a = o(n). n Thelimit law ν =ν is determined by αup to a scaling constant. For α= 2 α this is simply the normal law, and for 1 < α < 2 it is a one-sided stable law, because the negative tail of the offspring distribution is zero. We shall prove, that under these assumptions the height profile Hn con- verges in distribution, where the a are just the numbers, given in the as- n sumption. The limiting process H = (H ) turns out to be a functional u u of certain normalized excursions: H = ψ(Y). By a normalized excursion we understand a process Y = (Y ) , such that Y = Y = 0 and s 0≤s≤1 0 1 2 inf Y > 0 for all δ > 0. H will be obtained from Y as follows: δ≤s≤1−δ s The corresponding cumulative height profile u C = H dv, u 0, u v Z ≥ 0 is given by s dt C = sup s 1 : u . u { ≤ Z Y ≤ } 0 t H, having a.s. paths continuous from the right, is completely determined by the cumulative height process C. If dt/Y < , then h > 0 and 0+ t ∞ H > 0 for all u (0,h), also C 1, asRu h. Then we say, that H is u u ∈ → → non-degenerate. The case dt/Y = , implying C 0 and H 0, is in 0+ t ∞ ≡ ≡ the sequel of no significancRe. - The same transformation already appears in the paper [29] by Lamperti, who used it to transform L´evy-processes into continuous branching processes. Our theorem is in a way a conditioned version of Lamperti’s result. The normalized excursions are obtained as follows. There is a unique L´evy-process X = (X ) (a process with independent and stationary in- s s≥0 crements), such that X = 0 and the law ν is the distribution of X , and 0 1 to each of these processes there belongs a normalized excursion Y. This is explainedindetailinBertoin’s monograph[5](inparticular chapter VIII.4). Theorem 1 Let Hn be the scaled height profile of a CGW(n)-tree, satisfy- ing assumption A. Then, as n , Hn converges in distribution to the pro- → ∞ cess H = ψ(Y), derived from the corresponding normalized L´evy-excursion Y. H is a.s. non-degenerate. In general the excursions contain jumps. These jumps also show up in H. Thus we regard Hn and H as random elements in the space of c´adla´g- functions, endowed with the usual Skorohod J topology (compare [14]). 1 A neat case is that of an offspring distribution with finite variance: ∞ σ2 = x2p 1< . x − ∞ Xx=0 Then, choosing a = σn1/2, Y is a normalized Brownian excursion. In n this situation the limiting process allows another appealing description: (H ) =d (1L ) , where L denotes the local time process of a normal- u u 2 u/2 u ized Brownian excursion. The reason is: Up to a factor 1/2 L is related to a normalized Brownian excursion in the sameway, as H is derived above from Y, which follows from a result of Jeulin [19] (compare Biane [6], Th´eor`eme 3). This version of Theorem 1 has been observed by Aldous in special cases (as the geometric offspring distributions) and conjectured in the general fi- nite variance case, compare [3]. A first proof of the conjecture was given 3 by Drmota and Gittenberger [11]. They mastered the formidable task to obtain convergence of the finite-dimensional distributions as well as tight- ness, using generating functions and thereby generalizing work of Kennedy [25] on the one-dimensional distributions. Pitman [34] surrounded the diffi- culties by imbedding the problem into the context of convergence of strong solutions of stochastic differential equations. The limiting distribution of the maximum of the height profile, the ‘width’ of the family tree, has been found in the finite variance case already by Tak´acs [39]. The relations to stochastic analysis can be further developed in the case σ2 < . Note that the defining equation for C can be written as ∞ Cu dt u= for 0 < u h, and C = 1 for u > h, u Z Y ≤ 0 t where 1 dt h = . Z Y 0 t h is the asymptotic height of the rescaled tree, as will become clear in the next section. By differentiation with respect to u we get dC u H = = Y(C ). u u du Now a Brownian excursion Y solves the stochastic equation dY = dW + (1 Y2 )dt, with a standart Brownian motion W (compare [36], chapter IV, −1−t Y section (40.4)). Viewing C as a time-change, this leads to the stochastic equation H2 dH = √HdB + 1 dt, (cid:18) − 1 C(cid:19) − with a standart Brownian motion B. Pitman [34] also obtains this equation and discusses it in detail, therefore there is no need to trace this aspect further. The idea of using C as a time-change goes back to Lamperti [29]. The mentioned proofs all focus on the relationship to Brownian local time. In contrast we shall not rely on local times in this paper. (A short description of our proof in the finite variance case already appeared in the technical report[26].) Thereason isthatin thecase α < 2theconnection to local times breaks down. Then Y and consequently H exhibits a.s. jumps. SincelocaltimeprocessesofL´evy-processes(ifexistent)havea.s. continuous paths (compare f.e. [5], chapter V.1), they are no longer suited. This can be explained on a heuristic level, too. Consider the following construction, going back to Harris [23] and used by different people. Tra- verse the individuals of T in the following manner: From individual i pass over to its oldest child, which has not yet been visited, resp. return to its predecessor, if all children of i have already been visited. This gives a traversal through T with the root as starting and end point. Each edge is 4 passedtwice, onceforwardandoncebackward. Nextconsidertheassociated random path, which increases one unit, if we change over to a child, and decreases one unit, if we go back to a predecessor. Inthe case of a geometric offspring distribution we get a true random walk excursion, conditioned to return to zero after 2n steps for a CGW(n)-tree. This is due to the lack of memoryofthegeometric distribution. Thenumberofupcrossingsfromlevel k 1tolevel k isequaltothesizeZ ofthek’thgeneration, whichmakesthe k − relation to local times obvious in this situation. In general the random path exhibits complicated dependence properties. In the case σ2 < they are of ∞ a local natureand vanish in the limit n , as was shown by Aldous, such → ∞ that the asymptotic height profile can still be described by Brownian local time. For α < 2 however, the dependence structure survives in the limit. The combinatorics of CGW(n)-trees have been widely studied by means of generating functions (seef.e. [11,17,25,31]). Probabilistic methodshave beenintroducedinKolchin[28]andinparticularbyAldous[2,3]. Ourproof of theorem 1 is based on two probabilistic constructions, which are valid for the infinite variance case, too. Thefirst one will bedescribed in section 2, it establishes a connection between Galton-Watson trees and suitable random walk excursions. Though the relationship has been known for quite a while (compare [10]), the scope of this approach has been enlarged considerably only recently (see [4, 8, 20]). In our context it allows to reduce convergence of Hn toconvergence of excursions. Therequiredcontinuity theorem willbe developed in section 3. The second probabilistic construction, which will be presented in section 4, is size-biasing of Galton-Watson trees. We use it to describethebottom ofthetrees, whichisofsomeinterestof itsown andwill help to check a main condition of the continuity theorem. This concept goes back to Geiger [21], who developed a construction due to Lyons, Pemantle andPeres[30]. Section5addressesthequestionofconvergenceofexcursions. Thus the height profile will be considered as a functional of a random walk excursion S. Other quantities of the tree can be viewed as well as functionals of S. In this manner we can also treat the height profile of random forests, as discussed by Drmota and Gittenberger [12] and Pitman [34] in the finite variance case. We contend ourselves by stating a version of thetheorem, which is valid forinfinitevariances, too. Aconditioned random forest consists of l Galton-Watson trees, conditioned to contain altogether n individuals. Let now Z be the total number of all individuals in generation k k in one of the l trees. Suppose that l γa , as n , with γ 0. n ∼ → ∞ ≥ Then the height profile (Hn) = (a−1Z ) converges in distribution. u u n [nu/an] u The limiting process can be described as follows. Let X = (X ) be γ γ,s 0≤s≤1 a L´evy-process as above, now conditioned to hit γ at the moment s = 1 − for the first time. It is built up from the excursions Y = (Y ) = (X +η) , 0 η γ, η η,s s≤Lη s Tη≤s≤Tη+ ≤ ≤ with T = inf s : X = η , T = inf s : X < η and L = T T . η s η+ s η η+ η { − } { − } − 5 Then the limiting process is given by H , u 0, η,u ≥ 0≤Xη≤γ where H is derived similarly as above from η,u s dt C = sup s L : u . η,u η { ≤ Z Y ≤ } 0 η,t Thesumis a.s. finiteforevery u > 0, whichreflects thefact, thatalsointhe limit only finitely many trees contribute to the height profile. This result can be proved in much the same manner (and with only little additional effort), as we shall obtain Theorem 1 below. For γ = 0 we are back in the situation of Theorem 1. 2 Trees and Random Walk Excursions Ithasbeenknownforsometime,thatGalton-Watson treescanbeimbedded into random walks (compare [8] and the references therein). This is implicit in Dwass’ important paper [10]. Here we give a combinatorical treatment, which allows generalization. Let T be a tree of size n. Suppose that we label the individuals in T with the numbers 1,2,...,n such that the root gets label 1, the individuals in the first generation the labels 2,...,Z +Z 0 1 (say from left to right), the individuals in the second generation the labels Z +Z +1,...,Z +Z +Z and so forth. Using these labels we define a 0 1 0 1 2 random path S = (S(0),S(1),...,S(n)) recursively by S(0) = 1, S(i) = S(i 1)+ξ 1, i= 1,...n, i − − whereξ denotes the offspringnumberof the individualwith label i. We can i imagine that the path arises as follows: To its i’th increment individual i contributes the downward step 1, whereas each of its children contributes − one step +1 upwards. Then each individual is responsible for one upward and one downward step, except the root, which has no predecessor and thus contributes only a step downwards. Since S(0) = 1, S(n) = 0. Furthermore, individuals always have smaller labels than their offspring, therefore the upward step of an individual appears before its downward step. Clearly this implies S(i) > 0 for all i< n, i.e. S is an excursion of length n. Conversely, given such an excursion S of length n, with S(i) S(i 1) 1, we can construct a tree, fitting to the ≥ − − 6 excursion S. Namely, from S we read off ξ = S(i) S(i 1)+1, and the i − − given labelling rule allows us to grow the tree from its root. Thus there is a one-to-one correspondence between trees of size n and excursion of length n. Astotheprobabilisticaspectoftheconstruction note, thatforaGalton- Watson tree the ξ are independent random variables, such that S becomes i an ordinary random walk excursion. Likewise S is a random walk excursion of duration n, if T is a CGW(n)-tree. Remarks 1.) These considerations remain valid for other ways of labelling. One possibility is to label according to depth-first search, which hasbeenexploitedin[4,20]. Ingeneralthefollowingpropertiesarerequired: i) 1 is the label of the root. ii) Theindividualswith labels 1,...,i form asubtreefor any i< n. Inother words: Any individual has a smaller label than any of its children. iii) Given the subtree with labels 1,...,i and the numbers ξ ,...,ξ there is 1 i a rule, which specifies, which child of the individuals 1,...,i gets the label i+1. 2.) A random forest of l trees and n individuals can be described by a random walk path S with S(0) = l, S(i) > 0 for i < n and S(n) = 0. If we label the trees one after the other, then the r’th tree is represented by the part of the random walk between the hitting times of l+1 r and l r. − − In our labelling 1,...,Z + Z + ... + Z are just the members of 0 1 k−1 generation 0 to k 1. They contribute a negative step to S(Z +Z +...+ 0 1 − Z ). Theirchildren, i.etheindividualsingeneration 1tok, addapositive k−1 step. Therefore Z = S(Z +Z ...+Z ). k 0 1 k−1 This is the announced random walk representation for the height profile, which has been used by several authors (see [8]). We transform it into a differential equation. Define u Cn = a−1Z dv, u Z n [nv/an] 0 in particular 1 Cn = (Z +...+Z ). kan/n n 0 k−1 Further let for 0 s 1 ≤ ≤ Sn(s) = a−1S([ns]) n and Yn(s)= Sn(Cn ) for s [Cn ,Cn ). kan/n ∈ kan/n (k+1)an/n 7 Then the above equation translates into the ordinary differential equation d Cn = Yn(Cn), du u u which by integration leads to Cun dt u = for u h and Cn =1 for u > h , Z Yn(t) ≤ n u n 0 with 1 dt h = . n Z Yn(t) 0 h obviously is the height of the rescaled tree. — It is now our plan to n reduce the question of convergence of Hn and Cn to that of Sn and Yn. The next section provides the required continuity statement. 3 A Continuity Theorem Let D and D′ be the spaces of all c´adla´g-functions f : [0,1] R resp. → g : [0, ) R. By endowing them with the common Skorohod-distance ∞ → (compare [14]) we make them to complete metric spaces. Thus functions f n converge to f in D, if there are increasing bijections α : [0,1] [0,1], such n → that, as n , → ∞ sup α (t) t 0, sup f (t) f(α (t)) 0. n n n | − | → | − | → t t Similarlyg ginD′,ifthereareincreasingbijectionsβ :[0, ) [0, ), n n → ∞ → ∞ such that for all v < ∞ sup β (u) u 0, sup g (u) g(β (u)) 0. n n n | − |→ | − | → u≤v u≤v If g is a continuous function, we may choose β (u) = u. n We begin by collecting some analytical facts. Let D be the space of + non-negative f D. ∈ Lemma 2 Let f,f D . n + ∈ s i) For given s the mapping f dt/f(t) (with the possible value ) is 7→ 0 ∞ lower-semicontinuous (and thus mReasurable). ii) If inf f(t)> 0 for some 0 a < b 1, and if f f, then a≤t≤b n ≤ ≤ → s dt s dt sup 0. (cid:12)Z f(t) −Z f (t)(cid:12)→ a≤s≤b(cid:12) a a n (cid:12) (cid:12) (cid:12) (cid:12) (cid:12) 8 Proof Let α as above. As is well-known, it may be assumed without n loss of generality, that they are differentiable functions such that ′ sup α (t) 1 0. | n − | → t i) If f f, then for δ > 0 by Fatou’s Lemma n → s−δ dt s−δ dt s α′ (t)dt liminf liminf n Z0 f(t) ≤ n Z0 fn(α−n1(t)) ≤ n Z0 fn(t) and consequently s dt s dt liminf . Z0 f(t) ≤ n Z0 fn(t) ii) Due to uniform convergence of the integrands s dt s dt sup 0. (cid:12)Z f (t) −Z f(α (t))(cid:12)→ a≤s≤b(cid:12) a n a n (cid:12) (cid:12) (cid:12) (cid:12) (cid:12) Further, substituting α (t) = w n s dt αn(s) dw s dw = Z f(α (t)) Z α′ (t)f(w) → Z f(w) a n a n a uniformly for all s [a,b]. 2 ∈ Define now for f D a function g = φ(f) D′ by + ∈ ∈ s dt g(u) = sup s 1 u . { ≤ | Z f(t) ≤ } 0 Thus as above g(u) dt u = for u < h, and g(u) = 1 for u h, Z f(t) ≥ 0 with 1 dt h = . Z f(t) 0 g is continuous and increasing. It is everywhere differentiable from the right (since f is continuous from the right), and the derivative is given by d+ d+ g(u) = f(g(u)) for u < h and g(u) = 0 for u h. du du ≥ We denote d+ ψ(f)= φ(f). du 9 Lemma 3 Suppose f f in D , sup φ(f )(u) φ(f)(u) 0 and sdt/f(t) < for allns→< 1. Then+ψ(f ) u| ψ(fn) in D−′. | → 0 ∞ n → R 1 Proof Denote g = φ(f), g = φ(f ) and h = dt/f (t). Let α (t) n n n 0 n n be as above. The required bijections β : R R aRre defined as n + + → g−1(α (g (u))) for u h , βn(u) = (cid:26) β (h )n+n(u h ) for u ≤> hn. n n n n − Then sup g(β (u)) g (u) = sup α (g (u)) g (u) n n n n n | − | | − | u≤hn u≤hn sup α (t) t 0. n ≤ | − | → t By assumption it follows sup g(β (u)) g(u) 0. n | − | → u≤hn If h= 1dt/f(t) < , then g−1(s)= sdt/f(t) is uniformly continuous on 0 ∞ 0 [0,1], aRnd it follows R sup β (u) u = sup β (u) u 0. n n | − | | − | → u u≤hn If on the other hand h = , then h and g−1 is uniformly continuous n ∞ → ∞ on every intervall [0,s] with s < 1. In this case we may conclude sup β (u) u 0 n | − | → u≤v for every v > 0. This is one of the desired properties. Next for u > h we have 1 = g (h ) = g (u) and 1 = α (1) = n n n n n α (g (h )) = g(β (h )) = g(β (u)), therefore n n n n n n sup f (g (u)) f(g(β (u))) n n n | − | u = sup f (g (u)) f(g(β (u))) n n n | − | u≤hn = sup f (g (u)) f(α (g (u))) n n n n | − | u≤hn = sup f (t) f(α (t)) 0. n n | − | → t Since ψ(f) = f g and ψ(f ) = f g , also n n n ◦ ◦ sup ψ(f )(u) ψ(f)(β (u)) 0, n n | − | → u which proves the claim. 2 10

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.