A Ruelle Operator for continuous time Markov Chains Alexandre Baraviera ∗ Ruy Exel † Artur O. Lopes‡ 8 0 March 17, 2008 0 2 r a M Abstract We consider a generalization of the Ruelle theorem for the case of continuous 7 1 time problems. We present a result which we believe is important for future use in problems in Mathematical Physics related to C∗-Algebras. ] S We consider a finite state set S and a stationary continuous time Markov Chain D X , t ≥ 0, taking values on S. We denote by Ω the set of paths w taking values t h. on S (the elements w are locally constant with left and right limits and are also t right continuous on t). We consider an infinitesimal generator L and a stationary a m vector p . We denote by P the associated probability on (Ω,B). All functions f we 0 [ consider bellow are in the set L∞(P). FromtheprobabilityP wedefineaRuelleoperator Lt,t ≥ 0, actingonfunctions 1 v f : Ω → R of L∞(P). Given V : Ω → R, such that is constant in sets of the form 1 {X = c}, we define a modified Ruelle operator L˜t ,t ≥ 0, in the following way: 0 0 V there exist a certain f such that for each t we consider the operator acting on g 5 V 2 given by 03. L˜tV(g)(w) = [f1V Lt(eR0t(V◦Θs)(.)dsgfV)](w) 8 We are able to show the existence of an eigenfunction uand an eigen-probability 0 v: νV on Ω associated to L˜tV,t ≥ 0. We also show the following property for the probability ν : for any integrable i V X g ∈ L∞(P) and any real and positive t r a e−R0t(V◦Θs)(.)ds [(L˜tV (g)) ◦θt]dνV = gdνV Z Z This equation generalize, for the continuous time Markov Chain, a similar one for discrete time systems (and which is quite important for understandingthe KMS states of certain C∗-algebras). ∗Instituto de Matema´tica, UFRGS, Porto Alegre 91501-970, Brasil. Beneficiary of grant given by CAPES - PROCAD. †DepartamentodeMatema´tica,UFSC,Florianopolis88040-900,Brasil. PartiallysupportedbyCNPq, Instituto do Milenio, PRONEX - Sistemas Dinaˆmicos. ‡Instituto de Matema´tica, UFRGS, Porto Alegre 91501-970, Brasil. Partially supported by CNPq, Instituto doMilenio,PRONEX-SistemasDinaˆmicos,BeneficiaryofgrantgivenbyCAPES-PROCAD. fax number (55)51-33087301 1 1 Introduction We want to extend the concept of Ruelle operator to continuous time Markov Chains. In order to do that we need a probability a priori on paths. This fact is not explicit in the discrete time case (thermodynamic formalism) but it is necessary here. We consider a continuous time stochastic process: the sample paths are functions of the positive real line R = {t ∈ R: t ≥ 0} taking values in a finite set S with n elements, + that we denote by S = {1,2,...,n}. Now, consider a n by n real matrix L such that: 1) 0 < −L , for all i ∈ S, ii 2) L ≥ 0, for all i 6= j, i ∈ S, ij 3) n L = 0 for all fixed j ∈ S. i=1 ij WePpoint out that, by convention, we are considering column stochastic matrices and not line stochastic matrices (see [N] section 2 and 3 for general references). Wedenoteby Pt = etL thesemigroup generated byL. The left actionofthesemigroup can be identified with an action over functions from S to R (vectors in Rn) and the right action can be identified with action on measures on S (also vectors in Rn). The matrix etL is column stochastic, since from the assumptions on L it follows that 1 (1,...,1)etL = (1,...,1)(I +tL+ t2L2 +···) = (1,...,1). 2 It is well known that there exist a vector of probability p = (p1,p2,...,pn) ∈ Rn such 0 0 0 0 that etL(p ) = Ptp = p0 for all t > 0. The vector p is a right eigenvector of etL. All 0 0 0 entries pi are strictly positive, as a consequence of hypothesis 1. 0 Now, let’s consider the space Ω˜ = {1,2,...,n}R+ of all functions from R to S. In + principlethisseemstobeenoughforourpurposes, buttechnicaldetailsintheconstruction of probability measures on such a space force us to use a restriction: we consider the space Ω ⊂ Ω˜ as the set of right-continuous functions from R to S, which also have left limits + in every t > 0. These functions are constant in intervals (closed in the left and open in the right). In this set we consider the sigma algebra B generated by the cylinders of the form {w = a ,w = a ,w = a ,...,w = a }, 0 0 t1 1 t2 2 tr r where t ∈ R , r ∈ Z+,a ∈ S and 0 < t < t < ... < t . It is possible to endow Ω i + i 1 2 r with a metric, the Skorohod-Stone metric d, which makes Ω complete and separable ([EK] section 3.5), but the space is not compact. Now we can introduce a continuous time version of the shift map as follows: we define for each fixed s ∈ R the B-measurable transformation Θ : Ω → Ω given by + s Θ (w ) = w (we remark that Θ is also a continuous transformation with respect to s t t+s s the Skorohod-Stone metric d). For L and p fixed as above we denote by P the probability on the sigma-algebra B 0 defined for cylinders by P({w = a ,w = a ,...,w = a }) = Ptr−tr−1...Pt2−t1Pt1 pa0. 0 0 t1 1 tr r arar−1 a2a1 a1a0 0 2 For further details of the construction of this measure we refer the reader to [B]. The probability P on (Ω,B) is stationary in the sense that for any integrable function f, and, any s ≥ 0 f(w)dP(w) = (f ◦Θ )dP(w). s Z Z From now on the Stationary Process defined by P is denoted by X and all functions t f we consider are in the set L∞(P). There exist a version of P such that for a set of full measure we have that all sample elements w are locally constant on t, with left and right limits, and w is right continuous on t. We consider from now on such probability P acting on this space. From P we are able to define a continuous time Ruelle operator Lt, t > 0, acting on functions f : Ω → R of L∞(P). It’s also possible to introduce the endomorphism α : L∞(P) → L∞(P) defined as t α (ϕ) = ϕ◦Θ , ∀ϕ ∈ L∞(P). t t Werelateinthenext sectiontheconditionalexpectationwithrespect totheσ-algebras F+ with the operators Lt and α , as follows: t t [Lt(f)](Θ ) = E(f|F+). t t Given V : Ω → R, such that it is constant insets of theform{X = c} (i.e., V depends 0 only on the value of w(0)), we consider a Ruelle operator family L˜t , for all t > 0, given V by 1 L˜tV(g)(w) = [ f Lt(eR0t(V◦Θs)(.)dsgfV)](w), V for any given g, where f is a fixed function. V We are able to show the existence of an eigen-probability ν on Ω, for the family L˜t , V V for all t > 0, such that satisfies: Theorem A. For any integrable g ∈ L∞(P) and any positive t 1 Z [ eR0t(V◦Θs)(.)ds fV ] [ Lt( [ eR0t(V◦Θs)(.)dsfV] g ) ◦Θt ] dνV = Z gdνV. The above functional equation is a natural generalization (for continuous time) of the similar one presented in Theorem 7.4 in [EL1] and proposition 2.1 in [EL2] . In [EL1] and [EL2] the important probability in the Bernoulli space is an eigen- probability ν for the Ruelle operator associated to a certain potential V = logH : {1,2,..,d}N → R. This probability ν satisfies: for any m ∈ N, g ∈ C(X), λ E (λ−1g)dν = gdν, m m m Z Z where Hβ[m] Hβ(x)Hβ(σ(x))...Hβ(σn−1(x)) λ (x) = (x) = = m Λ[m] Λ[m](x) 3 elog(Hβ(x))+log(Hβ(σ(x)))+...+log(Hβ)(σn−1(x)) , Λ[m](x) and σ is the shift onthe Bernoulli space {1,2,..,d}N. Here E (f) = E(f|σ−m(B)) denotes m the expected projection (with respect to a initial probability P on the Bernoulii space) on the sigma-algebra σ−m(B), where B is the Borel sigma-algebra and Λ[m] is associated to the Jones index. We refer the reader to [CL] for a Thermodynamic view of C∗-Algebras which include concepts like pressure, entropy, etc.. We believe it will be important in the analysis of certain C∗ algebras associated to continuous time dynamical systems a characterization of KMS states by means of the above theorem. We point out, however, that we are able to show this theorem for a certain ρ just for a quite simple function V as above. In a forthcoming paper we will V consider more general potentials V. One could consider a continuous time version of the C∗-algebra considered in [EL1]. We just give an idea of what we are talking about. Given the above defined P for each t > 0, denote by s : L2(P) → L2(P), the Koopman operator, where for η ∈ L2(P) we t define (s η)(x) = η(θ (x)). t t Another important class of linear operators is M : L2(P) → L2(P), for a given fixed f f ∈ C(Ω), and defined by M (η)(x) = f(x)η(x), for any η in L2(P). We assume that f f is such defines an operator on L2(P) (remember that Ω is not compact). In this way we can consider a C∗-algebra generated by the above defined operators (for all different values of t > 0), then the concept of state, and finally given V and β we can ask about KMS states. There are several technical difficulties in the definition of the above C∗-algebra, etc... Anyway, at least formally, there is a need for finding ν which is a solution of an equation of the kind we describe here. We need this in order to be able to obtain a characterization of KMS states by means of an eigen-probability for the continuous time Ruelle operator. This setting will be the subject of a future work. This was the motivation for our result. With the operators α and L we can rewrite the theorem above as ρ (G−1E (G ϕ)) = ρ (ϕ), V T T T V for all ϕ ∈ L∞ and all T > 0, where, as usual, ρ (ϕ) = ϕdρ , E = α LT is in fact a V V T T projection on a subalgebra of B, and G : Ω → R is given by T R T G (x) = exp( V(x(s))ds). T Z0 For the map V : Ω → R, which is constant in cylinders of the form {w = i}, 0 i ∈ {1,2,...,n}, we denote by V the corresponding value. We also denote by V the i diagonal matrix with the i-diagonal element equal to V . i Now, consider Pt = et(L+V). The classical Perron-Frobenius Theorem for such semi- V group will be one of the main ingredients of our main proof. 4 As usual, we denote by F the sigma-algebra generated by X . We also denote by F+ s s s the sigma-algebra generated σ({X ,s ≤ u}). Note that a F+-measurable function f(w) u s on Ω does depend of the value w . s We also denote by I the indicator function of a measurable set A in Ω. A 2 A continuous time Ruelle Operator We consider the disintegration of P given by the family of measures, indexed by the elements of Ω and t > 0 defined as follows: first, consider a sequence 0 = t < t < ··· < 0 1 t < t ≤ t < ··· < t . Then for w ∈ Ω and t > 0 we have on cylinders: j−1 j r µw([X = a ,...,X = a ]) = t 0 0 tr r 1 Pt−tj−1 ···Pt2−t1Pt pa0 if a = w(t ),...,a = w(t ) pw(t) w(t)aj−1 a2a1 a1a0 0 j j r r 0 ( 0 otherwise. Proposition 2.1. µw is the disintegration of P along the fibers Θ−1(.). t t Proof: It is enough to show that for any integrable f fdP = f(x)dµw(x)dP(w). t ZΩ ZΩZΘ−t1(w) For doing that we can assume that f is in fact the indicator of the cylinder [X = 0 a ,...,X = a ]; then the right hand side becomes 0 tr r 1 fdµw(x)dP(w) = I Pt−tj−1 ···Pt1 pa0dP(w) = t [w(tj)=aj,...,w(tr)=ar]pw(t) w(t)aj−1 a1a0 0 Z Z Z 0 n 1 I Pt−tj−1 ···Pt1 pa0dP(w) = [w(t)=a,w(tj)=aj,...,w(tr)=ar]pw(t) w(t)aj−1 a1a0 0 a=1Z 0 X n 1 Ptr−tr−1...Ptj−tpa Pt−tj−1 ···Pt1 pa0 = Ptr−tr−1 ···Pt1 pa0 = arar−1 aja 0pa aaj−1 a1a0 0 arar−1 a1a0 0 a=1 0 X P([X = a ,...,X = a ]) = fdP. 0 0 tr r Z In the second inequality we use the fact that P is stationary. The proof for a general f follows from standard arguments. Definition 2.2. For t fixed we define the operator Lt : L∞(Ω,P) → L∞(Ω,P) as follows: Lt(ϕ)(x) = ϕ(y¯)dµx(y¯). t Zy¯∈Θ−t1(x) 5 Remark 2.3. The definition above can be rewritten as Lt(ϕ)(x) = ϕ(yx)dµx(yx), t Zy∈D[0,t) where the symbol yx means the concatenation of the path y with the translation of x: y(s) if s ∈ [0,t) xy(s) = x(s−t) if s ≥ t, (cid:26) and, D[0,t) is the set of right-continuous functions from [0,t) to S. This follows simply from the fact that, in this notation, Θ−1(x) = {yx: y ∈ D[0,t)}. t Note that the value lim y(s) do not have to be necessarily equal to x(0). s→t In order to understand better the definitions above we apply the operator to some simple functions. For example, we can see the effect of Lt on some indicator function of a given cylinder: consider the sequence 0 = t < t < .. < t < t ≤ t < ... < t and then 0 1 j−1 j r takef = I . Then, forapathz ∈ Ωsuchthatz = a ,...,z = a {X0=a0,Xt1=a1,...,Xtr=ar} tj−t j tr−t r (the future condition) we have 1 Lt(f)(z) = Pt−tj−1...Pt2−t1Pt1 pa0, pz0 z0aj−1 a2a1 a1a0 0 0 otherwise (i.e., if the path z does not satisfy the condition above) we get Lt(f)(z) = 0. Note that if t < t, then Lt(f)(z) depends only on z . For example, if f = I r 0 {X0=i0} then 1 Lt(f)(z) = I (yx)dµz(yx) = µz([X = i ]) = Pt pi0. {X0=i0} t t 0 0 pz0 z0,i0 0 Zy∈D[0,t) 0 In the case f = I , then Lt(f)(z) = Pt pi00, if z = j , and Lt(f)(z) = 0 {X0=i0,Xt=j0} z0,i0pz0 0 0 0 otherwise. We describe bellow some properties of Lt. Proposition 2.4. Lt(1) = 1, where 1 is the function that maps every point in Ω to 1. Proof: Indeed, Lt(1)(x) = 1(yx)dµx(yx) = dµx(yx) = µx([X = x(0)]) = t t t t Zy∈D[0,t) Z n n 1 µx([X = a,X = x(0)]) = Pt pa = 1. t 0 t px(0) x(0)a 0 a=1 0 a=1 X X We can also define the dual of Lt, denoted by (Lt)∗, acting on the measures. Then we get: 6 Proposition 2.5. For any positive t we have that (Lt)∗(P) = (P). Proof: For a fixed t we have that (Lt)∗(P) = (P), because for any f of the form f = I , 0 = t < t < .. < t < t ≤ t < ... < t , we get {X0=a0,Xt1=a1,...,Xtr=ar} 0 1 j−1 j r n Lt(f)(z)dP(z) = Lt(f)(z)dP(z) = Z b=1 Z{X0=b} X n 1 I (z)dP(z) Pt−tj−1...Pt2−t1Pt1 pa0 = {X0=b,Xtj−t=aj,...,Xtr−t=ar} pb baj−1 a2a1 a1a0 0 b=1 Z 0 X n 1 P({X = b,X = a ,X = a ,...,X = a }) Pt−tj−1...Pt2−t1Pt1 pa0 = 0 tj−t j tj+1−t j+1 tr−t r pb baj−1 a2a1 a1a0 0 b=1 0 X n 1 Ptr−tr−1...Ptj+1−tjPtj−tpb Pt−tj−1...Pt2−t1Pt1 pa0 = arar−1 aj+1aj ajb 0 pb baj−1 a2a1 a1a0 0 b=1 0 X f(w)dP(w). Z Proposition 2.6. Given t ∈ R , and the functions ϕ,ψ ∈ L∞(P), then + Lt(ϕ×(ψ ◦Θ ))(z) = ψ(z)×Lt(ϕ)(z). t Proof: Lt(ϕ(ψ ◦Θ ))(x) = ϕ(ix)(ψ ◦Θ )(ix)dµx(i) = t t t Zi∈D[0,t) ψ(x) ϕ(ix)dµx(i) = (ψLt(ϕ))(x) = ψ(x)Lt(ϕ)(x), t Zi∈D[0,t) since ψ ◦Θ (ix) = ψ(x), independently of i. t We just recall that the last proposition can be restated as Lt(ϕα (ψ)) = ψLt(ϕ). t Then we get: Proposition 2.7. α is the dual of Lt on L2(P). t 7 Proof: From last two propositions Lt(f)gdP = Lt(f ×(g ◦Θ ))dP = f ×(g ◦Θ )dP = fα (g)dP, t t t Z Z Z Z as claimed. We want to obtain conditional expectations in a more explicit form. For a given f, recall that the function Z(w) = E(f|F+) is the Z (almost everywhere defined) F+- t t measurable function such that for any F+-measurable set B we have Z(w)dP(w) = t B f(w)dP(w). B R RProposition 2.8. The conditional expectation is given by E(f|F+)(x) = fdµx. t t Z Proof: For t fixed, consider a F+-measurable set B. Then we have t fdµwdP(w) = (I (w) fdµw)dP(w) = t B t ZBZ Z Z (fI )dµwdP(w) = f(w)I (w)dP(w) = fdP. B t B Z Z Z ZB Now we can relate the conditional expectation with respect to the σ-algebras F+ with t the operators Lt and α as follows: t Proposition 2.9. [Lt(f)](Θ ) = E(f|F+). t t Proof: ThisfollowsfromthefactthatforanyB = {X = b ,X = b ,...,X = b }, s1 1 s2 2 su u with t < s < ... < s , we have I = I ◦Θ for some measurable A and 1 u B A t Lt(f)(Θ (w))dP(w) = I (w)Lt(f)(Θ (w))dP(w) = t B t ZB Z (I ◦Θ )(w)Lt(f)(Θ (w))dP(w) = I (w)Lt(f)(w)dP(w) = A t t A Z Z I (Θ (w))f(w)dP(w)= f(w)dP(w). A t Z ZB 8 3 The modified operator Ruelle Operator associated to V We are interested in the perturbation by V (defined above) of the Lt operator. Definition 3.1. We define G : Ω → R as t t G (x) = exp( V(x(s))ds) t Z0 Definition 3.2. We define the G-weigthed transfer operator Lt : L∞(Ω,P) → L∞(Ω,P) V acting on measurable functions f (of the form f = I ) by {X0=a0,Xt1=a1,...,Xtr=ar} Lt (f)(w) := Lt(G f) = V t n = Lt(eR0t(V◦Θs)(.)dsf ) = Lt(eR0t(V◦Θs)(.)dsI{Xt=b}f )(w). b=1 X Note that eR0t(V◦Θs)(.)dsI{Xt=b} does not depend on information for time larger than t. In the case f is such that t ≤ t (in the above notation), then Lt (f)(w) depends only on r V w(0). The integration on s above is consider over the open interval (0,t). We will show next the existence of an eigenfunction and an eigen-measure for such operator Lt . First we need the following: V Theorem (Perron-Frobenius for continuous time). ([S] page 111) Given L, p and 0 V as above, there exists a) a unique positive function u : Ω → R, constant equal to the value ui in each V V cylinder X = i, i ∈ {1,2,..,n}, (sometimes we will consider u as a vector in Rn). 0 V b) a unique probability vector µ in Rn(a probability over over the set {1,2,..,n} such V that µ ({i}) > 0, ∀i), that is, V n (u ) (µ ) = 1, V i V i i=1 X c) a real positive value λ(V), such that for any positive s e−sλ(V)u es(L+V) = u . V V d) Moreover, for any v = (v ,.,v ) ∈ Rn 1 n n lim e−tλ(V)vet(L+V) = ( v (µ ) )u , i V i V t→∞ i=1 X 9 e) for any positive s e−sλ(V)es(L+V)µ = µ . V V From property e) it follows that (L+V)µ = λ(V)µ , V V or (L+V −λ(V)I)µ = 0. V From c) it follows that u (L+V) = λ(V)u , V V or u (L+V −λ(V)I) = 0. v We point out that e) means that for any positive t we have (Pt)∗µ = eλ(V)tµ . V V V Note that when V = 0, then λ(V) = 0, µ = p0 and u is constant equal to 1. V V Now we return to our setting: for each i and t fixed one can consider the probability 0 µt defined over the sigma-algebra F− = σ({X |s ≤ t}) with support on {X = i } such i0 t s 0 0 that for cylinder sets with 0 < t < ... < t ≤ t 1 r µt ({X = i ,X = a ,...,X = a ,X = j }) = Pt−tr...Pt2−t1Pt1 . i0 0 0 t1 1 tr−1 r−1 t 0 j0ar a2a1 a1i0 The probability µt is not stationary. i0 We denote by Q(j,i) the i,j entry of the matrix et(L+V), that is (et(L+V)) . t j,i It is known ([K] page 52 or [S] Lemma 5.15) that Q(j0,i0)t = E{X0=i0}{eR0t(V◦Θs)(w)ds; X(t) = j0} = I{Xt=j0} eR0t(V◦Θs)(w)dsdµti0(w). Z For example, I{Xt=j0}eR0t(V◦Θs)(w)dsdP = Q(j0,i)tp0i. Z i=1,2,..,n X In the particular case where V is constant equal 0, then p0 = µ and λ(V) = 0. V We denote by f = f , where f(w) = f(w(0)), the density of probability µ in S with V V respect to the probability p0 in S. Therefore, fdp0 = 1. Proposition R3.3. f (w) = µV(w) = (µV)w(0), f : Ω → R, is an eigenfunction for Lt V p0(w) (p0)w(0) V V with eigenvalue etλ(V). 10