Birth-Death Processes, Lattice Path Combinatorics, Continued Fractions, and Orthogonal Polynomials

Birth-Death Processes, Lattice Path Combinatorics,
Continued Fractions, and Orthogonal Polynomials

Fabrice Guillemin

France Telecom, CNET, Lannuon, Breizh

Algorithms Seminar

February 2, 1998

[summary by Philippe Flajolet and Fabrice Guillemin]

A properly typeset version of this document is available in postscript and in pdf.

If some fonts do not look right on your screen, this might be fixed by configuring your browser (see the documentation here).

Abstract

Classic works of Karlin-McGregor and Jones-Magnus have established a fully general correspondence between birth-death processes and continued fractions of the Stieltjes-Jacobi type together with their associated orthogonal polynomials. This fundamental correspondence can be revisited in the light of the otherwise known combinatorial correspondence between weighted lattice paths and continued fractions. For birth-death processes, this approach separates clearly the formal apparatus from the analytic-probabilistic machinery and neatly delineates those parameters that are amenable to a treatment by means of continued fractions and orthogonal polynomials.

1 Birth-Death Processes

Consider a particle initially in state 0 that, at any given time, may change to another state 1 (where it stays), with rate l. This means that the probability of a state change in an interval of time of length dt is l dt. Then, the probability p₀(t) that the particle is still in state 0 at time t satisfies

p₀(t+dt)-p₀(t)=-l p₀(t) dt

or p₀'(t)=-l p₀(t), whose solution is an exponential distribution,

p₀(t)=e

-l t

Similarly, a particle initially in state 0 that may change either to state 1 with rate l or to state -1 with rate µ will satisfy (p_j(t) is the probability of being in state j at time t)

p₀(t)=e

-(l+µ)t

, p₁(t)=

l+µ

(1-e

-(l+µ)t

), p_-1(t)=

l+µ

(1-e

-(l+µ)t

The interpretation is obvious: the particle stays in state 0 for a random amount of time with an exponential distribution of rate l+µ and then changes to states -1,+1 with probabilities equal to to l/(l+µ) and µ(l+µ).

In a general birth-death process a particle can be in any state in {0,1,2,...} and when in state j, it can only change to state j+1 at rate l_j or to state j-1 at rate µ_j. By analogy with the model of an evolving population (whose size is represented by the state), the l_j are called birth rates and the µ_j death rates. The general problem is to understand the evolution of a process given values (or properties) of its birth and death rates; see [12, Ch. 4] for an excellent introduction.

Let p_n(t) be the probability of being in state n at time t. An essential rôle is played by the coefficients

p_n =

l₀l₁··· l_n-1

µ₁µ₂··· µ_n

Indeed, a classical result asserts that the process is ergodic (the expected time to return from each state to itself is finite) if and only if

n³1

p_n<¥,

n³0

l_np_n

=+¥.

(The first condition ensures the existence of an invariant measure for the embedded discrete-time Markov chain; the second one guarantees that, in the continuous-time process, the particle is not absorbed at infinity in finite time.) In that case, one has

p_n:=

lim

t®¥

p_n(t)=

p_n

n³1

p_n

where these quantities represent the long run probability of being in state n.

More puzzling is the nonstationary behaviour of the process that is described by the infinite-dimensional differential system

p'_j(t)=l _j-1p_j-1(t)-(l_j+µ_j)p_j(t)+µ_j+1p_j+1(t), p_j(0)=d_j,0. (1)

Although finite-dimensional versions are ``easy'' and reduce to combinations of exponentials, it is precisely the infinite-dimensional character of the system that renders its analysis interesting.

In a series of important papers, Karlin and McGregor [10, 11] have developed a general connection between the fundamental system (1) and an associated family of orthogonal polynomials. Later, Jones and Magnus constructed a direct continued fraction representation; see [8, 9].

This summary is an account of Guillemin's lecture (see [5, 6]), as well as of later developments. The point of view that is adopted here consists in relating the combinatorial theory of lattice paths to birth-death processes in the following way: (i) trajectories of birth-death processes are precisely lattice paths; (ii) lattice paths have generating functions expressed as continued fractions; (iii) the Laplace transform expresses the main parameters of birth-death processes as weighted lattice paths to which the combinatorial theory applies.

2 Lattice Paths and Continued Fractions

It is known that the formal theory of continued fraction expansions for power series is identical to the combinatorial theory of weighted lattice paths; see [1, 2, 4]. Define a path u=(U₀,U₁,...,U_n) to be a sequence of points in the lattice N×N such that if U_j=(x_j,y_j), then x_j=j and |y_j+1-y_j|=1. If successive points are connected by edges, then an edge can only be an ascent (a: y_j+1-y_j=+1), a descent (b: y_j+1-y_j=-1), or a level step (c: y_j+1-y_j=0). Thus a path is always nonnegative and by a horizontal translation, one may always assume that x₀=0. A path can be encoded by a word with a,b,c representing the three types of steps. What we call the standard encoding is such a word in which each step a,b,c is subscripted by the value of the y-coordinate of its associated point. For instance,

w=a₀a₁a₂b₃c₂c₂a₂b₃b₂b₁a₀c₁

encodes a path that connects the source U₀=(0,0) to the destination U₁₂=(12,1). We freely identify a path u defined as a sequence of points, its word encoding w, and the corresponding monomial.

We consider various geometric conditions that may be imposed on paths: H_k,l is the collection of all paths that connect a source at altitude k to a destination at altitude l, H^{[£ h]} denotes paths of height (maximal altitude) at most h, etc.

Theorem 1 The collection H_0,0 of all paths has generating function

H_0,0=

1-c₀-

a₀b₁

1-c₁-

a₁b₂

1-c₂-

a₂b₃

·
·
·

Proof. It suffices to observe that (1-f)^-1=1+f+f²+··· generates symbolically all the sequences with components f. For instance, in H_0,0, the expressions

1-c₀

1-c₀-a₀b₁

1-c₀-

a₀b₁

1-c₁

(2)

generate successively paths composed from c₀ level steps only, paths of height at most 1 without c₁ steps, all paths of height at most 1. The complete continued fraction representation is easily built by stages in a similar fashion.

In particular, the collection of all paths from level 0 to level 0 with height at most h is

H_0,0^{[< h]}=

P_h

Q_h

, (3)

a rational fraction, whose numerators and denominators, P_h,Q_h, each satisfy the recurrence

y_h+1=(1-c_h)y_h-a_h-1b_hy_h-1,

with Q_-1=P₀=0, Q₀=P₁=1. (Linear fractional transformations are 2×2 matrices in disguise!)

Well-known path decompositions, like those based on first or last time at which levels are reached, can then be used provided they are combinatorially ``unambiguous''. This and simple manipulations on linear fractional transformations give access to many geometric constraints in addition to (2) and (3). We cite here some representative identities from [1, 2],

H_0,h-1^[<h]=

a₀a₁··· a_h-1

Q_h

, H_0,k=

b₁b₂··· b_k

(

Q_kH_0,0-P_k

)

, (4)

H_k,l=

Q_k

a₀··· a_k-1b₁··· b_l

(

Q_lH_0,0-P_l

)

, (5)

where the latter holds provided k£ l.

The forms (2), (3) (4), (5) can be converted into bona fide counting generating functions of paths weighted multiplicatively by means of the combinatorial morphism,

c(a_k)=a_kz, c(b_k)=b_k z, c(c_k)=g_kz.

In that case, the continued fraction (2) becomes the general fraction of the J-type (for Jacobi); see [7, 9, 13].

3 The Connection

We illustrate here in its simplest form the many-faceted connection between birth-death processes and continued fractions. It was apparently first stated explicitly by Jones and Magnus but it is implicit in earlier works of Karlin and McGregor. The connection goes through the probabilities p_i,j(t) of being in state j at time t starting from state i and the Laplace transforms,

P_i,j(s) =

ó
õ

p_i,j(t) e^-st dt.

Theorem 2 The Laplace transform of the probability of return to the origin satisfies

P_0,0(s)=

l₀+s-

l₀µ₁

l₁+µ₁+s-

l₁µ₂

·
·
·

We offer here two proofs. A third proof that is based on ``uniformization of time'' can also be given but is omitted in this note.

Proof.[Proof 1]Take the Laplace transform of the fundamental system (1) (so that p_j(t)=p_0,j(t)) and use the induced relations on the ratios P_0,r/P_0,r+1. This proof is the most direct but the least illuminating from a structural standpoint. In particular, this proof does not provide an immediate grasp on the question of deciding which parameters are amenable to continued fraction representations.

Proof.[Proof 2] Examine the times at which the (continuous time) birth-death process {L_t} changes states. This defines an embedded (discrete time) Markov chain {Y_n}. Then the set of trajectories of the chain {Y_n} is exactly the family of lattice paths of Section 2. The method consists in splitting the probabilities by conditioning according to all legal trajectories.

The first observation is that, given a lattice path w=w₁w₂··· w_n, the probability p_0,0(t| w) of being back to 0 at time t having followed the path w is

Pr{L_t=0| w} =Pr ì
í
î S

q₁
+S

q₂
+... +S

q_n
£ t, S

q₁
+S

q₂
+... +S

q_n
+S

q_n+1
> t ü
ý
þ ,
where S_q_{_j} is the random variable that represents the sojourn time at the state q_j determined by w₁··· w_j, while the right-hand side involves q_n+1 that ranges over all legal ``continuations'' of w (in the case of H_0,0, one has w_n+1=a₀ and q_n+1=0). As seen already, the sojourn time at some state e is exponential with parameter (l_e+µ_e) so that its Laplace transform is (l_e+µ_e)/(s+µ_e+l_e).
The second observation is that the probability of a path in the embedded chain is the product of the individual transition probabilities, namely l_j/(l_j+µ_j) and µ_j/(l_j+µ_j).

The different sojourn times are independent by the nature of the process (the strong Markov property satisfied by {L_t}). Also, sums of independent random variables correspond to products of Laplace transforms. Thus, the Laplace transform of the probability in the continuous model of following a path w has a product form; for instance, to w=a₀a₁b₂a₁, there corresponds the transform

æ
ç
ç
è

l₀

l₀+µ₀

l₁

l₁+µ₁

µ₂

l₂+µ₂

l₁

l₁+µ₁

ö
÷
÷
ø

æ
ç
ç
è

l₀+µ₀

s+l₀+µ₀

l₁+µ₁

s+l₁+µ₁

l₂+µ₂

s+l₂+µ₂

l₁+µ₁

s+l₁+µ₁

ö
÷
÷
ø

Thus, the Laplace transform P_0,0(s) is, apart from a fudge factor of 1/(s+l₀), a sum over all paths lattice from zero to zero weighted multiplicatively by the probabilistic morphism,

c'(a_j)=

l_j

s+l_j+µ_j

, c'(b_j)=

µ_j

s+l_j+µ_j

, (6)

with c'(c_j)=0. In other words, one has P_0,0(s)=c'(H_0,0)1/s+l₀, and the statement follows.

The same method applies to the computation of transition probabilities, the analysis of maximum height, and so on. For instance, the probability of reaching state k has

P_0,k(s)=

µ₁µ₂··· µ_k

(

A_k(s)P_0,0(s)-B_k(s)

)

where A_k/B_k is the kth convergent of the continued fraction that represents P_0,0, so that A_k,B_k are simple variants of c'(P_k),c'(Q_k).

Orthogonality.

In the case of paths, the reciprocals of the Q_h polynomials, Q_h(z)=z^hc(Q)(z^-1) are formally orthogonal with respect to a measure defined its moments,

[zⁿ] º

ó
õ zⁿ dµ (z) = H_0,0,n. (7)

Formal aspects of paths and orthogonality are detailed in Godsil's book [3].

A similar orthogonality property then holds for the probabilistic counterparts A_h,B_h of the P_k,Q_k polynomials. This provides alternative expressions of various probabilistic quantities in terms of scalar products involving the measure µ of (7). One can rederive in this way, via the combinatorial theory, a number of formulæ originally discovered by Karlin and McGregor. For instance, one has

p_m,n(t)=p_n

ó
õ

e^-tx q_m(x)q_n(x) dµ(x),

where the q_k polynomials (closely related to the B_k and Q_k) satisfy the recurrence l_nq_n+1 +(x-l_n-µ_n)q_n +µ_n q_n-1=0.

4 So What?

The original motivation for the talk comes from the need to elucidate the behaviour of certain queueing systems in the context of telecommunication applications. For instance, the single server queue (M/M/1) is modelled by l_j=r, µ_j=1, while the infinite server queue (M/M/¥) corresponds to l_j=r, µ_j=j. (Models of population growth lead to considering different types of weights, like l_j=(j+1)r, µ_j=j.) More specifically, the problem is to quantify parameters of some simple statistical multiplexing scheme that describe the quality of service on an ATM link. The relevant model is that of the M/M/¥ queue and parameters are to be analysed, like the duration q of an excursion above some level c, the volume V of lost information, or the number of bursts C in a busy period.

Each parameter leads to a specific continued fraction representation. By Theorem 2, the basic continued fraction of the M/M/¥ process is

s+r-

s+1+r-

·
·
·

This is recognizable as an instance of Gauß's continued fraction associated to a quotient of contiguous hypergeometric functions. The numerator and denominator polynomials are the Poisson-Charlier polynomials that are orthogonal with respect to the Poisson measure.

The quantity V (area) leads to challenging asymptotics questions both for the M/M/¥ queue and for the M/M/1 queue. A simple modification of the basic techniques of this note shows that the bivariate Laplace transform with (s,u) ``marking'' (t,V) is obtained by the modified morphism,

c''(a_j)=

l_j

s+ju+l_j+µ_j

, c''(b_j)=

µ_j

s+ju+l_j+µ_j

In the case of area under the M/M/1 queue, quotients of continuous Bessel functions make an appearance. Stripped of its probabilistic context, the corresponding problem of tail estimation then admits a purely analytic formulation:

Problem. Let A(x) be a function whose Laplace transform is

~

A

(s)=

1

(s)^1/2

J

n(s)+1
æ
ç
ç
è

2(r)^1/2

s
ö
÷
÷
ø

J

n(s)
æ
ç
ç
è

2(r)^1/2

s
ö
÷
÷
ø

, n(s)=(1+r)/s,
with J_n a Bessel function, and r>0 a parameter. Show that, for some constants c₁,c₂, one has

ó
õ

¥

x
A(y) dy ~ c₁ x^-1/4e

-c₂(x)^1/2

, (x®+¥).

Under plausible analytic or probabilistic conjectures, precise (and useful!) quantitative conclusions can be drawn. See the papers by Guillemin and Pinchon [5, 6] for full developments.

References

[1]: Flajolet (P.). -- Combinatorial aspects of continued fractions. Discrete Mathematics, vol. 32, 1980, pp. 125--161.
[2]: Flajolet (P.), Françon (J.), and Vuillemin (J.). -- Sequence of operations analysis for dynamic data structures. Journal of Algorithms, vol. 1, 1980, pp. 111--141.
[3]: Godsil (C. D.). -- Algebraic Combinatorics. -- Chapman and Hall, 1993.
[4]: Goulden (Ian P.) and Jackson (David M.). -- Combinatorial Enumeration. -- John Wiley, New York, 1983.
[5]: Guillemin (Fabrice) and Pinchon (Didier). -- Continued fraction analysis of the duration of an excursion in an M/M/¥ system. Journal of Applied Probability, vol. 35, n°1, 1998, pp. 165--183.
[6]: Guillemin (Fabrice) and Pinchon (Didier). -- Excursions of birth and death processes, orthogonal polynomials, and continued fractions. -- Preprint, 1998. To appear in Journal of Applied Probability. 21 pages.
[7]: Henrici (Peter). -- Applied and Computational Complex Analysis. -- John Wiley, New York, 1977. 3 volumes.
[8]: Jones (William B.) and Magnus (Arne). -- Application of Stieltjes fractions to birth-death processes. In Saff (E. B.) and Varga (Richard S.) (editors), Padé and rational approximation. pp. 173--179. -- New York, 1977. Proceedings of an International Symposium held at the University of South Florida, Tampa, Fla., December 15-17, 1976.
[9]: Jones (William B.) and Thron (W. J.). -- Continued Fractions: Analytic Theory and Applications. -- Addison--Wesley, 1990, Encyclopedia of Mathematics and its Applications, vol. 11.
[10]: Karlin (S.) and McGregor (J. L.). -- The differential equations of birth-and-death processes, and the Stieltjes moment problem. Transactions of the American Mathematical Society, vol. 85, 1957, pp. 489--546.
[11]: Karlin (Samuel) and McGregor (James). -- The classification of birth and death processes. Transactions of the American Mathematical Society, vol. 86, 1957, pp. 366--400.
[12]: Karlin (Samuel) and Taylor (Howard). -- A First Course in Stochastic Processes. -- Academic Press, 1975, second edition.
[13]: Perron (Oskar). -- Die Lehre von der Kettenbrüchen. -- Teubner, 1954, vol. 2.

This document was translated from L^AT_EX by H^EV^EA.