Multivariate Lagrange Inversion

Multivariate Lagrange Inversion

Bruce Richmond

University of Waterloo, Canada

Algorithms Seminar

May 25, 1998

[summary by Danièle Gardy]

A properly typeset version of this document is available in postscript and in pdf.

If some fonts do not look right on your screen, this might be fixed by configuring your browser (see the documentation here).

Abstract

A new formulation of Lagrange inversion for several variables will be described which does not involve a determinant. This formulation is convenient for the asymptotic investigation of numbers defined by Lagrange inversion. Examples of tree problems where the number of vertices of degree k are counted and where vertices are 2-colored will be given. Non-crossing partitions give another example and the Meir-Moon formula for powers of an inversion is a special case.

1 Running Example

Consider a rooted plane tree where internal vertices can have two or three sons and are green or red, according to the following rules: (an example of such a tree is given below.)

a green vertex has three children; one is red and the other two are green;
a red vertex has two children, one of each color, and the left one is red.

Enumeration of such trees is best done by taking into account the colors of the vertices: let x₁ and x₂ mark the green and red vertices, and define w₁(x₁,x₂) and w₂ (x₁,x₂) as the functions enumerating the trees whose root is green (resp. red). These functions satisfy the system of equations

w₁ (x₁,x₂) = x₁ (1+ 3 w₁² w₂); w₂ (x₁,x₂) = x₂ (1 + w₁ w₂).

Introducing the vectors x = (x₁,x₂) and w = (w₁,w₂) and the functions f₁ (w) = 1 + 3 w₁² w₂ and f₂ (w) = 1 + w₁ w₂, one obtains the system w₁ (x) = x₁ f₁ (w); w₂ (x) = x₂ f₂ (w). Such equations are very similar to those that can be solved in one dimension by Lagrange inversion, and it is natural to try and solve them with a suitable extension.

2 Multivariate Lagrange Inversion

In one dimension, Lagrange inversion is used for implicit equations of the type w(x) = x f(w(x)), with f(0)¹ 0: It relates the coefficients of a solution w(x), or of a function of w(x), as formal power series, to the coefficients of the simpler function f:

[ xⁿ ] w(x) =

[ t^n-1 ] f (t)ⁿ; [ xⁿ] g(w(x)) =

[ t^n-1 ] g'(t) fⁿ (t) .

Extensions to the multivariate case have been considered for some time; surveys can be found in the paper written some twelve years back by Gessel [6], or in the recent book by Bergeron, Labelle and Leroux [4]. The version presented below is due to Good [7]:

Theorem 1 Let x be a d-dimensional vector, g(x) and f_i(x) (1 £ i £ d) be formal power series in x, s.t. f_i (0) ¹ 0. Then the equations w_i = x_i f_i (w) uniquely determine the w_i as formal power series in x, and

[

] g(

(

)) = [

]

æ
ç
ç
ç
è

)

(

)

½
½
½
½
½

d_i,j -

x_i ¶ f_j(

)

f_j (

) ¶ x_i

½
½
½
½
½

ö
÷
÷
÷
ø

with d_i,j the Kronecker symbol, ||A|| the determinant of the matrix A, f = (f₁,..., f_d), and fⁿ = f₁ⁿ_¹··· f_dⁿ_^d.

The determinant in this formula leads to trouble when one tries to get asymptotic information from it. Let us consider the univariate case to see what the problem is.

For d=1, Good's formula applied to the equation w(x) = x f(w(x)) gives an identity equivalent to the one presented above:

[ xⁿ ] w(x) = [ t^n-1]

æ
ç
ç
è

fⁿ (t)

æ
ç
ç
è

1-t

f^'(t)

f(t)

ö
÷
÷
ø

. (1)

When one wishes to obtain asymptotics, a natural tool is the saddle-point method, well suited to approximating coefficients of (variations on) large powers of functions; see for example [5] for a summary of results in this area. The idea is to use Cauchy's formula [ zⁿ ] F(z) = ò F(z) z^-n-1 dz, for F(z) = f(z)ⁿ (1-z f^'(z) / f(z)), with an integration path that is a circle going through the saddle-point r₀; r₀ is itself is a perturbation of the saddle-point r₁ that appears in the evaluation of the simpler coefficient [ xⁿ ] fⁿ (x). Now r₁ is defined as the solution of the equation 1 - x f^' (x) / f(x) =0, i.e. the integrand of the right part of (1) becomes zero close to r₀!

With care, it should be possible to work this out for one variable, but the outlook for a multi-dimensional extension is not favorable, as we can expect cancellation of the determinant close to the integration paths. Instead, Bender and Richmond have proposed a new multivariate version, better suited to asymptotics; this formula will use the derivatives of a vector wrt a directed graph.

3 Differentiating a Vector wrt a Directed Graph

To define the partial of a vector relative to a directed graph, consider all trees with vertices 0, 1,..., d and edges directed to 0. There are (d+1)^d-1 such trees; for example for d=2 there are three trees:

Now the derivative of a (d+1)-dimensional function f according to such a tree is a product on (d+1) terms, where f_i is differentiated according to the incoming edges into the vertex labelled by i; this is best explained on the above example, with f = (f₀,f₁,f₂):¹

¶ T₁

¶² f₀

¶ x₁ ¶ x₂

· f₁ · f₂;

¶ T₂

¶ f₀

¶ x₂

· f₁ ·

¶ f₂

¶ x₁

;

¶ T₃

¶ f₀

¶ x₁

¶ f₂

¶ x₂

· f₂.

4 The New Inversion Formula

Theorem 2 Under the assumptions of the former theorem,

[

] g (

(

)) =

æ
ç
ç
è

i=1

n_i

ö
÷
÷
ø

-1

[

]

¶ (g, f

n₁

, ... , f

n_d

)

¶ T

where the sum is on the set of trees with d+1 vertices.

Proof. This result is proven in [3]; it relies on the simple formula n [ x^n-1 ] f = [ xⁿ ] ¶ f /¶ x and on the expansion of a determinant. The terms are all positive as soon as the functions f_i and g have positive coefficients; hence the coefficient [ tⁿ ] g (w (t)), as a sum of (d+1)^d-1 such terms, is itself positive and there are no more cancellations.

What do we obtain for the first values of d? For d=1, the only tree is 1 ® 0 and one gets back the classical formula. For d=2, g(t₁,t₂) is a function of two variables and

[ x

n₁

n₂

]

g(w₁(x₁,x₂), w₂(x₁,x₂)) =

n₁ n₂

[ t

n₁-1

n₂-1

]

T Î { T₀, T₁, T₂ }

¶(g, f

n₁

, f

n₂

)

¶ T

n₁ n₂

[ t

n₁-1

n₂-1

]

æ
ç
ç
è

¶² g

¶ t₁ ¶ t₂

n₁

n₂

¶ g

¶ t₂

n₁

¶ (f

n₂

)

¶ t₁

¶ g

¶ t₁

¶ (f

n₁

)

¶ t₂

n₂

ö
÷
÷
ø

(n₁-1) (n₂-1)

[ t

n₁

n₂

]

æ
ç
ç
è

n₁

n₂

ö
÷
÷
ø

with (f₁ and f₂ are strictly positive at the saddle-points)

h:=

¶² g

¶ t₁ ¶ t₂

+ n₂

¶ g

¶ t₂

¶ f₂

¶ t₁

f₂

+ n₁

¶ g

¶ t₁

¶ f₁

¶ t₂

f₁

For general d, there is no determinant here, but a finite (although large!) sum of terms, each of which can be evaluated individually. The asymptotic value of [ tⁿ ] g (w(x)) is obtained by adding the individual asymptotic values of the (d+1)^d-1 terms.

It is possible to obtain a univariate local limit theorem for the number of red vertices in trees having a fixed number of vertices, or a bivariate local limit theorem for the joint distribution of the numbers of red and green vertices.

5 Local Limit Theorem

The usual approach towards a limiting theorem is through the covariance matrix (see for example a former paper by the same authors [1]); checking the non-degeneracy of this matrix leads to intricate conditions, which the authors try to bypass, by requiring instead the existence of a multivariate saddle-point. A local limit theorem holds whenever the functions g(x) and f_i (x) (1£ i £ d) are analytic; there is also an existence condition on the exponents of the variables in the functions whose coefficients we are studying. Formally, this involves the lattice generated by the exponents k for which the coefficient of t^k in f_i is not zero; see [2] for a precise formulation.

For example, for the colored trees presented in Section 1, the only non-zero coefficients are obtained, besides k = (0,0), for k = (2,1) in f₁, and for k = (1,1) in f₂. The lattice generated by { (1,1), (2,1) } is N²; hence all the terms t₁ⁱ_¹ t₂ⁱ_² will appear in the function f₁^k_¹ f₂^k_².

The saddle-point condition is that we should be able to solve the system of d equations { k_i = å_{1£ l £ d} k_l ¶ log f_l / ¶ log g_i} (with g_i = e^s_ⁱ).

We give the equations below for two variables, the better to understand what is going on, but it should be understood that it is more general and applies to d dimensions.

At some point, we have to compute a coefficient [ t₁ⁿ_¹ t₂ⁿ_² ] ( h f₁ⁿ_¹ f₂ⁿ_² ), where the functions h, f₁ and f₂ are on the variables t₁ and t₂. The way to do this is through a saddle-point approximation; more specifically we shall look at [ t₁^k_¹ t₂^k_² ] ( h f₁ⁿ_¹ f₂ⁿ_² ) for k₁ and k₂ of the same order as n₁ and n₂, but not necessarily equal. This coefficient can be written, by Cauchy's formula, as 1/(2ip)² ò ò e^h(t_¹^,t_²⁾ dt₁ dt₂, with h = n₁ log f₁ + n₂ log f₂ - k₁ log t₁ - k₂ log t₂. Now the saddle-points are defined by the two equations ¶ h / ¶ t₁ =0 and ¶ h / ¶ t₂ =0, which give the two-dimensional system

k₁ = n₁ t₁

¶ f₁

¶ t₁

f₁

+ n₂ t₁

¶ f₂

¶ t₁

f₂

; k₂ = n₁ t₂

¶ f₁

¶ t₂

f₁

+ n₂ t₂

¶ f₂

¶ t₂

f₂

Applied to our running example, this gives the system in t₁ and t₂

k₁ = n₁

6 t₁² t₂

1+3 t₁² t₂

+ n₂

t₁ t₂

1+ t₁ t₂

; k₂ = n₁

3 t₁² t₂

1+3 t₁² t₂

+ n₂

t₁ t₂

1+ t₁ t₂

Define r:= k₁ / k₂; r Î]1,2[. Solving, we get

t₁ =

(r -1)²

3 (2-r)

=: r₁; t₂ =

3 (2-r)²

(r-1)³

=: r₂.

This gives (k₁,k₂) = n (r / (1+r), 1/(1+r)). The covariance matrix is obtained by differentiation of log f, where f:= f₁ⁿ_¹ f₂ⁿ_², with f₁ and f₂ defined in Section 1. For example B_1,1 is the value of t₁ ¶ (log f) / ¶ t₁ + t₁² ¶² (log f) / ¶ t₁², taken at the point (r₁,r₂), which gives B_1,1 = n (r -1) (4+ 2 r - r²)/r (1+r). Similar computations give the other components of the covariance matrix:

r -1

r (1+r)

é
ë

4+2r - r²	2+2r - r²
2+2r-r²	1+2r-r²

ù
û

References

[1]: Bender (Edward A.) and Richmond (L. Bruce). -- Central and local limit theorems applied to asymptotic enumeration. II. Multivariate generating functions. Journal of Combinatorial Theory. Series A, vol. 34, n°3, 1983, pp. 255--265.
[2]: Bender (Edward A.) and Richmond (L. Bruce). -- Multivariate asymptotics for products of large powers with applications to Lagrange inversion. -- Technical Report n°Technical Report 98-10, Faculty of Mathematics, University of Waterloo, 1998.
[3]: Bender (Edward A.) and Richmond (L. Bruce). -- A multivariate Lagrange inversion formula for asymptotic calculations. Electronic Journal of Combinatorics, vol. 5, n°1, 1998, pp. Research Paper 33, 4 pp. (electronic).
[4]: Bergeron (F.), Labelle (G.), and Leroux (P.). -- Combinatorial species and tree-like structures. -- Cambridge University Press, Cambridge, 1998, Encyclopedia of Mathematics and its Applications, vol. 67, xx+457p. Translated from the 1994 French original by Margaret Readdy, With a foreword by Gian-Carlo Rota.
[5]: Gardy (Danièle). -- Some results on the asymptotic behaviour of coefficients of large powers of functions. Discrete Mathematics, vol. 139, n°1-3, 1995, pp. 189--217.
[6]: Gessel (Ira M.). -- A combinatorial proof of the multivariable Lagrange inversion formula. Journal of Combinatorial Theory. Series A, vol. 45, n°2, 1987, pp. 178--195.
[7]: Good (I. J.). -- The generalization of Lagrange's expansion and the enumeration of trees. Proceedings of the Cambridge Philosical Society, vol. 61, 1965, pp. 499--517.
[8]: Hwang (Hsien-Kuei). -- On convergence rates in the central limit theorems for combinatorial structures. European Journal of Combinatorics, vol. 19, n°3, 1998, pp. 329--343.

1: Although the definition is more general, trees are the only graphs considered here.

This document was translated from L^AT_EX by H^EV^EA.