On the Width of Labeled Trees

On the Width of Labeled Trees

Jean-François Marckert

Université de Nancy 1

Algorithms Seminar

April 12, 1999

[summary by Christine Fricker]

A properly typeset version of this document is available in postscript and in pdf.

If some fonts do not look right on your screen, this might be fixed by configuring your browser (see the documentation here).

Abstract

We consider A_n the set of all rooted labeled trees with n nodes. We denote by Z_i the number of nodes at distance i from the root and by W_n=max_{0£ i £ n} Z_i the width of the tree. The aim of the talk is to present results on convergence of moments of W_n (correctly renormalized) to those of the maximum of the normalized Brownian excursion and to give a tight bound for the rate of convergence. For the proof, the connections between especially breadth first search random walk on trees, random walk with Poissonian increment, parking function and empiric process of mathematical statistics are described.

The results presented in this talk were obtained jointly with P. Chassaing.

1 Introduction

A rooted labeled tree with n nodes is a connected graph with n vertices and n-1 edges where a vertex, the root, is specified. Tight bounds of the width W_n of rooted labeled trees with n nodes are given, answering an open question of Odlyzko and Wilf [3] that E(W_n) is between C₁(n)^1/2 and C₂(nlogn)^1/2. More precisely, we first present the result of Takacs [6] that W_n/(n)^1/2 converges in distribution to the maximum m of the Brownian excursion, with the well-known theta distribution given by

(m£ x)=

kÎ Z

(1-4k²x²)e

-2k²x²

However weak convergence does not answer completely the question of Odlyzko and Wilf. To fill this gap, we prove that

Theorem 1 For all p³ 1, |E(W_n^p/n^p/2)-E(m^p)|£ C_p n^-1/4(logn)^1/2 where m is the maximum of the normalized Brownian excursion.

The moments of m are well-known and given by

E(m^p)=p(p-1)G(p/2)z(p)2^-p/2.

For this we prove that there exists a sequence of normalized Brownian excursions of maximum m_n such that, for all p³ 1,

W_n/(n)^1/2-m_n

)£ C'_p n^-p/4(logn)^1/2

using that if q is defined by 1/p+1/q=1 and if X and Y are two real random variables in L_p, then by Holder's inequality,

E(X^p)-E(Y^p)

£ p

X-Y

X+Y

p/q

2 Relation Between Rooted Labeled Trees and Parking Functions

In hashing with linear probing or parking, we consider n cars c_i (1£ i£ n) arriving in this order at random in n+1 places {0,1,...,n}, where car c_i is parking on its place h_i if h_i is still empty, otherwise car c_i is trying places h_i+1 mod n+1,... . We consider parking functions, i.e. sequences (h_i)_{1£ i£ n} such that place n is empty. A parking function is alternatively characterized by the sequence (A_k={i,h_i=k}) _{0£ k£ n} of sets of cars that arrive on place k, with x_k=card A_k. If y_k is the number of cars that tried once to park on place k,

y_k=y_k-1-1+x_k, y₀=x₀.

The fact that place n is the empty place is given by

y_k³ 1 (0£ k£ n-1), y_n=0

or equivalently

i=0

x_i-k³ 1 (0£ k£ n-1),

i=0

x_i-n=0. (1)

A labeled tree with vertices {0,1,...,n} rooted at 0 is also characterized by the sequence of disjoint sets (A_k)_{0£ k£ n} whose union is {1,...,n} and the x_k=card A_k satisfying (1). Indeed, A_k (k³ 1) (respectively A₀) is defined as the set of new neighbors of the smallest element in A_k-1 (respectively 0) and (1) is the condition for the tree to be connected and to have root 0. The number of A_k (k³ 1) with cardinality x_k (k³ 1) is proportional to the product of Poisson probabilities e^-1/x_k!. In other words the corresponding unlabeled tree is a Galton-Watson tree with Poisson(1) progeny, constrained to have n+1 nodes. Thus, the sequence y=(y_k)_{0£ k£ n} is the discrete excursion with length n of a random walk with increments x_k-1. It is well-known that (y_{ë ntû}/(n)^1/2)_{0£ t£ 1} converges in distribution to (e(t))_{0£ t£ 1} where e is a normalized Brownian excursion and max_k y_k/(n)^1/2 converges in distribution to m=max_{0£ t£ 1} e(t), which is theta-distributed.

The random walk (y_k) (introduced in [1] and [5]) gives the profile of the tree (Z_k) as a sub-sequence of (y_k)

Z_k+1=y_l(k)

where l(k)=å_i=1^k Z_i and the width W_n as

W_n=

max

Z_k=

max

y_l(k).

We prove in the following proposition that W_n=max_k y_l(k) has the same behavior as max_k y_k.

Proposition 1 For each p³ 1,

||W_n-

max

y_k ||_p=O(n^1/4(log n)^3/4).

This result is based on the slow variation of the sequence y=(y_k)_{0£
k£ n}. Indeed, W_c(n) defined as the set of sequences y=(y_k)_{0£ k£ n} such that, for all k and m such that k+m£ n,

|y_m+k-y_m|£ c(klog n)^1/2

satisfies the following lemma.

Lemma 1 For all a>0, there exists c>0 such that, for all n,

1-Pr

(W_c(n))=o(n

-a

This can be proved using (see Petrov [4]) that, if (Y_k) is a random walk with increments X_k satisfying E(X_k)=0 and for some a>0, E(exp(a|X_k|)<¥, then there exists T,C₁,C₂>0 such that

Pr(|Y_k|³ x)£

ì
ï
ï
í
ï
ï
î

x²

4C₁

if 0£ x £ C₁T,

-C₂x

if x ³ C₁T.

Then it remains to prove that E((max_k y_k/(n)^1/2)^p)® E(m^p) and to estimate the rate of convergence. This is the object of the next section.

3 Parking Functions and Empiric Processes

Consider the sequence (U_i)_{1£ i£ n} of n i.i.d. random variables uniformly distributed on [0,1]. Let F_n(t) be the empiric distribution for (U_i)_{1£ i£ n} i.e.

F_n(t)=

card{iÎ {1,...,n},U_i£ t}

, (0£ t£ 1).

Process (F_n(t)) converges to (F(t))=(t), the distribution function of the uniform distribution. Especially, the empiric process (a_n(t))=((n)^1/2( F_n(t)-F(t)))_{0£ t£ 1} converges in distribution to the Brownian bridge (b(t))_{0£ t£ 1}.

There is a precise connection between parking functions and empiric processes. Indeed, consider the sequence (U_i)_{1£ i£ n} of i.i.d. random variables uniformly distributed on [0,1] and realize parking in the following way: If U_iÎ[k-1/n+1,k/n+1[, then car c_i tries to park first on place h_i=k. The last empty place V is given in terms of the empiric process.

Proposition 2 There is a unique T(n) in {0,1,...,n} such that

a_n(

T(n)

n+1

min

1£ j£ n

a_n(

n+1

Moreover, T(n)=V.

It is easy to deduce that

½
½
½
½

max

y_k

(n)^1/2

sup

0£ t£ 1

a_n(t)-

inf

0£ t£ 1

a_n(t))

½
½
½
½

1+2e_n

(n)^1/2

where e_n=(n)^1/2 sup_{0£ t£
1}|a_n(ë(n+1)tû/n+1)-a_n(t)| satisfies the following proposition.

Proposition 3 There exists A, C and K such that for all x and n,

Pr(e_n³ Clog n +x)£ An^1-KCe^-Kx. (2)

Then a_n(t) is replaced by a Brownian bridge b_n(t) using the following result of Komlos, Major and Tusnady [2].

Theorem 2 There exists a sequence of Brownian bridges (b_n)_{n³ 1} and A, M, µ>0 such that for all n and x

a_n(t)=b_n(t)+

c_n(t)

(n)^1/2

where C_n=sup_{0£ t£ 1}|c_n(t)| verifies for all x

Pr(C_n³ Alog

n +x)£ Me

-µ x

. (3)

Then

½
½
½
½

max

y_k

(n)^1/2

sup

0£ t£ 1

b_n(t)-

inf

0£ t£ 1

b_n(t))

½
½
½
½

1+2(e_n+C_n)

(n)^1/2

where C_n is introduced in Theorem 2. Using the fact that, if T is the almost surely unique point such that b(T)=min_{0£ t£ 1}b(t), then e=(e(t))_{0£
t£ 1}, defined by e(t)=b((T+t) mod 1)-b(T), is a normalized Brownian excursion independent of T, one has

½
½
½
½

max

y_k

(n)^1/2

sup

0£ t£ 1

e_n(t)

½
½
½
½

1+2(e_n+C_n)

(n)^1/2

. (4)

Relations (2) and (3) give that || e_n+C_n ||_p is bounded by K_plog n and thus (4) gives the following result.

Theorem 3 For each p³ 1,

½½
½½
½½
½½

m_n-

max

y_k

(n)^1/2

½½
½½
½½
½½

=O(

log n

(n)^1/2

It is then easy to deduce Theorem 1.

References

[1]: Aldous (David). -- Brownian excursions, critical random graphs and the multiplicative coalescent. The Annals of Probability, vol. 25, n°2, 1997, pp. 812--854.
[2]: Komlós (J.), Major (P.), and Tusnády (G.). -- An approximation of partial sums of independent RV's, and the sample DF. II. Zeitschrift für Wahrscheinlichkeitstheorie und Verwandte Gebiete, vol. 34, n°1, 1976, pp. 33--58.
[3]: Odlyzko (Andrew M.) and Wilf (Herbert S.). -- Bandwidths and profiles of trees. Journal of Combinatorial Theory. Series B, vol. 42, n°3, 1987, pp. 348--370.
[4]: Petrov (V. V.). -- Sums of independent random variables. -- Springer-Verlag, New York-Heidelberg, 1975, x+346p. Translated from the Russian by A. A. Brown, Ergebnisse der Mathematik und ihrer Grenzgebiete, Band 82.
[5]: Spencer (Joel). -- Enumerating graphs and Brownian motion. Communications on Pure and Applied Mathematics, vol. 50, n°3, 1997, pp. 291--294.
[6]: Takács (Lajos). -- Limit distributions for queues and random rooted trees. Journal of Applied Mathematics and Stochastic Analysis, vol. 6, n°3, 1993, pp. 189--216.

This document was translated from L^AT_EX by H^EV^EA.