Polylogarithms and Multiple Zeta Values

Michel Petitot

University of Lille I

Algorithms Seminar

April 19, 1999

[summary by Bruno Salvy]

A properly typeset version of this document is available in postscript and in pdf.

If some fonts do not look right on your screen, this might be fixed by configuring your browser (see the documentation here).

The polylogarithm is defined by the series
L
 
x
n1-1
 
0
x1··· x
nk-1
 
0
x1
(z):=
 
å
n1>...>nk>0
z
n1
 
n
s1
 
1
··· n
sk
 
k
.
The convergence of this series at 1 is granted when s1>1, and the limit is denoted z(s1,...,sk) and is called a multiple zeta value since it extends the classical Riemann zeta function. The number åsi is called the weight of the polylogarithm or multiple zeta.

Many polynomial identities relating multiple zeta values at integers are known. For instance, reorganizing double sums yields the following identity between multiple zetas of weight 4:
z(2,2)=
 
å
n1>n2>0
1
n12n22
=
1
2
æ
ç
ç
è
 
å
n
1
n2
ö
÷
÷
ø
2



 
-
1
2
 
å
n
1
n4
=
1
2
(z(2)2-z(4)).     (1)
This could be simplified further using the well-known values of z at even integers.

This is a very active and diverse area. The reader is encouraged to consult [1, 2] for surveys of many beautiful results, generalizations and conjectures. One of the most famous conjectures is the following.
Conjecture 1  [Zagier]   The set of multiple zeta values z(s1,...,sk) with si positive integers, s1³2 and s1+...+sk£ n generates a vector space over Q whose dimension dn obeys
dn+3=dn+1+dn,     d1=0,   d2=d3=1.
Note that since even the irrationality of z(5) is still unproven, this conjecture is completely out of reach. Even a proof that this sequence gives an upper bound is still to be found.

1   Shuffle and Stuffle

The manipulation leading to identity (1) is a special case of a more general mechanism involving products of multiple sums. By considering how indices in multiple sums can be reorganized, it is natural to define the stuffle product of two words over N. (Stuffle is a contraction of ``shuffle'' and ``stuff''.) Using lowercase symbols to denote letters and capital symbols to denote words, this is the formal sum defined recursively by
e* W=W*e=W,     aS* bT=a(S* bT)+b(aS* T)+(a+b)(S* T).
This definition is motivated by the following important stuffle relation:
z(A)z(B)=
 
å
SÎ A* B
z(S).
A simple example is z(2)z(3)=z(2,3)+z(3,2)+z(5). Another one is the identity (1) which is obtained from (2)*(2)=2(2,2)+2(4).

In the same way as the stuffle product arises in the reorganization of multiple sums, multiple integrals lead to considering the shuffle product of words over the alphabet X={x0,x1}. This is defined by the same formula as the stuffle product except that the last term in the sum is omitted. A bijection between words over N* and words of X*x1 is provided by the encoding
(s1,...,sk)« x
s1-1
 
0
x1··· x
sk-1
 
0
x1.
This makes it possible to extend the shuffle product to these words. For instance,
(2) W (2)|® x0x1 W  x0x1=2x0x1x0x1+4x0x0x1x1|®2(2,2)+4(3,1),
(2) W (3)|® x0x1 W  x0x0x1=6x03x12+3x02x1x0x1+x0x1x02x1|®6(4,1)+3(3,2)+(2,3).
The following integral representation is then proved by induction
L
 
x1
(z)=log
1
1-z
= ó
õ
z


0
dt
1-t
,     Lw(z)=
ì
ï
ï
ï
ï
í
ï
ï
ï
ï
î
ó
õ
z


0
dt
t
Lw'(t),  
if w=x0w',
ó
õ
z


0
dt
1-t
Lw'(t),  
if w=x1w'.
    (2)
The recursive definition of the shuffle now reads UV=òU'V+òUV', whence the shuffle relation:
LA(z)LB(z)=
 
å
SÎ A W  B
LS(z).
Setting z=1 in these identities yields new identities concerning multiple z values. Our examples above thus lead to z(2)2=2z(2,2)+4z(3,1), z(2)z(3)=6z(4,1)+3z(3,2)+z(2,3).
Conjecture 2  All known relations concerning multiple zeta values follow from the stuffle product of multiple zetas and the shuffle product of polylogarithms specialized at 1.
This has been checked up to weight 12 [3], and the set of identities thus obtained coincides with the bound provided by Zagier's conjecture.

2   Monodromy and Consequences

A first step towards proving the conjecture above is provided by the following theorem.
Theorem 1  [[4]]   The ideal of algebraic relations between polylogarithms at z is generated by the shuffle relations.
A Lyndon word is a non-empty word which precedes its strict right factors in the lexicographic order. A classical theorem due to Radford states that the Lyndon words form a basis of the shuffle algebra. This leads to the following result.
Corollary 1  The polylogarithms indexed by Lyndon words form a transcendence basis of the polylogarithms. In particular, the classical polylogarithms Lik=Lx0kx1 are algebraically independent.
This theorem is proved for relations involving polylogarithms of weight bounded by a fixed number. Using the shuffle relations, any polynomial in polylogarithms can be reduced to a linear combination of polylogarithms. Since the shuffle relations form a Gröbner basis for the total degree order (degrevlex), any polynomial which is not in the ideal is thus reduced to a nonzero linear combination. The theorem is thus reduced to proving that the polylogarithms are linearly independent. This is done by computing the monodromy of polylogarithms as we now describe.

It turns out to be convenient to prove a more general theorem where polylogarithms with indices ending in x0 are allowed. Consistency with the shuffle relations is achieved with
L
 
x0
(z):= ó
õ
z


1
dt
t
=log z,     L
 
x0m
(z):= ó
õ
z


1
dt
t
L
 
x0m-1
(t)
=
1
m!
logmz.

At z=0, the situation is simple: a word ending with x1 corresponds to an analytic polylogarithm, whence a trivial monodromy. An easy induction on the weight shows that all words ending in x0 can be rewritten as a sum of shuffles of powers of x0 and words ending in x1. Here are the corresponding relations up to weight 3:
L
 
x1x0
=L
 
x1
L
 
x0
-L
 
x0x1
,   L
 
x12x0
=L
 
x12
L
 
x0
-L
 
x1x0x1
-L
 
x0x12
,
L
 
x0x1x0
=L
 
x0x1
L
 
x0
-2L
 
x02x1
,   L
 
x1x02
=L
 
x1
L
 
x02
-L
 
x0x1
L
 
x0
+L
 
x02x1
.
Let  M0f(z) be f(ze2ip); applying  M0-Id on the right-hand sides of these identities only affects the Lx0k. Their monodromy follows from ( M0-Id)Lx0=2ip. Another shuffle thus shows that
( M0-Id )L
 
Ux0
=2ip LU+
 
å
V
µVLV,    ( M0-Id )L
 
Ux1
=0,
where the words V in the sum all have weight smaller than the weight of U.

We now proceed to prove the analogous property at 1 with M1f(1-z):=f((1-z)e2ip):
( M1-Id )L
 
Ux1
=-2ip LU+
 
å
V
µVLV,    ( M1-Id )L
 
Ux0
=0.     (3)
At z=1, words ending with x0 correspond to polylogarithms that are analytic there, hence, have a trivial monodromy. This is the second identity. The situation is slightly more complicated than at the origin because of divergence. As above, an induction on the weight shows that all words beginning with x1 can be rewritten as a sum of shuffles of powers of x1 and words beginning with x0. The monodromy of Lx1k follows from that of the logarithm. The remaining words are those beginning with x0 and ending with x1. Consider the path consisting of a straight line from z to a circle of radius e around 1, turning around 1 in the anti-clockwise direction and coming back to z. Then Cauchy's theorem implies that
( M1-Id )L
 
x0Ux1
(z)=
 
lim
e®0
ó
õ
z


1-e
dt
t
( M1-Id )L
 
Ux1
(t) +
 
lim
e®0
ó
õ
 


|1-t|=e
dt
t
L
 
Ux1
(t).
Another induction shows that the rightmost integral tends to 0, while convergence of LUx1 at 1 reduces the first limit to
ó
õ
z


1
dt
t
( M1-Id )L
 
Ux1
(t)
.
This makes it possible to compute all the monodromies of words ending in x1 and proves (3). Here are the corresponding relations up to weight 3, using p to denote 2ip:
( M1-Id )L
 
x1k
=
k
å
j=1
L
 
x1k-j
(-p)j
j!
,     ( M1-Id )L
 
x0x1
=-pL
 
x0
,   ( M1-Id )L
 
x02x1
=-pL
 
x02
,
( M1-Id )L
 
x0x12
=-p(L
 
x0x1
-z
 
x0x1
)+
p2
2
L
 
x0
,   ( M1-Id )L
 
x1x0x1
=2L
 
x0x12
-pL
 
x1x0
-2pz
 
x0x1
.
The proof of Theorem 1 is concluded by considering the maximal weight involved in a minimal non-trivial linear combination: applying both operators ( M0-Id) and ( M1-Id) leads to linear relations of smaller weight, that have to be trivial.

3   Changes of Variables

The group of six rational functions z, 1-z, 1/z, 1/(1-z), 1-1/z, z/(1-z) permutes the singularities 0, 1, ¥. If h is an element of this group, then
LxU(h(z))= ó
õ
h(z)


0
LU(t)wx(tdt = ó
õ
z


h-1(0)
LU(h(s))wx(h(s))h'(sds.
It turns out that for all h in the group and all xÎ{x0,x1}, wx(h(s))h'(s) can be rewritten as a linear combination of ds/s and ds/(1-s). Thus by induction, all polylogarithms at h(z) can be rewritten in terms of polylogarithms at z. For the classical dilogarithm Li2=Lx1x0, we get
Li2(1-z)+Li 2(z)=L
 
x0
(z)L
 
x1
(z)+z(2), Li2(z)-Li 2(1-z-1)=L
 
x0
(z)L
 
x1
(z)+z(2)+L
 
x02
(z).
Setting z to 1/2,±f,± 1/f,1+f,1-1/f, where f is the golden ratio, yields the only known values of Li2 in closed form.

4   Noncommutative Generating Function

All the inductions mentioned here are conveniently handled by introducing the noncommutative generating function L(z)=åLW(z)W where the sum is over all words of  X={x0,x1}*. The integral representation of polylogarithms is equivalent to a linear differential equation:
d
dz
L(z)= æ
ç
ç
è
x0
z
+
x1
1-z
ö
÷
÷
ø
L(z).
A consequence of the rewriting of words ending by x0 is that all polylogarithms except Lx0k tend to 0 at the origin. This leads to the initial condition L(e)=elne x0+O(e1-d), for e®0, where d is an arbitrarily small real number. The shuffle relation then implies that this generating function is a Lie exponential. A noteworthy consequence is that it can be factored as a product of Lie exponentials indexed by Lyndon words, which turns out to yield an efficient algorithm for computing identities [3].

The inductions used in the monodromy computations translate very explicitly into
M 0L(z)=L(z)e
2ip x0
 
,     M 1L(z)=L(z)Z-1e
-2ip x1
 
Z,
where Z is very close to being the generating function of the multiple zeta values: it is the unique Lie exponential such that
(Z|x0)=(Z|x1)=0,     (Z|x0Wx1)=z
 
x0Wx1
,    WÎ X.
Similarly, the changes of variables can be interpreted at the level of L(z) [5].

References

[1]
Borwein (Jonathan M.), Bradley (David M.), Broadhurst (David J.), and Lisonek (Petr). -- Special values of multiple polylogarithms. Transactions of the American Mathematical Society, 1999. -- To appear.

[2]
Lewin (Leonard) (editor). -- Structural properties of polylogarithms. -- American Mathematical Society, Providence, RI, 1991, xviii+412p.

[3]
Minh (Hoang Ngoc) and Petitot (Michel). -- Lyndon words, polylogarithms and the Riemann z function. Discrete Mathematics, To appear.

[4]
Minh (Hoang Ngoc), Petitot (Michel), and Van der Hoeven (Joris). -- Shuffle algebra and polylogarithms. In Formal Power Series and Algebraic Combinatorics. -- 1998. Proceedings PFSAC'98, Toronto.

[5]
Minh (Hoang Ngoc), Petitot (Michel), and Van der Hoeven (Joris). -- L'algèbre des polylogarithmes par les séries génératrices. In Formal Power Series and Algebraic Combinatorics. -- 1999. Proceedings PFSAC'99, Barcelona.

This document was translated from LATEX by HEVEA.