Fast Multivariate Power Series Multiplication in Characteristic Zero

Fast Multivariate Power Series Multiplication in Characteristic Zero

Grégoire Lecerf

Gage, École polytechnique (France)

Algorithms Seminar

June 11, 2001

[summary by Ludovic Meunier]

A properly typeset version of this document is available in postscript and in pdf.

If some fonts do not look right on your screen, this might be fixed by configuring your browser (see the documentation here).

Abstract

Let S be a multivariate power series ring over a field of characteristic zero. The article [5] presents an asymptotically fast algorithm for multiplying two elements of S truncated according to total degree. Up to logarithmic factors, the complexity of the algorithm is optimal, in the sense that it is linear in the size of the output.

1 Introduction

Let k be a field of characteristic zero. We write S=k[[x₁,...,x_n]] for the multivariate power series ring in the n variables x₁,...,x_n. Let I be any ideal of S. By computing at precision I in S, we understand computing modulo the ideal I in S. In other words, power series in S are regarded as vectors in the k-algebra S/I. We denote by m the maximal ideal (x₁,...,x_n) in S and by d any positive integer. The paper [5] sets the problem of a fast algorithm for multiplying two power series in S truncated in total degree d, that is computed at precision m^d+1.

The general question of a fast algorithm for multivariate multiplication in S modulo any ideal remains an open problem and has received very little attention in the literature. Previous works (e.g., [2]) investigated computation modulo the ideal (x₁^d+1,...,x_n^d+1), that is truncation according to partial degree with respect to each variable x_i. The method used is called Kronecker's substitution and is briefly discussed in Section 3.

The need for multiplication routines modulo m^d+1 arises in various fields, such as polynomial system solving [7] and treatment of systems of partial differential equations.

The efficiency of the algorithm is measured with respect to the model of nonscalar complexity. By nonscalar complexity, we understand the number of primitive operations in the field k needed to complete the algorithm, independently of the sizes of the numbers involved (see [3]). We now introduce some notation. We denote by D = deg(m^d+1) the degree of the ideal m^d+1. D is the number of monomials in S which are not in m^d+1, that is the dimension of the k-algebra S/m^d+1. Simple combinatorial considerations give

D = deg

(

m^d+1

)

æ
è

d+n

ö
ø

We set C := deg(m^d) and denote by M_u(d) the complexity of the multiplication of two univariate polynomials of degree d in k[t].

The next section presents the algorithm; its complexity belongs to

(

D log³D loglogD

)

. (1)

Since D is the size of the output, the algorithm is optimal, up to the logarithmic factors.

2 The Algorithm

2.1 Description

The first step of the algorithm consists in translating the multivariate problem into a univariate one. This is motivated by the fact that fast algorithms for univariate power series multiplication are known (e.g., [6]).

Let t be a new variable. We consider the substitution

S/m^d+1	¾®	k[x₁,...,x_n][[t]]/(t^d+1)
f(x₁,...,x_n)	\|®	f(x₁t,...,x_nt).

If f is an element of S/m^d+1, R_t^~(f) is a univariate power series in the single variable t truncated at degree d. It can then be written R_t^~(f) = f₀^~ + f₁^~ t + ... + f_d^~ t^d, where each coefficient f_i^~ is a homogeneous multivariate polynomial in the variables x₁,...,x_n of total degree i. This remark on the degree suggests that:

the substitution R_t^~ is optimal, in the sense that it provides us with a representation of f that retains exactly the monomials that form a basis of S/m^d+1. In particular, the algorithm does not suffer from any overhead caused by unnecessary terms (see Section 3);
in view of the homogeneity of the f_i^~, keeping all of the variables x_i is redundant. The substitution defined by

R_t :

S/m^d+1 ¾®

k[x₂,...,x_n][[t]]/(t^d+1)= ( k[[t]]/(t^d+1) ) [x₂,...,x_n]

f(x₁,...,x_n) |® f(t,x₂t,...,x_nt)

reduces the complexity in the step of evaluation-interpolation (see below): n-1 variables, instead of n variables, are actually needed.

The second step of the algorithm performs the multiplication. Let f and g be two power series in S/m^d+1 and h be the product fg in S/m^d+1. The equality h = fg turns into

R_t(h) = R_t(f)R_t(g). (2)

Consequently, we concentrate on a fast way to compute R_t(h). We use an evaluation-interpolation scheme. We first consider the evaluation map at the point P=(p₂,...,p_n) in k^n-1 defined by

E_P :

(

k[[t]]/(t^d+1)

)

[x₂,...,x_n]

¾®

k[[t]]/(t^d+1)

f(x₂,...,x_n)

|®

f(P).

We then apply E_P to equation (2), which yields

E_P

(

R_t(h)

)

= E_P

(

R_t(f)

)

E_P

(

R_t(g)

)

modt^d+1. (3)

Equation (3) holds for any point P and computes the product R_t(h) at P by using a univariate power series multiplication algorithm. Such an algorithm is described in [6].

The last step of the algorithm consists in reconstructing h from a set of values of R_t(h). We regard R_t(h) as a multivariate polynomial in the variables x₂,...,x_n. There exists an interpolation map

I :

(

k[[t]]/(t^d+1)

)

¾®

(

k[[t]]/(t^d+1)

)

[x₂,...,x_n]

(

f(P₁),...,f(P_C)

)

|®

f(x₂,...,x_n),

which recovers R_t(h) from a set of C pairwise distinct values {E_P₁( R_t(h) ),...,E_{P_C}( R_t(h) ) }. The evaluation points P_i, for i in 1,...,C, are chosen to be powers of distinct prime numbers, namely P_i=(p₂ⁱ,...,p_nⁱ), where p_j are distinct prime numbers. Note the key point is that the characteristic of the ground field k is zero, so that all E_{P_i}(R_t(h)) have pairwise distinct values. An implementation of both maps E_P and I is described by J. Canny, E. Kaltofen, and Y. Lakshman in [4]. Their method relies on fast univariate multipoint evaluation and interpolation (e.g., [1]).

Finally, we reconstruct h from R_t(h). If R_t(h)=h₀+h₁t+...+h_dt^d is given, h is obtained by homogenizing each h_i in degree i with respect to the variable x₁ and then evaluating at t =1.

We are now ready to unfold the algorithm.

MultivariatePS_Mult := proc(f,g)

(1)	F ¬ R_t(f); G ¬ R_t(g);		// new representation
(2)	for i in (P₁,...,P_C) do		// evaluation
		F_{P_i} ¬ E_{P_i}(F); G_{P_i} ¬ E_{P_i}(G);
(3)	for i to C do		// univariate multiplication
		H_{P_i} ¬ F_{P_i}G_{P_i};
(4)	R_t(h) ¬ I(H_P₁,...,H_{P_C});		// interpolation
(5)	h ¬ homogenization in degree with respect to x₁		// reconstruction
	in R_t(h);
	return h;

The next section derives the complexity result claimed by (1).

2.2 Complexity

Steps 1 and 5 can be performed in O(C) operations. We examine the cost of Steps 2, 3, and 4 separately:

Step 2 evaluates the d coefficients of F and G at C points. The C points P_i are chosen to be powers of the n-1 distinct prime numbers (p₂,...,p_n), namely P_i=(p₂ⁱ,...,p_nⁱ). Each coefficient can be computed in O ( M_u(C)logC) operations, according to the algorithm for fast multipoint evaluation given in [4]. This yields an overall complexity of O (d M_u(C)logC) for Step 2.
Step 3 performs C univariate power series products. Each multiplication requires O(M_u(d)) operations. Complexity of Step 3 is then O(CM_u(d)).
Step 4 interpolates the d coefficients of H. Each interpolation requires O ( M_u(C)logC) operations, also using the algorithm presented in [4]. Step 4 then requires O (d M_u(C)logC) operations.

The overall complexity of the algorithm is then derived by replacing M_u(C) by its estimate O ( C logC loglogC ) obtained in [6] and noting that C < Dlog(D)/d. This yields

(

D log³D loglogD

)

2.3 Generalization

We mention that van der Hoeven generalized the algorithm to the case when

I = ( x₁^d₁... x_n^d_n, for a₁d₁+...+a_nd_n > d ),

where the a_i are positive integers, by using the substitution defined by

V_t :

S/I	¾®	k[x₂,...,x_n][[t]]/( t^d+1)
f(x₁,...,x_n)	\|®	f(t^a₁,x₂t^a₂,...,x_nt^a_n)

instead of R_t. The rest of the algorithm remains unaltered.

3 Appendix: Kronecker's Substitution

Kronecker's substitution is defined by the map

K_t :

S/I	¾®	k[[t]]/t^(2d+1)ⁿ
f(x₁,...,x_n)	\|®	f(t,t^2d+1,...,t^{(2d+1)^n-1}),

where I = (x₁^d+1,...,x_n^d+1). This substitution truncates power series in partial degree d with respect to each variable x_i. Let f be a power series in S/I, one recovers the coefficient of x₁^e₁... x_n^e_n in f by simply reading off the coefficient of t^{e₁ + (2d+1)e₂ + ... +
(2d+1)^n-1e_n} in K_t(f). The cost of this algorithm is the cost of the multiplication of two univariate polynomials of degree (2d)ⁿ, that is O ( M_u( (2d)ⁿ) ). This is the lowest known complexity for multivariate power series multiplication modulo the ideal (x₁^d+1,...,x_n^d+1). In particular, when addressed in this context, the algorithm presented above requires precision m^nd+1 and yields a similar complexity.

Kronecker's substitution may be used to compute modulo m^d+1 as well. However, it results in a significant overhead of O ( 2ⁿn!), for fixed n and d » n, with respect to the size of the power series.

References

[1]: Aho (Alfred V.), Hopcroft (John E.), and Ullman (Jeffrey D.). -- The design and analysis of computer algorithms. -- Addison-Wesley Publishing Co., Reading, Mass.-London-Amsterdam, 1975, x+470p. Second printing, Addison-Wesley Series in Computer Science and Information Processing.
[2]: Brent (R. P.) and Kung (H. T.). -- Fast algorithms for composition and reversion of multivariate power series (preliminary version). In Proceedings of a Conference on Theoretical Computer Science Department of Computer Science, University of Waterloo, Waterloo, Ontario (August 1977), pp. 149--158. -- 1977.
[3]: Bürgisser (Peter), Clausen (Michael), and Shokrollahi (M. Amin). -- Algebraic complexity theory. -- Springer-Verlag, Berlin, 1997, xxiv+618p. With the collaboration of Thomas Lickteig.
[4]: Canny (John F.), Kaltofen (Erich), and Lakshman Yagati. -- Solving systems of nonlinear polynomial equations faster. In Gonnet (Gaston) (editor), Symbolic and Algebraic Computation (International Symposium ISSAC'89, Portland, Oregon, USA, July 17-19, 1989). pp. 121--128. -- ACM Press, 1989. Conference proceedings.
[5]: Lecerf (Grégoire) and Schost (Éric). -- Fast multivariate power series multiplication in characteristic zero. -- Available from http://www.medicis.polytechnique.fr/~schost/, 2001.
[6]: Schönhage (A.). -- Schnelle Multiplikation von Polynomen über Körpern der Charakteristik 2. Acta Informatica, vol. 7, n°4, 1976/77, pp. 395--398.
[7]: Schost (Éric). -- Sur la résolution des systèmes polynomiaux à paramètres. -- PhD thesis, École polytechnique, Palaiseau, France, January 2001. Defended on December 7, 2000.

This document was translated from L^AT_EX by H^EV^EA.