Fraïssé-Ehrenfeucht Games and Asymptotics

Fraïssé-Ehrenfeucht Games and Asymptotics

Alan Woods

University of Western Australia

Algorithms Seminar

March 23, 1998

[summary by Julien Clément and Jean-Marie Le Bars]

A properly typeset version of this document is available in postscript and in pdf.

If some fonts do not look right on your screen, this might be fixed by configuring your browser (see the documentation here).

Abstract

Fraïssé-Ehrenfeucht games are played on two structures, where a structure might, for example, consist of a unary function mapping a finite set into itself. Via generating series and a Tauberian theorem, it is possible to investigate the asymptotic probability of having a winning strategy for such a game, when it is played using a fixed structure, and a random structure of size n, with n going to infinity. Actually for unary functions this gives a convergence law for all properties of the structure which are definable in monadic second order logic.

1 Introduction

We consider here structures A based upon a set A and finitely many relations E_j of finite arity

A,E₁(x,y),E₂(x),E₃(x,y,z), ...

A classical example is a set of vertices V and an edge relation E(x,y) so that V=<V,E> describes a graph. We can also think of simple structures A=<A,f> consisting of a finite set A and a unary function mapping this set into itself (see fig. 1). This unary function induces a binary relation F(x,y) Û f(x)=y.

Figure 1: Graphical representation a structure A=<[29],f> (where the unary function f maps {1,2,..., 29} on itself).

In order to use generating functions (see the last section) we need to translate a decomposition property of structures to the generating functions: this will be done through the disjoint union. Let us consider two structures

A,E

, ...

and B=

B,E

, ...

If A Ç B=Ø and each E_i^A has the same arity as E_i^B, the disjoint union is defined as the structure whose domain is the union of the domains and whose relations are the unions of the corresponding relations

A ú`½ B=

A È B, E

È E

,...

A class of structures has components if each structure can be uniquely decomposed into disjoint unions of structures (called component structures) from some components classes. For structures A=<[n],f>, where [n] denotes {1,...,n} and f is a unary function, one can define component classes relative to the size of the unique loop present in each connected component of the graph of f. From this point of view, for the structure A of figure 1, we see three components. The first component of A consists of two component structures in the first component class (the class corresponding to loops of size one i.e. fixed elements of f). The two other components consist in two single component structures and are respectively in the component classes 2 and 7 (relatively to the size of the loop).
Let us define the rank r(j) of a formula j in the context of the second order logic (or MSO logic for short) inductively by:

If j has no quantifiers, then r(j)=0;
If j is ¬ s, then r(j)=r(s);
If j is obtained from s₁, s₂ by the application of a binary propositional connective (e.g., if j is s₁ Ù s₂, s₁ « s₂, etc.) then r(j)=max{r(s₁),r(s₂)};
If j is of the form " v s, $ v s, " V s or $ V s for some variable v, V, then r(j)=r(s)+1.

A sentence is a formula that has no free variables and is a property of a structure.
The key observation is that there are only finitely many inequivalent sentences x₁, ..., x_m of rank r. Hence every structure A satisfies exactly one of the sentences (also of rank r)

y₁=x₁ Ù ... Ù x_m, y₂=¬ x₁ Ù ... Ù x_m,..., y

2^m

=¬ x₁ Ù ... Ù ¬ x_m.

Given a rank r (and implicitly the sentences y₁,...,y₂^_m), for each i Î {1,...,2^m} we define the class of structures which satisfies y_i. These classes can be viewed as equivalence classes of Fraïssé-Ehrenfeucht games.

2 Fraïssé-Ehrenfeucht Games

The goal is to see whether or not we can distinguish two structures in a r moves game. The game is played with two structures A=<A,E₁^A, ...> and B=<B,E₁^B, ...>.

At move i, Spoil chooses A or B (let's say B) and one of the following is satisfied
1. an element b_i Î B or
2. a subset B_i Í B.
Dupe responds on the other structure ( A here) choosing one of the following
1. an element a_i Î A or
2. a subset A_i Í A.

Dupe wins if after r moves the map {a_i,...} ® {b_i,...} taking a_i |® b_i is an isomorphism of the induced substructures of < A,A_j,...>, < B,B_j,...> on these sets. We write

A º_r B Û Dupe has a winning strategy.

Note that there is no ex æquo (either Spoil or Dupe has a winning strategy). These games are the main tools for proving the following theorems:

Theorem 1 Let us consider some structures A₁, A₂, B₁, B₂, one has

A₁ º_r B₁, A₂ º_r B₂ Þ A₁ ú`½ A₂ º_r B₁ ú`½ B₂.

Theorem 2 For every structures A and B, one has

A º_r B iff there exists i such that A |= y_i and B |= y_i,

where the sentences y_i's are defined in the first section.

Corollary 1 There are only finitely many º_r classes.

Another problem consists in determining the º_r class of a given structure A. It is solved if we know the number of component structures lying in each º_r component class (or color if we think of º_r as a colouring). On figure 2, we have 5 component classes C₁,..., C₅ relative to the º₃ relation (namely triangles, squares, cycles of odd length strictly greater than 3, cycles of even length strictly greater than 4). The numbers of component structures in each component of the structure A are respectively m₁=5,m₂=1,m₃=0,m₄=4.

Figure 2: The components classes C₁,..., C₄ relative to º₃ (left), the structure A and its four components (right).

3 Counting Structures with Components

We count either

the number a_n of labelled structures with n elements, or
the number b_n of unlabelled structures with n elements (which is, also, the number of nonisomorphic structures with n elements).

Here we focus on counting labelled structures. So the exponential generating series

a(x)=

n=0

a_n

xⁿ

will prove highly useful. Indeed, for a structure A= G ú`½ H, letting a(x), h(x) and g(x) be the corresponding exponential generating series, we write

a(x)=g(x) h(x) or a(x)=

g(x)²

whether G and H are in different classes or not. By induction the exponential generating series associated to A= G⁽¹⁾ ú`½ ... ú`½ G^(m) the disjoint union of m structures G⁽¹⁾,..., G^(m), is

a(x)=g⁽¹⁾(x) ··· g^(m)(x) or (x)=

g(x)^m

respectively if G⁽¹⁾,..., G^(m) are all from different classes or all in the same class. Hence the generating series a(x) for structures with components in the component class C is

a(x)=1+c(x)+

c(x)²

+ ...+

c(x)^m

+···=e^c(x),

where c(x)=å_n c_n/n!xⁿ (c_n is the number of labelled structures in the component class C with n elements).
There is a connection with monadic second order logic due to Compton [2]. Let us consider the component classes (relatively to º_r) C₁, ..., C_k (so that the generating series for whole component class is c(x)=å_i=1^k c_i(x)). There is a unique k-tuple (m₁,...,m_k) associated to each structure A, where m_i is the number of component structures of A lying in the i-th component class C_i. Moreover for two structures A and B (with k-tuples (m₁,...,m_k) and (n₁,...,n_k)), there is an integer R=R(r) such that if " i Î {1,...,k} either m_i=n_i or m_i,n_i ³ R, then A º_r B (plainly speaking, too many component structures of the same component class prevent to distinguish structures). Hence for a sentence j of rank r, the number of labelled structures A such that A |= j depends only on m₁,...,m_k where m_i Î {0,1,...,R-1,¥} is the number of components in C_i (¥ means anything equal to at least R=R(r)). Considering the exponential generating series a_j(x)=åa_n^j/n! where a_n^j the number of labelled structures with n elements satisfying j, we can write

(x)=

(m₁,...,m_k) Î S

c₁(x)

m₁

m₁!

···

c_k(x)

m_k

m_k!

where S is finite and c_i(x)^¥/¥! denotes å_m=R^¥c_i(x)^m/m!= e^c_ⁱ^(x)-å_m=0^R-1 c_i(x)^m/m!. The series a_j(x) is a finite sum of very similar terms. It is enough just to consider a series of the form

(x)=

c₁(x)

m₁

m₁!

···

c_t(x)

m_t

m_t!

c_t+1(x)

··· e

c_k(x)

This formula means that a structure A satisfying j has exactly m_i components in the class i for i Î {1,...,t} and any number of components in the other classes. We want to know a_n^j or equivalently µ_n(j)=a_n^j/a_n, the fraction of structures of size n satisfying j. We are also interested in the asymptotic probability µ_j=lim_{n ® ¥} µ_n(j), when this limit exists.
It is Compton's idea to use partial converses Tauberian lemmas to get limit laws for µ_n. Here is a sample theorem whose proof is based on such lemmas.

Theorem 3 For any class with components, if a_n/n! ~ C tⁿ/n^a and c_n/n!=O(tⁿ/n) (with a> -1) then µ(j)=lim_{n® ¥} µ_n(j) exists for all MSO sentences j and is equal to a_j(r)/a(r).

Due to known results about a_n and c_n for structures with one unary function, we have also

Corollary 2 The asymptotic probability µ_j always exists with one unary function.

References

[1]: Compton (Kevin J.). -- Application of a Tauberian theorem to finite model theory. Archiv für Mathematische Logik und Grundlagenforschung, vol. 25, n°1-2, 1985, pp. 91--98.
[2]: Compton (Kevin J.). -- A logical approach to asymptotic combinatorics. II. Monadic second-order properties. Journal of Combinatorial Theory. Series A, vol. 50, n°1, 1989, pp. 110--131.
[3]: Fagin (Ronald). -- Probabilities on finite models. Journal of Symbolic Logic, vol. 41, n°1, 1976, pp. 50--58.
[4]: Glebskii (Ju. V.), Kogan (D. I.), Liogon'kii (M. I.), and Talanov (V. A.). -- Volume and fraction of satisfiability of formulas of the lower predicate calculus. Kibernetika (Kiev), vol. 5, n°2, 1969, pp. 17--27. -- English translation Cybernetics, vol. 5 (1972), pp. 142--154.
[5]: Woods (Alan). -- Counting finite models. The Journal of Symbolic Logic, vol. 62, n°3, September 1997, pp. 925--949.

This document was translated from L^AT_EX by H^EV^EA.