Graph Colouring via the Probabilistic Method

Graph Colouring via the Probabilistic Method

Bruce Reed

Équipe Combinatoire CNRS
Université Pierre et Marie Curie, Paris, France

Algorithms Seminar

April 21, 1997

[summary by François Morain and Philippe Robert]

A properly typeset version of this document is available in postscript and in pdf.

If some fonts do not look right on your screen, this might be fixed by configuring your browser (see the documentation here).

1 Introduction

Colouring a graph with the minimum number of colours is a classical problem in graph theory and has many applications. For instance, think of a cellular phone network on which each vertex (phone) must use a different frequency with its neighbours. This problem is also known to be a difficult one (see for instance [3]).

The purpose of the talk is to present a naive algorithm for colouring a certain type of graphs and explain how to analyze it with elementary probabilistic tools that we will describe first.

2 Probabilistic tools

Throughout this section we denote by Pr(A) the probability of the event A, E(X) the expected value of the random variable X, and E(X/A₁,...,A_n) the conditional expectation of X relative to the events A₁,...,A_n.

2.1 The Lovász Local lemma

Suppose that on some probability space W, there are n events A₁,..., A_n that are undesirable. We wish to estimate if there is a positive probability to avoid any of them, i.e., if there is a positive lower bound for the quantity

D = Pr

(

Ç_i=1ⁿ A_i^c

)

where A_i^c=W-A_i. If the events are independent, that is for any k-tuple 1£ i₁<...<i_k£ n,

(Ç_j=1^k A

i_j

j=1

i_j

then

1=1

(1-Pr(A_i)).

The problem is that in practice, the events are not always completely independent but weakly independent, in the sense that for each i there exists a subset V_iÌ {1,...,n} such that A_i is independent of the events A_j, jÎ V_j^c. In other words, A_i is possibly dependent of A_j with j in the ``neighbourhood'' V_i of i. If the cardinality of the V_i's is small, one might expect an estimate close to the one we saw for the independent case. This is the conclusion of Lovács's lemma, see [1].

Lemma 1 If the events are such that for all 1£ i£ n,

Pr(A_i)£ p,
A_i is independent of (A_j)_jÏV_{_i},
|V_i|£ d,

and if ep(d+1)<1 then none of the events A_i, i=1,...,n, occurs with positive probability.

2.2 Azuma's inequality

If (Y_i) be a sequence of independent random variables with the same distribution on {0,1}, p=Pr(Y_i=1); The result of successive coin tossings is a good model for this sequence of random variables. It is well known that the time averages 1/nå_i=1ⁿ Y_i converges exponentially fast to p as n® +¥. Rigourously, this is Chernoff's bound

æ
ç
ç
è

½
½
½
½

i=1

Y_i-p

½
½
½
½

ö
÷
÷
ø

<2e

-a²/3np

it says basically that with high probability [p/a²] coin tossings are sufficient to get an estimate of p with an accuracy of the order of a. This kind of result has been extended, for independent variables, to the case of arbitrary distributions, i.e., not only with values in {0,1}, as long as they have an exponential moment. This is a part of large deviations theory, see [2].

Another possible generalization is to consider the case where instead of the sum of independent variables, one looks at some functional X of some arbitrary random variables Y₁,...,Y_n with values in {0,1}. Azuma's inequality says that if the conditional exceptions of X with respect to Y₁,...,Y_i do not jump sharply as i goes from 1 to n, then X is concentrated around its average value, formally,

Proposition 1 If for each i£ n,

max

y₁,...,y_i+1Î{0,1}

E(X/Y₁=y₁,...,Y_i=y_i,Y_i+1=y_i+1)- E(X/Y₁=y₁,...,Y_i=y_i)

£ c_i, (1)

then

(

|X-E(X)|>a

)

£ 2e

a²

c_i²

Azuma's inequality is surprisingly sharp considering the weak hypotheses of the proposition. In the independent case, for X=å₁ⁿ Y_i, condition (1) is satisfied with c_i=1, hence the inequality is in this case,

æ
ç
ç
è

½
½
½
½

i=1

Y_i-p

½
½
½
½

ö
÷
÷
ø

<2e

-a²

which is very close to Chernoff's bound.

3 Graph colouring

We colour a graph G such that every pair of adjacent vertices receive different colours. The chromatic number of G, noted c(G) is the minimum number of colours required to colour G. It is easy to see that, if D(G) denotes the maximal degree of G, then c(G) £ D(G)+1.

We can obtain good bounds for c(G) for certain types of graphs, as explained in [4]. For fixed e > 0, we saw that a vertex v is e-sparse if the subgraph induced by N_v, the neighbourhood of v, has at most (1-e) D(D-1)/2 edges. A graph is e-sparse if each of its vertices is e-sparse.

Theorem 1 For D sufficiently large, if G has maximum degree D and G is e-sparse, then c(G) £ (1-e/2 e⁶) D.

Let us indicate a rough proof of this theorem. In a first step, we construct a partial colouring C of G such for each vertex v, the number of neighbours of v which are coloured exceeds the number of colours appearing on N_v by at least e/2 e⁶D+1.

From this, we complete the colouring of C to a (1-e/2 e⁶)D-colouring of G in a greedy manner: We colour the remaining vertices one at a time. When we come to colour v, there must be an available colour: Since v has at most D neighbours (this is where the sparseness comes in), the number of colours appearing in N_v is bounded by

D -

æ
ç
ç
è

2 e⁶

D+1

ö
÷
÷
ø

Hence fewer than (1-e/2 e⁶) D colours appear in its neighbourhood.

Let us come back to the construction of C. We first assign each vertex of G a uniformly random colour from {1, 2, ..., é D/2 ù }. If two adjacent vertices have the same colour, we uncolour them. The resulting partial colouring yields C.

The first thing to show is that C is not too small, which is rather easy. Then we must study, for vertex v, the random variable Z_v which counts the number of pairs of vertices in N_v which have the same colour in C. It can be shown that since G is sparse, the expectation of Z_v is greater than e D / e⁴.

Now that we have proved that many vertices in N_v are coloured, we must show that Z_v does not differ too much from its expected value. Once this is done, we use the Local Lemma to prove that every vertex will have such a property, thus proving the property on C. By a technical argument replacing Z_v with a more amenable quantity, Azuma's Inequality is used to prove the assumption on Z_v. Roughly speaking, the idea is that a colouring of v should not influence the colouring of the other parts of C, since G is sparse.

References

[1]: Alon (Noga) and Spencer (Joel H.). -- The probabilistic method. -- John Wiley & Sons Inc., New York, 1992, Wiley-Interscience Series in Discrete Mathematics and Optimization, xvi+254p.
[2]: Dembo (Amir) and Zeitouni (Ofer). -- Large deviations techniques and applications. -- Jones and Bartlett Publishers, Boston, MA, 1993, xiv+346p.
[3]: Karp (Richard M.). -- Reducibility among combinatorial problems. In Complexity of Computer Computations. pp. 85--103. -- New York, 1972. Proceedings of a Symposium held at IBM Thomas J. Watson Research Center, Yorktown Heights, N.Y., 1972.
[4]: Molloy (M.) and Reed (B.). -- Graph colouring via the probabilistic method. -- April 1997. Preprint.

This document was translated from L^AT_EX by H^EV^EA.