We present a few enumeration problems that arose in computational biology. We point out on several examples how symbolic enumeration methods allow for simplifying and extending previous results. We also present some asymptotics.
   W          T          W            T          W           T         W
------||--|-------|--||------||--|--------|--||------||--|-------|-||-----
        h'1        h'1         h'2         h'2        h'k         h'k
       
where  W is the set of of non-paired subsequences, and 
 T[h,b] the set of secondary structures that do not start or 
end with a pairing. Otherwise, such a pairing would be part of the external 
helix. 
In this scheme, we assume there exist k external helices of sizes 
h'1 ,... ,h'k. Two external 
helices are separated by non-paired positions, which is coded by language 
 W. We get the set decomposition:
where  A[h] is the set of couples of subsequences of the same 
size h' ³ h. 
One can prove a simple relation between  T[h,b] and 
 S[h,b]. Namely:
where  Y[b] counts sequences of length smaller than b (no structure is possible). I.e. Y[b](z) =åi=0b-1 zi. 
We plug (3) into (2) and, applying translation rules, we get 
(1) after simplification. | C- | 
| -G | 
| -C | 
| G- | 
| C- | 
| -G | 
| -C | 
| G- | 
| C- | 
| -G | 
| -C | 
| G- | 
| n+m | 
| n | 
| D1 (t) | =(1-t)2 -4t(1-t+tb) | 
| D 2 (t) | = (1-t2+tb+1)-4t(1-t+tb) . | 
| F(t) = | 
 | f(n) t n =[z10] f(z1,t/z1) . | 
| F(t) = | 
 | ó õ | 
 | dz1 = | 
 | ó õ | 
 | dz1 | 
This document was translated from LATEX by HEVEA.