We present a few enumeration problems that arose in computational biology. We point out on several examples how symbolic enumeration methods allow for simplifying and extending previous results. We also present some asymptotics.
W T W T W T W
------||--|-------|--||------||--|--------|--||------||--|-------|-||-----
h'1 h'1 h'2 h'2 h'k h'k
where W is the set of of non-paired subsequences, and
T[h,b] the set of secondary structures that do not start or
end with a pairing. Otherwise, such a pairing would be part of the external
helix.
In this scheme, we assume there exist k external helices of sizes
h'1 ,... ,h'k. Two external
helices are separated by non-paired positions, which is coded by language
W. We get the set decomposition:
where A[h] is the set of couples of subsequences of the same
size h' ³ h.
One can prove a simple relation between T[h,b] and
S[h,b]. Namely:
where Y[b] counts sequences of length smaller than b (no structure is possible). I.e. Y[b](z) =åi=0b-1 zi.
We plug (3) into (2) and, applying translation rules, we get
(1) after simplification. | C- |
| -G |
| -C |
| G- |
| C- |
| -G |
| -C |
| G- |
| C- |
| -G |
| -C |
| G- |
| n+m |
| n |
| D1 (t) | =(1-t)2 -4t(1-t+tb) |
| D 2 (t) | = (1-t2+tb+1)-4t(1-t+tb) . |
| F(t) = |
|
f(n) t n =[z10] f(z1,t/z1) . |
| F(t) = |
|
ó õ |
|
dz1 = |
|
ó õ |
|
dz1 |
This document was translated from LATEX by HEVEA.