ALCOMFT-TR-03-99

ALCOM-FT
 

Magali Lescot and Mireille Régnier
Motif statistics on plants datasets
INRIA. Work package 5. November 2003.
Abstract: Mathematical expressions to estimate the probabilities of rare or frequent words in genomic texts. have recently been obtained by different authors. We present a few extensions for the p-value computation, in the Markov probability model: double strand counting, some degenerated consensus. We compare with the results obtained by RSA-tools on some plant datasets. We will exhibit the co-occurrence of the G-box, the I-box and a third motif, very close the Ry-element, also called the Sph Box.
Postscript file: ALCOMFT-TR-03-99.ps.gz (73 kb).

System maintainer Gerth Stølting Brodal <gerth@cs.au.dk>