ALCOMFT-TR-03-99
|
|
Magali Lescot and Mireille Régnier
Motif statistics on plants datasets
INRIA.
Work package 5.
November 2003.
Abstract: Mathematical expressions to estimate
the probabilities of rare or frequent words in genomic texts.
have recently been obtained by different authors. We present a few
extensions for the p-value
computation, in the Markov probability model: double strand
counting, some degenerated consensus. We compare with the results
obtained
by RSA-tools on some
plant datasets. We will exhibit the co-occurrence of the G-box, the
I-box and a third motif, very close the Ry-element, also called the
Sph Box.
Postscript file: ALCOMFT-TR-03-99.ps.gz (73 kb).
System maintainer Gerth Stølting Brodal <gerth@cs.au.dk>