Abstract
The annotation of transcription binding sites in new sequenced
genomes is an important and challenging problem. We have
previously shown how a regression model that linearly relates gene expression
levels to the matching scores of nucleotide patterns allows us
to identify DNA-binding sites from a collection of co-regulated genes
and their nearby non-coding DNA sequences. Our methodology uses
Bayesian models and stochastic search techniques to select transcription
factor binding site candidates. Here we show that this methodology
allows us to identify binding sites in nearby species. We present examples
of annotation crossing from Schizosaccharomyces pombe to Schizosaccharomyces
japonicus. We found that the eng1 motif is also regulating a set
of 9 genes in S. japonicus. Our framework may have an effective interest
in conveying information in the annotation process of a new species. Finally
we discuss a number of statistical and biological issues related to
the identification of binding sites through covariates of genes expression
and sequences.
Anno
2007
Autori IAC
Tipo pubblicazione
Altri Autori
Angelini C., Cutillo L., De Feis I., Lio; P., van der Wath R.
Editore
Springer
Rivista
Lecture notes in computer science