Identify regulatory sites using neighbourhood species,

Abstract

The annotation of transcription binding sites in new sequenced genomes is an important and challenging problem. We have previously shown how a regression model that linearly relates gene expression levels to the matching scores of nucleotide patterns allows us to identify DNA-binding sites from a collection of co-regulated genes and their nearby non-coding DNA sequences. Our methodology uses Bayesian models and stochastic search techniques to select transcription factor binding site candidates. Here we show that this methodology allows us to identify binding sites in nearby species. We present examples of annotation crossing from Schizosaccharomyces pombe to Schizosaccharomyces japonicus. We found that the eng1 motif is also regulating a set of 9 genes in S. japonicus. Our framework may have an effective interest in conveying information in the annotation process of a new species. Finally we discuss a number of statistical and biological issues related to the identification of binding sites through covariates of genes expression and sequences.

Anno

2007

Autori IAC

ITALIA DE FEIS

CLAUDIA ANGELINI

Tipo pubblicazione

Articolo in rivista

Altri Autori

Angelini C., Cutillo L., De Feis I., Lio; P., van der Wath R.

Editore

Springer

Rivista