Abstract
Background: At present five evolutionary hypotheses have been proposed to explain the great variability of the
genomic GC content among and within genomes: the mutational bias, the biased gene conversion, the DNA
breakpoints distribution, the thermal stability and the metabolic rate. Several studies carried out on bacteria and
teleostean fish pointed towards the critical role played by the environment on the metabolic rate in shaping the
base composition of genomes. In mammals the debate is still open, and evidences have been produced in favor
of each evolutionary hypothesis. Human genes were assigned to three large functional categories (as well as to the
corresponding functional classes) according to the KOG database: (i) information storage and processing, (ii) cellular
processes and signaling, and (iii) metabolism. The classification was extended to the organisms so far analyzed
performing a reciprocal Blastp and selecting the best reciprocal hit. The base composition was calculated for each
sequence of the whole CDS dataset.
Results: The GC3 level of the above functional categories was increasing from (i) to (iii). This specific compositional
pattern was found, as footprint, in all mammalian genomes, but not in frog and lizard ones. Comparative analysis
of human versus both frog and lizard functional categories showed that genes involved in the metabolic processes
underwent the highest GC3 increment. Analyzing the KOG functional classes of genes, again a well defined intragenomic
pattern was found in all mammals. Not only genes of metabolic pathways, but also genes involved in
chromatin structure and dynamics, transcription, signal transduction mechanisms and cytoskeleton, showed an
average GC3 level higher than that of the whole genome. In the case of the human genome, the genes of the
aforementioned functional categories showed a high probability to be associated with the chromosomal bands.
Conclusions: In the light of different evolutionary hypotheses proposed so far, and contributing with different
potential to the genome compositional heterogeneity of mammalian genomes, the one based on the metabolic
rate seems to play not a minor role. Keeping in mind similar results reported in bacteria and in teleosts, the
specific compositional patterns observed in mammals highlight metabolic rate as unifying factor that fits over a
wide range of living organisms.
Anno
2012
Autori IAC
Tipo pubblicazione
Altri Autori
Luisa Bern, Ankita Chaurasia, Claudia Angelini, Concetta Federico, Salvatore Saccone and Giuseppe D;Onofrio
Editore
BioMed Central,
Rivista
BMC genomics