ISB Home



- Article -





Volume 6


Full article

In Silico Biology 6, 0027 (2006); ©2006, Bioinformation Systems e.V.  



AMIGOS: a method for the inspection of genomic organisation or structure and its application to characterise conserved gene arrangements

Rainer Merkl

Institut für Biophysik und Physikalische Biochemie, Universität Regensburg, Universitätsstrasse 31, D-93053 Regensburg


   Email: Rainer.Merkl@biologie.uni-regensburg.de


Edited by E. Wingender; received February 16, 2006; revised April 03/May 11, 2006; accepted May 11, 2006; published June 17, 2006


Abstract

In order to identify and to characterise gene clusters conserved in microbial genomes, the algorithm AMIGOS was developed. It is based on a categorisation of genes using a predefined set of gene functions (GFs). After the categorisation of all genes of a genome and based on their location on a replicon, distances between GFs were determined and stored in genome-specific matrices. These matrices were used to identify GF clusters like those strictly conserved in 13 archeal, in 47 bacterial genomes and in the combination of the sets. Within the combined set of these 60 microbial genomes, there exist only two strictly conserved clusters harbouring two ribosomal genes each, namely those for L4, L23 and L22, L29.

In order to characterise less strictly conserved GF clusters, content of genomes i. e. matrices were analyzed pairwise. Resulting clusters were merged to (meta-) clusters if their content overlapped. A scoring system named consCL was developed. It quantifies conservedness of cluster membership for individual GFs.

For the genome of Escherichia coli it was shown that a grouping of cluster elements on consCL values dissected the clusters into smaller sets. These sets were frequently overlapped by known transcriptional units (TUs). This finding justifies the usage of consCL scores to predict TU membership of genes. In addition, consCL values provide a sound basis for non-homologous gene annotation. Based on consCL values, examples of conserved clusters containing annotated genes and single ones with unknown function are given.


Keywords: non-homologous gene annotation, functional coupling, conserved gene clusters, operon prediction