ISB Home



- Article -





Volume 2

Special Issue
GCB'01



Full article

In Silico Biology 2, 0018 (2002); ©2002, Bioinformation Systems e.V.  



AGenDA: Gene prediction by comparative sequence analysis

Oliver Rinner1,2,3 and Burkhard Morgenstern1,4

1GSF Research Center, MIPS / Institute of Bioinformatics, Ingolstädter Landstraße 1, 85764 Neuherberg, Germany
2Physiologisch-Chemisches Institut, Universität Tübingen, Hoppe-Seyler-Str. 4, 72076 Tübingen, Germany
3current address: Brain Research Institute, ETH Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
4current address: International Graduate School for Bioinformatics and Genome Research, University of Bielefeld, Postfach 100131, 33501 Bielefeld, Germany


Edited by E. Wingender; received November 30, 2001; revised and accepted January 7, 2002; published March 15, 2002


Abstract

Comparative sequence analysis is a powerful approach to identify functional elements in genomic sequences. Herein, we describe AGenDA (Alignment-based GENe Detection Algorithm), a novel method for gene prediction that is based on long-range alignment of syntenic regions in eukaryotic genome sequences. Local sequence homologies identified by the DIALIGN program are searched for conserved splice signals to define potential protein-coding exons; these candidate exons are then used to assemble complete gene structures. The performance of our method was tested on a set of 105 human-mouse sequence pairs. These test runs showed that sensitivity and specificity of AGenDA are comparable with the best gene- prediction program that is currently available. However, since our method is based on a completely different type of input information, it can detect genes that are not detectable by standard methods and vice versa. Thus, our approach seems to be a useful addition to existing gene-prediction programs.

Keywords: gene prediction, sequence alignment, comparative genome analysis, cross-species sequence comparison, phylogenetic footprinting, genome annotation, dynamic programming