ISB Home



- Article -





Volume 3


Full article

In Silico Biology 3, 0023 (2003); ©2003, Bioinformation Systems e.V.  



Functional classification of proteins using a nearest neighbour algorithm

Hans-Peter Keck1,* and Thomas Wetter2

1 LION bioscience AG, Waldhofer Str. 98, 69121 Heidelberg, Germany
Email: hans.peter.keck@lionbioscience.com
2 Universitätsklinikum, Institut für Medizinische Biometrie und Informatik, Im Neuenheimer Feld 400, 69120 Heidelberg, Germany

*  corresponding author


Edited by E. Wingender; received October 02, 2002; revised and accepted February 12, 2003; published February 17, 2003


Abstract

With the large volume of genomic data being analysed nowadays it becomes extremely important to provide automated ways of protein classification to give the scientist a good overview of the analysed data.

The system described here is very flexible and can be used with any given protein classification scheme. Before using the system, it has to be trained with a set of already classified proteins. Afterwards, other proteins can be classified by the system automatically. Several tests have been performed to assess the quality of this classification; they have shown the usefulness of the system.

The system will be available as part of a commercial software package.

Key words: protein function, classification, genome analysis, nearest neighbor algorithm, machine learning