ISB Home



- Article -





Volume 4


Full article

In Silico Biology 4, 0025 (2004); ©2004, Bioinformation Systems e.V.  



Molecular chaperones: proposal of a systematic computer-oriented nomenclature and construction of a centralized database

Haïtham Sghaier1, Thuy Le Huyen Ai 1, Tokumasa Horiike2 and Takao Shinozawa3*

1 Department of Biological and Chemical Engineering, Faculty of Engineering, Gunma University, Kiryu, Gunma 376-8515, Japan
  Email: haithamsghaier@scientist.com, lhathuy@yahoo.com
2 Gene Function Research Laboratory, National Institute of Genetics, Yata 1111, Mishima, Shizuoka 411-8540, Japan
  Email: horiike_t@hotmail.com
3 Graduate School of Science and Engineering, Major in Integrative Bioscience and Biomedical Engineering Waseda Research Park, IOC Honjo-Waseda, 1011-3 Shimokuboyama, Nishitomita,Honjo, Saitama 367-0035, Japan
  Email: shinozawa@waseda.jp

*  corresponding author


Edited by E. Wingender; received March 06, 2004; revised and accepted April 06, 2004; published April 16, 2004


Abstract

Molecular chaperones are a wide group of unrelated protein families whose role is to assist others proteins. Comparably, under environmental stress, stress proteins behave as biocatalysts of protein stabilization. Stress proteins include a large class of proteins that were originally termed heat shock proteins (HSPs) due to their initial discovery in tissues exposed to elevated temperatures. Many, but not all, stress proteins and HSPs are molecular chaperones. Moreover, not all HSPs are derivable from stress. HSPs are structurally diversified by the contribution of various domains having specific roles. HSPs have been grouped, mainly on the basis of their molecular masses, into specific families that include small HSPs (sHSPs)/α-crystallins, HSP10s, HSP40s, HSP60s, HSP70s, HSP90s, HSP100s and HSP110s. The names of these major families are historical artefacts with limited information content. Using the current databases, names and proteic domains of many molecular chaperones in different species were analyzed. Although traditional names of HSPs are trivial, it is unrealistic to suggest replacing them, because they are preferred and widely used. Here we suggest that these traditional names be chaperoned, in silico, by a systematic nomenclature. Thus, for example, with the same intent of use of [trioxygen: O3] for ozone, we propose here C7HSP70[Ehsa]ER-P11021 for GRP78 (78 kDa endoplasmic Human molecular chaperone in HSP70 superfamily with P11021 as its accession number in the database of the National Center for Biotechnology Information (NCBI)). The proposed systematic computer-oriented naming and classification method is designed for HSPs and also their partners based on the number of amino acids, domain structure, phylogenetic domain, localization in the cell and accession number as stated in the NCBI. Arabidopsis thaliana was analyzed as a model, because it contains a large number of various HSPs localized in several organelles. Overall, this naming system helps in building, optimizing and managing a novel online database entirely devoted to HSPs. The purported taxonomy, coupled with the newly constructed database, can contribute to studies involving large amounts of stored data on HSPs.

Key words: stress genes, in silico supplementary nomenclature, hierarchical centralized website