FASSM: Enhanced Function Association in whole genome analysis using Sequence and Structural Motifs
Kumar Gaurav, Nitin Gupta1,2 and Ramanathan Sowdhamini*
National Centre for Biological Sciences, UAS-GKVK campus, Bellary Road, Bangalore 560 065, India
We present an algorithm to detect remote homology, which arises through circular permutation and discontinuous domains. It is also helpful in detecting small domain proteins that are characterized by few conserved residues. The input to the algorithm is a set of multiply aligned protein sequence profiles. This method, coded as FASSM, examines the sequence conservation and positions of protein family signatures or motifs for the annotation of protein sequences and to facilitate the analysis of their domains. The overall coverage of FASSM is 93% in comparison to other validation tools like HMM and IMPALA. The method is especially useful for difficult relationships such as discontinuous domains during whole-genome surveys and is demonstrated to perform accurate family associations at sequence identities as low as 15%.
Availability: Available upon request from the authors.