| In Silico Biology 6, 0053 (2006); ©2006, Bioinformation Systems e.V. |
Department of Molecular Medicine,
Rajiv Gandhi Centre for Biotechnology,
Thiruvananthapuram,
Kerala,
India
* Corresponding author
Department of Molecular Medicine,
Rajiv Gandhi Centre for Biotechnology
Thycaud. P.O, Thiruvananthapuram- 695 014
Kerala State, India
Phone: +91- (0) 471- 2347973, Fax: +91-(0) 471- 2349303
Email: mrpillai@vsnl.com
Edited by H. Michael; received July 20, 2006; revised and accepted October 16, 2006; published November 13, 2006
Molecular docking and virtual screening based on molecular docking have become an integral part of many modern structure-based drug discovery efforts. Hence, it becomes a useful endeavor to evaluate existing docking programs, which can assist in the choice of the most suitable docking algorithm for any particular study. The objective of the current study was to evaluate the ability of ArgusLab 4.0, a relatively new molecular modeling package in which molecular docking is implemented, to reproduce crystallographic binding orientations and to compare its accuracy with that of a well established commercial package, GOLD. The study also aimed to evaluate the effect of the nature of the binding site and ligand properties on docking accuracy. The three dimensional structures of a carefully chosen set of 75 pharmaceutically relevant protein-ligand complexes were used for the comparative study. The study revealed that the commercial package outperforms the freely available docking engine in almost all the parameters tested. However, the study also revealed that although lagging behind in accuracy, results from ArgusLab are biologically meaningful. This taken together with the fact that ArgusLab has an easy to use graphical user interface, means that it can be employed as an effective teaching tool to demonstrate molecular docking to beginners in this area.
Keywords: molecular docking, ArgusLab, GOLD, docking accuracy, protein-ligand complex
The last few years have witnessed the development and implementation of a range of 'molecular docking' algorithms, based on different search methods [1, 2]. The approach, which has been shown to nicely complement high-throughput screening, has had several recent successes in drug discovery [3, 4].
Molecular docking involves the prediction of ligand (small molecule, in this study) conformation and orientation, sometimes dubbed 'pose', within the active site of the molecular target.
In this context it becomes a useful endeavor to evaluate existing docking programs, which can assist in the choice of the most suitable docking algorithm for any particular study. Indeed, several such studies, evaluating and comparing the accuracies of modern docking modules like Glide, GOLD, FLExX, ICM, DOCK, have been reported in recent literature [5, 6].
The objective of the current study was to evaluate the ability of ArgusLab 4.0, a relatively new molecular modeling package in which molecular docking is implemented, to reproduce crystallographic binding orientations and to compare its accuracy with that of a well established commercial package, GOLD. The study also aimed to evaluate the effect of the nature of the binding site and ligand properties on docking accuracy.
ArgusLab4.0, distributed freely for Windows platforms by Planaria Software, has fast become a favorite introductory molecular modeling package with academics mainly because of its user-friendly interface and intuitive calculation menus [7, 8]. The ArgusDock docking engine, implemented in ArgusLab4.0, approximates an exhaustive search method, with similarities to DOCK and Glide. Flexible ligand docking is possible with ArgusLab, where the ligand is described as a torsion tree and grids are constructed that overlay the binding site. Ligand's root node (group of bonded atoms that do not have rotatable bonds) is placed on a search point in the binding site and a set of diverse and energetically favorable rotations is created. For each rotation, torsions in breadth-first order are constructed and those poses that survive the torsion search are scored. The N-lowest energy poses are retained and the final set of poses undergoes coarse minimization, re-clustering and ranking.
Genetic Optimization for Ligand Docking, (GOLD) is distributed by Cambridge Crystallographic Data Centre. GOLD employs a genetic algorithm (GA) to explore the full range of conformational flexibility of the ligand and also the rotational flexibility of selected receptor hydrogens [9, 10]. The mechanism for ligand placement is based on fitting points. Fitting points are added to hydrogen-bonding groups on protein and ligand and the program maps acceptor points in the ligand on donor points in the protein and vice versa.
GOLD also maps ligand CH groups onto hydrophobic fitting points generated on the protein binding site cavity. GOLD also optimizes flexible ligand dihedrals, ligand ring geometries, dihedrals of protein hydroxyl and amino groups and mappings of the latter. A molecular mechanics-like scoring function that includes terms for hydrogen bond, 4-8 intermolecular van der Waals bonds and 6-12 intramolecular van der Waals bonds (for the internal energy of the ligand) is employed by GOLD to rank the docked poses.
In the present study, the ability of ArgusLab to reproduce the crystallographic binding orientations of the ligand was evaluated and the results were compared to that obtained with GOLD. The protein-ligand complexes used in the study have been derived from a well-validated set of complexes, relevant to drug discovery that was reported in recent literature [5]. The accuracy of the ArgusLab docking algorithm was also assessed taking into account key features such as the nature of the binding site and the number of rotatable bonds of the ligand.
Perola et al. had generated a test set of 150 pharmaceutically relevant protein-ligand complexes, for the evaluation of docking/scoring tools [5]. The 75 protein-ligand complexes, used in our study, was derived from that master set, after eliminating those complexes that contained water molecules in their binding sites that took part in ligand binding (see Supplementary Table). We employed this initial screening because ArgusLab totally failed to dock a few preliminary test cases that contained water molecules in their ligand-binding cavity. Swiss-pdb viewer (Deep view 3.7) was used to perform many of the following steps [11]. The entire computational work was carried out on an Intel Pentium 4 processor, 2.8 GHz.
Fig. 1A shows the percentages of the top ranking poses predicted by both GOLD and ArgusLab, within certain, defined root mean square deviation (RMSD) values from the original crystallographic pose. It can be seen that GOLD outperforms ArgusLab, in predicting the ligand pose correctly to within 2Å of the crystallographic pose. However, the surprising observation from the study is that, if one reduces the docking accuracy cutoff to 3Å RMSD value between the predicted and original poses; then, there is very little difference in the docking accuracies between ArgusLab and GOLD. This result is significant since for some research applications and for educational purposes, for which ArgusLab is primarily designed, this accuracy meets the requirements.
Fig. 1B shows the docking algorithms evaluated for their sampling accuracy. The top 10 poses predicted by GOLD and ArgusLab in each case were analyzed and compared to the crystallographic pose. The percentage of poses with RMSD within 2Å from the experimental structure was 75% for GOLD and 34% for ArgusLab. This confirms the results reported by earlier studies, GOLD seems to be highly efficient in terms of sampling. However, under less stringent conditions, i. e., docking accuracy cutoff of 3Å the performance of ArgusLab is vastly improved with 68% of the top ten poses falling within 3Å of the crystallographic pose. This indicates that the thoroughness of sampling implemented in ArgusLab, although clearly not up to state-of-the art standards, still gives biologically modest results, and does not betray its use in educational purpose demonstrations and similar situations.
Fig. 1C shows the results of yet another kind of analysis, namely, effect of a ligand parameter on docking accuracy. It is a well-known fact that as the number of rotatable bonds of the ligand increases, the docking accuracy falls since a much larger conformational space has to be sampled. The conformational space sampling methods employed by GOLD and ArgusLab are totally different. The complexes in the present study were divided into two groups, ligands with 1 to 6 rotors and those with 7 to 12 rotors. For both GOLD and for ArgusLab, the docking accuracy falls by almost half as the number of rotatable bonds increases one fold. Also in both cases, accuracy of GOLD is approximately double that of ArgusLab. However, an important consideration here is that docking times in ArgusLab is typically much shorter than that of GOLD and therefore any future improvements in this regard in ArgusLab can afford to sacrifice docking times.
The effect of degree of hydrogen bonding on docking accuracy was evaluated and the results are shown on Fig. 1D. Complexes having a DHB of 0.15 or higher are classified as hydrogen bond driven whereas complexes with DHB below 0.10 are dubbed hydrophobic-burial driven ones. In complexes with DHB values between these two cut-offs, the protein-ligand interaction is stabilized by a combination of these two major forces. Unlike GOLD which performs best when the DHB value is between 0.1 and 0.15, there is no noticeable change in the docking accuracy of ArgusLab with degree of hydrogen bonding at the active site. The incentive behind carrying out this analysis is that hydrogen bonding and hydrophobic burial are the two major forces important in stabilizing protein-ligand interactions. Protein ligand complexes, in the majority of cases can be divided into those driven by hydrogen bonds and those driven by hydrophobic burial. As can be seen from the docking results, for GOLD the docking accuracy falls drastically as the active site becomes progressively hydrophobic. ArgusLab, however, appears to be refractive to this effect.
In conclusion, this study evaluated and compared the docking accuracies of a commercial docking package, GOLD against a similar more general purpose one in the public domain, ArgusLab. The study revealed that the commercial package outperforms the freely available docking engine in almost all the parameters tested. However, the study also revealed that although lagging behind in accuracy, results from ArgusLab are biologically meaningful. This taken together with the fact that ArgusLab is offered in an intuitive, easy to use graphical user interface, suggests that it can be employed as an effective teaching tool to demonstrate molecular docking to beginners in this area. More research and development of the various docking and scoring modules of ArgusLab should definitely make this software a more popular one in educational circles.
Ramkumar Hariharan is supported by a research fellowship from the Council of Scientific and Industrial Research (CSIR), Government of India [F. No. 9/553 (14)/2003].