Rapid Design of Knowledge-Based Scoring Potentials for Enrichment of Near-Native Geometries in Protein-Protein Docking
نویسندگان
چکیده
Protein-protein docking protocols aim to predict the structures of protein-protein complexes based on the structure of individual partners. Docking protocols usually include several steps of sampling, clustering, refinement and re-scoring. The scoring step is one of the bottlenecks in the performance of many state-of-the-art protocols. The performance of scoring functions depends on the quality of the generated structures and its coupling to the sampling algorithm. A tool kit, GRADSCOPT (GRid Accelerated Directly SCoring OPTimizing), was designed to allow rapid development and optimization of different knowledge-based scoring potentials for specific objectives in protein-protein docking. Different atomistic and coarse-grained potentials can be created by a grid-accelerated directly scoring dependent Monte-Carlo annealing or by a linear regression optimization. We demonstrate that the scoring functions generated by our approach are similar to or even outperform state-of-the-art scoring functions for predicting near-native solutions. Of additional importance, we find that potentials specifically trained to identify the native bound complex perform rather poorly on identifying acceptable or medium quality (near-native) solutions. In contrast, atomistic long-range contact potentials can increase the average fraction of near-native poses by up to a factor 2.5 in the best scored 1% decoys (compared to existing scoring), emphasizing the need of specific docking potentials for different steps in the docking protocol.
منابع مشابه
DrugScore(CSD)-knowledge-based scoring function derived from small molecule crystal data with superior recognition rate of near-native ligand poses and better affinity prediction.
Following the formalism used for the development of the knowledge-based scoring function DrugScore, new distance-dependent pair potentials are obtained from nonbonded interactions in small organic molecule crystal packings. Compared to potentials derived from protein-ligand complexes, the better resolved small molecule structures provide relevant contact data in a more balanced distribution of ...
متن کاملStructural Prediction of Protein-RNA Interaction by Computational Docking with Propensity-Based Statistical Potentials
Despite the importance of protein-RNA interactions in the cellular context, the number of available protein-RNA complex structures is still much lower than those of other biomolecules. As a consequence, few computational studies have been addressed towards protein-RNA complexes, and to our knowledge, no systematic benchmarking of protein-RNA docking has been reported. In this study we have extr...
متن کاملDOCKGROUND protein-protein docking decoy set
UNLABELLED A protein-protein docking decoy set is built for the Dockground unbound benchmark set. The GRAMM-X docking scan was used to generate 100 non-native and at least one near-native match per complex for 61 complexes. The set is a publicly available resource for the development of scoring functions and knowledge-based potentials for protein docking methodologies. AVAILABILITY The decoys...
متن کاملPARADOCKS – a framework for molecular docking
The prediction of possible binding geometries as well as ranking of putative protein-ligand complexes according their binding affinities is the intention of so called molecular docking approaches. To evaluate complexes against each other, scoring functions are required. In recent years knowledge-based scoring functions have been evolved. They exploit the vast amount of experimentally determined...
متن کاملEvaluation of Several Two-Step Scoring Functions Based on Linear Interaction Energy, Effective Ligand Size, and Empirical Pair Potentials for Prediction of Protein-Ligand Binding Geometry and Free Energy
The performances of several two-step scoring approaches for molecular docking were assessed for their ability to predict binding geometries and free energies. Two new scoring functions designed for "step 2 discrimination" were proposed and compared to our CHARMM implementation of the linear interaction energy (LIE) approach using the Generalized-Born with Molecular Volume (GBMV) implicit solvat...
متن کامل