Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions
نویسندگان
چکیده
BACKGROUND A method to estimate ease of synthesis (synthetic accessibility) of drug-like molecules is needed in many areas of the drug discovery process. The development and validation of such a method that is able to characterize molecule synthetic accessibility as a score between 1 (easy to make) and 10 (very difficult to make) is described in this article. RESULTS The method for estimation of the synthetic accessibility score (SAscore) described here is based on a combination of fragment contributions and a complexity penalty. Fragment contributions have been calculated based on the analysis of one million representative molecules from PubChem and therefore one can say that they capture historical synthetic knowledge stored in this database. The molecular complexity score takes into account the presence of non-standard structural features, such as large rings, non-standard ring fusions, stereocomplexity and molecule size. The method has been validated by comparing calculated SAscores with ease of synthesis as estimated by experienced medicinal chemists for a set of 40 molecules. The agreement between calculated and manually estimated synthetic accessibility is very good with r2 = 0.89. CONCLUSION A novel method to estimate synthetic accessibility of molecules has been developed. This method uses historical synthetic knowledge obtained by analyzing information from millions of already synthesized chemicals and considers also molecule complexity. The method is sufficiently fast and provides results consistent with estimation of ease of synthesis by experienced medicinal chemists. The calculated SAscore may be used to support various drug discovery processes where a large number of molecules needs to be ranked based on their synthetic accessibility, for example when purchasing samples for screening, selecting hits from high-throughput screening for follow-up, or ranking molecules generated by various de novo design approaches.
منابع مشابه
Ligand based lead generation - considering chemical accessibility in rescaffolding approaches via BROOD
In pharmaceutical industry ligand based approaches like scaffold hopping, scaffold decoration and me-too approaches, are used to generate lead structures in discovery projects. We use several tools to generate novel lead structures, such as BROOD [1]. BROOD is a software tool which explores chemical space around query molecules based on shape similarity and electrostatics, and it generates anal...
متن کاملNovel Small Molecules against Two Binding Sites of Wnt2 Protein as potential Drug Candidates for Colorectal Cancer: A Structure Based Virtual Screening Approach
Wnts are the major ligands responsible for activating Wnt signaling pathway through binding to Frizzled proteins (Fzd) as the receptors. Among these ligands, Wnt2 plays the main role in the tumorigenesis of several human cancers especially colorectal cancer (CRC). Therefore, it can be considered as a potential drug target.The aim of this study was to identify potential drug candidates ...
متن کاملNovel Small Molecules against Two Binding Sites of Wnt2 Protein as potential Drug Candidates for Colorectal Cancer: A Structure Based Virtual Screening Approach
Wnts are the major ligands responsible for activating Wnt signaling pathway through binding to Frizzled proteins (Fzd) as the receptors. Among these ligands, Wnt2 plays the main role in the tumorigenesis of several human cancers especially colorectal cancer (CRC). Therefore, it can be considered as a potential drug target.The aim of this study was to identify potential drug candidates ...
متن کاملLeadOp+R: Structure-Based Lead Optimization With Synthetic Accessibility
We previously described a structure-based fragment hopping for lead optimization using a pre-docked fragment database, "LeadOp," that conceptually replaced "bad" fragments of a ligand with "good" fragments while leaving the core of the ligand intact thus improving the compound's activity. LeadOp was proven to optimize the query molecules and systematically developed improved analogs for each of...
متن کاملMolecular Docking Based on Virtual Screening, Molecular Dynamics and Atoms in Molecules Studies to Identify the Potential Human Epidermal Receptor 2 Intracellular Domain Inhibitors
Human epidermal growth factor receptor 2 (HER2) is a member of the epidermal growth factor receptor family having tyrosine kinase activity. Overexpression of HER2 usually causes malignant transformation of cells and is responsible for the breast cancer. In this work, the virtual screening, molecular docking, quantum mechanics and molecular dynamics methods were employed to study protein–ligand ...
متن کامل