Chemically Aware Model Builder (camb): an R package for property and bioactivity modelling of small molecules

نویسندگان

  • Daniel S. Murrell
  • Isidro Cortes-Ciriano
  • Gerard J. P. van Westen
  • Ian Stott
  • Andreas Bender
  • Therese E. Malliavin
  • Robert C. Glen
چکیده

BACKGROUND In silico predictive models have proved to be valuable for the optimisation of compound potency, selectivity and safety profiles in the drug discovery process. RESULTS camb is an R package that provides an environment for the rapid generation of quantitative Structure-Property and Structure-Activity models for small molecules (including QSAR, QSPR, QSAM, PCM) and is aimed at both advanced and beginner R users. camb's capabilities include the standardisation of chemical structure representation, computation of 905 one-dimensional and 14 fingerprint type descriptors for small molecules, 8 types of amino acid descriptors, 13 whole protein sequence descriptors, filtering methods for feature selection, generation of predictive models (using an interface to the R package caret), as well as techniques to create model ensembles using techniques from the R package caretEnsemble). Results can be visualised through high-quality, customisable plots (R package ggplot2). CONCLUSIONS Overall, camb constitutes an open-source framework to perform the following steps: (1) compound standardisation, (2) molecular and protein descriptor calculation, (3) descriptor pre-processing and model training, visualisation and validation, and (4) bioactivity/property prediction for new molecules. camb aims to speed model generation, in order to provide reproducibility and tests of robustness. QSPR and proteochemometric case studies are included which demonstrate camb's application.Graphical abstractFrom compounds and data to models: a complete model building workflow in one package.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

QSPR with ’camb’ Chemically Aware Model Builder

Daniel S. Murrell∗1,5, Isidro Cortes-Ciriano†2,5, Gerard J. P. van Westen, Ian P. Stott, Andreas Bender, Therese E. Malliavin, and Robert C. Glen Unilever Centre for Molecular Science Informatics, Department of Chemistry, University of Cambridge, Cambridge, United Kingdom. Unite de Bioinformatique Structurale, Institut Pasteur and CNRS UMR 3825, Structural Biology and Chemistry Department, 25-2...

متن کامل

Proteochemometrics (PCM) with ’camb’ Chemistry Aware Model Builder

Chemistry Aware Model Builder Isidro Cortes-Ciriano∗1,5, Daniel S. Murrell†2,5, Gerard J. P. van Westen, Ian P. Stott, Andreas Bender, Therese E. Malliavin, and Robert C. Glen Unite de Bioinformatique Structurale, Institut Pasteur and CNRS UMR 3825, Structural Biology and Chemistry Department, 25-28, rue Dr. Roux, 75 724 Paris, France. Unilever Centre for Molecular Science Informatics, Departme...

متن کامل

Bayesian molecular design with a chemical language model

The aim of computational molecular design is the identification of promising hypothetical molecules with a predefined set of desired properties. We address the issue of accelerating the material discovery with state-of-the-art machine learning techniques. The method involves two different types of prediction; the forward and backward predictions. The objective of the forward prediction is to cr...

متن کامل

P122: Small Molecules as Chemical and Pharmacological Tools for Neuroinflammatory Diseases Treatment (with Emphasis on Multiple Sclerosis)

Multiple Sclerosis (MS) is a neuroinflammatory disease resulting in degeneration of the myelin sheaths and death of oligodendrocytes. So far, several strategies have been introduced to control the disease. Treatment with small molecules is one of the strategies that have recently attracted the attention in the scientific community. These molecules that target epigenetic and other cellular proce...

متن کامل

Inventory Model for Deteriorating Items Involving Fuzzy with Shortages and Exponential Demand

This paper considers the fuzzy inventory model for deteriorating items for power demand under fully backlogged conditions. We define various factors which are affecting the inventory cost by using the shortage costs. An intention of this paper is to study the inventory modelling through fuzzy environment. Inventory parameters, such as holding cost, shortage cost, purchasing cost and deteriorati...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 7  شماره 

صفحات  -

تاریخ انتشار 2015