CORAL: QSPR model of water solubility based on local and global SMILES attributes.

نویسندگان

  • Andrey A Toropov
  • Alla P Toropova
  • Emilio Benfenati
  • Giuseppina Gini
  • Danuta Leszczynska
  • Jerzy Leszczynski
چکیده

Water solubility is an important characteristic of a chemical in many aspects. However experimental definition of the endpoint for all substances is impossible. In this study quantitative structure-property relationships (QSPRs) for negative logarithm of water solubility-logS (mol L(-1)) are built up for five random splits into the sub-training set (≈55%), the calibration set (≈25%), and the test set (≈20%). Simplified molecular input-line entry system (SMILES) is used as the representation of the molecular structure. Optimal SMILES-based descriptors are calculated by means of the Monte Carlo method using the CORAL software (http://www.insilico.eu/coral). These one-variable models for water solubility are characterized by the following average values of the statistical characteristics: n(sub_train)=725-763; n(calib)=312-343; n(test)=231-261; r(sub_train)(2)=0.9211±0.0028; r(calib)(2)=0.9555±0.0045; r(test)(2)=0.9365±0.0073; s(sub_train)=0.561±0.0086; s(calib)=0.453±0.0209; s(test)=0.520±0.0205. Thus, the reproducibility of statistical quality of suggested models for water solubility confirmed for five various splits.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

LINGO, an Efficient Holographic Text Based Method To Calculate Biophysical Properties and Intermolecular Similarities

SMILES strings are the most compact text based molecular representations. Implicitly they contain the information needed to compute all kinds of molecular structures and, thus, molecular properties derived from these structures. We show that this implicit information can be accessed directly at SMILES string level without the need to apply explicit time-consuming conversion of the SMILES string...

متن کامل

CORAL: Quantitative structure-activity relationship models for estimating toxicity of organic compounds in rats

For six random splits, one-variable models of rat toxicity (minus decimal logarithm of the 50% lethal dose [pLD50], oral exposure) have been calculated with CORAL software (http://www.insilico.eu/coral/). The total number of considered compounds is 689. New additional global attributes of the simplified molecular input line entry system (SMILES) have been examined for improvement of the optimal...

متن کامل

Prediction of boiling point and water solubility of crude oil hydrocarbons using sub-structural molecular fragments method

The quantitative structure–property relationship (QSPR) method is used to develop the correlation between structures of crude oil hydrocarbons (80 compounds) and their boiling point and water solubility. Sub-structural molecular fragments (SMF) calculated from structure alone were used to represent molecular structures. A subset of the calculated fragments selected using stepwise regression (fo...

متن کامل

coral Software: QSAR for Anticancer Agents.

CORrelations And Logic (coral at http://www.insilico.eu/coral) is freeware aimed at establishing a quantitative structure - property/activity relationships (QSPR/QSAR). Simplified molecular input line entry system (SMILES) is used to represent the molecular structure. In fact, symbols in SMILES nomenclatures are indicators of the presence of defined molecular fragments. By means of the calculat...

متن کامل

Chem. Pharm. Bull. 55(4) 669—674 (2007)

tant molecular property, playing a large role in the behavior of compounds in many areas of interest. Given the importance of solubility, a means of prediction based solely on molecular structure should prove a useful tool, as many compounds exist for which the solubility simply is not available. The solubility of chemicals and drugs in the water phase has an essential influence on the extent o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Chemosphere

دوره 90 2  شماره 

صفحات  -

تاریخ انتشار 2013