Machine learning of free energies in chemical compound space using ensemble representations: Reaching experimental uncertainty for solvation

نویسندگان

چکیده

Free energies govern the behavior of soft and liquid matter, improving their predictions could have a large impact on development drugs, electrolytes or homogeneous catalysts. Unfortunately, it is challenging to devise an accurate description effects governing solvation such as hydrogen-bonding, van der Waals interactions, conformational sampling. We present energy Machine Learning (FML) model applicable throughout chemical compound space based representation that employs Boltzmann averages account for approximated sampling configurational space. Using FreeSolv database, FML's out-of-sample prediction errors experimental hydration free decay systematically with training set size, uncertainty (0.6 kcal/mol) reached after 490 molecules (80\% FreeSolv). Corresponding FML are also par state-of-the art physics approaches. To generate input new query compound, requires approximate short molecular dynamics runs. showcase its usefulness through analysis 116k organic (all force-field compatible in QM9 database) identifying most least solvated systems, rediscovering quasi-linear structure property relationships terms simple descriptors hydrogen-bond donors, number NH OH groups, oxygen atoms hydrocarbons, heavy atoms. accuracy maximal when temperature used simulation averaged samples same compounds. The time converges rapidly respect error.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Machine Learning of Molecular Electronic Properties in Chemical Compound Space

Grégoire Montavon,1 Matthias Rupp,2 Vivekanand Gobre,3 Alvaro Vazquez-Mayagoitia,4 Katja Hansen,3 Alexandre Tkatchenko,3, 5, ∗ Klaus-Robert Müller,1, 6, † and O. Anatole von Lilienfeld4, ‡ 1Machine Learning Group, Technical University of Berlin, Franklinstr 28/29, 10587 Berlin, Germany 2Institute of Pharmaceutical Sciences, ETH Zurich, 8093 Zürich, Switzerland 3Fritz-Haber-Institut der Max-Plan...

متن کامل

Predicting Solvation Free Energies Using Parameter-Free Solvent Models.

We present a new approach for predicting solvation free energies in nonaqueous solvents. Utilizing the corresponding states principle, we estimate solvent Lennard-Jones parameters directly from their critical points. Combined with atomic solutes and the pressure corrected three-dimensional reference interaction site model (3D-RISM/PC+), the model gives accurate predictions for a wide range of n...

متن کامل

Bayesian Model Averaging for Ensemble-Based Estimates of Solvation-Free Energies.

This paper applies the Bayesian Model Averaging statistical ensemble technique to estimate small molecule solvation free energies. There is a wide range of methods available for predicting solvation free energies, ranging from empirical statistical models to ab initio quantum mechanical approaches. Each of these methods is based on a set of conceptual assumptions that can affect predictive accu...

متن کامل

Hypertension Prediction in Primary School Students Using an Ensemble Machine Learning Method

Introduction: The prevalence of hypertension in children is increasing, and this complication is considered the most important risk factor for cardiovascular diseases in older age. Early detection and control of hypertension can prevent its progress and reduce its consequences. Machine learning methods can help predict this complication promptly and reduce cost and time. This study aimed to pro...

متن کامل

Hypertension Prediction in Primary School Students Using an Ensemble Machine Learning Method

Introduction: The prevalence of hypertension in children is increasing, and this complication is considered the most important risk factor for cardiovascular diseases in older age. Early detection and control of hypertension can prevent its progress and reduce its consequences. Machine learning methods can help predict this complication promptly and reduce cost and time. This study aimed to pro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Chemical Physics

سال: 2021

ISSN: ['1520-9032', '1089-7690', '0021-9606']

DOI: https://doi.org/10.1063/5.0041548