Using Alternative Multi-variate Data Analysis Methods for Predicting Physical-chemical Properties of Organic Compounds

نویسنده

  • Anna Svantesson
چکیده

In drug design, the knowledge of physical properties of organic compounds is vital. The special field of ADMET (Administration, Distribution, Metabolism, Excretion, Toxicity) involves the determination of properties governing the effect of a drug on the human body, such as aqueous solubility, intestinal absorption and permeability. Assessing these properties experimentally, however, is often laborious and time-consuming. This master’s thesis investigates the applicability of three neural network methods as computer aided means for predicting physical properties from molecular structure. The examined methods include fuzzy ARTMAP, Adaptive Network-based Fuzzy Inference System (ANFIS) and Support Vector Machines (SVM). Furthermore, the possibility of constructing a so-called consensus model, in which the results of the above mentioned models are weighted together in order to receive improved predictions, is investigated. The results from this study show that the SVM technique produced the most accurate and stable predictions on a majority of the data sets. Both fuzzy ARTMAP and ANFIS experienced difficulties in generalising to unseen data, as a consequence of over-training. This was in particular valid for the smaller data sets (< 30 compounds). Nevertheless, some satisfactory results were obtained. Fuzzy ARTMAP made several excellent test set predictions, even though the cross-validation results were genuinely poor for the small data sets. ANFIS turned out to be less appropriate for biological applications, since the number of adjustable network parameters grows exponentially with the number of molecular descriptors, owing to the network’s complex architecture. For practical use, the number of descriptors was restricted to at most ten. Finally, utilising a simple consensus model was found to be an excellent method for improving the predictions. ANVÄNDNING AV ALTERNATIVA DATAANALYSMETODER FÖR PREDIKTION AV FYSIKALISKA OCH KEMISKA EGENSKAPER HOS ORGANISKA FÖRENINGAR

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

مقایسه توابع انتقالی رگرسیونی و شبکه عصبی مصنوعی در برآورد گنجایش

Measuring the cation exchange capacity (CEC) as one of the most important chemical soil properties is very time consuming and costly. Pedotransfer functions (PTFs) provide an alternative to direct measurement by estimating CEC. The objective of this study was to develop PTFs for predicting CEC of Guilan province soils using artificial neural network (ANN) and multiple-linear regression method a...

متن کامل

Investigating the Effect of Land Use and Soil’s Physio-chemical properties on Wind Erosion Threshold Velocities via Data Mining

Introduction: Wind erosion is a phenomenon that causes severe environmental changes in arid and semi-arid climates. As surface soil texture is very effective in soil erodibility, identifying soil erodibility index is important and efficient. Mismanagement greatly contributes to the development of wind erosion. The velocity that makes the first particles of soil move from the surface is called t...

متن کامل

Predicting biological effect as a function of the chemical structure and the primary physical and chemical properties of organic compounds.

The dependence of biological parameters on chemical structure was studied for a series of compounds with similar chemical structures. Computational formulas were developed on the basis of correlation and regression analysis. These formulas make it possible to predict the biological effect thresholds for a great number of organic compounds on the basis of their primary physical and chemical prop...

متن کامل

Influence of organic compost compounds on soil chemical and physical properties

This study was conducted to evaluate the impact of municipal waste compost and manure on soil chemical and physical properties quality and crop production in Sari city (north of Iran). In this study, the effect of compost and manure (cow and sheep) on the quality of soil organic material with experimental measurements was investigated. An experiment was conducted in a completely randomized desi...

متن کامل

Quantum Chemical Modeling of N-(2-benzoylphenyl)oxalamate: Geometry Optimization, NMR, FMO, MEP and NBO Analysis Based on DFT Calculations

In the present work, the quantum theoretical calculations of the molecular structure of the (N-(2-benzoylphenyl) oxalamate has been investigated and are evaluated using Density Functional Theory (DFT). The geometry of the title compound was optimized by B3LYP method with 6-311+G(d) basis set. The theoretical 1H and 13C NMR chemical shift (GIAO method) values of the title compound are calculated...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003