Combinatorial QSAR of Ambergris Fragrance Compounds

نویسندگان

  • Assia Kovatcheva
  • Alexander Golbraikh
  • Scott Oloff
  • Yun-De Xiao
  • Weifan Zheng
  • Peter Wolschann
  • Gerhard Buchbauer
  • Alexander Tropsha
چکیده

A combinatorial quantitative structure-activity relationships (Combi-QSAR) approach has been developed and applied to a data set of 98 ambergris fragrance compounds with complex stereochemistry. The Combi-QSAR approach explores all possible combinations of different independent descriptor collections and various individual correlation methods to obtain statistically significant models with high internal (for the training set) and external (for the test set) accuracy. Seven different descriptor collections were generated with commercially available MOE, CoMFA, CoMMA, Dragon, VolSurf, and MolconnZ programs; we also included chirality topological descriptors recently developed in our laboratory (Golbraikh, A.; Bonchev, D.; Tropsha, A. J. Chem. Inf. Comput. Sci. 2001, 41, 147-158). CoMMA descriptors were used in combination with MOE descriptors. MolconnZ descriptors were used in combination with chirality descriptors. Each descriptor collection was combined individually with four correlation methods, including k-nearest neighbors (kNN) classification, Support Vector Machines (SVM), decision trees, and binary QSAR, giving rise to 28 different types of QSAR models. Multiple diverse and representative training and test sets were generated by the divisions of the original data set in two. Each model with high values of leave-one-out cross-validated correct classification rate for the training set was subjected to extensive internal and external validation to avoid overfitting and achieve reliable predictive power. Two validation techniques were employed, i.e., the randomization of the target property (in this case, odor intensity) also known as the Y-randomization test and the assessment of external prediction accuracy using test sets. We demonstrate that not every combination of the data modeling technique and the descriptor collection yields a validated and predictive QSAR model. kNN classification in combination with CoMFA descriptors was found to be the best QSAR approach overall since predictive models with correct classification rates for both training and test sets of 0.7 and higher were obtained for all divisions of the ambergris data set into the training and test sets. Many predictive QSAR models were also found using a combination of kNN classification method with other collections of descriptors. The combinatorial QSAR affords automation, computational efficiency, and higher probability of identifying significant QSAR models for experimental data sets than the traditional approaches that rely on a single QSAR method.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

In-silico combinatorial design and pharmacophore modeling of potent antimalarial 4-anilinoquinolines utilizing QSAR and computed descriptors

There are very few studies for combinatorial library design and high throughput screening of 4-anilinoquinoline antimalarial compounds having activities against parasitic strain of P. falciparum. Therefore, an attempt has been made in the present paper to design potent lead compounds in this congener utilizing quantitative structure activity relationship utilizing theoretical molecular descript...

متن کامل

A Review on Computational Methods in Developing Quantitative Structure-Activity Relationship (QSAR)

Virtual filtering and screening of combinatorial libraries have recently gained attention as methods complementing the high-throughput screening and combinatorial chemistry. These chemoinformatic techniques rely heavily on quantitative structure-activity relationship (QSAR) analysis, a field with established methodology and successful history. In this review, we discuss the computational method...

متن کامل

A framework for prioritizing fragrance materials for aquatic risk assessment.

More than 2,100 chemically defined organic chemicals are listed in the Research Institute of Fragrance Materials/Flavor and Extract Manufacturers' Association (RIFM/FEMA) Database that are used as ingredients of fragrances for consumer products. An approach was developed for prioritizing these fragrance materials for aquatic risk assessment by first estimating the predicted environmental concen...

متن کامل

An in silico skin absorption model for fragrance materials.

Fragrance materials are widely used in cosmetics and other consumer products. The Research Institute for Fragrance Materials (RIFM) evaluates the safety of these ingredients and skin absorption is an important parameter in refining systemic exposure. Currently, RIFM's safety assessment process assumes 100% skin absorption when experimental data are lacking. This 100% absorption default is not s...

متن کامل

Predicting the bioconcentration of fragrance ingredients by rainbow trout using measured rates of in vitro intrinsic clearance.

Bioaccumulation in aquatic species is a critical end point in the regulatory assessment of chemicals. Few measured fish bioconcentration factors (BCFs) are available for fragrance ingredients. Thus, predictive models are often used to estimate their BCFs. Because biotransformation can reduce chemical accumulation in fish, models using QSAR-estimated biotransformation rates have been developed. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of chemical information and computer sciences

دوره 44 2  شماره 

صفحات  -

تاریخ انتشار 2004