QSAR with Few Compounds and Many Features

نویسندگان

  • Douglas M. Hawkins
  • Subhash C. Basak
  • Xiaofang Shi
چکیده

Fitting quantitative structure-activity relationships (QSAR) requires different statistical methodologies and, to some degree, philosophies depending on the "shape" of the data matrix. When few features are used and there are many compounds, it is a reasonable expectation that good feature subset selection may be made and that nonlinearities and nonadditivities can be detected and diagnosed. Where there are many features and few compounds, this is unrealistic. Methods such as ridge regression RR, PLS, and principal component regression PCR, which abjure feature selection and rely on linearity may provide good predictions and fair understanding. We report a development of ridge regression for the underdetermined case by using generalized cross-validation to choose the ridge constant and perform F-tests for additional information. Conventional regression diagnostics can be used in followup to identify nonlinearities and other departures from model. We illustrate the approach with QSAR models of four data sets using calculated molecular descriptors.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Developing a CoMSIA model for inhibition of COX-2 by resveratrol derivatives

Design of selective cyclooxygenase-2 (COX-2) inhibitors is still a challenging task because of active site similarities between COX isoenzymes. To help with this issue, we tried to generate a 3D-QSAR (3 dimensional quantitative structure activity relationship) model that might reflect the essential features of COX-2 active sites. Compounds in a series of resveratrol derivatives inhibitors with ...

متن کامل

Developing a CoMSIA model for inhibition of COX-2 by resveratrol derivatives

Design of selective cyclooxygenase-2 (COX-2) inhibitors is still a challenging task because of active site similarities between COX isoenzymes. To help with this issue, we tried to generate a 3D-QSAR (3 dimensional quantitative structure activity relationship) model that might reflect the essential features of COX-2 active sites. Compounds in a series of resveratrol derivatives inhibitors with ...

متن کامل

A Computational Study of Cytotoxicity of Substituted Amides of Pyrazine-2-carboxylic acids Using QSAR and DFT Based Molecular Surface Electrostatic Potential

Pyrazine derivatives are important class of compounds with diverse biological and cytotoxic activities and clinical applications. In this study, B3 p 86 / 6 – 31 + + G * was used to compute and map the molecular surface electrostatic potentials of a group of substituted amides of pyrazine-2-carboxylic acids to identify common features related to their subsequent cytotoxicities. Several statisti...

متن کامل

QSAR & Network-based multi-species activity models for antifungals

__________________________________________________________________________________________ Abstract. There are many pathogen microbial species with very different antimicrobial drugs susceptibility. In this work, we selected pairs of antifungal drugs with similar/dissimilar species predicted-activity profile and represented it as a large network, which may be used to identify drugs with similar...

متن کامل

A Computational Study of Cytotoxicity of Substituted Amides of Pyrazine-2-carboxylic acids Using QSAR and DFT Based Molecular Surface Electrostatic Potential

Pyrazine derivatives are important class of compounds with diverse biological and cytotoxic activities and clinical applications. In this study, B3 p 86 / 6 – 31 + + G * was used to compute and map the molecular surface electrostatic potentials of a group of substituted amides of pyrazine-2-carboxylic acids to identify common features related to their subsequent cytotoxicities. Several statisti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of chemical information and computer sciences

دوره 41 3  شماره 

صفحات  -

تاریخ انتشار 2001