SCISSORS: A Linear-Algebraical Technique to Rapidly Approximate Chemical Similarities

نویسندگان

  • Imran S. Haque
  • Vijay S. Pande
چکیده

Algorithms for several emerging large-scale problems in cheminformatics have as their rate-limiting step the evaluation of relatively slow chemical similarity measures, such as structural similarity or three-dimensional (3-D) shape comparison. In this article we present SCISSORS, a linear-algebraical technique (related to multidimensional scaling and kernel principal components analysis) to rapidly estimate chemical similarities for several popular measures. We demonstrate that SCISSORS faithfully reflects its source similarity measures for both Tanimoto calculation and rank ordering. After an efficient precalculation step on a database, SCISSORS affords several orders of magnitude of speedup in database screening. SCISSORS furthermore provides an asymptotic speedup for large similarity matrix construction problems, reducing the number of conventional slow similarity evaluations required from quadratic to linear scaling.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Error Bounds on the SCISSORS Approximation Method

The SCISSORS method for approximating chemical similarities has shown excellent empirical performance on a number of real-world chemical data sets but lacks theoretically proven bounds on its worst-case error performance. This paper first proves reductions showing SCISSORS to be equivalent to two previous kernel methods: kernel principal components analysis and the rank-k Nyström approximation ...

متن کامل

a New Approximate Solution Technique (Quantized Method) for Simultaneous Gas Solid Reactions

Simultaneous reactions between solids and gases are very important in the chemical and metallurgical processes. In the modeling, the chemical reaction and diffusion of gases must be considered. Therefore, a set of coupled partial differential equations is found. When the kinetic is a function of solid concentration, there is not any analytical solution for these equations. Therefore, numerical ...

متن کامل

Idempotent Functional Analysis : an Algebraical Approach

In this paper we consider Idempotent Functional Analysis, an 'abstract' version of Idempotent Analysis developed by V. P. Maslov and his collaborators. We give a review of the basic ideas of Idempotent Analysis. The correspondence between concepts and theorems of the traditional Functional Analysis and its idempotent version is discussed; this correspondence is similar to N. Bohr's corresponden...

متن کامل

Approximate solution of the stochastic Volterra integral equations via expansion method

In this paper, we present an efficient method for determining the solution of the stochastic second kind Volterra integral equations (SVIE) by using the Taylor expansion method. This method transforms the SVIE to a linear stochastic ordinary differential equation which needs specified boundary conditions. For determining boundary conditions, we use the integration technique. This technique give...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of chemical information and modeling

دوره 50 6  شماره 

صفحات  -

تاریخ انتشار 2010