Fast spectral analysis for approximate nearest neighbor search

نویسندگان

چکیده

In large-scale machine learning, of central interest is the problem approximate nearest neighbor (ANN) search, where goal to query particular points that are close a given object under certain metric. this paper, we develop novel data-driven ANN search algorithm data structure learned by fast spectral technique based on s landmarks selected ridge leverage scores. We show with overwhelming probability, our returns $$(1+\epsilon /4)$$ -ANN for any approximation parameter $$\epsilon \in (0,1)$$ . A remarkable feature it computationally efficient. Specifically, learning k-length hash codes requires $$O((s^3+ns^2)\log n)$$ running time and $$O(d^2)$$ extra space, returning needs $$O(k\log time. The experimental results computer vision natural language understanding tasks demonstrate significant advantage compared state-of-the-art methods.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fast Approximate Nearest-Neighbor Search with k-Nearest Neighbor Graph

We introduce a new nearest neighbor search algorithm. The algorithm builds a nearest neighbor graph in an offline phase and when queried with a new point, performs hill-climbing starting from a randomly sampled node of the graph. We provide theoretical guarantees for the accuracy and the computational complexity and empirically show the effectiveness of this algorithm.

متن کامل

Composite Quantization for Approximate Nearest Neighbor Search

This paper presents a novel compact coding approach, composite quantization, for approximate nearest neighbor search. The idea is to use the composition of several elements selected from the dictionaries to accurately approximate a vector and to represent the vector by a short code composed of the indices of the selected elements. To efficiently compute the approximate distance of a query to a ...

متن کامل

Approximate Nearest Neighbor Search in $\ell_p$

We present a new locality sensitive hashing (LSH) algorithm for c-approximate nearest neighbor search in lp with 1 < p < 2. For a database of n points in lp, we achieve O(dn ) query time and O(dn + n) space, where ρ ≤ O((ln c)/c). This improves upon the previous best upper bound ρ ≤ 1/c by Datar et al. (SOCG 2004), and is close to the lower bound ρ ≥ 1/c by O’Donnell, Wu and Zhou (ITCS 2011). T...

متن کامل

Hardness of Approximate Nearest Neighbor Search

We prove conditional near-quadratic running time lower bounds for approximate Bichromatic Closest Pair with Euclidean, Manhattan, Hamming, or edit distance. Specifically, unless the Strong Exponential Time Hypothesis (SETH) is false, for every δ > 0 there exists a constant ε > 0 such that computing a (1 + ε)-approximation to the Bichromatic Closest Pair requires Ω (

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Machine Learning

سال: 2022

ISSN: ['0885-6125', '1573-0565']

DOI: https://doi.org/10.1007/s10994-021-06124-1