GPU-SME-kNN: Scalable and memory efficient kNN and lazy learning using GPUs

نویسندگان

  • Pablo David Gutiérrez
  • Miguel Lastra
  • Jaume Bacardit
  • José Manuel Benítez
  • Francisco Herrera
چکیده

The k nearest neighbor (kNN) rule is one of the most used techniques in data mining and pattern recognition due to its simplicity and low identification error. However, the computational effort it requires is directly related to the dataset sizes, hence delivering a poor performance on large datasets. ::: The :::: use :: of :::::::: graphics processing units (GPU) ::: has :::::::: improved :::: the :::::::: run-time : performance of the kNN rule but the computational requirements of current ::::::::: approaches ::::: limit ::: this : performance as the dataset size increases. In this paper : , : we propose a new scalable and memory efficient design for a GPU-based kNN rule, called GPU-SME-kNN, that ::::: breaks :::: the ::::::::::: dependency between dataset size and memory footprint while delivering high performance. An experimental study of GPU-SME-kNN is presented showing a high performance, even in cases that other :::::::: methods : cannot address, ::::: while ::: the :::::::::::::: computational ::::::::::: requirements :::: are : suitable for most commercial GPU devices. Our design has also been applied to kNN-based lazy learning algorithms reducing run-times in a significant way.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

GPU-FS-kNN: A Software Tool for Fast and Scalable kNN Computation Using GPUs

BACKGROUND The analysis of biological networks has become a major challenge due to the recent development of high-throughput techniques that are rapidly producing very large data sets. The exploding volumes of biological data are craving for extreme computational power and special computing facilities (i.e. super-computers). An inexpensive solution, such as General Purpose computation based on ...

متن کامل

kNN-Borůvka-GPU: A Fast and Scalable MST Construction from kNN Graphs on GPU

Computation of the minimum spanning tree (MST) is a common task in numerous fields of research, such as pattern recognition, computer vision, network design (telephone, electrical, hydraulic, cable TV, computer, road networks etc.), VLSI layout, to name a few. However, for a large-scale dataset when the graphs are complete, classical MST computation algorithms become unsuitable on general purpo...

متن کامل

Study of K Nearest Neighbour Applications in Image Processing with Graphics Processing Unit

The GPU has always been a processor with ample computational resources. It is a powerful graphics engine and a highly parallel programmable processor having better efficiency and high speed that overshadows CPU. It is used in high performance computing system. The implementation of GPU can be done with CUDA C. Due to its highly parallel structure it is used in a number of real time applications...

متن کامل

KNN Model-Based Approach in Classification

The k-Nearest-Neighbours (kNN) is a simple but effective method for classification. The major drawbacks with respect to kNN are (1) its low efficiency being a lazy learning method prohibits it in many applications such as dynamic web mining for a large repository, and (2) its dependency on the selection of a “good value” for k. In this paper, we propose a novel kNN type method for classificatio...

متن کامل

A GPU-based Method for Computing Eigenvector Centrality of Gene-expression Networks

In this paper, we present a fast and scalable method for computing eigenvector centrality using graphics processing units (GPUs). The method is designed to compute the centrality on gene-expression networks, where the network is pre-constructed in the form of kNN graphs from DNA microarray data sets.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Inf. Sci.

دوره 373  شماره 

صفحات  -

تاریخ انتشار 2016