General Parallel Computation on Commodity Graphics Hardware: Case Study with the All-Pairs Shortest Paths Problem
نویسنده
چکیده
Programmability and IEEE-standard floating point arithmetic makes the latest commodity graphics processors (GPUs) an attractive platform for general parallel computing. In this paper we describe the implementation of the Warshall-Floyd algorithm on a class of GPUs. All-pairs shortest paths problem is relevant to many practical applications. Efficient GPU implementation of the Warshall-Floyd algorithm is challenging due to the algorithm's dynamic nature as well as limited GPU instruction set. GPU specific data organization, parallelization, and experimental results for several graphics accelerators are discussed. Algorithm implementation on the GPU utilizes interpolators, vertex and fragment pipelines, as well as vector operations to maximize performance. Speedups of up to 3x over a CPU implementation were achieved.
منابع مشابه
Estimating graph distance and centrality on shared nothing architectures
We present a parallel toolkit for pairwise distance computation in massive networks. Computing the exact shortest paths between a large number of vertices is a costly operation, and serial algorithms are not practical for billion-scale graphs. We first describe an efficient parallel method to solve the single source shortest path problem on commodity hardware with no shared memory. Using it as ...
متن کاملIn-place Recursive Approach for All-pairs Shortest Paths Problem Using Opencl
The all-pairs shortest paths (APSP) problem finds the shortest path distances between all pairs of vertices,and is one of the most fundamental graph problems. In this paper, a parallel recursive partitioning approach to APSP problem using Open Computing Language (OpenCL) for directed and dense graphs with no negative cyclesbased on R-Kleene algorithm, is presented, which recursively partitions ...
متن کاملA task parallel algorithm for finding all-pairs shortest paths using the GPU
This paper proposes an acceleration method for finding the all-pairs shortest paths (APSPs) using the graphics processing unit (GPU). Our method is based on Harish’s iterative algorithm that computes the cost of the single-source shortest path (SSSP) in parallel on the GPU. In addition to this fine-grained parallelism, we exploit the coarse-grained parallelism by using a task parallelization sc...
متن کاملLarge Graph Algorithms for Massively Multithreaded Architectures
Modern Graphics Processing Units (GPUs) provide high computation power at low costs and have been described as desktop supercomputers. The GPUs expose a general, data-parallel programming model today in the form of CUDA and CAL. The GPU is presented as a massively multithreaded architecture by them. Several high-performance, general data processing algorithms such as sorting, matrix multiplicat...
متن کاملHardware/Software Codesign for All-Pairs Shortest-Paths on a Reconfigurable Supercomputer
Rapid advances in VLSI technology have led to FieldProgrammable Gate Arrays (FPGAs) being employed in High Performance Computing systems. Applications using FPGAs on reconfigurable supercomputers involve software on the system managing computation on the reconfigurable hardware. To extract maximum benefits from a parallel FPGA kernel at the application level, it becomes crucial to minimize data...
متن کامل