ThunderGP: Resource-Efficient Graph Processing Framework on FPGAs with HLS
نویسندگان
چکیده
FPGA has been an emerging computing infrastructure in datacenters benefiting from fine-grained parallelism, energy efficiency, and reconfigurability. Meanwhile, graph processing attracted tremendous interest data analytics, its performance is increasing demand with the rapid growth of data. Many works have proposed to tackle challenges designing efficient FPGA-based accelerators for processing. However, largely overlooked programmability still requires hardware design expertise sizable development efforts developers. ThunderGP , a high-level synthesis based framework on FPGAs, hence close gap, which developers could enjoy high FPGA-accelerated by writing only few functions no knowledge hardware. adopts gather-apply-scatter model as abstraction various algorithms realizes built-in highly parallel memory-efficient accelerator template. With inputs, automatically explores massive resources multiple super-logic regions modern platforms generate deploy accelerators, well schedule tasks them. Although DRAM-based memory bandwidth bounded, recent (HBM) brings large potentials performance. system bottleneck shifts resource consumption HBM-enabled platforms. Therefore, we further propose improve efficiency utilize more HBM. We conduct evaluation seven common applications 19 graphs. provides 1.9× ∼ 5.2× improvement over state art, whereas HBM-based delivers up speedup state-of-the-art RTL-based approach. This work open sourced GitHub at https://github.com/Xtra-Computing/ThunderGP .
منابع مشابه
Data Processing on FPGAs
Computer architectures are quickly changing toward heterogeneous many-core systems. Such a trend opens up interesting opportunities but also raises immense challenges since the efficient use of heterogeneous many-core systems is not a trivial problem. In this paper, we explore how to program data processing operators on top of field-programmable gate arrays (FPGAs). FPGAs are very versatile in ...
متن کاملAccelerated image processing on FPGAs
The Cameron project has developed a language called single assignment C (SA-C), and a compiler for mapping image-based applications written in SA-C to field programmable gate arrays (FPGAs). The paper tests this technology by implementing several applications in SA-C and compiling them to an Annapolis Microsystems (AMS) WildStar board with a Xilinx XV2000E FPGA. The performance of these applica...
متن کاملStudy on Resource Efficiency of Distributed Graph Processing
Graphs may be used to represent many different problem domains – a concrete example is that of detecting communities in social networks, which are represented as graphs. With big data and more sophisticated applications becoming widespread in recent years, graph processing has seen an emergence of requirements pertaining data volume and volatility. This multidisciplinary study presents a review...
متن کاملGraphGen for CoRAM: Graph Computation on FPGAs
This paper presents a system for executing graph computations on FPGAs. It implements an optimizing FPGA backend for the GraphGen graph algorithm compiler [12] and uses the CoRAM prototype implementation [6] to support FPGAs from both Xilinx and Altera. High performance in the generated implementations is achieved through a combination of data transfer optimizations and utilizing the hardware f...
متن کاملRIPL: An Efficient Image Processing DSL for FPGAs
Field programmable gate arrays (FPGAs) can accelerate image processing by exploiting finegrained parallelism opportunities in image operations. FPGA language designs are often subsets or extensions of existing languages, though these typically lack suitable hardware computation models so compiling them to FPGAs leads to inefficient designs. Moreover, these languages lack image processing domain...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM Transactions on Reconfigurable Technology and Systems
سال: 2022
ISSN: ['1936-7414', '1936-7406']
DOI: https://doi.org/10.1145/3517141