HI-FFT: Heterogeneous Parallel In-Place Algorithm for Large-Scale 2D-FFT

نویسندگان

چکیده

Fast Fourier Transform (FFT) is a fundamental operation for 2D data in various applications. To accelerate large-scale 2D-FFT computation, we propose Heterogeneous parallel In-place algorithm, HI-FFT. Our novel work decomposition method makes it possible to run our algorithm on the original (i.e., in-place), unlike prior algorithms that require additional memory space out-of-place) guarantee independence among sub-tasks. also removes duplicated operations out-of-place approaches. Using method, introduced an in-place heterogeneous utilizes both multi-core CPU and GPU simultaneously. maximize utilization efficiency of computing resources, priority-based dynamic scheduling method.We compared performance seven different algorithms, including ours, problems whose sizes varied from 20K2 120K2. As result, found achieved up 2.92 4.42 times higher than conventional homogeneous based state-of-the-art libraries, respectively. Also, showed 2.27 while requiring two less space. check benefit HI-FFT actual application, applied CGH (Computer Generated Holography) process. We successfully reduces hologram generation time. These results demonstrate advantage approach computation.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

3D FFT with 2D decomposition

Many scientific applications including molecular dynamics (MD) require a fast fourier transform (FFT). As the number of processors for high performance computer increases this transform has to be parallelized to larger number of processors to remove it as a bottleneck for the parallelization. This requires the decomposition to be changed from 1D to 2D. Such a 2D decomposed 3D FFT was implemente...

متن کامل

Parallel FFT-Hashing

We propose two families of scalable hash functions for collision–resistant hashing that are highly parallel and based on the generalized fast Fourier transform (FFT). FFT–hashing is based on multipermutations. This is a basic cryptographic primitive for perfect generation of diffusion and confusion which generalizes the boxes of the classic FFT. The slower FFT–hash functions iterate a compressi...

متن کامل

Pipelined Parallel FFT Architecture

ABSTARCT: In this paper, an optimized efficient VLSI architecture of a pipeline Fast Fourier transform (FFT) processor capable of producing the reverse output order sequence is presented. Paper presents Radix-2 multipath delay architecture for FFT calculation. The implementation of FFT in hardware is very critical because for calculation of FFT number of butterfly operations i.e. number of mult...

متن کامل

Massively Parallel FFT Algorithm for the NVIDIA Tesla GPU

The emergence of streaming multicore processors with multi-SIMD architectures opens unprecedented opportunities for executing many sophisticated signal processing algorithms, including FFTs, faster and within a much lower energy budget. We report on the development, implementation, and demonstration of a novel, massively parallel computational scheme for the FFT that exploits the capabilities o...

متن کامل

Parallel FFT with Eden Skeletons

The paper investigates and compares skeleton-based Eden implementations of different FFT-algorithms on workstation clusters with distributed memory. Our experiments show that the basic divide-andconquer versions suffer from an inherent input distribution and result collection problem. Advanced approaches like calculating FFT using a parallel map-and-transpose skeleton provide more flexibility t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2021

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2021.3108404