FFTS ON THE VPP 500 3 Platform FFT performance rFFT
نویسندگان
چکیده
Fast Fourier transforms parallelize well but need large amounts of communication. An algorithm which concentrates all the communication in one or two transposition steps is the transpose split algorithm. Di erent transposition algorithms can be used depending on data size and communication latency. A new transpose split algorithm for real and hermitian data is presented for one, two and three dimensional transforms. This algorithm is implemented on the Fujitsu VPP 500. The Fujitsu VPP 500 is a parallel processor with a moderate number of very fast vector processors connected by a crossbar switch. Each processor has a peak performance of 1.6 G op/s and can simultaneously read and write 400 MByte/s. Very long vector length stride one implementations of multiple FFTs on one node [Hegland, Numerische Mathematik, to appear, 1994] are combined with optimized transpositions. One third of peak performance was achieved on a con guration with up to 11 processors. 1991 Mathematics Subject Classi cation. 65T,65Y.
منابع مشابه
3D FFTs on HPCx
Fast Fourier Transforms (FFTs) are an essential part of many scientific codes: from Molecular Dynamics to Climate Modelling. It is, therefore, evident that HPCx requires efficient methods for performing FFTs and related calculations. This study compares the performance of the two main FFT libraries on HPCx: IBM’s ESSL/PESSL and FFTW. Both serial and parallel (distributedmemory only) 3D complex-...
متن کاملJava Signal Processing : FFTs with
This paper investigates the possibility of using Java as a language for Digital Signal Processing. We compare the performance of the Fast Fourier Transform using Java interpreters, compilers, and native execution. To characterize the Java language as a platform for signal processing, we have implemented a traditional FFT algorithm in both C and Java and compared their relative performance. Addi...
متن کاملJava signal processing: FFTs with bytecodes
This paper investigates the possibility of using Java as a language for Digital Signal Processing. We compare the performance of the Fast Fourier Transform using Java interpreters, compilers, and native execution. To characterize the Java language as a platform for signal processing, we have implemented a traditional FFT algorithm in both C and Java and compared their relative performance. Addi...
متن کاملDynamically Reconfigurable FFTs for Cognitive Radio on a Multiprocessor Platform
Multiprocessor platforms have been proposed as an enabling technology for Cognitive Radio. In this paper, we explore various FFT implementations on a multiprocessor prototype platform as building components for OFDM based Cognitive Radio. The results show that our FFT implementations on the multiprocessor platform are fast and energy efficient. Moreover, the FFT implementations can be dynamical...
متن کاملLow Power FFT Architectures via Folding Transformation
This paper presents a technique to develop a low power parallel-pipelined architecture for Fast Fourier Transform (FFT). A technique to design FFT architectures via folding transformation and register minimization techniques is proposed. Both complex valued FFT (CFFT) and real valued FFT (RFFT) architectures can be derived using the proposed approach. The proposed architecture for RFFT helps to...
متن کامل