A throughput maximised parallel architecture for 2D fast Discrete Pascal Transform
نویسندگان
چکیده
In this paper, we present a fully pipelined parallel implementation of a two dimensional (2D) Discrete Pascal Transform (DPT). Our approach rst makes use of the properties of the Kronecker product and the vec operation on matrices to form an alternate 2D DPT representation suitable for column parallel computation. Next, we lend ourselves to the results from Skodras' work in 1D DPT to achieve the nal architecture for fast 2D DPT. With a fully pipelined implementation, the architecture possesses an initial latency of 2(N − 1) clock cycles and a maximum throughput of one complete two dimensional transform every clock cycle, given any input matrix of size N×N . To evaluate our work, our results obtained from actual FPGA implementation were benchmarked against results from other previous works.
منابع مشابه
A rescheduling and fast pipeline VLSI architecture for lifting-based discrete wavelet transform
A lifting based 2D DWT with efficient folded architecture and parallel scanning is being proposed. The architecture results in lesser hardware complexity and memory requirement due to multiplexing of 2 stages of lifting architecture. The 2D DWT architecture is realized by cascading two 2D processing elements. The coefficients for the lifting stage were chosen according with 9/7 filter. The 1D p...
متن کاملUnified Architecture for Discrete Fourier and Inverse Cosine Transforms
A unified architecture for radix-2 decimation-in-time fast Fourier transform and inverse discrete cosine transform of type II is presented based on constant geometry formulation of the algorithms. The parallelism of the architecture can be specified to meet various throughput and area requirements. The type and size of the transform in the architecture can be configured in run-time. In addition...
متن کاملA high-throughput, memory efficient architecture for computing the tile-based 2D discrete wavelet transform for the JPEG2000
In this paper, the design and implementation of an optimized hardware architecture in terms of speed and memory requirements for computing the tile-based 2D forward discrete wavelet transform for the JPEG2000 image compression standard, are described. The proposed architecture is based on a well-known architecture template for calculating the 2D forward discrete wavelet transform. This architec...
متن کاملHigh Throughput Reconfigurable Resource Sharing Parallel Architecture for a 2D Lifting Based DWT
Article history: Received 16 April 2015 Accepted 12 June 2015 Available online 1 July 2015
متن کاملDistributed Memory Parallel Architecture Based on Modular Linear Arrays for 2-D Separable Transforms Computation
A framework for mapping systematically 2-dimensional (2-D) separable transforms into a parallel architecture consisting of fully pipelined linear array stages is presented. The resulting model architecture is characterized by its generality, high degree of modularity, high throughput, and the exclusive use of distributed memory and control. There is no central shared memory block to facilitate ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Computers & Electrical Engineering
دوره 36 شماره
صفحات -
تاریخ انتشار 2010