Overcoming Bandwidth Limitations in Visual Computing

نویسندگان

  • Robert Strzodka
  • Mohammed Shaheen
  • Dawid Pajak
چکیده

Because visual computations are very data intensive they are often limited by the bandwidth of the system rather than its peak computational performance. The trend towards many-core architectures exacerbates the problem because the parallel cores let the compute capability grow exponentially while the system bandwidth increases only linearly. At the core of the bandwidth problem in visual computing are iterative loops over discrete local operators. A typical representative is a stencil computation with constants weights or a sparse matrix vector product in case of variable weights. This computations are bandwidth limited because of their low arithmetic intensity. We have developed techniques that accelerate these loops beyond the peak bandwidth limit. While different cache friendly approaches have been tried in the past, they have never been so successful. For example our quad-core Xeon X5482 3.2GHz system reaches 22.3 GFLOPS in double precision on a stencil computation in registers. On a large 3D domain of 504 doubles and 100 iterations a handvectorized single-threaded naive stencil implementation achieves 1.6 GFLOPS and there is no improvement in the multi-threaded version because the system memory bandwidth limits the performance. A state-of-art automatic loop transformation framework Pluto [?] achieves 1.9 GFLOPS for this stencil computation with four threads. In comparison, our scheme performs already at 5.3 GFLOPS with a single thread and soars to 13.0 GFLOPS with four threads.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Synchronization and Caching Solution for Cost-Effective E-Learning in Resource and Bandwidth Constrained Environments

Electronic learning (e-learning) content delivery and accessibility have received significant research attention over years in order to ensure reliability, availability and cost-effectiveness through Information and Communication Technologies (ICTs).The evolvement of mobile computing devices especially smartphones bring prospects in overcoming the inherent limitations of the Internet when acces...

متن کامل

Improving Bandwidth-power Efficiency of Homogeneous Wireless Networks Using On-meet Threshold Strategy (RESEARCH NOTE)

Over two decades, a problem of location dependent has been focused for improving the communication Bandwidth-Power Efficiency of homogeneous networks. The efficiencies of communication links are weakened by the Hidden Terminal Problem.  Thus we propose a Fine – Tune Strategy for analyzing the On-Off communication region. We were observed that the proposed technique had been able to track and mo...

متن کامل

Review of Mobile Cloud Computing Framework and Authentication Problems

Now days there are many innovations coming every day in the mobile applications in order to serve end users in best possible ways as well as use of cloud computing is an also increase with mobile applications which is called as mobile cloud computing. The mobile cloud computing is uses the services of cloud into environment of mobile applications for overcoming the many issues such as bandwidth...

متن کامل

Joint Allocation of Computational and Communication Resources to Improve Energy Efficiency in Cellular Networks

Mobile cloud computing (MCC) is a new technology that has been developed to overcome the restrictions of smart mobile devices (e.g. battery, processing power, storage capacity, etc.) to send a part of the program (with complex computing) to the cloud server (CS). In this paper, we study a multi-cell with multi-input and multi-output (MIMO) system in which the cell-interior users request service...

متن کامل

Overcoming the challenges of designing optimized broadband vibroseis sweeps

Our sweep design methodology centers on creating a frequency dependent force-amplitude (drive) function such that the drive-level at each frequency is maximised but does not exceed the mechanical limitations of the vibrator. Bagaini (2008) showed how we can overcome the reduction in energy from reducing the drive-level by increasing the instantaneous sweep rate. This methodology has commonly be...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010