Overcoming Bandwidth Limitations in Visual Computing
نویسندگان
چکیده
Because visual computations are very data intensive they are often limited by the bandwidth of the system rather than its peak computational performance. The trend towards many-core architectures exacerbates the problem because the parallel cores let the compute capability grow exponentially while the system bandwidth increases only linearly. At the core of the bandwidth problem in visual computing are iterative loops over discrete local operators. A typical representative is a stencil computation with constants weights or a sparse matrix vector product in case of variable weights. This computations are bandwidth limited because of their low arithmetic intensity. We have developed techniques that accelerate these loops beyond the peak bandwidth limit. While different cache friendly approaches have been tried in the past, they have never been so successful. For example our quad-core Xeon X5482 3.2GHz system reaches 22.3 GFLOPS in double precision on a stencil computation in registers. On a large 3D domain of 504 doubles and 100 iterations a handvectorized single-threaded naive stencil implementation achieves 1.6 GFLOPS and there is no improvement in the multi-threaded version because the system memory bandwidth limits the performance. A state-of-art automatic loop transformation framework Pluto [?] achieves 1.9 GFLOPS for this stencil computation with four threads. In comparison, our scheme performs already at 5.3 GFLOPS with a single thread and soars to 13.0 GFLOPS with four threads.
منابع مشابه
Synchronization and Caching Solution for Cost-Effective E-Learning in Resource and Bandwidth Constrained Environments
Electronic learning (e-learning) content delivery and accessibility have received significant research attention over years in order to ensure reliability, availability and cost-effectiveness through Information and Communication Technologies (ICTs).The evolvement of mobile computing devices especially smartphones bring prospects in overcoming the inherent limitations of the Internet when acces...
متن کاملImproving Bandwidth-power Efficiency of Homogeneous Wireless Networks Using On-meet Threshold Strategy (RESEARCH NOTE)
Over two decades, a problem of location dependent has been focused for improving the communication Bandwidth-Power Efficiency of homogeneous networks. The efficiencies of communication links are weakened by the Hidden Terminal Problem. Thus we propose a Fine – Tune Strategy for analyzing the On-Off communication region. We were observed that the proposed technique had been able to track and mo...
متن کاملReview of Mobile Cloud Computing Framework and Authentication Problems
Now days there are many innovations coming every day in the mobile applications in order to serve end users in best possible ways as well as use of cloud computing is an also increase with mobile applications which is called as mobile cloud computing. The mobile cloud computing is uses the services of cloud into environment of mobile applications for overcoming the many issues such as bandwidth...
متن کاملJoint Allocation of Computational and Communication Resources to Improve Energy Efficiency in Cellular Networks
Mobile cloud computing (MCC) is a new technology that has been developed to overcome the restrictions of smart mobile devices (e.g. battery, processing power, storage capacity, etc.) to send a part of the program (with complex computing) to the cloud server (CS). In this paper, we study a multi-cell with multi-input and multi-output (MIMO) system in which the cell-interior users request service...
متن کاملOvercoming the challenges of designing optimized broadband vibroseis sweeps
Our sweep design methodology centers on creating a frequency dependent force-amplitude (drive) function such that the drive-level at each frequency is maximised but does not exceed the mechanical limitations of the vibrator. Bagaini (2008) showed how we can overcome the reduction in energy from reducing the drive-level by increasing the instantaneous sweep rate. This methodology has commonly be...
متن کامل