OSCAR API v2.1: Extensions for an Advanced Accelerator Control Scheme to a Low-Power Multicore API

نویسندگان

  • Keiji Kimura
  • Cecilia González-Álvarez
  • Akihiro Hayashi
  • Hiroki Mikami
  • Mamoru Shimaoka
  • Jun Shirako
  • Hironori Kasahara
چکیده

The number of cores in smartphones and tablet-PCs are rapidly increasing along with their required high computational power. However, almost all applications on those devices have not used multiple cores for their high speed and low power execution since the application development environments, which allow the application developers easy and prompt development of parallelized application, are not available. In addition to the development of parallelized applications, low-power consumption techniques and efficiently use of accelerators such as GPUs are required to application developers. In order to provide more productive application development environment for multicores, an automatic parallelizing compiler, OSCAR compiler, which parallelizes C and Fortran programs automatically by applying multi-grain parallelization, local-memory and cache optimization and low-power optimization, has been developed. Furthermore, the OSCAR API has been also developed as an interface between the OSCAR Compiler and various kinds of shared memory multicores including homogeneous and heterogeneous manycores with SMP, cc-NUMA and PGAS architectures from various vendors, such as ARM, Intel, IBM, AMD, Tilera, Fujitsu, Renesas Electronics, and so on. The OSCAR API v1.0 and v2.0 have been opened and their specifications are available from http://www.kasahara.cs.waseda.ac.jp/. In the OSCAR API v1.0, fundamental thread control, memory control, DMA-control, power control and flexible group barrier are supported. In the OSCAR API v2.0, various kinds of heterogeneous multicores are supported. In this paper, the API extensions in OSCAR API v2.1, which includes control schemes for asynchronously executable accelerators, and hint directives for low-power optimizations, are described in addition to brief review of the OSCAR API v1.0 and v2.0. A flexible and low-overhead accelerator control scheme that allows us overlapped execution of CPUs, accelerators and DMA controllers can be realized by newly added accelerator control APIs.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

OSCAR API for Real-Time Low-Power Multicores and Its Performance on Multicores and SMP Servers

OSCAR (Optimally Scheduled Advanced Multiprocessor) API has been designed for real-time embedded low-power multicores to generate parallel programs for various multicores from different vendors by using the OSCAR parallelizing compiler. The OSCAR API has been developed by Waseda University in collaboration with Fujitsu Laboratory, Hitachi, NEC, Panasonic, Renesas Technology, and Toshiba in an M...

متن کامل

Evaluation of Power Consumption at Execution of Multiple Automatically Parallelized and Power Controlled Media Applications on the RP2 Low-Power Multicore

This paper evaluates an automatic power reduction scheme of OSCAR automatic parallelizing compiler having power reduction control capability when multiple media applications parallelized by the OSCAR compiler are executed simultaneously on RP2, a 8-core multicore processor developed by Renesas Electronics, Hitachi, and Waseda University. OSCAR compiler enables the hierarchical multigrain parall...

متن کامل

Parallelizing Compiler Framework and API for Power Reduction and Software Productivity of Real-Time Heterogeneous Multicores

Heterogeneous multicores have been attracting much attention to attain high performance keeping power consumption low in wide spread of areas. However, heterogeneous multicores force programmers very difficult programming. The long application program development period lowers product competitiveness. In order to overcome such a situation, this paper proposes a compilation framework which bridg...

متن کامل

Evaluation of Automatic Power Reduction with OSCAR Compiler on Intel Haswell and ARM Cortex-A9 Multicores

Reducing power dissipation is one of the most important issues that need to be addressed to improve the performance of all computing systems, such as supercomputers, cloud servers, desktop PCs, medical systems, and wearable devices. Exploiting parallelism and decreasing redundant power dissipation by fine grain power control for multicore/manycore systems are promising approaches, which can ens...

متن کامل

Hardware Design, Synthesis, and Verification of a Multicore Communications API

Modern trends in computer architecture and semiconductor scaling are leading towards the design of chips with more and more processor cores. Highly concurrent hardware and software architectures are inevitable in future systems. One of the greatest problems in these systems is communication. Providing coherence, consistency, synchronization, and sharing of data in a multicore system requires th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013