DXML: A High-performance Scientific Subroutine Library

نویسندگان

  • Chandrika Kamath
  • Roy Ho
  • Dwight P. Manley
چکیده

Mathematical subroutine libraries for science and engineering applications are an important tool in high-performance computing. By identifying and optimizing frequently used, numerically intensive operations, these libraries help in reducing the cost of computation, enhancing portability, and improving productivity. The Digital eXtended Math Library is a set of public domain and Digital proprietary software that has been optimized for high performance on Alpha systems. In this paper, DXML and the issues related to library software technology are described. Specific examples illustrate how algorithms can be optimized to take advantage of the architecture of Alpha systems. Modern algorithms that effectively exploit the memory hierarchy enable DXML routines to provide substantial improvements in performance.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Portable Subroutine Library for Solving Linear Control Problems on Distributed Memory Computers

This paper describes the design of a software library for solving the basic computational problems that arise in analysis and synthesis of linear control systems. The library is intended for use in high performance computing environments based on parallel distributed memory architectures. The portability of the library is ensured by using the BLACS, PBLAS, and ScaLAPACK as the basic layer of co...

متن کامل

Experiments with Strassen’s Algorithm: from Sequential to Parallel

This paper studies Strassen’s matrix multiplication algorithm by implementing it in a variety of methods: sequential, workflow, and in parallel. All the methods show better performance than the well-known scientific libraries for medium to large size matrices. The sequential recursive program is implemented and compared with ATLAS’s DGEMM subroutine. A workflow program in the NetSolve system an...

متن کامل

Design of the Scienti c Subroutine Library for the Fujitsu VPP 300

A research agreement between the Australian National University and Fujitsu Japan has led to the development of a library of parallel mathematical subrou-tines and the extension of the library of single processor routines for the Fujitsu VPP300. The Fujitsu VPP300 provides a very sophisticated architecture combining vector processors in parallel by a crossbar switch. Very high performance can b...

متن کامل

Exploiting functional parallelism of POWER2 to design high-performance numerical algorithms

We describe the algorithms and architecture approach to produce high-performance codes for numerically intensive computations. In this approach, for a given computation, we design algorithms so that they perform optimally when run on a target machine-in this case, the new POWERS'" machines from the RSl6000 family of RISC processors. The algorithmic features that we emphasize are functional para...

متن کامل

Auto-tuning a Matrix Routine for High Performance

Well-written scientific simulations typically get tremendous performance gains by using highly optimized library routines. Some of the most fundamental of these routines perform matrix-matrix multiplications and related routines, known as BLAS (Basic Linear Algebra Subprograms). Optimizing these library routines for efficiency is therefore of tremendous importance for many scientific simulation...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Digital Technical Journal

دوره 6  شماره 

صفحات  -

تاریخ انتشار 1994