Symmetric Pivoting in ScaLAPACK

نویسنده

  • Craig Lucas
چکیده

Recently codes have been developed for computing the Cholesky factorization with complete pivoting of a symmetric positive semidefinite matrix for the serial LAPACK library. In the parallel ScaLAPACK library there are only routines for the unpivoted factorization in the positive definite case and no algorithms use complete pivoting. We aim to assess the feasibility of complete pivoting in ScaLAPACK by implementing a parallel pivoted Cholesky routine. We discuss the steps needed to parallelize the existing serial code, and discuss the specific constraints of the data distribution and communication for ScaLAPACK. We present some experiments, comparing our code and the existing ScaLAPACK code, conducted on both a Cray XD1 and a Cray XT3. We show that on fewer processors our new code scales well and the pivoting overhead is small. However, the pivoting overhead increases with the number of processors, but decreases with problem size.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Cholesky Algorithm for Some Complex Symmetric Systems

Complex symmetric systems do not in general admit a Cholesky factorization without pivoting, as would be the case for hermitian systems. Nevertheless, for some complex symmetric systems, as those coming from the discretization of boundary integral formulations, pivoting can be avoided. We present a Cholesky factorization algorithm for such complex symmetric systems. We propose a LAPACK-style im...

متن کامل

Packed Storage Extension for ScaLAPACK

We describe an extension to ScaLAPACK for computing with symmetric (and hermitian) matrices stored in a packed form. This is similar to the compact storage for symmetric (and hermitian) matrices available in LAPACK [2]. This enables more efficient use of memory by storing only the lower or upper triangular part of a symmetric matrix. The capabilities include Choleksy factorization (PxSPTRF) and...

متن کامل

Department of Computer Science Technical Report CS - 98 - 385 Packed storage extension for ScaLAPACK

We describe a new extension to ScaLAPACK [2] for computing with symmetric (Hermi-tian) matrices stored in a packed form. The new code is built upon the ScaLAPACK routines for full dense storage for a high degree of software reuse. The original ScaLAPACK stores a symmetric matrix as a full matrix but accesses only the lower or upper triangular part. The new code enables more efficient use of mem...

متن کامل

Department of Computer Science Technical Report CS - 97 - 347 Packed storage extension for ScaLAPACK

We describe a new extension to ScaLAPACK [2] for computing with symmetric (Hermi-tian) matrices stored in a packed form. The new code is built upon the ScaLAPACK routines for full dense storage for a high degree of software reuse. The original ScaLAPACK stores a symmetric matrix as a full matrix but accesses only the lower or upper triangular part. The new code enables more efficient use of mem...

متن کامل

On Data Layout in the Parallel Block-jacobi Svd Algorithm with Pre–processing

An efficient version of the parallel two-sided block-Jacobi algorithm for the singular value decomposition of an m × n matrix A includes the pre-processing step, which consists of the QR factorization of A with column pivoting followed by the optional LQ factorization of the Rfactor. Then the iterative two-sided block-Jacobi algorithm is applied in parallel to the R-factor (or L-factor). Having...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006