Symmetric Pivoting in ScaLAPACK
نویسنده
چکیده
Recently codes have been developed for computing the Cholesky factorization with complete pivoting of a symmetric positive semidefinite matrix for the serial LAPACK library. In the parallel ScaLAPACK library there are only routines for the unpivoted factorization in the positive definite case and no algorithms use complete pivoting. We aim to assess the feasibility of complete pivoting in ScaLAPACK by implementing a parallel pivoted Cholesky routine. We discuss the steps needed to parallelize the existing serial code, and discuss the specific constraints of the data distribution and communication for ScaLAPACK. We present some experiments, comparing our code and the existing ScaLAPACK code, conducted on both a Cray XD1 and a Cray XT3. We show that on fewer processors our new code scales well and the pivoting overhead is small. However, the pivoting overhead increases with the number of processors, but decreases with problem size.
منابع مشابه
A Cholesky Algorithm for Some Complex Symmetric Systems
Complex symmetric systems do not in general admit a Cholesky factorization without pivoting, as would be the case for hermitian systems. Nevertheless, for some complex symmetric systems, as those coming from the discretization of boundary integral formulations, pivoting can be avoided. We present a Cholesky factorization algorithm for such complex symmetric systems. We propose a LAPACK-style im...
متن کاملPacked Storage Extension for ScaLAPACK
We describe an extension to ScaLAPACK for computing with symmetric (and hermitian) matrices stored in a packed form. This is similar to the compact storage for symmetric (and hermitian) matrices available in LAPACK [2]. This enables more efficient use of memory by storing only the lower or upper triangular part of a symmetric matrix. The capabilities include Choleksy factorization (PxSPTRF) and...
متن کاملDepartment of Computer Science Technical Report CS - 98 - 385 Packed storage extension for ScaLAPACK
We describe a new extension to ScaLAPACK [2] for computing with symmetric (Hermi-tian) matrices stored in a packed form. The new code is built upon the ScaLAPACK routines for full dense storage for a high degree of software reuse. The original ScaLAPACK stores a symmetric matrix as a full matrix but accesses only the lower or upper triangular part. The new code enables more efficient use of mem...
متن کاملDepartment of Computer Science Technical Report CS - 97 - 347 Packed storage extension for ScaLAPACK
We describe a new extension to ScaLAPACK [2] for computing with symmetric (Hermi-tian) matrices stored in a packed form. The new code is built upon the ScaLAPACK routines for full dense storage for a high degree of software reuse. The original ScaLAPACK stores a symmetric matrix as a full matrix but accesses only the lower or upper triangular part. The new code enables more efficient use of mem...
متن کاملOn Data Layout in the Parallel Block-jacobi Svd Algorithm with Pre–processing
An efficient version of the parallel two-sided block-Jacobi algorithm for the singular value decomposition of an m × n matrix A includes the pre-processing step, which consists of the QR factorization of A with column pivoting followed by the optional LQ factorization of the Rfactor. Then the iterative two-sided block-Jacobi algorithm is applied in parallel to the R-factor (or L-factor). Having...
متن کامل