The Improved Unsymmetric Lanczos Process on Massively Distributed Memory Computers
نویسنده
چکیده
|For the eigenvalues of a large and sparse unsym-metric coeecient matrix, we propose an improved version of the unsymmetric Lanczos process combining elements of numerical stability and parallel algorithm design. Stability is obtained by a coupled two-term recurrences procedure that generates Lanczos vectors scaled to unit length. The algorithm is derived such that all inner products and matrix-vector multiplications of a single iteration step are independent and communication time required for inner product can be overlapped eeciently with computation time. Therefore , the cost of global communication on parallel distributed memory computers can be signiicantly reduced. The resulting algorithm maintains the favorable properties of the Lanczos process while not increasing computational costs. In this paper, a simple theoretical model of computation and communications phases is presented to allow us to give a qualitative analysis of the parallel performance with two-dimensional grid topology. The theoretical results in the performance is demonstrated by experimental timing results from nite element models carried out on massively parallel distributed memory computer Parsytec GC/PowerPlus.
منابع مشابه
Efficient Implementation of the Improved Unsymmetric Lanczos Process on Massively Distributed Memory Computers
For the eigenvalues of a large and sparse unsymmetric co-eecient matrix, we have proposed an improved version of the unsymmetric Lanczos process combining elements of numerical stability and parallel algorithm design. The algorithm is derived such that all inner products and matrix-vector multiplications of a single iteration step are independent and communication time required for inner produc...
متن کاملThe Improved Quasi - Minimal Residual
For the solutions of linear systems of equations with unsymmetric coeecient matrices, we propose an improved version of the quasi-minimal residual (IQMR) method by using the Lanczos process as a major component combining elements of numerical stability and parallel algorithm design. For Lanczos process, stability is obtained by a coupled two-term procedure that generates Lanczos vectors normali...
متن کاملThe Improved BiCG Method for Large and Sparse Linear Systems on Parallel Distributed Memory Architectures
For the solutions of large and sparse linear systems of equations with unsymmetric coeÆcient matrices, we propose an improved version of the BiConjugate Gradient method (IBiCG) method based on [5, 6] by using the Lanczos process as a major component combining elements of numerical stability and parallel algorithm design. For Lanczos process, stability is obtained by a coupled two-term procedure...
متن کاملEfficient Implementation of the Improved Quasi-Minimal Residual Method on Massively Distributed Memory Computers
For the solutions of linear systems of equations with un-symmetric coeecient matrices, we has proposed an improved version of the quasi-minimal residual (IQMR) method by using the Lanczos process as a major component combining elements of numerical stability and parallel algorithm design. For Lanczos process, stability is obtained by a couple two-term procedure that generates Lanczos vectors sc...
متن کاملA Parallel Version of the Unsymmetric Lanczos Algorithm and its Application to QMR
A new version of the unsymmetric Lanczos algorithm without look-ahead is described combining elements of numerical stability and parallel algorithm design. Firstly, stability is obtained by a coupled two-term procedure that generates Lanczos vectors scaled to unit length. Secondly, the algorithm is derived by making all inner products of a single iteration step independent such that global sync...
متن کامل