A New Organization of Sparse Gauss Elimination for Solving PDEs

نویسندگان

  • Mo Mu
  • John R. Rice
چکیده

A new Gauss elimination algorithm is presented for solving sparse, nonsymmetric linear systems arising from partial differential equation (PDE) problems. It is particularly suitable for use on distributed memory message passing (DMMP) multiprocessor computers and it is presented and analyzed in this context. The objective of the algorithm is to exploit the sparsity (Le., reducing both computational and memory requirements) and sharply reduce the data structure manipulation overhead of standard sparse matrix algorithms. The algorithm is based on the nested dissection approach, which starts with a large set of very sparse, completely independent subsystems and progresses in stages to a single. nearly dense system at the last stage. The computational efforts of each stage are roughly equal (almost exactly equal for model problems), yet the data structures appropriate for the first and last stages are quite different. Thus we use different types of data structures and algorithm components at different stages of the solution. .. Supported by NSF grant CCR-8619817. ..... Supported in part by AFOSR grant 88-0243 and the Strategic Defense Initiative through ARO conlnlCl DAAG03-86-K-OI06. L INTRODUCTION Solving linear PDEs naturally generates large, sparse linear systems of equations to solve. These systems have structures which are not exploited by general purpose sparse matrix algorithms and we present a new organization of sparse Gauss elimination tailored to e;qIloit-these structures. We stan with some general background comments on sparse rnattix melhods for PDE problems. The linear systems are almost always created by the PDE solving system with the equations (manix rows) distributed among the processors. One has the freedom to choose the assignment of rows to processors, but one cannot choose to have columns assigned to processors wiilieut the high expense of performing a matrix transpose (or equivalent) on a DMMP machine. The linear systems are conunonly non-symmetric so that a solver of symmetric systems is applicable to a limited class of PDE problems and/or discretization methods. The lack of symmetry requires two data srructures on a DMMP machine, one each for the row and column sparsities. One can combine these in clever ways, but it is prohibitedly expensive to repeatedly obtain column sparsity information from the row sparsity structure. Note that similar sparsity patterns occur in other imponant applications (e.g., least squares problems [Rice, 1984]) and the considerations studied here for PDE problems are also relevant there. Symbolic factorization is not well suited for non-symmetric systems as. so far, the techniques generate much too large a data structure. Merging the symbolic factorization with the numerical computation is not inherently more expensive than doing these separately and, for non-symmetric systems, it allows the final data structure to be just the required size for the system. This dynamic data structure creation is used in the parallel sparse algorithm of [Mu and Rice, 1990.]. If nested dissection is performed geometrically rather than algebraically (as is natural in PDE problems), then a great deal of matrix structure is known "a priori" from the geometric structure and need not be explicitly expressed in the sparse matrix data structure. This idea is exploited in parallel sparse [Mu and Rice, 1989a] to some extent. There are two general row oriented organizations of Gauss elimination focusing on what happens when one eliminates an unknown from a pivot equation. Theyare/an·in and fan~our schemes. Another imponant organization is the mulrlfrontal scheme which is unknown oriented (using both rows and columns associated with unknowns). The fan-in organization processes the LV factorization row by row as follows.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Organization of Sparse Gauss Elimination for Solving PDEs on Distributed Memory Machines

A computational arrangement of Gauss elimination is presented for solving sparse, nonsymmetric linear systems arising from partial differential equation (PDE) problems. It is particularly targeted for use on distributed memory message passing (DMMP) multiprocessor computers and it is presented and analyzed in this context. The objective of the algorithm is to exploit the sparsity (i.e., reducin...

متن کامل

Solving Linear Systems with Sparse Matrices on Hypercubes

We investigate parallel Gauss elimination for sparse matrices, especially those arising from the discretization of PDEs. We propose an approach which combines minimum degree ordering, nested dissection, domain decomposition and multifront techniques. Neither symbolic factorization nor explicit representation of elimination trees are needed. An effective and economic dynamic data structure is pr...

متن کامل

Dual Face Algorithm Using Gauss-jordan Elimination for Linear Programming

The dual face algorithm uses Cholesky factorization, as would be not very suitable for sparse computations. The purpose of this paper is to present a dual face algorithm using Gauss-Jordan elimination for solving bounded-variable LP problems.

متن کامل

A New Modification of Legendre-Gauss Collocation Method for Solving a Class of Fractional Optimal Control Problems

In this paper, the optimal conditions for fractional optimal control problems (FOCPs) were derived in which the fractional differential operators defined in terms of Caputo sense and reduces this problem to a system of fractional differential equations (FDEs) that is called twopoint boundary value (TPBV) problem. An approximate solution of this problem is constructed by using the Legendre-Gauss...

متن کامل

PARALLEL SPARSE: Data Structure and Organization

PARALLEL SPARSE is an algorithm for the direct solution of general sparse linear systems using Gauss elimination. It is designed for distributed memory machines and has been implemented on the NCUBE-7, a hypercube machine with 128 processors. The algorithm is intended to be particularly efficient for linear systems arising from solving partial differential equations using domain decomposition w...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013