High Performance Parallel LOBPCG Method for Large Hamiltonian Derived from Hubbard Model on Multi-GPU Systems
نویسندگان
چکیده
Abstract The physical property of the Hubbard model can be understood by solving eigenvalue problem for Hamiltonian derived from model. Since is a large sparse matrix, an iteration method usually utilized problems. One effectual solvers this LOBPCG (Locally Optimal Block Preconditioned Conjugate Gradient) method. tuning strategies on GPU systems when all vectors are stored in device memory have been proposed. In research, we propose parallel multi-GPU system and some host memory. When used multi eigenpairs (eigenvalues corresponding eigenvectors), number vectors, whose size same as dimension Hamiltonian, proportional to eigenpairs. On other hand, consumption non-zero elements significantly reduced considering regular arrangement elements. Therefore, execute GPUs, transferred between needed. cost data transfer very large, also optimization it. simulation result shows that effective high performance computing.
منابع مشابه
High performance MRI simulations of motion on multi-GPU systems
BACKGROUND MRI physics simulators have been developed in the past for optimizing imaging protocols and for training purposes. However, these simulators have only addressed motion within a limited scope. The purpose of this study was the incorporation of realistic motion, such as cardiac motion, respiratory motion and flow, within MRI simulations in a high performance multi-GPU environment. ME...
متن کاملCommunication Avoiding Neumann Expansion Preconditioner for LOBPCG Method: Convergence Property of Exact Diagonalization Method for Hubbard Model
Application of Eisenstat-SSOR Preconditioner to Realistic Stress Analysis Problem by Parallel Cache-Cache Computing Kuniyoshi Abe1, Seiji Fujino2 1Faculty of Economics and Information, Gifu Shotoku University, Japan; 2Professor Emeritus, Kyushu University, Japan Communication avoiding Neumann expansion preconditioner for LOBPCG method: Convergence property of exact diagonalization method for Hu...
متن کاملA Multi Objective Optimization Model for Redundancy Allocation Problems in Series-Parallel Systems with Repairable Components
The main goal in this paper is to propose an optimization model for determining the structure of a series-parallel system. Regarding the previous studies in series-parallel systems, the main contribution of this study is to expand the redundancy allocation parallel to systems that have repairable components. The considered optimization model has two objectives: maximizing the system mean time t...
متن کاملPerformance Engineering of the Kernel Polynomial Method on Large-Scale CPU-GPU Systems
The Kernel Polynomial Method (KPM) is a wellestablished scheme in quantum physics and quantum chemistry to determine the eigenvalue density and spectral properties of large sparse matrices. In this work we demonstrate the high optimization potential and feasibility of peta-scale heterogeneous CPU-GPU implementations of the KPM. At the node level we show that it is possible to decouple the spars...
متن کاملPerformance Prediction for Large Scale Parallel Systems
In both the design of parallel computer systems and the development of applications, it is very important to have good performance prediction tools. This paper describes a new approach -PetaSIM, which is designed for the rapid prototyping stage of machine or application design. Computers, networks and applications are described as objects in a Java IDL (Interface Definition Language) with speci...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2022
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-031-10419-0_1