Petascale turbulence simulation using a highly parallel fast multipole method on GPUs

نویسندگان

  • Rio Yokota
  • Lorena A. Barba
  • Tetsu Narumi
  • Kenji Yasuoka
چکیده

We present a 0.5 Petaflop/s calculation of homogeneous isotropic turbulence in a cube of 2048 particles, using a highly parallel fast multipole method (FMM) using 2048 GPUs on the TSUBAME 2.0 system. We compare this particle-based code with a spectral DNS code under the same calculation condition and the same machine. The results of our particlebased turbulence simulation match quantitatively with that of the spectral method. The calculation time for one time step is approximately 30 seconds for both methods; this result shows that the scalability of the FMM starts to become an advantage over FFT-based methods beyond 2000 GPUs.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

FMM-based vortex method for simulation of isotropic turbulence on GPUs, compared with a spectral method

The Lagrangian vortex method offers an alternative numerical approach for direct numerical simulation of turbulence. The fact that it uses the fast multipole method (FMM)—a hierarchical algorithm for N-body problems with highly scalable parallel implementations—as numerical engine makes it a potentially good candidate for exascale systems. However, there have been few validation studies of Lagr...

متن کامل

Parallel Algorithms for Constructing Data Structures for Fast Multipole Methods

We present efficient algorithms to build data structures and the lists needed for fast multipole methods. The algorithms are capable of being efficiently implemented on both serial, data parallel GPU and on distributed architectures. With these algorithms it is possible to map the FMM efficiently on to the GPU or distributed heterogeneous CPU-GPU systems. Further, in dynamic problems, as the di...

متن کامل

Biomolecular electrostatics using a fast multipole BEM on up to 512 gpus and a billion unknowns

We present teraflop-scale calculations of biomolecular electrostatics enabled by the combination of algorithmic and hardware acceleration. The algorithmic acceleration is achieved with the fast multipole method (fmm) in conjunction with a boundary element method (bem) formulation of the continuum electrostatic model, as well as the bibee approximation to bem. The hardware acceleration is achiev...

متن کامل

Petascale molecular dynamics simulation using the fast multipole method on K computer

In this paper, we report all-atom simulations of molecular crowding – a result from the full node simulation on the “K computer”, which is a 10-PFLOPS supercomputer in Japan. The capability of this machine enables us to perform simulation of crowded cellular environments, which are more realistic compared to conventional MD simulations where proteins are simulated in isolation. Living cells are...

متن کامل

11b.5 a Highly Parallel Algorithm for Turbulence Simulations in Planetary Boundary Layers: Results with Meshes

Petascale computing (e.g., UCAR/JOSS, 2005) has the potential to alter the landscape of turbulence simulations in planetary boundary layers (PBLs). Increased computer power using O(104−105) or more processors will permit large-eddy simulations (LESs) of turbulent flows over a wide range of scales in realistic outdoor environments, for example, flow over hills, atmosphereland interactions (Patto...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computer Physics Communications

دوره 184  شماره 

صفحات  -

تاریخ انتشار 2013