Collective I/O Optimizations for Adaptive Mesh Refinement Data Writes on Lustre File System
نویسندگان
چکیده
Adaptive mesh refinement (AMR) applications refine small regions of a physical space. As a result, when AMR data has to be stored in a file, writing data involves storing a large number of small blocks of data. Chombo is an AMR software library for solving partial differential equations over block-structured grids, and is used in large-scale climate and fluid dynamics simulations. Chombo’s current implementation for writing data on an AMR hierarchy uses several independent write operations, causing low I/O performance. In this paper, we investigate collective I/O optimizations for Chombo’s write function. We introduce Aggregated Collective Buffering (ACB) to reduce the number of small writes. We demonstrate that our approach outperforms the current implementation by 2× to 9.1× and the MPI-IO collective buffering by 1.5× to 3.4× on the Edison and Cori platforms at NERSC using the ChomboIO benchmark. Using the Darshan I/O characterization tool, we show that ACB makes larger contiguous writes than collective buffering at the POSIX level, and this difference gives ACB a significant performance benefit over collective buffering.
منابع مشابه
Recent Progress in Tuning Performance of Large-scale I/O with Parallel HDF5
Large-scale scientific simulations running on hundreds of thousands of cores produce massive amounts of data that often needs to be stored in files. Analysis applications run on thousands of cores to access data files in order to extract useful information. Both, simulation and analysis codes, require highlevel I/O libraries that offer superior data access performance for writing and reading da...
متن کاملLIOProf: Exposing Lustre File System Behavior for I/O Middleware
As parallel I/O subsystem in large-scale supercomputers is becoming complex due to multiple levels of software libraries, hardware layers, and various I/O patterns, detecting performance bottlenecks is a critical requirement. While there exist a few tools to characterize application I/O, robust analysis of file system behavior and associating file-system feedback with application I/O patterns a...
متن کاملA Scalable Message Passing Interface Implementation of an Ad-Hoc Parallel I/o system
In this paper we present the novel design, implementation, and evaluation of an ad-hoc parallel I/O system (AHPIOS). AHPIOS is the first scalable parallel I/O system completely implemented in the Message Passing Interface (MPI). The MPI implementation brings the advantages of portability, scalability and high performance. AHPIOS allows MPI applications to dynamically manage and scale distribute...
متن کاملA scalable MPI implementation of an ad-hoc parallel I/O system
This paper presents the novel design, implementation and evaluation of an ad-hoc parallel I/O system (AHPIOS). AHPIOS is the first scalable parallel I/O system completely implemented in Message Passing Interface (MPI). The MPI implementation brings the advantages of portability, scalability and high-performance. AHPIOS allows MPI applications to dynamically manage and scale distributed partitio...
متن کاملTuning HDF5 for Lustre File Systems
HDF5 is a cross-platform parallel I/O library that is used by a wide variety of HPC applications for the flexibility of its hierarchical object-database representation of scientific data. We describe our recent work to optimize the performance of the HDF5 and MPI-IO libraries for the Lustre parallel file system. We selected three different HPC applications to represent the diverse range of I/O ...
متن کامل