Kernel-Level Caching for Optimizing I/O by Exploiting Inter-Application Data Sharing
نویسندگان
چکیده
With applications becoming larger and the increasing load on high performance systems, it is important to tackle the I/O bottleneck problem from several angles. It is not only essential to optimize the I/O accesses of any one application, but also to be able to identify and exploit opportunities resulting from the sharing of datasets across applications. Clusters are rapidly becoming the platform of choice for demanding applications due to their costeffectiveness and widespread deployment. Consequently, this paper attempts to optimize data sharing across applications concurrently executing on the cluster. Specifically, we propose and implement a kernel-level caching module at each node of a Linux cluster that can be used to service several processes of different applications. Using detailed evaluations on an actual Linux cluster, this paper demonstrates the benefits of this module in optimizing intra and inter-application I/O requests.
منابع مشابه
Block Asynchronous I/O: A Flexible Infrastructure for User-Level Filesystems
Block Asynchronous I/O (BAIO) is a mechanism that strives to eliminate the kernel abstraction of a filesystem. In-kernel filesystems try to serve all applications with a uniform generic set of policies, and consequently end up achieving sub-optimal performance on a majority of applications. Application level knowledge of the peculiarities of the data that is managed on disk is totally ignored b...
متن کاملThe Case Against User-Level Networking
Extensive research on system support for enabling I/O-intensive applications to achieve performance close to the limits imposed by the hardware suggests two main approaches: Low overhead I/O protocols and the flexibility to customize I/O policies to the needs of applications. One way to achieve both is by supporting user-level access to I/O devices, enabling user-level implementations of I/O pr...
متن کاملEnwrich: a Compute-processor Write Caching Scheme for Parallel File Systems Apratim Purakayastha 2 Enwrich: a Compute-processor Write Caching Scheme for Parallel File Systems *
Many parallel scientific applications need high-performance I/O. Unfortunately, end-to-end parallel-I/O performance has not been able to keep up with substantial improvements in parallel-I/O hardware because of poor parallel file-system software. Many radical changes, both at the interface level and the implementation level, have recently been proposed. One such proposed interface is collective...
متن کاملSpeckle Reduction in Synthetic Aperture Radar Images in Wavelet Domain Exploiting Intra-scale and Inter-scale Dependencies
Synthetic Aperture Radar (SAR) images are inherently affected by a multiplicative noise-like phenomenon called speckle, which is indeed the nature of all coherent systems. Speckle decreases the performance of almost all the information extraction methods such as classification, segmentation, and change detection, therefore speckle must be suppressed. Despeckling can be applied by the multilooki...
متن کاملNVMeDirect: A User-space I/O Framework for Application-specific Optimization on NVMe SSDs
The performance of storage devices has been increased significantly due to emerging technologies such as Solid State Drives (SSDs) and Non-Volatile Memory Express (NVMe) interface. However, the complex I/O stack of the kernel impedes utilizing the full performance of NVMe SSDs. The application-specific optimization is also difficult on the kernel because the kernel should provide generality and...
متن کامل