DMRlib: Easy-Coding and Efficient Resource Management for Job Malleability
نویسندگان
چکیده
Process malleability has proved to have a highly positive impact on the resource utilization and global productivity in data centers compared with conventional static allocation policy. However, non-negligible additional development effort this solution imposes constrained its adoption by scientific programming community. In work, we present DMRlib, library designed offer advantages of process while providing minimalist MPI-like syntax. The includes series predefined communication patterns that greatly ease malleable applications. addition, deploy several scenarios demonstrate featuring different scalability patterns. Concretely, study two job submission modes (rigid moldable) order identify best-case for using metrics such as rate, completed jobs per second, energy consumption. experiments prove our elastic approach may improve throughput factor higher than 3x traditional workloads non-malleable jobs.
منابع مشابه
Contributions for Resource and Job Management in High Performance Computing Contributions for Resource and Job Management in High Performance Computing
High Performance Computing is characterized by the latest technological evolutions in computing architectures and by the increasing needs of applications for computing power. A particular middleware called Resource and Job Management System (RJMS), is responsible for delivering computing power to applications. The RJMS plays an important role in HPC since it has a strategic place in the whole s...
متن کاملEfficient Resource Management
The design of linear logic programming languages and theorem provers opens a number of new implementation challenges not present in more traditional logic languages such as Horn clauses (Prolog) and hereditary Harrop formulas (λProlog). Among these, the problem of efficiently managing the linear context when solving a goal is of crucial importance for the use of these systems in non-trivial app...
متن کاملEfficient Resource Management for Malleable Applications
In this paper we present a method for managing concurrent parallel applications on large shared-memory machines efficiently and fair. It combines advantages of space-sharing for tight coupled parallel applications and the possibility of immediate job execution in time-sharing environments. An application parallelism manager (APM) assigns system resources to running jobs on the fly. In case of c...
متن کاملHarnessing Virtual Machine Resource Control for Job Management
Virtual machine technology promises important benefits for grid computing and cluster batch job systems, including improved isolation, customizable workspaces, and support for checkpointing and migration. One way to gain these benefits is to “drill holes” in existing batch computing systems; however, we believe these new capabilities warrant a rethinking of the architectures of existing systems...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Computers
سال: 2021
ISSN: ['1557-9956', '2326-3814', '0018-9340']
DOI: https://doi.org/10.1109/tc.2020.3022933