A Survey on Malleability Solutions for High-Performance Distributed Computing
نویسندگان
چکیده
Maintaining a high rate of productivity, in terms completed jobs per unit time, High-Performance Computing (HPC) facilities is cornerstone the next generation exascale supercomputers. Process malleability presented as straightforward mechanism to address that issue. Nowadays, vast majority HPC are intended for distributed-memory applications based on Message Passing (MP) paradigm. For this reason, many efforts Interface (MPI), de facto standard programming model. Malleability aims rescale executions on-the-fly, other words, reconfigure number and layout processes running applications. involves resources reallocation within system, handling application, redistributing data among those resume execution. This manuscript compiles how different frameworks process malleability, their main features, integration resource management systems, they may be used user codes. paper detailed state-of-the-art devised an entry point researchers who interested malleability.
منابع مشابه
A survey on impact of cloud computing security challenges on NFV infrastructure and risks mitigation solutions
Increased broadband data rate for end users and the cost of resource provisioning to an agreed SLA in telecom service providers, are forcing operators in order to adhere to employment Virtual Network Functions (VNF) in an NFV solution. The newly 5G mobile telecom technology is also based on NFV and Software Define Network (SDN) which inherit opportunities and threats of such constructs. Thus a ...
متن کاملCommunication Constructs for High Performance Distributed Computing
This paper is concerned with the impact of languagelevel communication mechanisms on the overall performance of distributed systems. Analytic models are developed of several existing mechanisms, including RPC and several enhancements. Using this modelling, a novel construct, called responsibilities, is synthesized. Analysis of the construct, guiding refinement of the initial design, demonstrate...
متن کاملDISCWorld: A Distributed High Performance Computing Environment
An increasing number of science and engineering applications require distributed and parallel computing resources to satisfy user response-time requirements. Distributed science and engineering applications require a high performance “middleware” which will both allow the embedding of legacy applications as well as enable new distributed programs, and which allows the best use of existing and s...
متن کاملA survey on resource allocation in high performance distributed computing systems
An efficient resource allocation is a fundamental requirement in high performance computing (HPC) systems. Many projects are dedicated to large-scale distributed computing systems that have designed and developed resource allocation mechanisms with a variety of architectures and services. In our study, through analysis, a comprehensive survey for describing resource allocation in various HPCs i...
متن کاملA Java Framework for Distributed High Performance Computing
The past few years have dramatically changed the view of high performance applications and computing. While traditionally such applications have been targeted towards dedicated parallel machines, we see the emerging trend of building \meta-applications" composed of several modules that exploit heterogeneous platforms and employ hybrid forms of parallelism. In particular, Java has been recognize...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied sciences
سال: 2022
ISSN: ['2076-3417']
DOI: https://doi.org/10.3390/app12105231