Execution replay of parallel programs
نویسندگان
چکیده
Debugging MIMD programs is often a delicate job. As a matter of fact, they can have diierent behaviors in successive executions. So, cyclic debugging is not applicable. To make it available for parallel programmers , we propose execution replay (full and partial) for our multi-threaded execution model, the Communicating Active Components (CAC). CAC/s have been deened to implement Parallel Object Oriented Languages. This work is part of the PVC/BOX project which goal is a full parallel object oriented environment. Execution replay mechanism is the basic tool for usable parallel debugging tools building.
منابع مشابه
Leblanc and Mellor - Crummey : Debugging Parallel Programs with Instant Replay
The debugging cycle is the most common methodology for finding and correcting errors in sequential programs. Cyclic debugging is effective because sequential programs are usually deterministic. Debugging parallel programs is considerably more difficult because successive executions of the same program often do not produce the same results. In this paper we present a general solution for reprodu...
متن کاملAn Efficient Logical Clock for Replaying Message-Passing Programs
Cyclic debugging is one of the most important and most commonly used activities in programs development. During cyclic debugging, the program is repeatedly re-executed to track down errors when a failure has been observed. The cyclic debugging approach often fails for parallel programs because parallel programs reveal nondeterministic characteristics due to message race conditions. Execution re...
متن کاملA Taxonomy of Distributed Debuggers Based on Execution Replay
This paper presents a taxonomy of parallel and distributed debug-gers based on execution replay. Programming of distributed and parallel systems is a complex task. Amongst the many factors contributing to this complexity, the nondeterminacy of these systems is an important one. Execution replay is a technique developed to facilitate the debugging of nondeterministic programs. Execution replay h...
متن کاملParallel Program Debugging based on Data-Replay
Nondeterministic nature of parallel programs is the major difficulty in debugging. Order-replay, a technique to solve this problem, is widely used because of its small overhead. It has, however, several serious drawbacks: all processes of the parallel program have to participate in replay even when some of them are clearly not involved with the bug; and the programmer cannot stop the process be...
متن کاملVisualization, Execution Control and Replay of Massively Parallel Programs within Annai’s Debugging Tool
PDT is the Parallel Debugging Tool of the Annai programming environment developedwithin the Joint CSCS-ETH/NEC Collaboration in Parallel Processing. Similarly to the other components of the integrated environment, PDT provides support for application developers to debug data-parallel programs written in HPF, and message-passingprograms based on the MPI standard. This paper describes how the PDT...
متن کامل