Pact – A Fault Tolerant Parallel Programming Environment
نویسنده
چکیده
Pact is a parallel programming environment relieving the programmer from the burdens of parallel programming which are not really necessary to write efficient parallel programs. This is done by providing a simple synchronization model and virtual shared data with user-defined granularity and automatic consistency control. Pact guarantees user-transparent fault-tolerance with low overhead by using atomic actions as basic units of parallel execution. Additionally, the runtime system maps parallel actions to server processes using dynamic load-balancing. An included on-line visualization tool helps tuning and debugging parallel programs.
منابع مشابه
Fault Tolerance Lessons Applied to Parallel Computing
This paper describes an approach to fault-tolerant parallel computing which is based on the experiences with the most successful fault-tolerant software – the transaction processing systems. The algorithms presented here have less runtime overhead and faster recovery than most preceding approaches. In the Pact parallel programming environment fault tolerance is provided fully user transparent i...
متن کاملFault-Tolerant Parallel Programming with Atomic Actions
The Pact (parallel actions) parallel programming environment provides an easy-to-use parallel execution and synchronization model based on task parallelization. To give the programmer an abstraction for global data (even on distributed memory machines) the Pact runtime system uses virtual shared memory. Execution’s efficiency is improved with data-dependent dynamic load balancing and latency-ma...
متن کاملThe Pact Fault-Tolerant Parallel Programming Environment for the ParagonTM
Parallel programming is hard enough, nevertheless parallel computers, which often are not far away from prototype status, have low reliability. This makes it difficult or sometimes impossible to successfully execute long-running applications, as they are typical for the ‘Grand-Chal-
متن کاملAlgorithm-based fault-tolerant programming in scientific computation on multiprocessors
EEcient parallel algorithms proposed to solve many fundamental problems in scientiic computation are sensitive to processor failures. Because of its low costs, algorithm-based fault tolerance i s a n i n t e r esting concept for introducing fault tolerance into existing multi-processors. To facilitate fault{tolerant programming in scientiic computation, we have modiied and developed further an ...
متن کاملApplication Recovery in Parallel Programming Environment
In this paper, fault-tolerant feature of TOPAS parallel programming environment for distributed systems is presented. TOPAS automatically analyzes data dependence among tasks and synchronizes data, which reduces the time needed for parallel program developments. TOPAS also provides supports for scheduling, load balancing and fault tolerance. The main topics of this paper is to present the solut...
متن کامل