Monitoring Parallel Programs for Performance Tuning in Cluster Environments
نویسنده
چکیده
This chapter surveys the issues of collecting monitoring data for performance debugging of parallel programs executed in cluster environments. The main categories of monitoring tools are either clock driven or event driven, the latter including timing, counting or tracing. The paper focuses on software tracing, deemed the most general and portable event driven monitoring technique. The lack of global clock and the tracing intrusion are identified as the two major problems hindering the quality of the traced information collected by software tracers. A global time implementation by software and an approach for compensating the software tracing intrusion are then described.
منابع مشابه
Topic 1 - Support Tools and Environments
Nowadays parallel distributed programmers use different tools and environments that facilitate the design, programming, testing, debugging and performance analysis and tuning of their applications. However, they do not satisfy all user requirements, such as broad usability, high effectiveness and proper accuracy. Therefore new propositions are still being developed and their properties tested o...
متن کاملParallel computing using MPI and OpenMP on self-configured platform, UMZHPC.
Parallel computing is a topic of interest for a broad scientific community since it facilitates many time-consuming algorithms in different application domains.In this paper, we introduce a novel platform for parallel computing by using MPI and OpenMP programming languages based on set of networked PCs. UMZHPC is a free Linux-based parallel computing infrastructure that has been developed to cr...
متن کاملPerformance Measurement, Visualization and Modeling of Parallel and Distributed Programs using the AIMS Toolkit
Writing large-scale parallel and distributed scientific applications that make optimum use of the multiprocessor is a challenging problem. Typically, computational resources are underused due to performance failures in the application being executed. Performance-tuning tools are essential for exposing these performance failures and for suggesting ways to improve program performance. In this pap...
متن کاملA Run-Time Performance Monitor for Message-Passing Parallel Programs
A fundamental problem with parallel program monitoring tools is the intrusiveness introduced by instrumenting a monitor into the original program for collecting trace data. An intrusive monitor may alter the order of program events and distort the timing information. This may result in a change in the probability of making particular non-deterministic choices, and thus change the program behavi...
متن کاملInteractive Debugging and Performance Analysis of Massively Parallel Applications
In the eld of high performance computing, massively parallel processing systems (MPPs) get more and more important. A rising number of complex applications is parallelized for execution on these machines. Still a signiicant portion of the time needed for parallelization is spent for the process of debugging and performance tuning. A main reason for this fact is the absence of adequate tools sup...
متن کامل