A distributed execution environment for large data visualization
نویسندگان
چکیده
Over the years, homogeneous computer cluster have been the most popular, and, in some sense, the only viable, platform for use in parallel visualization. In this work, we designed an execution environment for data-intensive visualization that is suitable to handle SciDAC scale datasets. This environment is solely based on computers distributed across the Internet that are owned and operated by independent institutions, while being openly shared for free. Those Internet computers are inherently of heterogeneous hardware configuration and running a variety of operating systems. Using 100 processors of such kind, we have been able to obtain the same level of performance offered by a 64-node cluster of 2.2 GHz P4 processors, while processing a 75GBs subset of TSI simulation data. Due to its inherently shared nature, this execution environment for data-intensive visualization could provide a viable means of collaboration among geographically separated SciDAC scientists.
منابع مشابه
Visualizing Distributed Data Structures
A new programming style for large-scale parallel programs centered around distributed data structures has emerged. The current parallel program visualization tools were intended for the old style and do not deal with distributed data structures. We show, with several examples of visualizations and animations developed for large scale pC++ programs, that visualizing and animating distributed dat...
متن کاملThe SCIRun Problem Solving Environment: Implementation Within a Distributed Environment
Introduction Building systems that alter program behavior during execution based on user speci ed criteria computational steering systems has been a recent research topic particularly among the high performance computing community To enable a computational steering system with powerful visualization capabilities such as SCIRun to run in a distributed computational environment a distributed infr...
متن کاملVirtue: Performance Visualization of Parallel and Distributed Applications
44 Computer H igh-speed, wide-area networks have made it both possible and desirable to interconnect geographically distributed applications that control distributed collections of scientific data, remote scientific instruments, and highperformance computer systems. Such an application might, for example, control a remote radio telescope, transmit raw data from the telescope site to a distribut...
متن کاملVisualization, Execution Control and Replay of Massively Parallel Programs within Annai’s Debugging Tool
PDT is the Parallel Debugging Tool of the Annai programming environment developedwithin the Joint CSCS-ETH/NEC Collaboration in Parallel Processing. Similarly to the other components of the integrated environment, PDT provides support for application developers to debug data-parallel programs written in HPF, and message-passingprograms based on the MPI standard. This paper describes how the PDT...
متن کاملVisualization Systems and the Internet
The paper discusses how established dataflow visualization systems such as IRIS Explorer can be extended to take advantage of two new opportunities for visualization offered by the Internet. The first is the use of the Web as a distributed computing environment in which visualization services can be provided. The techniques fall roughly into two classes: client-based systems, where execution of...
متن کامل