MAGNET: A Tool for Debugging, Analyzing and Adapting Computing Systems
نویسندگان
چکیده
As computing systems grow in complexity, the cluster and grid communities require more sophisticated tools to diagnose, debug and analyze such systems. We have developed a toolkit called MAGNET (Monitoring Apparatus for General kerNel-Event Tracing) that provides a detailed look at operating-system kernel events with very low overhead. Using the fine-grained information that MAGNET exports from kernel space, challenging problems become amenable to identification and correction. In this paper, we first present the design, implementation and evaluation of MAGNET. Then, we show its use as a diagnostic tool, an online-monitoring tool and a tool for building adaptive applications in clusters and grids.
منابع مشابه
Title: Engineering Synthetic Trans-splicing Ribozyme Systems
Natural intron-like self-splicing ribozymes have been re-engineered to trans-splice two arbitrary RNA pieces together. This capability has potential to be tremendously useful for the synthetic biologist. We propose analyzing the suitability of these ribozymes as a tool for engineering biology by adapting trans-splicing ribozymes for use in measuring, debugging, patching, and building biological...
متن کاملA High-Performance Sensor for Cluster Monitoring and Adaptation
As Beowulf clusters have grown in size and complexity, the task of monitoring the performance, status, and health of such clusters has become increasingly more difficult but also more important. Consequently, tools such as Ganglia and Supermon have emerged in recent years to provide the robust support needed for scalable cluster monitoring. However, the scalability comes at the expense of accur...
متن کاملUsing Complete System Simulation for Temporal Debugging of General Purpose Operating Systems and Workloads
Digital convergence is precipitating the addition of soft real-time applications to mainstream desktop and server operating environments. Most traditional debuggers for mainstream systems lack a notion of temporal correctness, making them unsuitable for real-time system design and analysis. We propose leveraging complete system simulation to build a temporal debugger capable of analyzing mixed ...
متن کاملUsing Complete System Simulation for Temporal Debugging of General Purpose Operating Systems and Workload
Digital convergence is precipitating the addition of soft real-time applications to mainstream desktop and server operating environments. Most traditional debuggers for mainstream systems lack a notion of temporal correctness, making them unsuitable for real-time system design and analysis. We propose leveraging complete system simulation to build a temporal debugger capable of analyzing mixed ...
متن کاملImproving the Resilience of Military Hospitals Through Self-Adaptation of Hospital Systems Using Organic Computing
Background and Aim: Among the failures of a disaster, the disruption of the critical infrastructure of the community causes the most damage to society. Therefore, the ability of critical infrastructure such as hospitals to anticipate, absorb, adapt or rapidly recover from a devastating event is essential. The purpose of this study is to design a self-adaptive model for resilient hospital system...
متن کامل