Towards Lightweight Logging and Replay of Embedded, Distributed Systems⋆ (Invited Paper)
نویسندگان
چکیده
Due to their safety critical nature, Cyber-Physical Systems such as collaborative cars or smart grids demand for thorough testing and evaluation. However, debugging such systems during deployment is challenging, due to the concurrent nature of distributed systems and the limited insight that any deployed system offers. In this paper we introduce MILD; providing Minimal Intrusive Logging and Deterministic replay. MILD enables logging of events on deployed Cyber-Physical Systems and the deterministic replay in controlled environments such as system simulators. To illustrate the feasibility and low overhead of our architecture, we evaluate a prototype implementation based on Wireless Sensor Networks (WSN) in this paper.
منابع مشابه
Towards Lightweight Logging and Replay of Embedded, Distributed Systems
Due to their safety critical nature, Cyber-Physical Systems such as collaborative cars or smart grids demand for thorough testing and evaluation. However, debugging such systems during deployment is challenging, due to the concurrent nature of distributed systems and the limited insight that any deployed system offers. In this paper we introduce MILD; providing Minimal Intrusive Logging and Det...
متن کاملReplay Debugging for Distributed Applications
We have developed a new replay debugging tool, liblog, for distributed C/C++ applications. It logs the execution of deployed application processes and replays them deterministically, faithfully reproducing race conditions and non-deterministic failures, enabling careful offline analysis. To our knowledge, liblog is the first replay tool to address the requirements of large distributed systems: ...
متن کاملLightweight Message Logging Protocol for Distributed Sensor Networks
Among a lot of rollback-recovery protocols developed for providing fault-tolerance for long-running distributed applications, sender-based message logging with checkpointing is one of the most lightweight fault-tolerance techniques to be capable of being applied in this field, significantly decreasing high failure-free overhead of synchronous logging by using message sender's volatile memory as...
متن کاملLightPlay: Efficient Replay with GPUs
Previous deterministic replay systems reduce the runtime overhead by either relying on hardware support or by relaxing the determinism requirements for replay. We propose LightPlay that fulfills stricter determinism requirements with low overhead without requiring hardware or OS support. LightPlay guarantees that the memory state after each instruction instance in a replay run is the same as in...
متن کاملReduced Overhead Logging for Rollback Recovery in Distributed Shared Memory
Rollback techniques that use message logging and deterministic replay can be used in parallel systems to recover a failed node without involving other nodes. Distributed shared memory (DSM) systems cannot directly apply message-passing logging techniques because they use inherently nondeterministic asynchronous communication. This paper presents new logging schemes that reduce the typically hig...
متن کامل