Simultaneous Logging and Replay for Recording Evidences of System Failures
نویسندگان
چکیده
As embedded systems take more important roles at many places, it is more important for them to be able to show the evidences of system failures. Providing such evidences makes it easier to investigate the root causes of the failures and to prove the responsible parties. This paper proposes simultaneous logging and replaying of a system that enables recording evidences of system failures. The proposed system employs two virtual machines, one for the primary execution and the other for the backup execution. The backup virtual machine maintains the past state of the primary virtual machine along with the log to make the backup the same state as the primary. When a system failure occurs on the primary virtual machine, the VMM saves the backup state and the log. The saved backup state and the log can be used as an evidence. By replaying the backup virtual machine from the saved state following the saved log, the execution path to the failure can be completely analyzed. We developed such a logging and replaying feature in a VMM. It can log and replay the execution of the Linux operating system. The experiment results show the overhead of the primary execution is only fractional.
منابع مشابه
A Methodological Note on Setting-up Logging and Replay Mechanisms in InfoVis Systems
Information Visualization needs longitudinal studies to assess the usefulness, usability, and, more generally, the value of its techniques. However, most of the longitudinal studies conducted so far have involved human resources to collect and analyze evidences. Automatic logging and session replay mechanisms can help answering questions while limiting human collection and analysis of data. In ...
متن کاملLightPlay: Efficient Replay with GPUs
Previous deterministic replay systems reduce the runtime overhead by either relying on hardware support or by relaxing the determinism requirements for replay. We propose LightPlay that fulfills stricter determinism requirements with low overhead without requiring hardware or OS support. LightPlay guarantees that the memory state after each instruction instance in a replay run is the same as in...
متن کاملInstant Recovery with Write-Ahead Logging: Page Repair, System Restart, and Media Restore
Traditional theory and practice of write-ahead logging and of database recovery techniques revolve around three failure classes: transaction failures resolved by rollback; system failures (typically software faults) resolved by restart with log analysis, “redo,” and “undo” phases; and media failures (typically hardware faults) resolved by restore operations that combine multiple types of backup...
متن کاملA Survey in Deterministic Replaying Approaches in Multiprocessors
most multithread executions acts nondeterministic while execute on multiprocessor systems. Recording nondeterministic events in such executions can enable deterministic replay. So some researchers are focused on deterministic replay capability in recording important information during executions. In this survey we introduce Hardware-Base, Software-Base and Software-Hardware approaches that have...
متن کامل“Faithless Replay” for Persistent Logless Mid-Tier Components
A goal for enterprise applications has been to provide “exactly once” execution regardless of system failures. This has classically required “stateless” applications that manage their states explicitly via transactional resource managers. Support for “stateful” applications requires the system to do more to manage state, which has been considered too difficult and costly. The Phoenix/App system...
متن کامل