نتایج جستجو برای: fault recovery

تعداد نتایج: 262091  

2017
Matthew C. Ruschmann John McGreevy

Fault management is one of the key technologies that enable distributed and disaggregated mission architectures wherein multiple vehicles work cooperatively and autonomously in a cluster or formation, a typical mission concept involving small satellites. In this paper, we describe a software architecture, called Separable Architecture for Fault Isolation and Recovery (SAFIR), which addresses fa...

2010
Gabriella Carrozza Roberto Natella

Roberto Natella Università degli Studi di Napoli Federico II, Italy ABSTRACT This paper proposes an approach to software faults diagnosis in complex fault tolerant systems, encompassing the phases of error detection, fault location, and system recovery. Errors are detected in the first phase, exploiting the operating system support. Faults are identified during the location phase, adopting o...

1997
Arobinda Gupta Sukumar Ghosh Sriram V. Pemmaraju

Self-stabilizing systems can automatically recover from arbitrary transient faults, and changes in the environment of the system, without any external intervention. However, in existing distributed self-stabilizing protocols, the performance of recovery is not linked to the severity of the fault. Recovery from failure at even a single component of the system may take a long time and aaect the o...

1999
Srinidhi Varadarajan Tzi-cker Chiueh

EtheReal is a real-time Fast Ethernet switch architecture that provides bandwidth guarantees to distributed multimedia applications without OS and hardware modifications on the host machines. It implements true link-layer multicast, and offers a natural match to support networklayer QoS protocols such as RSVP. Because real-time performance guarantees fundamentally require state to be installed ...

2007
Anurag Dasgupta Sukumar Ghosh Xin Xiao

Research on fine tuning stabilization properties has received attention for nearly a decade. This paper presents a probabilistic algorithm for fault-containment, that confines the effect of any single fault to the immediate neighborhood of the faulty process, with an expected recovery time of O(∆). The most significant aspect of the algorithm is that the fault-gap, defined as the smallest inter...

Journal: :IEEE Trans. Computers 2000
Frank Liberato Rami G. Melhem Daniel Mossé

ÐReal-time systems are being increasingly used in several applications which are time-critical in nature. Fault tolerance is an essential requirement of such systems, due to the catastrophic consequences of not tolerating faults. In this paper, we study a scheme that guarantees the timely recovery from multiple faults within hard real-time constraints in uniprocessor systems. Assuming earliestd...

Journal: :IJARAS 2011
Gabriella Carrozza Roberto Natella

This paper proposes an approach to software faults diagnosis in complex fault tolerant systems, encompassing the phases of error detection, fault location, and system recovery. Errors are detected in the first phase, exploiting the operating system support. Faults are identified during the location phase, through a machine learning based approach. Then, the best recovery action is triggered onc...

1999
Daniel Gil R. Martínez José V. Busquets-Mataix Juan Carlos Baraza Pedro J. Gil

This work presents a campaign of fault injection to validate the dependability of a fault tolerant microcomputer system. The system is duplex with cold stand-by sparing, parity detection and a watchdog timer. The faults have been injected on a chip-level VHDL model, using an injection tool designed with this purpose. We have carried out a set of injection experiments (with 3000 injections each)...

1989
Richard Harper Charles Stark

In a fault-tolerant parallel computer, a functional programming model can facilitate distributed checlq3ointing, error recovery, load balancing, and graceful degradation. Such a model has been implemented on the Draper Fault Tolerant Parallel Processor (FTPP). When used in conjunction with the FrPP's fault detection and masking capabilities, this implementation results in a graceful degradation...

2011
Aly Farahat Ali Ebnenasir

Most existing techniques for the design and implementation of fault tolerance use resource redundancy. As such, due to scarcity of resources, it is difficult to directly apply them for adding fault tolerance to sensor nodes in Wireless Sensor Networks (WSNs). Thus, it is desirable to develop techniques that implement fault tolerance under the constraints of memory and processing power of sensor...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید