Fault - Containment in Self - Stabilizing Distributed Systems

نویسندگان

  • Arobinda Gupta
  • Sukumar Ghosh
  • Sriram V. Pemmaraju
چکیده

Self-stabilizing systems can automatically recover from arbitrary transient faults, and changes in the environment of the system, without any external intervention. However, in existing distributed self-stabilizing protocols, the performance of recovery is not linked to the severity of the fault. Recovery from failure at even a single component of the system may take a long time and aaect the operation of the entire system. Since at any given time, limited faults in a small number of components are more likely than faults in a large number of components, this limitation restricts the use of self-stabilizing protocols in practice. As a solution, we propose in this thesis the design of fault-containing self-stabilizing protocols, self-stabilizing protocols that provide additional performance guarantees from less severe faults. From limited faults that are expected to occur more frequently in practice, such protocols ensure very fast recovery, and aaects only the processes in a small region around the faults during the recovery. The eeects of limited faults are thus contained eeciently in both time and space. At the same time, self-stabilization ensures automatic recovery from occasional transient faults that may be more widespread in the system. These protocols are a step forward towards making self-stabilization more practical. As a rst step, we focus on fault-containment from single transient faults. However , our deenitions are applicable to containment from other subsets of limited faults as well. The rst part of this thesis presents a framework for deening and evaluating fault-containing self-stabilizing protocols. In particular, we deene important metrics 2 for evaluating the fault-containment properties of a self-stabilizing protocol. The second part of the thesis presents fault-containing self-stabilizing protocols for three important problems: leader election on a ring, construction of the spanning tree of a network, and construction of the breadth-rst-search tree of a network. Finally, we present a general technique to automatically transform a non-reactive self-stabilizing protocol into a fault-containing self-stabilizing protocol. This general technique sig-niicantly simpliies the design of fault-containing self-stabilizing protocols. Abstract approved: Thesis supervisor Title and department Date Thesis supervisor Title and department

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hierarchical Composition of Self-Stabilizing Protocols Preserving the Fault-Containment Property

A desired property of large distributed systems is self adaptability against the faults that occur more frequently as the size of the distributed system grows. Self-stabilizing protocols provide autonomous recovery from finite number of transient faults. Fault-containing selfstabilizing protocols promise not only self-stabilization but also containment of faults (quick recovery and small effect...

متن کامل

Scalable fault-containing self-stabilization in dynamic networks

Self-stabilizing distributed systems provide a high degree of non-masking fault-tolerance. They recover from transient faults of any scale or nature without human intervention. In general, however, the time needed to recover from small-scale transient faults may not differ significantly from the time needed to recover from large-scale transient faults. Bounding the impact of small-scale faults ...

متن کامل

Self-stabilization with Byzantine tolerance for global tasks

Self-stabilization is a versatile approach to fault-tolerance since it permits a distributed system to recover from any transient fault that arbitrarily corrupts the contents of all memories in the system. Byzantine tolerance is an attractive feature of distributed systems that permits to cope with arbitrary malicious behaviors. Combining these two properties proved difficult: it is impossible ...

متن کامل

2-Edge-Connectivity and 2-Vertex-Connectivity with Fault Containment

Self-stabilization for non-masking fault-tolerant distributed system has received considerable research interest over the last decade. In this paper, we propose a self-stabilizing algorithm for 2-edge-connectivity and 2-vertex-connectivity of an asynchronous distributed computer network. It is based on a self-stabilizing depth-first search, and is not a composite algorithm in the sense that it ...

متن کامل

Preserving the Fault-Containment of Ring Protocols Executed on Trees

Reliable and fault-tolerant distributed systems have been attracting more and more attention (see Autonomic Computing Project by IBM, http://www-03.ibm.com/autonomic/). A self-stabilizing protocol is a fault-tolerant protocol that guarantees autonomous recovery from any number of and any type of faults that can affect the data stored locally at some process(es). If the impact of the faults can ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997