Designing Efficient Algorithms for the Eventually Perfect Failure Detector Class

نویسندگان

  • Mikel Larrea
  • Alberto Lafuente
  • Iratxe Soraluze Arriola
  • Roberto Cortiñas
  • Joachim Wieland
چکیده

This paper focuses on the design of unreliable failure detectors of the Eventually Perfect class (3P) in crash-prone partially synchronous systems. We adopt a monitoring mechanism based on heartbeats over a logical ring arrangement of processes as the common design feature. This provides good communication efficiency, a performance parameter which refers to the number of links that carry messages forever. We follow two different approaches that result in two families of failure detectors: a nearly communication-efficient family, which uses n + C links forever, being C the number of correct processes out of the n processes in the system, and a communicationefficient family, which uses only n links forever. Besides communication efficiency, we evaluate the algorithms in terms of QoS parameters, which include the capability of the failure detector to provide right answers as well as its reaction time.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Implementing the Weakest Failure Detector for Solving Consensus

The concept of unreliable failure detector was introduced by Chandra and Toueg as a mechanism that provides information about process failures. This mechanism has been used to solve several agreement problems, like Consensus. In this paper, algorithms that implement failure detectors in partially synchronous systems are presented. First two simple algorithms of the weakest class to solve Consen...

متن کامل

Stable Leader Election

We introduce the notion of stable leader election and derive several algorithms for this problem. Roughly speaking, a leader election algorithm is stable if it ensures that once a leader is elected, it remains the leader for as long as it does not crash and its links have been behaving well, irrespective of the behavior of other processes and links. In addition to being stable, our leader elect...

متن کامل

Stabilizing Dining with Failure Locality 1

The dining philosophers problem, or simply dining, is a fundamental distributed resource allocation problem. In large scale and longlived systems, the likelihood of some process failing at some point is high, thus sparking interest in fault-tolerant versions of dining. We propose two algorithms for solving stabilizing dining with failure locality 1 in asynchronous shared-memory systems with reg...

متن کامل

An efficient failure detector for sparsely connected networks

We present an implementation of an eventually perfect failure detector for sparsely connected, partitionable networks, where each process has only a bounded number of neighbors. Processes and links may fail by crashing. Regarding synchrony, our algorithm only needs to know an upper bound on the jitter ε of the communication between direct neighbors. No a-priori knowledge about the number of pro...

متن کامل

Fail-Aware Failure Detectors

In existing asynchronous distributed systems it is impossible to implement failure detectors which are perfect, i.e. they only suspect crashed processes and eventually suspect all crashed processes. Some recent research has however proposed that any “reasonable” failure detector for solving the election problem must be perfect. We address this problem by introducing two new classes of fail-awar...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • JSW

دوره 2  شماره 

صفحات  -

تاریخ انتشار 2007