A Dynamic Replica Selection Algorithm for Tolerating Timing Faults
نویسندگان
چکیده
Server replication is commonly used to improve the fault tolerance and response time of distributed services. An important problem when executing time-critical applications in a replicated environment is that of preventing timing failures by dynamically selecting the replicas that can satisfy a client’s timing requirement, even when the quality of service is degraded due to replica failures and excess load on the server. In this paper, we describe the approach we have used to solve this problem in AQuA, a CORBA-based middleware that transparently replicates objects across a local area network. The approach we use estimates a replica’s response time distribution based on performance measurements regularly broadcast by the replica. An online model uses these measurements to predict the probability with which a replica can prevent a timing failure for a client. A selection algorithm then uses this prediction to choose a subset of replicas that can together meet the client’s timing constraints with at least the probability requested by the client. We conclude with experimental results based on our
منابع مشابه
Performance Evaluation of a Probabilistic Replica Selection Algorithm
When executing time-sensitive distributed applications, a middleware that provides dependability and timeliness is faced with the important problem of preventing timing failures both under normal conditions and when the quality of service is degraded due to replica failures and transient overload on the server. To address this problem, we have designed a probabilistic model-based replica select...
متن کاملImprove Replica Placement in Content Distribution Networks with Hybrid Technique
The increased using of the Internet and its accelerated growth leads to reduced network bandwidth and the capacity of servers; therefore, the quality of Internet services is unacceptable for users while the efficient and effective delivery of content on the web has an important role to play in improving performance. Content distribution networks were introduced to address this issue. Replicatin...
متن کاملMälardalen University
In this paper we present an approach to the designoptimization of fault-tolerant embedded systems for safety-critical applications. Processes are statically scheduledand communications are performed using the time-triggered protocol. We use process re-execution andreplication for tolerating transient faults. Our designoptimization approach decides the mapping of proc...
متن کاملDynamic Data Grid Replication Algorithm Based on Weight and Cost of Replica
Data Grid is composed of a large number of distributed computation and storage resources to facilitate the management of the huge distributed and sharing data resources efficiently. Dynamic replication can reduce the file storage time and use the grid resources effectively in a Data Grid environment. The Data Grid topology is divided into three layers: Regional level, LAN level, the grid site l...
متن کاملAdaptive timeliness of consensus in presence of crash and timing faults
The ∆-timed uniform consensus is a stronger variant of the traditional consensus and it satisfies the following additional property: Every correct process terminates its execution within a constant time ∆ (∆-timeliness), and no two processes decide differently (Uniformity). In this paper, we consider the ∆-timed uniform consensus problem in presence of fc crash processes and ft timing-faulty pr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001