Architecting Dependable Systems with Proactive Fault Management
نویسندگان
چکیده
Management of an ever-growing complexity of computing systems is an everlasting challenge for computer system engineers. We argue that we need to resort to predictive technologies in order to harness the system’s complexity and transform a vision of proactive system and failure management into reality. We describe proactive fault management, provide an overview and taxonomy for online failure prediction methods and present a classification of failure prediction-triggered methods. We present a model to assess the effects of proactive fault management on system reliability and show that overall dependability can significantly be enhanced. After having shown the methods and potential of proactive fault management we describe a blueprint how proactive fault management can be incorporated into a dependable system’s
منابع مشابه
Future study of Description System Architecture Approaches with Emphasis on Strategic Management
Systems Architecture is a generic discipline to handle objects (existing or to be created) called systems, in a way that supports reasoning about the structural properties of these objects. Systems Architecture is a response to the conceptual and practical difficulties of the description and the design of complex systems. Systems Architecture is a generic discipline to handle objects (existin...
متن کاملWorkshop on Architecting Dependable Systems
In comparison with the state of the art in the field of Web Services architectures and their composition, we propose to exploit the concept of CA Actions to enable to dependable composition of Web Services. CA Actions introduce a mechanism for structuring fault tolerant concurrent systems through the generalization of the concepts of atomic actions and transactions, and are adapted to the compo...
متن کاملComputing Science Architecting Fault Tolerant Systems Architecting Fault Tolerant Systems Bibliographical Details about the Author Computing Science Architecting Fault Tolerant Systems Architecting Fault Tolerant Systems Bibliographical Details about the Author Suggested Keywords Architecting Fault Tolerant Systems
As building trustworthy (dependable) systems is one of the major challenges faced by software developers, dealing with various threats (such as errors, faults and failures) is becoming one of the main foci of software and system research and development. In the core of ensuring system dependability is acceptance of the fact that errors always happen in spite of all the efforts to eliminate faul...
متن کاملSecond workshop on proactive failure avoidance, recovery, and maintenance (PFARM)
Proactive approaches to failure avoidance, recovery and maintenance have recently attracted increased interest among researchers and practitioners from various areas of dependable system design and operation. This first workshop provided a stimulating, and fruitful forum to foster collaboration among researchers working on proactive fault management, to discuss ideas, exchange experiences and t...
متن کاملAA – A Software Architecture Aware Environment for Dependable Systems
Explicitly considering software architectural information at all times is now a recognized means for addressing software system dependability. In this paper we propose the basic ideas for AA, an architecture aware environment to improve software system dependability. It builds on ideas from architecting dependable systems, control engineering, and software product lines. AA supports fault toler...
متن کامل