Fault Tolerance in Cloud Using Reactive and Proactive Techniques

نویسنده

  • V. M. Sivagami
چکیده

Fault tolerance plays a vital role in ensuring high serviceability and reliability in cloud. A lot of research is currently under way to analyze how cloud can provide fault tolerance for an application. The work proposes a reactive fault tolerant technique that uses check pointing to tolerate the fault. The work proposes a VMμ Checkpoint framework to protect both VMs and applications in the VMs against transient errors. The VM-μCheckpoint mechanism is implemented using CoW-PC (Copy on Write – Presave in cache) algorithm. The CoW-PC algorithm presaves all the tasks running on the VM’s in a cache memory. When there is any transient failure happening in VMs, it is noted and it is recovered using last presaved checkpoint from the cache memory. Once the tasks are executed successfully, the presaved checkpoints are deleted automatically from the cache memory. Thus the algorithm uses in place and in memory recovery of checkpoints that reduces the checkpoint overhead and improves the performance.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Survey on Fault Tolerance Techniques in Cloud Computing

Cloud computing becomes very popular these days and fault tolerance is a major issue in this cloud environment. This paper is dedicated to study different kinds of faults and various techniques for handling them. This survey mainly focuses on why fault tolerance is required in cloud computing and which techniques are used to handle these failures so that services can be delivered without delay ...

متن کامل

Improving the palbimm scheduling algorithm for fault tolerance in cloud computing

Cloud computing is the latest technology that involves distributed computation over the Internet. It meets the needs of users through sharing resources and using virtual technology. The workflow user applications refer to a set of tasks to be processed within the cloud environment. Scheduling algorithms have a lot to do with the efficiency of cloud computing environments through selection of su...

متن کامل

Using Proactive Fault-Tolerance Approach to Enhance Cloud Service Reliability

The large-scale utilization of cloud computing services for hosting industrial/enterprise applications has led to the emergence of cloud service reliability as an important issue for both cloud service providers and users. To enhance cloud service reliability, two types of fault tolerance schemes, reactive and proactive, have been proposed. Existing schemes rarely consider the problem of coordi...

متن کامل

A Replica Distribution Based Fault Tolerance Management For Cloud Computing

Cloud computing now a day's become most popular and reliable computing technique for organizations and individuals. In the cloud environments, data availability and backup replication are critical and complex issues in the an efficient fault tolerance policy is the major. Fault tolerance policy is the strategy in action when a fault occurs in the system. Taking backups is the one of the most us...

متن کامل

Second workshop on proactive failure avoidance, recovery, and maintenance (PFARM)

Proactive approaches to failure avoidance, recovery and maintenance have recently attracted increased interest among researchers and practitioners from various areas of dependable system design and operation. This first workshop provided a stimulating, and fruitful forum to foster collaboration among researchers working on proactive fault management, to discuss ideas, exchange experiences and t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015