Download PDFOpen PDF in browser

A Survey of Fault Tolerance Technique in Distributed System

EasyChair Preprint 15425

7 pagesDate: November 14, 2024

Abstract

This paper presents a thorough survey of fault tolerance mechanisms in distributed systems. It examines potential failure factors, available mechanisms, and their foundations, focusing on mechanisms explicitly developed for distributed systems. This paper summarizes how fault-tolerance techniques can be combined to provide various dependability characteristics. The primary goal of this paper is to serve as a guide to the extensive research and development activity in the domain of distributed systems, examining the current fault tolerance mechanisms and highlighting future avenues for research aiming to help in identifying areas for further exploration and innovation, providing a roadmap for their future work and emphasizing the significance of their contributions to the research community.

Keyphrases: Checkpointing technique., Reactive Fault Tolerance, Replication technique, distributed systems, fault tolerance, proactive fault tolerance

BibTeX entry
BibTeX does not have the right entry for preprints. This is a hack for producing the correct reference:
@booklet{EasyChair:15425,
  author    = {Rand Alamleh and Nammer El Emam},
  title     = {A Survey of Fault Tolerance Technique in Distributed System},
  howpublished = {EasyChair Preprint 15425},
  year      = {EasyChair, 2024}}
Download PDFOpen PDF in browser