Download PDFOpen PDF in browserA Survey of Fault Tolerance Technique in Distributed SystemEasyChair Preprint 154257 pages•Date: November 14, 2024AbstractThis paper presents a thorough survey of fault tolerance mechanisms in distributed systems. It examines potential failure factors, available mechanisms, and their foundations, focusing on mechanisms explicitly developed for distributed systems. This paper summarizes how fault-tolerance techniques can be combined to provide various dependability characteristics. The primary goal of this paper is to serve as a guide to the extensive research and development activity in the domain of distributed systems, examining the current fault tolerance mechanisms and highlighting future avenues for research aiming to help in identifying areas for further exploration and innovation, providing a roadmap for their future work and emphasizing the significance of their contributions to the research community. Keyphrases: Checkpointing technique., Reactive Fault Tolerance, Replication technique, distributed systems, fault tolerance, proactive fault tolerance
|