Fault-tolerant atomic computations in an object-based distributed system

Mustaque Ahamad, Partha Dasgupta, Richard J. LeBlanc

Research output: Contribution to journalArticlepeer-review

10 Scopus citations

Abstract

A distributed system can support fault-tolerant applications by replicating data and computation at nodes that have independent failure modes. We present a scheme called parallel execution threads (PET) which can be used to implement fault-tolerant computations in an object-based distributed system. In a system that replicates objects, the PET scheme can be used to replicate a computation by creating a number of parallel threads which execute with different replicas of the invoked objects. A computation can be completed successfully if at least one thread does not encounter any failed nodes and its completion preserves the consistency of the objects. The PET scheme can tolerate failures that occur during the execution of the computation as long as all threads are not affected by the failures. We present the algorithms required to implement the PET scheme and also address some performance issues.

Original languageEnglish (US)
Pages (from-to)69-80
Number of pages12
JournalDistributed Computing
Volume4
Issue number2
DOIs
StatePublished - Jun 1990
Externally publishedYes

Keywords

  • Atomicity
  • Distributed systems and replication
  • Fault-tolerant computing

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Hardware and Architecture
  • Computer Networks and Communications
  • Computational Theory and Mathematics

Fingerprint

Dive into the research topics of 'Fault-tolerant atomic computations in an object-based distributed system'. Together they form a unique fingerprint.

Cite this