Fault-tolerant atomic computations in an object-based distributed system

Mustaque Ahamad, Partha Dasgupta, Richard J. LeBlanc

Research output: Contribution to journalArticle

10 Citations (Scopus)

Abstract

A distributed system can support fault-tolerant applications by replicating data and computation at nodes that have independent failure modes. We present a scheme called parallel execution threads (PET) which can be used to implement fault-tolerant computations in an object-based distributed system. In a system that replicates objects, the PET scheme can be used to replicate a computation by creating a number of parallel threads which execute with different replicas of the invoked objects. A computation can be completed successfully if at least one thread does not encounter any failed nodes and its completion preserves the consistency of the objects. The PET scheme can tolerate failures that occur during the execution of the computation as long as all threads are not affected by the failures. We present the algorithms required to implement the PET scheme and also address some performance issues.

Original languageEnglish (US)
Pages (from-to)69-80
Number of pages12
JournalDistributed Computing
Volume4
Issue number2
DOIs
StatePublished - Jun 1990
Externally publishedYes

Fingerprint

Fault-tolerant
Thread
Distributed Systems
Failure modes
Failure Mode
Vertex of a graph
Object
Replica
Completion

Keywords

  • Atomicity
  • Distributed systems and replication
  • Fault-tolerant computing

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computational Theory and Mathematics

Cite this

Fault-tolerant atomic computations in an object-based distributed system. / Ahamad, Mustaque; Dasgupta, Partha; LeBlanc, Richard J.

In: Distributed Computing, Vol. 4, No. 2, 06.1990, p. 69-80.

Research output: Contribution to journalArticle

Ahamad, Mustaque ; Dasgupta, Partha ; LeBlanc, Richard J. / Fault-tolerant atomic computations in an object-based distributed system. In: Distributed Computing. 1990 ; Vol. 4, No. 2. pp. 69-80.
@article{2bd387a0652840ed9b422f677920611e,
title = "Fault-tolerant atomic computations in an object-based distributed system",
abstract = "A distributed system can support fault-tolerant applications by replicating data and computation at nodes that have independent failure modes. We present a scheme called parallel execution threads (PET) which can be used to implement fault-tolerant computations in an object-based distributed system. In a system that replicates objects, the PET scheme can be used to replicate a computation by creating a number of parallel threads which execute with different replicas of the invoked objects. A computation can be completed successfully if at least one thread does not encounter any failed nodes and its completion preserves the consistency of the objects. The PET scheme can tolerate failures that occur during the execution of the computation as long as all threads are not affected by the failures. We present the algorithms required to implement the PET scheme and also address some performance issues.",
keywords = "Atomicity, Distributed systems and replication, Fault-tolerant computing",
author = "Mustaque Ahamad and Partha Dasgupta and LeBlanc, {Richard J.}",
year = "1990",
month = "6",
doi = "10.1007/BF01786632",
language = "English (US)",
volume = "4",
pages = "69--80",
journal = "Distributed Computing",
issn = "0178-2770",
publisher = "Springer Verlag",
number = "2",

}

TY - JOUR

T1 - Fault-tolerant atomic computations in an object-based distributed system

AU - Ahamad, Mustaque

AU - Dasgupta, Partha

AU - LeBlanc, Richard J.

PY - 1990/6

Y1 - 1990/6

N2 - A distributed system can support fault-tolerant applications by replicating data and computation at nodes that have independent failure modes. We present a scheme called parallel execution threads (PET) which can be used to implement fault-tolerant computations in an object-based distributed system. In a system that replicates objects, the PET scheme can be used to replicate a computation by creating a number of parallel threads which execute with different replicas of the invoked objects. A computation can be completed successfully if at least one thread does not encounter any failed nodes and its completion preserves the consistency of the objects. The PET scheme can tolerate failures that occur during the execution of the computation as long as all threads are not affected by the failures. We present the algorithms required to implement the PET scheme and also address some performance issues.

AB - A distributed system can support fault-tolerant applications by replicating data and computation at nodes that have independent failure modes. We present a scheme called parallel execution threads (PET) which can be used to implement fault-tolerant computations in an object-based distributed system. In a system that replicates objects, the PET scheme can be used to replicate a computation by creating a number of parallel threads which execute with different replicas of the invoked objects. A computation can be completed successfully if at least one thread does not encounter any failed nodes and its completion preserves the consistency of the objects. The PET scheme can tolerate failures that occur during the execution of the computation as long as all threads are not affected by the failures. We present the algorithms required to implement the PET scheme and also address some performance issues.

KW - Atomicity

KW - Distributed systems and replication

KW - Fault-tolerant computing

UR - http://www.scopus.com/inward/record.url?scp=0025449420&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0025449420&partnerID=8YFLogxK

U2 - 10.1007/BF01786632

DO - 10.1007/BF01786632

M3 - Article

AN - SCOPUS:0025449420

VL - 4

SP - 69

EP - 80

JO - Distributed Computing

JF - Distributed Computing

SN - 0178-2770

IS - 2

ER -