Asynchronous recovery protocols for distributed systems

K. W. Hwang, W. T. Tsai

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

The authors address the problem of error recovery in a system of distributed communication processes. They show that if each process can detect its local computation errors while establishing the recovery points, then the amount of process dependencies can be reduced by exploiting the temporal ordering of message communication among the processes. The proposed approach allows processes to proceed independently during normal computation, and can be further improved to accommodate independent rollback without explicit coordination. The authors also discuss the handling of messages that are originated from, or received by, tasks that later abort. Simulation studies indicate that the approach taken achieves a much higher throughput than the asynchronous approach.

Original languageEnglish (US)
Title of host publicationProceedings - IEEE Computer Society's International Computer Software & Applications Conference
EditorsGeorge J. Knafl
PublisherPubl by IEEE
Pages513-520
Number of pages8
ISBN (Print)0818608730
StatePublished - Dec 1 1988

Publication series

NameProceedings - IEEE Computer Society's International Computer Software & Applications Conference

ASJC Scopus subject areas

  • Software
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Asynchronous recovery protocols for distributed systems'. Together they form a unique fingerprint.

Cite this