Rollback propagation detection and performance evaluation of FTMR2M-A fault-tolerant multiprocessor

Yann Hang Lee, Kang G. Shin

Research output: Contribution to journalConference articlepeer-review

3 Scopus citations

Abstract

In this paper we consider the rollback propagation and the performance of a fault-tolerant multiprocessor with a rollback recovery mechanism (FTMR2M) [1], which was designed to be tolerant of hardware failure with minimum time overhead. Rollback propagation between cooperating processes is usually required to ensure correct recovery from failure. To minimize the waste of processor time and storage overhead required for handling sophisticated rollback propagations, the FTMR2M always keeps one recoverable state. Approaches for evaluating the recovery overhead and analyzing the performance of FTMR2M are presented. Two methods for detecting rollback propagations and multi-step rollbacks between cooperating processes are also proposed.

Original languageEnglish (US)
Pages (from-to)171-180
Number of pages10
JournalProceedings - International Symposium on Computer Architecture
StatePublished - Apr 26 1982
Externally publishedYes
Event9th Annual Symposium on Computer Architecture, ISCA 1982 - Austin, United States
Duration: Apr 26 1982Apr 29 1982

ASJC Scopus subject areas

  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'Rollback propagation detection and performance evaluation of FTMR2M-A fault-tolerant multiprocessor'. Together they form a unique fingerprint.

Cite this