MEASUREMENT AND APPLICATION OF FAULT LATENCY.

Kang G. Shin; Yann Hang Lee

doi:10.1109/TC.1986.1676773

MEASUREMENT AND APPLICATION OF FAULT LATENCY.

Kang G. Shin, Yann Hang Lee

Research output: Contribution to journal › Article › peer-review

26 Scopus citations

Abstract

The time interval between the occurrence of a fault and the detection of the error caused by the fault is divided by the generation of that error into two parts: fault latency and error latency. Since the moment of error generation is not directly observable, all related works in the literature have dealt with only the sum of fault and error latencies, thereby making the analysis of their separate effects impossible. To remedy this deficiency, the authors 1) present a new methodology for indirectly measuring fault latency, 2) derive the distribution of fault latency from the methodology, and 3) apply the knowledge of fault latency to the analysis of two important examples. The proposed methodology has been implemented for measuring fault latency in the Fault-Tolerant Multiprocessor (FTMP) at the NASA Airlab. The experimental results show wide variations in the mean fault latencies of different function circuits within FTMP. Also, the measured distributions of fault latency are shown to have monotone hazard rates. Consequently, Gamma and Weibull distributions are selected for the least-squares fit as the distribution of fault latency.

Original language	English (US)
Pages (from-to)	370-375
Number of pages	6
Journal	IEEE Transactions on Computers
Volume	C-35
Issue number	4
DOIs	https://doi.org/10.1109/TC.1986.1676773
State	Published - 1986
Externally published	Yes

ASJC Scopus subject areas

Software
Theoretical Computer Science
Hardware and Architecture
Computational Theory and Mathematics

Access to Document

10.1109/TC.1986.1676773

Cite this

@article{7191c5152c434586a599801783f29f4b,

title = "MEASUREMENT AND APPLICATION OF FAULT LATENCY.",

abstract = "The time interval between the occurrence of a fault and the detection of the error caused by the fault is divided by the generation of that error into two parts: fault latency and error latency. Since the moment of error generation is not directly observable, all related works in the literature have dealt with only the sum of fault and error latencies, thereby making the analysis of their separate effects impossible. To remedy this deficiency, the authors 1) present a new methodology for indirectly measuring fault latency, 2) derive the distribution of fault latency from the methodology, and 3) apply the knowledge of fault latency to the analysis of two important examples. The proposed methodology has been implemented for measuring fault latency in the Fault-Tolerant Multiprocessor (FTMP) at the NASA Airlab. The experimental results show wide variations in the mean fault latencies of different function circuits within FTMP. Also, the measured distributions of fault latency are shown to have monotone hazard rates. Consequently, Gamma and Weibull distributions are selected for the least-squares fit as the distribution of fault latency.",

author = "Shin, {Kang G.} and Lee, {Yann Hang}",

year = "1986",

doi = "10.1109/TC.1986.1676773",

language = "English (US)",

volume = "C-35",

pages = "370--375",

journal = "IEEE Transactions on Computers",

issn = "0018-9340",

publisher = "IEEE Computer Society",

number = "4",

}

TY - JOUR

T1 - MEASUREMENT AND APPLICATION OF FAULT LATENCY.

AU - Shin, Kang G.

AU - Lee, Yann Hang

PY - 1986

Y1 - 1986

N2 - The time interval between the occurrence of a fault and the detection of the error caused by the fault is divided by the generation of that error into two parts: fault latency and error latency. Since the moment of error generation is not directly observable, all related works in the literature have dealt with only the sum of fault and error latencies, thereby making the analysis of their separate effects impossible. To remedy this deficiency, the authors 1) present a new methodology for indirectly measuring fault latency, 2) derive the distribution of fault latency from the methodology, and 3) apply the knowledge of fault latency to the analysis of two important examples. The proposed methodology has been implemented for measuring fault latency in the Fault-Tolerant Multiprocessor (FTMP) at the NASA Airlab. The experimental results show wide variations in the mean fault latencies of different function circuits within FTMP. Also, the measured distributions of fault latency are shown to have monotone hazard rates. Consequently, Gamma and Weibull distributions are selected for the least-squares fit as the distribution of fault latency.

AB - The time interval between the occurrence of a fault and the detection of the error caused by the fault is divided by the generation of that error into two parts: fault latency and error latency. Since the moment of error generation is not directly observable, all related works in the literature have dealt with only the sum of fault and error latencies, thereby making the analysis of their separate effects impossible. To remedy this deficiency, the authors 1) present a new methodology for indirectly measuring fault latency, 2) derive the distribution of fault latency from the methodology, and 3) apply the knowledge of fault latency to the analysis of two important examples. The proposed methodology has been implemented for measuring fault latency in the Fault-Tolerant Multiprocessor (FTMP) at the NASA Airlab. The experimental results show wide variations in the mean fault latencies of different function circuits within FTMP. Also, the measured distributions of fault latency are shown to have monotone hazard rates. Consequently, Gamma and Weibull distributions are selected for the least-squares fit as the distribution of fault latency.

UR - http://www.scopus.com/inward/record.url?scp=0022706725&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0022706725&partnerID=8YFLogxK

U2 - 10.1109/TC.1986.1676773

DO - 10.1109/TC.1986.1676773

M3 - Article

AN - SCOPUS:0022706725

SN - 0018-9340

VL - C-35

SP - 370

EP - 375

JO - IEEE Transactions on Computers

JF - IEEE Transactions on Computers

IS - 4

ER -

MEASUREMENT AND APPLICATION OF FAULT LATENCY.

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this