NZDC: A compiler technique for near zero silent data corruption

Moslem Didehban, Aviral Shrivastava

Research output: Chapter in Book/Report/Conference proceedingConference contribution

23 Citations (Scopus)

Abstract

Exponentially growing rate of soft errors makes reliability a major concern in modern processor design. Since software-oriented approaches offer flexible protection even in off-the-shelf processes, they are attractive solutions in protecting against soft errors. Among such approaches, in-application instruction duplication based approaches have been widely used and are deemed to be the most effective. Such techniques duplicate the program assembly instructions and periodically check the results to identify possible errors. Even though early reports suggest that these achieve close to 100% protection from soft errors, we find several gaps in the protection. Existing techniques are unable to protect several important microarchitectural components, as well as a significant fraction of instructions, resulting in Silent Data Corruptions (SDCs). This paper presents nZDC or near Zero silent Data Corruption - an effective instruction duplication based approach to protect programs from soft errors. Extensive fault injection experiments on almost all the unprotected microarchitectural components in simulated ARM Cortex A53, while executing benchmarks fromMiBench suite, demonstrate that nZDC is extremely effective, without incurring any more performance penalty than the state-of-the-art.

Original languageEnglish (US)
Title of host publicationProceedings of the 53rd Annual Design Automation Conference, DAC 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
Volume05-09-June-2016
ISBN (Electronic)9781450342360
DOIs
StatePublished - Jun 5 2016
Event53rd Annual ACM IEEE Design Automation Conference, DAC 2016 - Austin, United States
Duration: Jun 5 2016Jun 9 2016

Other

Other53rd Annual ACM IEEE Design Automation Conference, DAC 2016
CountryUnited States
CityAustin
Period6/5/166/9/16

Fingerprint

Soft Error
Compiler
Zero
Duplication
Fault Injection
Program assemblers
Cortex
Penalty
Benchmark
Software
Demonstrate
Experiment
Experiments

ASJC Scopus subject areas

  • Computer Science Applications
  • Control and Systems Engineering
  • Electrical and Electronic Engineering
  • Modeling and Simulation

Cite this

Didehban, M., & Shrivastava, A. (2016). NZDC: A compiler technique for near zero silent data corruption. In Proceedings of the 53rd Annual Design Automation Conference, DAC 2016 (Vol. 05-09-June-2016). [a48] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1145/2897937.2898054

NZDC : A compiler technique for near zero silent data corruption. / Didehban, Moslem; Shrivastava, Aviral.

Proceedings of the 53rd Annual Design Automation Conference, DAC 2016. Vol. 05-09-June-2016 Institute of Electrical and Electronics Engineers Inc., 2016. a48.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Didehban, M & Shrivastava, A 2016, NZDC: A compiler technique for near zero silent data corruption. in Proceedings of the 53rd Annual Design Automation Conference, DAC 2016. vol. 05-09-June-2016, a48, Institute of Electrical and Electronics Engineers Inc., 53rd Annual ACM IEEE Design Automation Conference, DAC 2016, Austin, United States, 6/5/16. https://doi.org/10.1145/2897937.2898054
Didehban M, Shrivastava A. NZDC: A compiler technique for near zero silent data corruption. In Proceedings of the 53rd Annual Design Automation Conference, DAC 2016. Vol. 05-09-June-2016. Institute of Electrical and Electronics Engineers Inc. 2016. a48 https://doi.org/10.1145/2897937.2898054
Didehban, Moslem ; Shrivastava, Aviral. / NZDC : A compiler technique for near zero silent data corruption. Proceedings of the 53rd Annual Design Automation Conference, DAC 2016. Vol. 05-09-June-2016 Institute of Electrical and Electronics Engineers Inc., 2016.
@inproceedings{5c6b236291d548c59469cb71c5e812e0,
title = "NZDC: A compiler technique for near zero silent data corruption",
abstract = "Exponentially growing rate of soft errors makes reliability a major concern in modern processor design. Since software-oriented approaches offer flexible protection even in off-the-shelf processes, they are attractive solutions in protecting against soft errors. Among such approaches, in-application instruction duplication based approaches have been widely used and are deemed to be the most effective. Such techniques duplicate the program assembly instructions and periodically check the results to identify possible errors. Even though early reports suggest that these achieve close to 100{\%} protection from soft errors, we find several gaps in the protection. Existing techniques are unable to protect several important microarchitectural components, as well as a significant fraction of instructions, resulting in Silent Data Corruptions (SDCs). This paper presents nZDC or near Zero silent Data Corruption - an effective instruction duplication based approach to protect programs from soft errors. Extensive fault injection experiments on almost all the unprotected microarchitectural components in simulated ARM Cortex A53, while executing benchmarks fromMiBench suite, demonstrate that nZDC is extremely effective, without incurring any more performance penalty than the state-of-the-art.",
author = "Moslem Didehban and Aviral Shrivastava",
year = "2016",
month = "6",
day = "5",
doi = "10.1145/2897937.2898054",
language = "English (US)",
volume = "05-09-June-2016",
booktitle = "Proceedings of the 53rd Annual Design Automation Conference, DAC 2016",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
address = "United States",

}

TY - GEN

T1 - NZDC

T2 - A compiler technique for near zero silent data corruption

AU - Didehban, Moslem

AU - Shrivastava, Aviral

PY - 2016/6/5

Y1 - 2016/6/5

N2 - Exponentially growing rate of soft errors makes reliability a major concern in modern processor design. Since software-oriented approaches offer flexible protection even in off-the-shelf processes, they are attractive solutions in protecting against soft errors. Among such approaches, in-application instruction duplication based approaches have been widely used and are deemed to be the most effective. Such techniques duplicate the program assembly instructions and periodically check the results to identify possible errors. Even though early reports suggest that these achieve close to 100% protection from soft errors, we find several gaps in the protection. Existing techniques are unable to protect several important microarchitectural components, as well as a significant fraction of instructions, resulting in Silent Data Corruptions (SDCs). This paper presents nZDC or near Zero silent Data Corruption - an effective instruction duplication based approach to protect programs from soft errors. Extensive fault injection experiments on almost all the unprotected microarchitectural components in simulated ARM Cortex A53, while executing benchmarks fromMiBench suite, demonstrate that nZDC is extremely effective, without incurring any more performance penalty than the state-of-the-art.

AB - Exponentially growing rate of soft errors makes reliability a major concern in modern processor design. Since software-oriented approaches offer flexible protection even in off-the-shelf processes, they are attractive solutions in protecting against soft errors. Among such approaches, in-application instruction duplication based approaches have been widely used and are deemed to be the most effective. Such techniques duplicate the program assembly instructions and periodically check the results to identify possible errors. Even though early reports suggest that these achieve close to 100% protection from soft errors, we find several gaps in the protection. Existing techniques are unable to protect several important microarchitectural components, as well as a significant fraction of instructions, resulting in Silent Data Corruptions (SDCs). This paper presents nZDC or near Zero silent Data Corruption - an effective instruction duplication based approach to protect programs from soft errors. Extensive fault injection experiments on almost all the unprotected microarchitectural components in simulated ARM Cortex A53, while executing benchmarks fromMiBench suite, demonstrate that nZDC is extremely effective, without incurring any more performance penalty than the state-of-the-art.

UR - http://www.scopus.com/inward/record.url?scp=84977104070&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84977104070&partnerID=8YFLogxK

U2 - 10.1145/2897937.2898054

DO - 10.1145/2897937.2898054

M3 - Conference contribution

AN - SCOPUS:84977104070

VL - 05-09-June-2016

BT - Proceedings of the 53rd Annual Design Automation Conference, DAC 2016

PB - Institute of Electrical and Electronics Engineers Inc.

ER -