TY - JOUR
T1 - Design of reliable storage and compute systems with lightweight group testing based non-binary error correction codes
AU - Bu, Lake
AU - Karpovsky, Mark G.
AU - Kinsy, Michel A.
N1 - Publisher Copyright:
© 2019 Institution of Engineering and Technology. All rights reserved.
PY - 2019/5/1
Y1 - 2019/5/1
N2 - In this study, the authors propose a new group testing based (GTB) error control codes (ECCs) approach for improving the reliability of memory structures in computing systems. Compared with conventional single- A nd double-bit error correcting codes, the GTB codes provide higher reliability at the multi-byte error correction granularity. The proposed codes are cost-efficient in their encoding and decoding procedures. Instead of requiring multiplication or inversion over Galois finite field like most multi-byte ECC schemes, the proposed technique only involves bitwise XOR operations, therefore, significantly reducing the computation complexity and latency. For instance, to correct m errors in a Q-ary codeword of length N, where $Q \ge 2$Q≥2, the compute complexity is mere $O\lpar mN\log Q\rpar $O(mNlog. âiQ). The GTB codes trade redundancy for encoding and decoding simplicity, and are able to achieve better code rate than other ECCs of the same trade-off. The proposed GTB codes lend themselves well to designs with high reliability and low computation complexity requirements, such as storage systems with strong fault tolerance, or compute systems with straggler tolerance, and so on.
AB - In this study, the authors propose a new group testing based (GTB) error control codes (ECCs) approach for improving the reliability of memory structures in computing systems. Compared with conventional single- A nd double-bit error correcting codes, the GTB codes provide higher reliability at the multi-byte error correction granularity. The proposed codes are cost-efficient in their encoding and decoding procedures. Instead of requiring multiplication or inversion over Galois finite field like most multi-byte ECC schemes, the proposed technique only involves bitwise XOR operations, therefore, significantly reducing the computation complexity and latency. For instance, to correct m errors in a Q-ary codeword of length N, where $Q \ge 2$Q≥2, the compute complexity is mere $O\lpar mN\log Q\rpar $O(mNlog. âiQ). The GTB codes trade redundancy for encoding and decoding simplicity, and are able to achieve better code rate than other ECCs of the same trade-off. The proposed GTB codes lend themselves well to designs with high reliability and low computation complexity requirements, such as storage systems with strong fault tolerance, or compute systems with straggler tolerance, and so on.
UR - http://www.scopus.com/inward/record.url?scp=85065669939&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85065669939&partnerID=8YFLogxK
U2 - 10.1049/iet-cdt.2018.5008
DO - 10.1049/iet-cdt.2018.5008
M3 - Article
AN - SCOPUS:85065669939
SN - 1751-8601
VL - 13
SP - 218
EP - 223
JO - IET Computers and Digital Techniques
JF - IET Computers and Digital Techniques
IS - 3
ER -