Redundant neurons and shared redundant synapses for robust memristor-based DNNs with reduced overhead

Baogang Zhang; Necati Uysal; Deliang Fan; Rickard Ewetz

doi:10.1145/3386263.3406910

Redundant neurons and shared redundant synapses for robust memristor-based DNNs with reduced overhead

Baogang Zhang, Necati Uysal, Deliang Fan, Rickard Ewetz

Engineering, Ira A. Fulton Schools of (IAFSE)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

2 Scopus citations

Abstract

The dominating computational workload in the inference phase of deep neural networks (DNNs) is matrix-vector multiplication. An arising solution to accelerate the inference phase is to perform analog matrix-vector multiplication using memristor crossbar arrays (MCAs). A key challenge is that stuck-at-fault defects may degrade the classification accuracy of the memristor-based DNNs. A common technique to reduce the negative impact of stuck-at-faults is to utilize redundant synapses, i.e, each row in a weight matrix is realized using two (or r) parallel rows in an MCA. In this paper, we propose to handle stuck-at-faults by inserting redundant neurons and by sharing redundant synapses. The first technique is based on inserting redundant neurons to surgically repair neurons connected to rows and columns in the MCAs with many stuck-at-faults. The second technique is focused on sharing redundant synapses between different neurons to reduce the hardware overhead, which generalizes (1:r) synapse redundancy in previous studies to (q:r) synapse redundancy. The experimental results demonstrate new trade-offs between robustness and hardware overhead without requiring the neural networks to be retrained. Compared with state-of-the-art, the power and area overhead for a neural network can be reduced with up to 16% and 25%, respectively.

Original language	English (US)
Title of host publication	GLSVLSI 2020 - Proceedings of the 2020 Great Lakes Symposium on VLSI
Publisher	Association for Computing Machinery
Pages	339-344
Number of pages	6
ISBN (Electronic)	9781450379441
DOIs	https://doi.org/10.1145/3386263.3406910
State	Published - Sep 7 2020
Event	30th Great Lakes Symposium on VLSI, GLSVLSI 2020 - Virtual, Online, China Duration: Sep 7 2020 → Sep 9 2020

Publication series

Name	Proceedings of the ACM Great Lakes Symposium on VLSI, GLSVLSI

Conference

Conference	30th Great Lakes Symposium on VLSI, GLSVLSI 2020
Country/Territory	China
City	Virtual, Online
Period	9/7/20 → 9/9/20

ASJC Scopus subject areas

General Engineering

Access to Document

10.1145/3386263.3406910

Cite this

Zhang, B., Uysal, N., Fan, D., & Ewetz, R. (2020). Redundant neurons and shared redundant synapses for robust memristor-based DNNs with reduced overhead. In GLSVLSI 2020 - Proceedings of the 2020 Great Lakes Symposium on VLSI (pp. 339-344). (Proceedings of the ACM Great Lakes Symposium on VLSI, GLSVLSI). Association for Computing Machinery. https://doi.org/10.1145/3386263.3406910

Redundant neurons and shared redundant synapses for robust memristor-based DNNs with reduced overhead. / Zhang, Baogang; Uysal, Necati; Fan, Deliang et al.
GLSVLSI 2020 - Proceedings of the 2020 Great Lakes Symposium on VLSI. Association for Computing Machinery, 2020. p. 339-344 (Proceedings of the ACM Great Lakes Symposium on VLSI, GLSVLSI).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Zhang, B, Uysal, N, Fan, D & Ewetz, R 2020, Redundant neurons and shared redundant synapses for robust memristor-based DNNs with reduced overhead. in GLSVLSI 2020 - Proceedings of the 2020 Great Lakes Symposium on VLSI. Proceedings of the ACM Great Lakes Symposium on VLSI, GLSVLSI, Association for Computing Machinery, pp. 339-344, 30th Great Lakes Symposium on VLSI, GLSVLSI 2020, Virtual, Online, China, 9/7/20. https://doi.org/10.1145/3386263.3406910

@inproceedings{958c2c5822cb4cb4b18051668490808d,

title = "Redundant neurons and shared redundant synapses for robust memristor-based DNNs with reduced overhead",

abstract = "The dominating computational workload in the inference phase of deep neural networks (DNNs) is matrix-vector multiplication. An arising solution to accelerate the inference phase is to perform analog matrix-vector multiplication using memristor crossbar arrays (MCAs). A key challenge is that stuck-at-fault defects may degrade the classification accuracy of the memristor-based DNNs. A common technique to reduce the negative impact of stuck-at-faults is to utilize redundant synapses, i.e, each row in a weight matrix is realized using two (or r) parallel rows in an MCA. In this paper, we propose to handle stuck-at-faults by inserting redundant neurons and by sharing redundant synapses. The first technique is based on inserting redundant neurons to surgically repair neurons connected to rows and columns in the MCAs with many stuck-at-faults. The second technique is focused on sharing redundant synapses between different neurons to reduce the hardware overhead, which generalizes (1:r) synapse redundancy in previous studies to (q:r) synapse redundancy. The experimental results demonstrate new trade-offs between robustness and hardware overhead without requiring the neural networks to be retrained. Compared with state-of-the-art, the power and area overhead for a neural network can be reduced with up to 16% and 25%, respectively.",

author = "Baogang Zhang and Necati Uysal and Deliang Fan and Rickard Ewetz",

note = "Publisher Copyright: {\textcopyright} 2020 Association for Computing Machinery.; 30th Great Lakes Symposium on VLSI, GLSVLSI 2020 ; Conference date: 07-09-2020 Through 09-09-2020",

year = "2020",

month = sep,

day = "7",

doi = "10.1145/3386263.3406910",

language = "English (US)",

series = "Proceedings of the ACM Great Lakes Symposium on VLSI, GLSVLSI",

publisher = "Association for Computing Machinery",

pages = "339--344",

booktitle = "GLSVLSI 2020 - Proceedings of the 2020 Great Lakes Symposium on VLSI",

}

TY - GEN

T1 - Redundant neurons and shared redundant synapses for robust memristor-based DNNs with reduced overhead

AU - Zhang, Baogang

AU - Uysal, Necati

AU - Fan, Deliang

AU - Ewetz, Rickard

PY - 2020/9/7

Y1 - 2020/9/7

N2 - The dominating computational workload in the inference phase of deep neural networks (DNNs) is matrix-vector multiplication. An arising solution to accelerate the inference phase is to perform analog matrix-vector multiplication using memristor crossbar arrays (MCAs). A key challenge is that stuck-at-fault defects may degrade the classification accuracy of the memristor-based DNNs. A common technique to reduce the negative impact of stuck-at-faults is to utilize redundant synapses, i.e, each row in a weight matrix is realized using two (or r) parallel rows in an MCA. In this paper, we propose to handle stuck-at-faults by inserting redundant neurons and by sharing redundant synapses. The first technique is based on inserting redundant neurons to surgically repair neurons connected to rows and columns in the MCAs with many stuck-at-faults. The second technique is focused on sharing redundant synapses between different neurons to reduce the hardware overhead, which generalizes (1:r) synapse redundancy in previous studies to (q:r) synapse redundancy. The experimental results demonstrate new trade-offs between robustness and hardware overhead without requiring the neural networks to be retrained. Compared with state-of-the-art, the power and area overhead for a neural network can be reduced with up to 16% and 25%, respectively.

AB - The dominating computational workload in the inference phase of deep neural networks (DNNs) is matrix-vector multiplication. An arising solution to accelerate the inference phase is to perform analog matrix-vector multiplication using memristor crossbar arrays (MCAs). A key challenge is that stuck-at-fault defects may degrade the classification accuracy of the memristor-based DNNs. A common technique to reduce the negative impact of stuck-at-faults is to utilize redundant synapses, i.e, each row in a weight matrix is realized using two (or r) parallel rows in an MCA. In this paper, we propose to handle stuck-at-faults by inserting redundant neurons and by sharing redundant synapses. The first technique is based on inserting redundant neurons to surgically repair neurons connected to rows and columns in the MCAs with many stuck-at-faults. The second technique is focused on sharing redundant synapses between different neurons to reduce the hardware overhead, which generalizes (1:r) synapse redundancy in previous studies to (q:r) synapse redundancy. The experimental results demonstrate new trade-offs between robustness and hardware overhead without requiring the neural networks to be retrained. Compared with state-of-the-art, the power and area overhead for a neural network can be reduced with up to 16% and 25%, respectively.

UR - http://www.scopus.com/inward/record.url?scp=85091304373&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85091304373&partnerID=8YFLogxK

U2 - 10.1145/3386263.3406910

DO - 10.1145/3386263.3406910

M3 - Conference contribution

AN - SCOPUS:85091304373

T3 - Proceedings of the ACM Great Lakes Symposium on VLSI, GLSVLSI

SP - 339

EP - 344

BT - GLSVLSI 2020 - Proceedings of the 2020 Great Lakes Symposium on VLSI

PB - Association for Computing Machinery

T2 - 30th Great Lakes Symposium on VLSI, GLSVLSI 2020

Y2 - 7 September 2020 through 9 September 2020

ER -

Redundant neurons and shared redundant synapses for robust memristor-based DNNs with reduced overhead

Abstract

Publication series

Conference

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this