IMC Architecture for Robust DNN Acceleration

Gokul Krishnan; Zhenyu Wang; Li Yang; Injune Yeo; Jian Meng; Rajiv V. Joshi; Nathaniel C. Cady; Deliang Fan; Jae Sun Seo; Yu Cao

doi:10.1109/ICSICT55466.2022.9963165

IMC Architecture for Robust DNN Acceleration

Gokul Krishnan, Zhenyu Wang, Li Yang, Injune Yeo, Jian Meng, Rajiv V. Joshi, Nathaniel C. Cady, Deliang Fan, Jae Sun Seo, Yu Cao

Engineering, Ira A. Fulton Schools of (IAFSE)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

RRAM-based in-memory computing (IMC) effectively accelerates deep neural networks (DNNs) and other machine learning algorithms. On the other hand, in the presence of RRAM device variations and lower precision, the mapping of DNNs to RRAM-based IMC suffers from severe accuracy loss. In this work, we propose a novel hybrid IMC architecture that integrates an RRAM-based IMC macro with a digital SRAM macro using a programmable shifter to compensate for the RRAM variations and recover the accuracy. The digital SRAM macro consists of a small SRAM memory array and an array of multiply-and-accumulate (MAC) units. The non-ideal output from the RRAM macro, due to device and circuit nonidealities, is compensated by adding the precise output from the SRAM macro. In addition, the programmable shifter allows for different scales of compensation by shifting the SRAM macro output relative to the RRAM macro output. We design a silicon prototype of the proposed hybrid IMC architecture in the 65nm SUNY process to demonstrate its efficacy. Experimental evaluation of the hybrid IMC architecture shows up to 21.9%, and 6.5% improvement in post-mapping accuracy over state-of-the-art techniques, at minimal overhead for CIFAR-10 and ImageNet datasets, respectively.

Original language	English (US)
Title of host publication	Proceedings of 2022 IEEE 16th International Conference on Solid-State and Integrated Circuit Technology, ICSICT 2022
Editors	Fan Ye, Ting-Ao Tang
Publisher	Institute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)	9781665469067
DOIs	https://doi.org/10.1109/ICSICT55466.2022.9963165
State	Published - 2022
Event	16th IEEE International Conference on Solid-State and Integrated Circuit Technology, ICSICT 2022 - Nanjing, China Duration: Oct 25 2022 → Oct 28 2022

Publication series

Name	Proceedings of 2022 IEEE 16th International Conference on Solid-State and Integrated Circuit Technology, ICSICT 2022

Conference

Conference	16th IEEE International Conference on Solid-State and Integrated Circuit Technology, ICSICT 2022
Country/Territory	China
City	Nanjing
Period	10/25/22 → 10/28/22

ASJC Scopus subject areas

Electrical and Electronic Engineering
Artificial Intelligence
Computer Networks and Communications
Computer Science Applications
Information Systems and Management

Access to Document

10.1109/ICSICT55466.2022.9963165

Cite this

Krishnan, G., Wang, Z., Yang, L., Yeo, I., Meng, J., Joshi, R. V., Cady, N. C., Fan, D., Seo, J. S., & Cao, Y. (2022). IMC Architecture for Robust DNN Acceleration. In F. Ye, & T.-A. Tang (Eds.), Proceedings of 2022 IEEE 16th International Conference on Solid-State and Integrated Circuit Technology, ICSICT 2022 (Proceedings of 2022 IEEE 16th International Conference on Solid-State and Integrated Circuit Technology, ICSICT 2022). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICSICT55466.2022.9963165

IMC Architecture for Robust DNN Acceleration. / Krishnan, Gokul; Wang, Zhenyu; Yang, Li et al.
Proceedings of 2022 IEEE 16th International Conference on Solid-State and Integrated Circuit Technology, ICSICT 2022. ed. / Fan Ye; Ting-Ao Tang. Institute of Electrical and Electronics Engineers Inc., 2022. (Proceedings of 2022 IEEE 16th International Conference on Solid-State and Integrated Circuit Technology, ICSICT 2022).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Krishnan, G, Wang, Z, Yang, L, Yeo, I, Meng, J, Joshi, RV, Cady, NC, Fan, D, Seo, JS & Cao, Y 2022, IMC Architecture for Robust DNN Acceleration. in F Ye & T-A Tang (eds), Proceedings of 2022 IEEE 16th International Conference on Solid-State and Integrated Circuit Technology, ICSICT 2022. Proceedings of 2022 IEEE 16th International Conference on Solid-State and Integrated Circuit Technology, ICSICT 2022, Institute of Electrical and Electronics Engineers Inc., 16th IEEE International Conference on Solid-State and Integrated Circuit Technology, ICSICT 2022, Nanjing, China, 10/25/22. https://doi.org/10.1109/ICSICT55466.2022.9963165

Krishnan G, Wang Z, Yang L, Yeo I, Meng J, Joshi RV et al. IMC Architecture for Robust DNN Acceleration. In Ye F, Tang TA, editors, Proceedings of 2022 IEEE 16th International Conference on Solid-State and Integrated Circuit Technology, ICSICT 2022. Institute of Electrical and Electronics Engineers Inc. 2022. (Proceedings of 2022 IEEE 16th International Conference on Solid-State and Integrated Circuit Technology, ICSICT 2022). doi: 10.1109/ICSICT55466.2022.9963165

Krishnan, Gokul ; Wang, Zhenyu ; Yang, Li et al. / IMC Architecture for Robust DNN Acceleration. Proceedings of 2022 IEEE 16th International Conference on Solid-State and Integrated Circuit Technology, ICSICT 2022. editor / Fan Ye ; Ting-Ao Tang. Institute of Electrical and Electronics Engineers Inc., 2022. (Proceedings of 2022 IEEE 16th International Conference on Solid-State and Integrated Circuit Technology, ICSICT 2022).

@inproceedings{dba58caf1f9a42649e35f222244c4ae4,

title = "IMC Architecture for Robust DNN Acceleration",

abstract = "RRAM-based in-memory computing (IMC) effectively accelerates deep neural networks (DNNs) and other machine learning algorithms. On the other hand, in the presence of RRAM device variations and lower precision, the mapping of DNNs to RRAM-based IMC suffers from severe accuracy loss. In this work, we propose a novel hybrid IMC architecture that integrates an RRAM-based IMC macro with a digital SRAM macro using a programmable shifter to compensate for the RRAM variations and recover the accuracy. The digital SRAM macro consists of a small SRAM memory array and an array of multiply-and-accumulate (MAC) units. The non-ideal output from the RRAM macro, due to device and circuit nonidealities, is compensated by adding the precise output from the SRAM macro. In addition, the programmable shifter allows for different scales of compensation by shifting the SRAM macro output relative to the RRAM macro output. We design a silicon prototype of the proposed hybrid IMC architecture in the 65nm SUNY process to demonstrate its efficacy. Experimental evaluation of the hybrid IMC architecture shows up to 21.9%, and 6.5% improvement in post-mapping accuracy over state-of-the-art techniques, at minimal overhead for CIFAR-10 and ImageNet datasets, respectively.",

author = "Gokul Krishnan and Zhenyu Wang and Li Yang and Injune Yeo and Jian Meng and Joshi, {Rajiv V.} and Cady, {Nathaniel C.} and Deliang Fan and Seo, {Jae Sun} and Yu Cao",

note = "Publisher Copyright: {\textcopyright} 2022 IEEE.; 16th IEEE International Conference on Solid-State and Integrated Circuit Technology, ICSICT 2022 ; Conference date: 25-10-2022 Through 28-10-2022",

year = "2022",

doi = "10.1109/ICSICT55466.2022.9963165",

language = "English (US)",

series = "Proceedings of 2022 IEEE 16th International Conference on Solid-State and Integrated Circuit Technology, ICSICT 2022",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

editor = "Fan Ye and Ting-Ao Tang",

booktitle = "Proceedings of 2022 IEEE 16th International Conference on Solid-State and Integrated Circuit Technology, ICSICT 2022",

}

TY - GEN

T1 - IMC Architecture for Robust DNN Acceleration

AU - Krishnan, Gokul

AU - Wang, Zhenyu

AU - Yang, Li

AU - Yeo, Injune

AU - Meng, Jian

AU - Joshi, Rajiv V.

AU - Cady, Nathaniel C.

AU - Fan, Deliang

AU - Seo, Jae Sun

AU - Cao, Yu

PY - 2022

Y1 - 2022

N2 - RRAM-based in-memory computing (IMC) effectively accelerates deep neural networks (DNNs) and other machine learning algorithms. On the other hand, in the presence of RRAM device variations and lower precision, the mapping of DNNs to RRAM-based IMC suffers from severe accuracy loss. In this work, we propose a novel hybrid IMC architecture that integrates an RRAM-based IMC macro with a digital SRAM macro using a programmable shifter to compensate for the RRAM variations and recover the accuracy. The digital SRAM macro consists of a small SRAM memory array and an array of multiply-and-accumulate (MAC) units. The non-ideal output from the RRAM macro, due to device and circuit nonidealities, is compensated by adding the precise output from the SRAM macro. In addition, the programmable shifter allows for different scales of compensation by shifting the SRAM macro output relative to the RRAM macro output. We design a silicon prototype of the proposed hybrid IMC architecture in the 65nm SUNY process to demonstrate its efficacy. Experimental evaluation of the hybrid IMC architecture shows up to 21.9%, and 6.5% improvement in post-mapping accuracy over state-of-the-art techniques, at minimal overhead for CIFAR-10 and ImageNet datasets, respectively.

AB - RRAM-based in-memory computing (IMC) effectively accelerates deep neural networks (DNNs) and other machine learning algorithms. On the other hand, in the presence of RRAM device variations and lower precision, the mapping of DNNs to RRAM-based IMC suffers from severe accuracy loss. In this work, we propose a novel hybrid IMC architecture that integrates an RRAM-based IMC macro with a digital SRAM macro using a programmable shifter to compensate for the RRAM variations and recover the accuracy. The digital SRAM macro consists of a small SRAM memory array and an array of multiply-and-accumulate (MAC) units. The non-ideal output from the RRAM macro, due to device and circuit nonidealities, is compensated by adding the precise output from the SRAM macro. In addition, the programmable shifter allows for different scales of compensation by shifting the SRAM macro output relative to the RRAM macro output. We design a silicon prototype of the proposed hybrid IMC architecture in the 65nm SUNY process to demonstrate its efficacy. Experimental evaluation of the hybrid IMC architecture shows up to 21.9%, and 6.5% improvement in post-mapping accuracy over state-of-the-art techniques, at minimal overhead for CIFAR-10 and ImageNet datasets, respectively.

UR - http://www.scopus.com/inward/record.url?scp=85143970044&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85143970044&partnerID=8YFLogxK

U2 - 10.1109/ICSICT55466.2022.9963165

DO - 10.1109/ICSICT55466.2022.9963165

M3 - Conference contribution

AN - SCOPUS:85143970044

T3 - Proceedings of 2022 IEEE 16th International Conference on Solid-State and Integrated Circuit Technology, ICSICT 2022

BT - Proceedings of 2022 IEEE 16th International Conference on Solid-State and Integrated Circuit Technology, ICSICT 2022

A2 - Ye, Fan

A2 - Tang, Ting-Ao

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 16th IEEE International Conference on Solid-State and Integrated Circuit Technology, ICSICT 2022

Y2 - 25 October 2022 through 28 October 2022

ER -

IMC Architecture for Robust DNN Acceleration

Abstract

Publication series

Conference

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this