TY - GEN
T1 - Modeling and Optimization of SRAM-based In-Memory Computing Hardware Design
AU - Saikia, Jyotishman
AU - Yin, Shihui
AU - Cherupally, Sai Kiran
AU - Zhang, Bo
AU - Meng, Jian
AU - Seok, Mingoo
AU - Seo, Jae Sun
N1 - Funding Information:
This paper presented SRAM based IMC modeling framework and optimization. By properly modeling the bitcell variation ACKNOWLEDGMENT This work was in part supported by Samsung Advanced Institute of Technology, NSF grant 1652866, and C-BRIC, one of six centers in JUMP, a SRC program sponsored by DARPA.
Publisher Copyright:
© 2021 EDAA.
PY - 2021/2/1
Y1 - 2021/2/1
N2 - In-memory computing (IMC) has been demonstrated as a promising technique to significantly improve energy-efficiency for deep neural network (DNN) hardware accelerators. However, designing one involves setting many design variables such as the number of parallel rows to assert, analog-to-digital converter (ADC) at the periphery of memory sub-array, activation/weight precisions of DNNs, etc., which affect energy-efficiency, DNN accuracy, and area. While individual IMC designs have been presented in the literature, they have not investigated this multi-dimensional design optimization. In this paper, to fill this knowledge gap, we present a SRAM-based IMC hardware modeling and optimization framework. A unified systematic study closely models IMC hardware, and investigates how a number of design variables and nonidealities (e.g. device mismatch and ADC quantization) affect the DNN accuracy of IMC design. To maintain high DNN accuracy for the IMC SRAM hardware, it is shown that the number of activated rows, ADC resolution, ADC quantization range, and different sources of variability/noise need to be carefully selected and co-optimized with an underlying DNN algorithm to implement.
AB - In-memory computing (IMC) has been demonstrated as a promising technique to significantly improve energy-efficiency for deep neural network (DNN) hardware accelerators. However, designing one involves setting many design variables such as the number of parallel rows to assert, analog-to-digital converter (ADC) at the periphery of memory sub-array, activation/weight precisions of DNNs, etc., which affect energy-efficiency, DNN accuracy, and area. While individual IMC designs have been presented in the literature, they have not investigated this multi-dimensional design optimization. In this paper, to fill this knowledge gap, we present a SRAM-based IMC hardware modeling and optimization framework. A unified systematic study closely models IMC hardware, and investigates how a number of design variables and nonidealities (e.g. device mismatch and ADC quantization) affect the DNN accuracy of IMC design. To maintain high DNN accuracy for the IMC SRAM hardware, it is shown that the number of activated rows, ADC resolution, ADC quantization range, and different sources of variability/noise need to be carefully selected and co-optimized with an underlying DNN algorithm to implement.
KW - In-memory computing
KW - SRAM
KW - deep neural networks
KW - modeling
KW - optimization
UR - http://www.scopus.com/inward/record.url?scp=85111048819&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85111048819&partnerID=8YFLogxK
U2 - 10.23919/DATE51398.2021.9473973
DO - 10.23919/DATE51398.2021.9473973
M3 - Conference contribution
AN - SCOPUS:85111048819
T3 - Proceedings -Design, Automation and Test in Europe, DATE
SP - 942
EP - 947
BT - Proceedings of the 2021 Design, Automation and Test in Europe, DATE 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 Design, Automation and Test in Europe Conference and Exhibition, DATE 2021
Y2 - 1 February 2021 through 5 February 2021
ER -