Benchmark of RRAM based Architectures for Dot-Product Computation

Xiaochen Peng; Shimeng Yu

doi:10.1109/APCCAS.2018.8605606

Benchmark of RRAM based Architectures for Dot-Product Computation

Xiaochen Peng, Shimeng Yu

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

5 Scopus citations

Abstract

Memory array architecture based on emerging non-volatile memory devices have been proposed for on-chip acceleration of dot-product computation in neural networks. As recent advances in machine learning have shown that precision reduction is a useful technique to reduce the computation and memory storage, it is desired to evaluate their hardware cost. In this paper, we use a circuit-level macro model, i.e. NeuroSim, to benchmark the circuit-level performance metrics, such as chip area, latency, and dynamic energy for the XNOR-RRAM and conventional 8-bit RRAM architectures. Both architectures are implemented to process the dot-product operation of a 512×512 synaptic matrix in sequential row-by-row and parallel read-out fashion separately. The simulation results are based on RRAM models and 32nm CMOS PDK, the energy-efficiency of the parallel XNOR-RRAM architecture could achieve 311 TOPS/W, showing at least ~15× and ~621× improvement compared to the parallel and sequential conventional 8-bit RRAM architectures respectively.

Original language	English (US)
Title of host publication	2018 IEEE Asia Pacific Conference on Circuits and Systems, APCCAS 2018
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	378-381
Number of pages	4
ISBN (Electronic)	9781538682401
DOIs	https://doi.org/10.1109/APCCAS.2018.8605606
State	Published - Jan 8 2019
Event	14th IEEE Asia Pacific Conference on Circuits and Systems, APCCAS 2018 - Chengdu, China Duration: Oct 26 2018 → Oct 30 2018

Publication series

Name	2018 IEEE Asia Pacific Conference on Circuits and Systems, APCCAS 2018

Conference

Conference	14th IEEE Asia Pacific Conference on Circuits and Systems, APCCAS 2018
Country/Territory	China
City	Chengdu
Period	10/26/18 → 10/30/18

Keywords

hardware accelerator
machine learning
neuromorphic computing
non-volatile memory

ASJC Scopus subject areas

Biomedical Engineering
Electrical and Electronic Engineering
Instrumentation

Access to Document

10.1109/APCCAS.2018.8605606

Cite this

Peng, X., & Yu, S. (2019). Benchmark of RRAM based Architectures for Dot-Product Computation. In 2018 IEEE Asia Pacific Conference on Circuits and Systems, APCCAS 2018 (pp. 378-381). Article 8605606 (2018 IEEE Asia Pacific Conference on Circuits and Systems, APCCAS 2018). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/APCCAS.2018.8605606

Benchmark of RRAM based Architectures for Dot-Product Computation. / Peng, Xiaochen; Yu, Shimeng.
2018 IEEE Asia Pacific Conference on Circuits and Systems, APCCAS 2018. Institute of Electrical and Electronics Engineers Inc., 2019. p. 378-381 8605606 (2018 IEEE Asia Pacific Conference on Circuits and Systems, APCCAS 2018).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Peng, X & Yu, S 2019, Benchmark of RRAM based Architectures for Dot-Product Computation. in 2018 IEEE Asia Pacific Conference on Circuits and Systems, APCCAS 2018., 8605606, 2018 IEEE Asia Pacific Conference on Circuits and Systems, APCCAS 2018, Institute of Electrical and Electronics Engineers Inc., pp. 378-381, 14th IEEE Asia Pacific Conference on Circuits and Systems, APCCAS 2018, Chengdu, China, 10/26/18. https://doi.org/10.1109/APCCAS.2018.8605606

@inproceedings{085061d814514c8491d36954d1123858,

title = "Benchmark of RRAM based Architectures for Dot-Product Computation",

abstract = "Memory array architecture based on emerging non-volatile memory devices have been proposed for on-chip acceleration of dot-product computation in neural networks. As recent advances in machine learning have shown that precision reduction is a useful technique to reduce the computation and memory storage, it is desired to evaluate their hardware cost. In this paper, we use a circuit-level macro model, i.e. NeuroSim, to benchmark the circuit-level performance metrics, such as chip area, latency, and dynamic energy for the XNOR-RRAM and conventional 8-bit RRAM architectures. Both architectures are implemented to process the dot-product operation of a 512×512 synaptic matrix in sequential row-by-row and parallel read-out fashion separately. The simulation results are based on RRAM models and 32nm CMOS PDK, the energy-efficiency of the parallel XNOR-RRAM architecture could achieve 311 TOPS/W, showing at least ~15× and ~621× improvement compared to the parallel and sequential conventional 8-bit RRAM architectures respectively.",

keywords = "hardware accelerator, machine learning, neuromorphic computing, non-volatile memory",

author = "Xiaochen Peng and Shimeng Yu",

note = "Funding Information: ACKNOWLEDGEMENT This work is supported in part by NSF-CCF-1552687, and NSF/SRC E2CDA program with NSF-CCF-1740225 and SRC Contract 2018-NC-2762. Publisher Copyright: {\textcopyright} 2018 IEEE.; 14th IEEE Asia Pacific Conference on Circuits and Systems, APCCAS 2018 ; Conference date: 26-10-2018 Through 30-10-2018",

year = "2019",

month = jan,

day = "8",

doi = "10.1109/APCCAS.2018.8605606",

language = "English (US)",

series = "2018 IEEE Asia Pacific Conference on Circuits and Systems, APCCAS 2018",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "378--381",

booktitle = "2018 IEEE Asia Pacific Conference on Circuits and Systems, APCCAS 2018",

}

TY - GEN

T1 - Benchmark of RRAM based Architectures for Dot-Product Computation

AU - Peng, Xiaochen

AU - Yu, Shimeng

N1 - Funding Information: ACKNOWLEDGEMENT This work is supported in part by NSF-CCF-1552687, and NSF/SRC E2CDA program with NSF-CCF-1740225 and SRC Contract 2018-NC-2762. Publisher Copyright: © 2018 IEEE.

PY - 2019/1/8

Y1 - 2019/1/8

N2 - Memory array architecture based on emerging non-volatile memory devices have been proposed for on-chip acceleration of dot-product computation in neural networks. As recent advances in machine learning have shown that precision reduction is a useful technique to reduce the computation and memory storage, it is desired to evaluate their hardware cost. In this paper, we use a circuit-level macro model, i.e. NeuroSim, to benchmark the circuit-level performance metrics, such as chip area, latency, and dynamic energy for the XNOR-RRAM and conventional 8-bit RRAM architectures. Both architectures are implemented to process the dot-product operation of a 512×512 synaptic matrix in sequential row-by-row and parallel read-out fashion separately. The simulation results are based on RRAM models and 32nm CMOS PDK, the energy-efficiency of the parallel XNOR-RRAM architecture could achieve 311 TOPS/W, showing at least ~15× and ~621× improvement compared to the parallel and sequential conventional 8-bit RRAM architectures respectively.

AB - Memory array architecture based on emerging non-volatile memory devices have been proposed for on-chip acceleration of dot-product computation in neural networks. As recent advances in machine learning have shown that precision reduction is a useful technique to reduce the computation and memory storage, it is desired to evaluate their hardware cost. In this paper, we use a circuit-level macro model, i.e. NeuroSim, to benchmark the circuit-level performance metrics, such as chip area, latency, and dynamic energy for the XNOR-RRAM and conventional 8-bit RRAM architectures. Both architectures are implemented to process the dot-product operation of a 512×512 synaptic matrix in sequential row-by-row and parallel read-out fashion separately. The simulation results are based on RRAM models and 32nm CMOS PDK, the energy-efficiency of the parallel XNOR-RRAM architecture could achieve 311 TOPS/W, showing at least ~15× and ~621× improvement compared to the parallel and sequential conventional 8-bit RRAM architectures respectively.

KW - hardware accelerator

KW - machine learning

KW - neuromorphic computing

KW - non-volatile memory

UR - http://www.scopus.com/inward/record.url?scp=85062234025&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85062234025&partnerID=8YFLogxK

U2 - 10.1109/APCCAS.2018.8605606

DO - 10.1109/APCCAS.2018.8605606

M3 - Conference contribution

AN - SCOPUS:85062234025

T3 - 2018 IEEE Asia Pacific Conference on Circuits and Systems, APCCAS 2018

SP - 378

EP - 381

BT - 2018 IEEE Asia Pacific Conference on Circuits and Systems, APCCAS 2018

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 14th IEEE Asia Pacific Conference on Circuits and Systems, APCCAS 2018

Y2 - 26 October 2018 through 30 October 2018

ER -

Benchmark of RRAM based Architectures for Dot-Product Computation

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this