Accelerating Linear Algebra Kernels on a Massively Parallel Reconfigurable Architecture

A. Soorishetty; J. Zhou; S. Pal; D. Blaauw; H. Kim; T. Mudge; R. Dreslinski; C. Chakrabarti

doi:10.1109/ICASSP40776.2020.9054126

Accelerating Linear Algebra Kernels on a Massively Parallel Reconfigurable Architecture

A. Soorishetty, J. Zhou, S. Pal, D. Blaauw, H. Kim, T. Mudge, R. Dreslinski, C. Chakrabarti

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

4 Scopus citations

Abstract

Much of the recent work on domain-specific architectures has focused on bridging the gap between performance/efficiency and programmability. We consider one such example architecture, Transformer, consisting of light-weight cores interconnected by caches and crossbars that supports run-time reconfiguration between shared and private cache mode operations. We present customized implementation of a select set of linear algebra kernels, namely, triangular matrix solver, LU decomposition, QR decomposition and matrix in-version, on Transformer. The performance of the kernel algorithms is evaluated with respect to execution time and energy efficiency. Our study shows that each kernel achieves high performance for a certain cache mode and that this cache mode can change when the matrix size changes, making a case for run-time reconfiguration.

Original language	English (US)
Title of host publication	2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020 - Proceedings
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	1558-1562
Number of pages	5
ISBN (Electronic)	9781509066315
DOIs	https://doi.org/10.1109/ICASSP40776.2020.9054126
State	Published - May 2020
Event	2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020 - Barcelona, Spain Duration: May 4 2020 → May 8 2020

Publication series

Name	ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume	2020-May
ISSN (Print)	1520-6149

Conference

Conference	2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020
Country/Territory	Spain
City	Barcelona
Period	5/4/20 → 5/8/20

ASJC Scopus subject areas

Software
Signal Processing
Electrical and Electronic Engineering

Access to Document

10.1109/ICASSP40776.2020.9054126

Cite this

Soorishetty, A., Zhou, J., Pal, S., Blaauw, D., Kim, H., Mudge, T., Dreslinski, R., & Chakrabarti, C. (2020). Accelerating Linear Algebra Kernels on a Massively Parallel Reconfigurable Architecture. In 2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020 - Proceedings (pp. 1558-1562). Article 9054126 (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; Vol. 2020-May). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICASSP40776.2020.9054126

Accelerating Linear Algebra Kernels on a Massively Parallel Reconfigurable Architecture. / Soorishetty, A.; Zhou, J.; Pal, S. et al.
2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2020. p. 1558-1562 9054126 (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; Vol. 2020-May).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Soorishetty, A, Zhou, J, Pal, S, Blaauw, D, Kim, H, Mudge, T, Dreslinski, R & Chakrabarti, C 2020, Accelerating Linear Algebra Kernels on a Massively Parallel Reconfigurable Architecture. in 2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020 - Proceedings., 9054126, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, vol. 2020-May, Institute of Electrical and Electronics Engineers Inc., pp. 1558-1562, 2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020, Barcelona, Spain, 5/4/20. https://doi.org/10.1109/ICASSP40776.2020.9054126

Soorishetty A, Zhou J, Pal S, Blaauw D, Kim H, Mudge T et al. Accelerating Linear Algebra Kernels on a Massively Parallel Reconfigurable Architecture. In 2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020 - Proceedings. Institute of Electrical and Electronics Engineers Inc. 2020. p. 1558-1562. 9054126. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings). doi: 10.1109/ICASSP40776.2020.9054126

Soorishetty, A. ; Zhou, J. ; Pal, S. et al. / Accelerating Linear Algebra Kernels on a Massively Parallel Reconfigurable Architecture. 2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2020. pp. 1558-1562 (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).

@inproceedings{0a809c5c83fe46c4bce217134d46387f,

title = "Accelerating Linear Algebra Kernels on a Massively Parallel Reconfigurable Architecture",

abstract = "Much of the recent work on domain-specific architectures has focused on bridging the gap between performance/efficiency and programmability. We consider one such example architecture, Transformer, consisting of light-weight cores interconnected by caches and crossbars that supports run-time reconfiguration between shared and private cache mode operations. We present customized implementation of a select set of linear algebra kernels, namely, triangular matrix solver, LU decomposition, QR decomposition and matrix in-version, on Transformer. The performance of the kernel algorithms is evaluated with respect to execution time and energy efficiency. Our study shows that each kernel achieves high performance for a certain cache mode and that this cache mode can change when the matrix size changes, making a case for run-time reconfiguration.",

author = "A. Soorishetty and J. Zhou and S. Pal and D. Blaauw and H. Kim and T. Mudge and R. Dreslinski and C. Chakrabarti",

note = "Funding Information: Acknowledgment: The material is based on research sponsored by Air Force Research Laboratory (AFRL) and Defense Advanced Research Projects Agency (DARPA) under agreement number FA8650-18-2-7864. The views and conclusions contained herein are those of the authors and do not represent the official policies or endorsements, either expressed or implied, of ARFL and DARPA or the U.S. Government. Publisher Copyright: {\textcopyright} 2020 IEEE.; 2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020 ; Conference date: 04-05-2020 Through 08-05-2020",

year = "2020",

month = may,

doi = "10.1109/ICASSP40776.2020.9054126",

language = "English (US)",

series = "ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "1558--1562",

booktitle = "2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020 - Proceedings",

}

TY - GEN

T1 - Accelerating Linear Algebra Kernels on a Massively Parallel Reconfigurable Architecture

AU - Soorishetty, A.

AU - Zhou, J.

AU - Pal, S.

AU - Blaauw, D.

AU - Kim, H.

AU - Mudge, T.

AU - Dreslinski, R.

AU - Chakrabarti, C.

N1 - Funding Information: Acknowledgment: The material is based on research sponsored by Air Force Research Laboratory (AFRL) and Defense Advanced Research Projects Agency (DARPA) under agreement number FA8650-18-2-7864. The views and conclusions contained herein are those of the authors and do not represent the official policies or endorsements, either expressed or implied, of ARFL and DARPA or the U.S. Government. Publisher Copyright: © 2020 IEEE.

PY - 2020/5

Y1 - 2020/5

N2 - Much of the recent work on domain-specific architectures has focused on bridging the gap between performance/efficiency and programmability. We consider one such example architecture, Transformer, consisting of light-weight cores interconnected by caches and crossbars that supports run-time reconfiguration between shared and private cache mode operations. We present customized implementation of a select set of linear algebra kernels, namely, triangular matrix solver, LU decomposition, QR decomposition and matrix in-version, on Transformer. The performance of the kernel algorithms is evaluated with respect to execution time and energy efficiency. Our study shows that each kernel achieves high performance for a certain cache mode and that this cache mode can change when the matrix size changes, making a case for run-time reconfiguration.

AB - Much of the recent work on domain-specific architectures has focused on bridging the gap between performance/efficiency and programmability. We consider one such example architecture, Transformer, consisting of light-weight cores interconnected by caches and crossbars that supports run-time reconfiguration between shared and private cache mode operations. We present customized implementation of a select set of linear algebra kernels, namely, triangular matrix solver, LU decomposition, QR decomposition and matrix in-version, on Transformer. The performance of the kernel algorithms is evaluated with respect to execution time and energy efficiency. Our study shows that each kernel achieves high performance for a certain cache mode and that this cache mode can change when the matrix size changes, making a case for run-time reconfiguration.

UR - http://www.scopus.com/inward/record.url?scp=85089238825&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85089238825&partnerID=8YFLogxK

U2 - 10.1109/ICASSP40776.2020.9054126

DO - 10.1109/ICASSP40776.2020.9054126

M3 - Conference contribution

AN - SCOPUS:85089238825

T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

SP - 1558

EP - 1562

BT - 2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020 - Proceedings

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020

Y2 - 4 May 2020 through 8 May 2020

ER -

Accelerating Linear Algebra Kernels on a Massively Parallel Reconfigurable Architecture

Abstract

Publication series

Conference

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this