Characterizing Loop Acceleration in Heterogeneous Computing

Saman Biookaghazadeh; Fengbo Ren; Ming Zhao

doi:10.1109/CLOUD53861.2021.00059

Characterizing Loop Acceleration in Heterogeneous Computing

Saman Biookaghazadeh, Fengbo Ren, Ming Zhao

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

Computation intensive applications usually consist of multiple nested or flattened loops. These loops are the main building blocks of the applications and embody a specific type of execution pattern. In order to reduce the running time of the loops, developers need to analyze the loops in the code and try to parallelize them on hardware accelerators, such as GPUs, TPUs, and FPGAs, which are increasingly available in the cloud. Unfortunately, the lack of understanding of loop characteristics and the ability of hardware accelerators in handling these types of loops prevents developers from choosing the right platform to develop their applications in the cloud. Also, developing and optimizing code for a specific accelerator is a time-consuming effort. To address these issues, this paper studies the effectiveness of different processors in accelerating common patterns of loops. It identifies five important types of loops that commonly exist in real-world applications, and presents Loopy, the implementations of these loops optimized for different architectures. Using Loopy, the paper also evaluates different hardware in accelerating the loop patterns. The result reveals the architectural differences among different accelerators with regard to different loop patterns. It also provides insights for the developers to choose the right accelerators for their applications. The current version of Loopy supports both FPGAs and GPUs, which are the most versatile and available accelerators.

Original language	English (US)
Title of host publication	Proceedings - 2021 IEEE 14th International Conference on Cloud Computing, CLOUD 2021
Editors	Claudio Agostino Ardagna, Carl K. Chang, Ernesto Daminai, Rajiv Ranjan, Zhongjie Wang, Robert Ward, Jia Zhang, Wensheng Zhang
Publisher	IEEE Computer Society
Pages	445-455
Number of pages	11
ISBN (Electronic)	9781665400602
DOIs	https://doi.org/10.1109/CLOUD53861.2021.00059
State	Published - Sep 2021
Event	14th IEEE International Conference on Cloud Computing, CLOUD 2021 - Virtual, Online, United States Duration: Sep 5 2021 → Sep 11 2021

Publication series

Name	IEEE International Conference on Cloud Computing, CLOUD
Volume	2021-September
ISSN (Print)	2159-6182
ISSN (Electronic)	2159-6190

Conference

Conference	14th IEEE International Conference on Cloud Computing, CLOUD 2021
Country/Territory	United States
City	Virtual, Online
Period	9/5/21 → 9/11/21

Keywords

FPGA
GPU
Heterogeneous computing
Loop characterization

ASJC Scopus subject areas

Artificial Intelligence
Information Systems
Software

Access to Document

10.1109/CLOUD53861.2021.00059

Cite this

Biookaghazadeh, S., Ren, F., & Zhao, M. (2021). Characterizing Loop Acceleration in Heterogeneous Computing. In C. A. Ardagna, C. K. Chang, E. Daminai, R. Ranjan, Z. Wang, R. Ward, J. Zhang, & W. Zhang (Eds.), Proceedings - 2021 IEEE 14th International Conference on Cloud Computing, CLOUD 2021 (pp. 445-455). (IEEE International Conference on Cloud Computing, CLOUD; Vol. 2021-September). IEEE Computer Society. https://doi.org/10.1109/CLOUD53861.2021.00059

Characterizing Loop Acceleration in Heterogeneous Computing. / Biookaghazadeh, Saman; Ren, Fengbo ; Zhao, Ming.
Proceedings - 2021 IEEE 14th International Conference on Cloud Computing, CLOUD 2021. ed. / Claudio Agostino Ardagna; Carl K. Chang; Ernesto Daminai; Rajiv Ranjan; Zhongjie Wang; Robert Ward; Jia Zhang; Wensheng Zhang. IEEE Computer Society, 2021. p. 445-455 (IEEE International Conference on Cloud Computing, CLOUD; Vol. 2021-September).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Biookaghazadeh, S, Ren, F & Zhao, M 2021, Characterizing Loop Acceleration in Heterogeneous Computing. in CA Ardagna, CK Chang, E Daminai, R Ranjan, Z Wang, R Ward, J Zhang & W Zhang (eds), Proceedings - 2021 IEEE 14th International Conference on Cloud Computing, CLOUD 2021. IEEE International Conference on Cloud Computing, CLOUD, vol. 2021-September, IEEE Computer Society, pp. 445-455, 14th IEEE International Conference on Cloud Computing, CLOUD 2021, Virtual, Online, United States, 9/5/21. https://doi.org/10.1109/CLOUD53861.2021.00059

Biookaghazadeh S, Ren F , Zhao M. Characterizing Loop Acceleration in Heterogeneous Computing. In Ardagna CA, Chang CK, Daminai E, Ranjan R, Wang Z, Ward R, Zhang J, Zhang W, editors, Proceedings - 2021 IEEE 14th International Conference on Cloud Computing, CLOUD 2021. IEEE Computer Society. 2021. p. 445-455. (IEEE International Conference on Cloud Computing, CLOUD). doi: 10.1109/CLOUD53861.2021.00059

Biookaghazadeh, Saman ; Ren, Fengbo ; Zhao, Ming. / Characterizing Loop Acceleration in Heterogeneous Computing. Proceedings - 2021 IEEE 14th International Conference on Cloud Computing, CLOUD 2021. editor / Claudio Agostino Ardagna ; Carl K. Chang ; Ernesto Daminai ; Rajiv Ranjan ; Zhongjie Wang ; Robert Ward ; Jia Zhang ; Wensheng Zhang. IEEE Computer Society, 2021. pp. 445-455 (IEEE International Conference on Cloud Computing, CLOUD).

@inproceedings{22a001a47fc14675b42c53b839240b6c,

title = "Characterizing Loop Acceleration in Heterogeneous Computing",

abstract = "Computation intensive applications usually consist of multiple nested or flattened loops. These loops are the main building blocks of the applications and embody a specific type of execution pattern. In order to reduce the running time of the loops, developers need to analyze the loops in the code and try to parallelize them on hardware accelerators, such as GPUs, TPUs, and FPGAs, which are increasingly available in the cloud. Unfortunately, the lack of understanding of loop characteristics and the ability of hardware accelerators in handling these types of loops prevents developers from choosing the right platform to develop their applications in the cloud. Also, developing and optimizing code for a specific accelerator is a time-consuming effort. To address these issues, this paper studies the effectiveness of different processors in accelerating common patterns of loops. It identifies five important types of loops that commonly exist in real-world applications, and presents Loopy, the implementations of these loops optimized for different architectures. Using Loopy, the paper also evaluates different hardware in accelerating the loop patterns. The result reveals the architectural differences among different accelerators with regard to different loop patterns. It also provides insights for the developers to choose the right accelerators for their applications. The current version of Loopy supports both FPGAs and GPUs, which are the most versatile and available accelerators.",

keywords = "FPGA, GPU, Heterogeneous computing, Loop characterization",

author = "Saman Biookaghazadeh and Fengbo Ren and Ming Zhao",

note = "Funding Information: We thank the anonymous reviewers for their helpful comments. This work is supported by National Science Foundation awards CNS-1955593, CNS-1562837, and CNS-1629888 and Intel{\textquoteright}s donation of the Fog Reference Design units. Publisher Copyright: {\textcopyright} 2021 IEEE.; 14th IEEE International Conference on Cloud Computing, CLOUD 2021 ; Conference date: 05-09-2021 Through 11-09-2021",

year = "2021",

month = sep,

doi = "10.1109/CLOUD53861.2021.00059",

language = "English (US)",

series = "IEEE International Conference on Cloud Computing, CLOUD",

publisher = "IEEE Computer Society",

pages = "445--455",

editor = "Ardagna, {Claudio Agostino} and Chang, {Carl K.} and Ernesto Daminai and Rajiv Ranjan and Zhongjie Wang and Robert Ward and Jia Zhang and Wensheng Zhang",

booktitle = "Proceedings - 2021 IEEE 14th International Conference on Cloud Computing, CLOUD 2021",

}

TY - GEN

T1 - Characterizing Loop Acceleration in Heterogeneous Computing

AU - Biookaghazadeh, Saman

AU - Ren, Fengbo

AU - Zhao, Ming

N1 - Funding Information: We thank the anonymous reviewers for their helpful comments. This work is supported by National Science Foundation awards CNS-1955593, CNS-1562837, and CNS-1629888 and Intel’s donation of the Fog Reference Design units. Publisher Copyright: © 2021 IEEE.

PY - 2021/9

Y1 - 2021/9

N2 - Computation intensive applications usually consist of multiple nested or flattened loops. These loops are the main building blocks of the applications and embody a specific type of execution pattern. In order to reduce the running time of the loops, developers need to analyze the loops in the code and try to parallelize them on hardware accelerators, such as GPUs, TPUs, and FPGAs, which are increasingly available in the cloud. Unfortunately, the lack of understanding of loop characteristics and the ability of hardware accelerators in handling these types of loops prevents developers from choosing the right platform to develop their applications in the cloud. Also, developing and optimizing code for a specific accelerator is a time-consuming effort. To address these issues, this paper studies the effectiveness of different processors in accelerating common patterns of loops. It identifies five important types of loops that commonly exist in real-world applications, and presents Loopy, the implementations of these loops optimized for different architectures. Using Loopy, the paper also evaluates different hardware in accelerating the loop patterns. The result reveals the architectural differences among different accelerators with regard to different loop patterns. It also provides insights for the developers to choose the right accelerators for their applications. The current version of Loopy supports both FPGAs and GPUs, which are the most versatile and available accelerators.

AB - Computation intensive applications usually consist of multiple nested or flattened loops. These loops are the main building blocks of the applications and embody a specific type of execution pattern. In order to reduce the running time of the loops, developers need to analyze the loops in the code and try to parallelize them on hardware accelerators, such as GPUs, TPUs, and FPGAs, which are increasingly available in the cloud. Unfortunately, the lack of understanding of loop characteristics and the ability of hardware accelerators in handling these types of loops prevents developers from choosing the right platform to develop their applications in the cloud. Also, developing and optimizing code for a specific accelerator is a time-consuming effort. To address these issues, this paper studies the effectiveness of different processors in accelerating common patterns of loops. It identifies five important types of loops that commonly exist in real-world applications, and presents Loopy, the implementations of these loops optimized for different architectures. Using Loopy, the paper also evaluates different hardware in accelerating the loop patterns. The result reveals the architectural differences among different accelerators with regard to different loop patterns. It also provides insights for the developers to choose the right accelerators for their applications. The current version of Loopy supports both FPGAs and GPUs, which are the most versatile and available accelerators.

KW - FPGA

KW - GPU

KW - Heterogeneous computing

KW - Loop characterization

UR - http://www.scopus.com/inward/record.url?scp=85119319749&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85119319749&partnerID=8YFLogxK

U2 - 10.1109/CLOUD53861.2021.00059

DO - 10.1109/CLOUD53861.2021.00059

M3 - Conference contribution

AN - SCOPUS:85119319749

T3 - IEEE International Conference on Cloud Computing, CLOUD

SP - 445

EP - 455

BT - Proceedings - 2021 IEEE 14th International Conference on Cloud Computing, CLOUD 2021

A2 - Ardagna, Claudio Agostino

A2 - Chang, Carl K.

A2 - Daminai, Ernesto

A2 - Ranjan, Rajiv

A2 - Wang, Zhongjie

A2 - Ward, Robert

A2 - Zhang, Jia

A2 - Zhang, Wensheng

PB - IEEE Computer Society

T2 - 14th IEEE International Conference on Cloud Computing, CLOUD 2021

Y2 - 5 September 2021 through 11 September 2021

ER -

Characterizing Loop Acceleration in Heterogeneous Computing

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this