Branch penalty reduction on IBM cell SPUs via software branch hinting

Jing Lu; Yooseong Kim; Aviral Shrivastava; Chuan Huang

doi:10.1145/2039370.2039425

Branch penalty reduction on IBM cell SPUs via software branch hinting

Jing Lu, Yooseong Kim, Aviral Shrivastava, Chuan Huang

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

3 Scopus citations

Abstract

As power-efficiency becomes paramount concern in processor design, architectures are coming up that completely do away with hardware branch prediction, and rely solely on software branch hinting. A popular example is the Synergistic Processing Unit (SPU) in the IBM Cell processor. To be able to minimize the branch penalty using branch hint instructions, in addition to estimating the branch probabilities (which has been looked at before [6, 25, 24]), it is important to carefully insert branch hints. Towards this, in this paper, we i) construct a branch penalty model for compiler, ii) formulate the problem of minimizing branch penalty using branch hinting and iii) propose a heuristic to solve this problem. The heuristic is based on three basic techniques that we introduce in this paper: NOP padding, hint pipelining, and nested loop restructuring. Experimental results on several benchmarks show that our solution can reduce the branch penalty as much as 35.4% over the previous approach.

Original language	English (US)
Title of host publication	Embedded Systems Week 2011, ESWEEK 2011 - Proceedings of the 9th IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis, CODES+ISSS'11
Pages	355-364
Number of pages	10
DOIs	https://doi.org/10.1145/2039370.2039425
State	Published - 2011
Event	Embedded Systems Week 2011, ESWEEK 2011 - 9th IEEE/ACM International Conference on Hardware/Software-Codesign and System Synthesis, CODES+ISSS'11 - Taipei, Taiwan, Province of China Duration: Oct 9 2011 → Oct 14 2011

Publication series

Name	Embedded Systems Week 2011, ESWEEK 2011 - Proceedings of the 9th IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis, CODES+ISSS'11

Other

Other	Embedded Systems Week 2011, ESWEEK 2011 - 9th IEEE/ACM International Conference on Hardware/Software-Codesign and System Synthesis, CODES+ISSS'11
Country/Territory	Taiwan, Province of China
City	Taipei
Period	10/9/11 → 10/14/11

Keywords

Branch hint
Cell processor
Compiler optimization

ASJC Scopus subject areas

Hardware and Architecture
Software
Control and Systems Engineering

Access to Document

10.1145/2039370.2039425

Cite this

Lu, J., Kim, Y., Shrivastava, A., & Huang, C. (2011). Branch penalty reduction on IBM cell SPUs via software branch hinting. In Embedded Systems Week 2011, ESWEEK 2011 - Proceedings of the 9th IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis, CODES+ISSS'11 (pp. 355-364). (Embedded Systems Week 2011, ESWEEK 2011 - Proceedings of the 9th IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis, CODES+ISSS'11). https://doi.org/10.1145/2039370.2039425

Branch penalty reduction on IBM cell SPUs via software branch hinting. / Lu, Jing; Kim, Yooseong; Shrivastava, Aviral et al.
Embedded Systems Week 2011, ESWEEK 2011 - Proceedings of the 9th IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis, CODES+ISSS'11. 2011. p. 355-364 (Embedded Systems Week 2011, ESWEEK 2011 - Proceedings of the 9th IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis, CODES+ISSS'11).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Lu, J, Kim, Y, Shrivastava, A & Huang, C 2011, Branch penalty reduction on IBM cell SPUs via software branch hinting. in Embedded Systems Week 2011, ESWEEK 2011 - Proceedings of the 9th IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis, CODES+ISSS'11. Embedded Systems Week 2011, ESWEEK 2011 - Proceedings of the 9th IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis, CODES+ISSS'11, pp. 355-364, Embedded Systems Week 2011, ESWEEK 2011 - 9th IEEE/ACM International Conference on Hardware/Software-Codesign and System Synthesis, CODES+ISSS'11, Taipei, Taiwan, Province of China, 10/9/11. https://doi.org/10.1145/2039370.2039425

Lu J, Kim Y, Shrivastava A, Huang C. Branch penalty reduction on IBM cell SPUs via software branch hinting. In Embedded Systems Week 2011, ESWEEK 2011 - Proceedings of the 9th IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis, CODES+ISSS'11. 2011. p. 355-364. (Embedded Systems Week 2011, ESWEEK 2011 - Proceedings of the 9th IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis, CODES+ISSS'11). doi: 10.1145/2039370.2039425

Lu, Jing ; Kim, Yooseong ; Shrivastava, Aviral et al. / Branch penalty reduction on IBM cell SPUs via software branch hinting. Embedded Systems Week 2011, ESWEEK 2011 - Proceedings of the 9th IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis, CODES+ISSS'11. 2011. pp. 355-364 (Embedded Systems Week 2011, ESWEEK 2011 - Proceedings of the 9th IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis, CODES+ISSS'11).

@inproceedings{f9d9c62d710a46b5b5a45c1b87fc2b0d,

title = "Branch penalty reduction on IBM cell SPUs via software branch hinting",

abstract = "As power-efficiency becomes paramount concern in processor design, architectures are coming up that completely do away with hardware branch prediction, and rely solely on software branch hinting. A popular example is the Synergistic Processing Unit (SPU) in the IBM Cell processor. To be able to minimize the branch penalty using branch hint instructions, in addition to estimating the branch probabilities (which has been looked at before [6, 25, 24]), it is important to carefully insert branch hints. Towards this, in this paper, we i) construct a branch penalty model for compiler, ii) formulate the problem of minimizing branch penalty using branch hinting and iii) propose a heuristic to solve this problem. The heuristic is based on three basic techniques that we introduce in this paper: NOP padding, hint pipelining, and nested loop restructuring. Experimental results on several benchmarks show that our solution can reduce the branch penalty as much as 35.4% over the previous approach.",

keywords = "Branch hint, Cell processor, Compiler optimization",

author = "Jing Lu and Yooseong Kim and Aviral Shrivastava and Chuan Huang",

year = "2011",

doi = "10.1145/2039370.2039425",

language = "English (US)",

isbn = "9781450307154",

series = "Embedded Systems Week 2011, ESWEEK 2011 - Proceedings of the 9th IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis, CODES+ISSS'11",

pages = "355--364",

booktitle = "Embedded Systems Week 2011, ESWEEK 2011 - Proceedings of the 9th IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis, CODES+ISSS'11",

note = "Embedded Systems Week 2011, ESWEEK 2011 - 9th IEEE/ACM International Conference on Hardware/Software-Codesign and System Synthesis, CODES+ISSS'11 ; Conference date: 09-10-2011 Through 14-10-2011",

}

TY - GEN

T1 - Branch penalty reduction on IBM cell SPUs via software branch hinting

AU - Lu, Jing

AU - Kim, Yooseong

AU - Shrivastava, Aviral

AU - Huang, Chuan

PY - 2011

Y1 - 2011

N2 - As power-efficiency becomes paramount concern in processor design, architectures are coming up that completely do away with hardware branch prediction, and rely solely on software branch hinting. A popular example is the Synergistic Processing Unit (SPU) in the IBM Cell processor. To be able to minimize the branch penalty using branch hint instructions, in addition to estimating the branch probabilities (which has been looked at before [6, 25, 24]), it is important to carefully insert branch hints. Towards this, in this paper, we i) construct a branch penalty model for compiler, ii) formulate the problem of minimizing branch penalty using branch hinting and iii) propose a heuristic to solve this problem. The heuristic is based on three basic techniques that we introduce in this paper: NOP padding, hint pipelining, and nested loop restructuring. Experimental results on several benchmarks show that our solution can reduce the branch penalty as much as 35.4% over the previous approach.

AB - As power-efficiency becomes paramount concern in processor design, architectures are coming up that completely do away with hardware branch prediction, and rely solely on software branch hinting. A popular example is the Synergistic Processing Unit (SPU) in the IBM Cell processor. To be able to minimize the branch penalty using branch hint instructions, in addition to estimating the branch probabilities (which has been looked at before [6, 25, 24]), it is important to carefully insert branch hints. Towards this, in this paper, we i) construct a branch penalty model for compiler, ii) formulate the problem of minimizing branch penalty using branch hinting and iii) propose a heuristic to solve this problem. The heuristic is based on three basic techniques that we introduce in this paper: NOP padding, hint pipelining, and nested loop restructuring. Experimental results on several benchmarks show that our solution can reduce the branch penalty as much as 35.4% over the previous approach.

KW - Branch hint

KW - Cell processor

KW - Compiler optimization

UR - http://www.scopus.com/inward/record.url?scp=81355124046&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=81355124046&partnerID=8YFLogxK

U2 - 10.1145/2039370.2039425

DO - 10.1145/2039370.2039425

M3 - Conference contribution

AN - SCOPUS:81355124046

SN - 9781450307154

T3 - Embedded Systems Week 2011, ESWEEK 2011 - Proceedings of the 9th IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis, CODES+ISSS'11

SP - 355

EP - 364

BT - Embedded Systems Week 2011, ESWEEK 2011 - Proceedings of the 9th IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis, CODES+ISSS'11

T2 - Embedded Systems Week 2011, ESWEEK 2011 - 9th IEEE/ACM International Conference on Hardware/Software-Codesign and System Synthesis, CODES+ISSS'11

Y2 - 9 October 2011 through 14 October 2011

ER -

Branch penalty reduction on IBM cell SPUs via software branch hinting

Abstract

Publication series

Other

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this