Compilation of stream programs onto scratchpad memory based embedded multicore processors through retiming

Weijia Che, Karam Chatha

Research output: Chapter in Book/Report/Conference proceedingConference contribution

12 Citations (Scopus)

Abstract

The prevalence of stream applications in signal processing, multi-media, and network processing domains has resulted in a new trend of programming and architecture design. Several languages and multicore architectures have been developed to support streaming applications. In many of these multicore architectures scratchpad memories (SPM) have substituted caches due to their lower power consumption. Performance optimization on SPM based architectures requires the programmer/compiler to efficiently manage the limited local memory. Our paper addresses the problem of compilation of stream programs onto multicore architectures that incorporate SPMs. We propose a retiming technique that maximizes the throughput under a memory constraint with a user-specified number of software pipeline stages. Trade-offs between double buffering and code overlay are explored intensively in our technique to achieve the best performance. The efficiency of our technique was evaluated by compiling several stream applications for the IBM Cell BE and comparing their results against existing approaches.

Original languageEnglish (US)
Title of host publicationProceedings - Design Automation Conference
Pages122-127
Number of pages6
StatePublished - 2011
Event2011 48th ACM/EDAC/IEEE Design Automation Conference, DAC 2011 - San Diego, CA, United States
Duration: Jun 5 2011Jun 9 2011

Other

Other2011 48th ACM/EDAC/IEEE Design Automation Conference, DAC 2011
CountryUnited States
CitySan Diego, CA
Period6/5/116/9/11

Fingerprint

Multi-core Processor
Compilation
Data storage equipment
Memory architecture
Signal processing
Electric power utilization
Pipelines
Throughput
Performance Optimization
Overlay
Streaming
Compiler
Cache
Power Consumption
Multimedia
Signal Processing
Processing
Programming
Trade-offs
Maximise

Keywords

  • Compiler
  • Embedded
  • Multicore
  • Processors
  • Retiming
  • Scratchpad Memory
  • Stream

ASJC Scopus subject areas

  • Computer Science Applications
  • Control and Systems Engineering
  • Electrical and Electronic Engineering
  • Modeling and Simulation

Cite this

Che, W., & Chatha, K. (2011). Compilation of stream programs onto scratchpad memory based embedded multicore processors through retiming. In Proceedings - Design Automation Conference (pp. 122-127). [5981708]

Compilation of stream programs onto scratchpad memory based embedded multicore processors through retiming. / Che, Weijia; Chatha, Karam.

Proceedings - Design Automation Conference. 2011. p. 122-127 5981708.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Che, W & Chatha, K 2011, Compilation of stream programs onto scratchpad memory based embedded multicore processors through retiming. in Proceedings - Design Automation Conference., 5981708, pp. 122-127, 2011 48th ACM/EDAC/IEEE Design Automation Conference, DAC 2011, San Diego, CA, United States, 6/5/11.
Che W, Chatha K. Compilation of stream programs onto scratchpad memory based embedded multicore processors through retiming. In Proceedings - Design Automation Conference. 2011. p. 122-127. 5981708
Che, Weijia ; Chatha, Karam. / Compilation of stream programs onto scratchpad memory based embedded multicore processors through retiming. Proceedings - Design Automation Conference. 2011. pp. 122-127
@inproceedings{b2d9109dd36642dabaa12d32c6dfe8c8,
title = "Compilation of stream programs onto scratchpad memory based embedded multicore processors through retiming",
abstract = "The prevalence of stream applications in signal processing, multi-media, and network processing domains has resulted in a new trend of programming and architecture design. Several languages and multicore architectures have been developed to support streaming applications. In many of these multicore architectures scratchpad memories (SPM) have substituted caches due to their lower power consumption. Performance optimization on SPM based architectures requires the programmer/compiler to efficiently manage the limited local memory. Our paper addresses the problem of compilation of stream programs onto multicore architectures that incorporate SPMs. We propose a retiming technique that maximizes the throughput under a memory constraint with a user-specified number of software pipeline stages. Trade-offs between double buffering and code overlay are explored intensively in our technique to achieve the best performance. The efficiency of our technique was evaluated by compiling several stream applications for the IBM Cell BE and comparing their results against existing approaches.",
keywords = "Compiler, Embedded, Multicore, Processors, Retiming, Scratchpad Memory, Stream",
author = "Weijia Che and Karam Chatha",
year = "2011",
language = "English (US)",
isbn = "9781450306362",
pages = "122--127",
booktitle = "Proceedings - Design Automation Conference",

}

TY - GEN

T1 - Compilation of stream programs onto scratchpad memory based embedded multicore processors through retiming

AU - Che, Weijia

AU - Chatha, Karam

PY - 2011

Y1 - 2011

N2 - The prevalence of stream applications in signal processing, multi-media, and network processing domains has resulted in a new trend of programming and architecture design. Several languages and multicore architectures have been developed to support streaming applications. In many of these multicore architectures scratchpad memories (SPM) have substituted caches due to their lower power consumption. Performance optimization on SPM based architectures requires the programmer/compiler to efficiently manage the limited local memory. Our paper addresses the problem of compilation of stream programs onto multicore architectures that incorporate SPMs. We propose a retiming technique that maximizes the throughput under a memory constraint with a user-specified number of software pipeline stages. Trade-offs between double buffering and code overlay are explored intensively in our technique to achieve the best performance. The efficiency of our technique was evaluated by compiling several stream applications for the IBM Cell BE and comparing their results against existing approaches.

AB - The prevalence of stream applications in signal processing, multi-media, and network processing domains has resulted in a new trend of programming and architecture design. Several languages and multicore architectures have been developed to support streaming applications. In many of these multicore architectures scratchpad memories (SPM) have substituted caches due to their lower power consumption. Performance optimization on SPM based architectures requires the programmer/compiler to efficiently manage the limited local memory. Our paper addresses the problem of compilation of stream programs onto multicore architectures that incorporate SPMs. We propose a retiming technique that maximizes the throughput under a memory constraint with a user-specified number of software pipeline stages. Trade-offs between double buffering and code overlay are explored intensively in our technique to achieve the best performance. The efficiency of our technique was evaluated by compiling several stream applications for the IBM Cell BE and comparing their results against existing approaches.

KW - Compiler

KW - Embedded

KW - Multicore

KW - Processors

KW - Retiming

KW - Scratchpad Memory

KW - Stream

UR - http://www.scopus.com/inward/record.url?scp=80052663504&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=80052663504&partnerID=8YFLogxK

M3 - Conference contribution

SN - 9781450306362

SP - 122

EP - 127

BT - Proceedings - Design Automation Conference

ER -