ID-cache: Instruction and memory divergence based cache management for GPUs

Akhil Arunkumar, Shin Ying Lee, Carole-Jean Wu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

11 Scopus citations

Abstract

Modern graphic processing units (GPUs) are not only able to perform graphics rendering, but also perform general purpose parallel computations (GPGPUs). It has been shown that the GPU L1 data cache and the on chip interconnect bandwidth are important sources of performance bottlenecks and inefficiencies in GPGPUs. Through this work, we aim to understand the sources of inefficiencies and possible opportunities for more efficient cache and interconnect bandwidth management on the GPUs. We do so by understanding the predictability of reuse behavior and spatial utilization of cache lines using program level information such as the instruction PC, and runtime behavior such as the extent of memory divergence. Through our characterization results, we demonstrate that a) PC, and memory divergence can be used to efficiently bypass zero reuse cache lines from the cache; b) memory divergence information can further be used to dynamically insert cache lines of varying size granularities based on their spatial utilization. Finally, based on the insights derived through our characterization, we design a simple Instruction and memory Divergence cache management method that is able to achieve an average of 71% performance improvement for a wide variety of cache and interconnect sensitive applications.

Original languageEnglish (US)
Title of host publicationProceedings of the 2016 IEEE International Symposium on Workload Characterization, IISWC 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages158-167
Number of pages10
ISBN (Electronic)9781509038954
DOIs
StatePublished - Oct 3 2016
Event2016 IEEE International Symposium on Workload Characterization, IISWC 2016 - Providence, United States
Duration: Sep 25 2016Sep 27 2016

Publication series

NameProceedings of the 2016 IEEE International Symposium on Workload Characterization, IISWC 2016

Other

Other2016 IEEE International Symposium on Workload Characterization, IISWC 2016
Country/TerritoryUnited States
CityProvidence
Period9/25/169/27/16

ASJC Scopus subject areas

  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'ID-cache: Instruction and memory divergence based cache management for GPUs'. Together they form a unique fingerprint.

Cite this