An online learning methodology for performance modeling of graphics processors

Ujjwal Gupta; Manoj Babu; Raid Ayoub; Michael Kishinevsky; Francesco Paterna; Suat Gumussoy; Umit Ogras

doi:10.1109/TC.2018.2840710

An online learning methodology for performance modeling of graphics processors

Ujjwal Gupta, Manoj Babu, Raid Ayoub, Michael Kishinevsky, Francesco Paterna, Suat Gumussoy, Umit Ogras

Research output: Contribution to journal › Article › peer-review

8 Scopus citations

Abstract

Approximately 18 percent of the 3.2 million smartphone applications rely on integrated graphics processing units (GPUs) to achieve competitive performance. Graphics performance, typically measured in frames per second, is a strong function of the GPU frequency, which in turn has a significant impact on mobile processor power consumption. Consequently, dynamic power management algorithms have to assess the performance sensitivity to the frequency accurately to choose the operating frequency of the GPU effectively. Since the impact of GPU frequency on performance varies rapidly over time, there is a need for online performance models that can adapt to varying workloads. This paper presents a light-weight adaptive runtime performance model that predicts the frame processing time of graphics workloads at runtime without apriori characterization. We employ this model to estimate the frame time sensitivity to the GPU frequency, i.e., the partial derivative of the frame time with respect to the GPU frequency. The proposed model does not rely on any parameter learned offline. Our experiments on commercial platforms with common GPU benchmarks show that the mean absolute percentage error in frame time and frame time sensitivity prediction are 4.2 and 6.7 percent, respectively.

Original language	English (US)
Article number	8365819
Pages (from-to)	1677-1691
Number of pages	15
Journal	IEEE Transactions on Computers
Volume	67
Issue number	12
DOIs	https://doi.org/10.1109/TC.2018.2840710
State	Published - Dec 1 2018

Keywords

Integrated GPUs
RLS
frequency scaling
online learning
performance modeling
power management

ASJC Scopus subject areas

Software
Theoretical Computer Science
Hardware and Architecture
Computational Theory and Mathematics

Access to Document

10.1109/TC.2018.2840710

Cite this

@article{dca1c7c027fd4eb7908e2f9d078f8741,

title = "An online learning methodology for performance modeling of graphics processors",

abstract = "Approximately 18 percent of the 3.2 million smartphone applications rely on integrated graphics processing units (GPUs) to achieve competitive performance. Graphics performance, typically measured in frames per second, is a strong function of the GPU frequency, which in turn has a significant impact on mobile processor power consumption. Consequently, dynamic power management algorithms have to assess the performance sensitivity to the frequency accurately to choose the operating frequency of the GPU effectively. Since the impact of GPU frequency on performance varies rapidly over time, there is a need for online performance models that can adapt to varying workloads. This paper presents a light-weight adaptive runtime performance model that predicts the frame processing time of graphics workloads at runtime without apriori characterization. We employ this model to estimate the frame time sensitivity to the GPU frequency, i.e., the partial derivative of the frame time with respect to the GPU frequency. The proposed model does not rely on any parameter learned offline. Our experiments on commercial platforms with common GPU benchmarks show that the mean absolute percentage error in frame time and frame time sensitivity prediction are 4.2 and 6.7 percent, respectively.",

keywords = "Integrated GPUs, RLS, frequency scaling, online learning, performance modeling, power management",

author = "Ujjwal Gupta and Manoj Babu and Raid Ayoub and Michael Kishinevsky and Francesco Paterna and Suat Gumussoy and Umit Ogras",

note = "Funding Information: This work was supported partially by Semiconductor Research Corporation (SRC) Task 2721.001 and US National Science Foundation (NSF) grant CNS-1526562. Publisher Copyright: {\textcopyright} 1968-2012 IEEE.",

year = "2018",

month = dec,

day = "1",

doi = "10.1109/TC.2018.2840710",

language = "English (US)",

volume = "67",

pages = "1677--1691",

journal = "IEEE Transactions on Computers",

issn = "0018-9340",

publisher = "IEEE Computer Society",

number = "12",

}

TY - JOUR

T1 - An online learning methodology for performance modeling of graphics processors

AU - Gupta, Ujjwal

AU - Babu, Manoj

AU - Ayoub, Raid

AU - Kishinevsky, Michael

AU - Paterna, Francesco

AU - Gumussoy, Suat

AU - Ogras, Umit

N1 - Funding Information: This work was supported partially by Semiconductor Research Corporation (SRC) Task 2721.001 and US National Science Foundation (NSF) grant CNS-1526562. Publisher Copyright: © 1968-2012 IEEE.

PY - 2018/12/1

Y1 - 2018/12/1

N2 - Approximately 18 percent of the 3.2 million smartphone applications rely on integrated graphics processing units (GPUs) to achieve competitive performance. Graphics performance, typically measured in frames per second, is a strong function of the GPU frequency, which in turn has a significant impact on mobile processor power consumption. Consequently, dynamic power management algorithms have to assess the performance sensitivity to the frequency accurately to choose the operating frequency of the GPU effectively. Since the impact of GPU frequency on performance varies rapidly over time, there is a need for online performance models that can adapt to varying workloads. This paper presents a light-weight adaptive runtime performance model that predicts the frame processing time of graphics workloads at runtime without apriori characterization. We employ this model to estimate the frame time sensitivity to the GPU frequency, i.e., the partial derivative of the frame time with respect to the GPU frequency. The proposed model does not rely on any parameter learned offline. Our experiments on commercial platforms with common GPU benchmarks show that the mean absolute percentage error in frame time and frame time sensitivity prediction are 4.2 and 6.7 percent, respectively.

AB - Approximately 18 percent of the 3.2 million smartphone applications rely on integrated graphics processing units (GPUs) to achieve competitive performance. Graphics performance, typically measured in frames per second, is a strong function of the GPU frequency, which in turn has a significant impact on mobile processor power consumption. Consequently, dynamic power management algorithms have to assess the performance sensitivity to the frequency accurately to choose the operating frequency of the GPU effectively. Since the impact of GPU frequency on performance varies rapidly over time, there is a need for online performance models that can adapt to varying workloads. This paper presents a light-weight adaptive runtime performance model that predicts the frame processing time of graphics workloads at runtime without apriori characterization. We employ this model to estimate the frame time sensitivity to the GPU frequency, i.e., the partial derivative of the frame time with respect to the GPU frequency. The proposed model does not rely on any parameter learned offline. Our experiments on commercial platforms with common GPU benchmarks show that the mean absolute percentage error in frame time and frame time sensitivity prediction are 4.2 and 6.7 percent, respectively.

KW - Integrated GPUs

KW - RLS

KW - frequency scaling

KW - online learning

KW - performance modeling

KW - power management

UR - http://www.scopus.com/inward/record.url?scp=85047613199&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85047613199&partnerID=8YFLogxK

U2 - 10.1109/TC.2018.2840710

DO - 10.1109/TC.2018.2840710

M3 - Article

AN - SCOPUS:85047613199

SN - 0018-9340

VL - 67

SP - 1677

EP - 1691

JO - IEEE Transactions on Computers

JF - IEEE Transactions on Computers

IS - 12

M1 - 8365819

ER -

An online learning methodology for performance modeling of graphics processors

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this