16 Scopus citations


Heterogeneous multiprocessor system-on-chips (SoCs) provide a wide range of architectural parameters that can be managed dynamically. For example, one can control the type (big/little), number and frequency of active cores of the state-of-the-art mobile processors at runtime. These runtime choices result in more than 10x range in execution time, 5x range in power consumption, and 50x range in performance per watt. Therefore, it is crucial to make optimum runtime decisions as a function of dynamically varying workloads. This paper presents a reinforcement learning approach for dynamically controlling the number and frequency of big and little cores in mobile processors. We first demonstrate that commonly used Q-Table approach does not scale to address this problem. Then, we propose an efficient deep Q-learning methodology to optimize the performance per watt. Experiments using Odroid XU3 mobile processor platform show that the performance per watt achieved by the proposed approach is within 1% of the optimal value obtained by an oracle.

Original languageEnglish (US)
JournalIEEE Computer Architecture Letters
StateAccepted/In press - Jan 1 2019


  • Frequency control
  • Instruments
  • Memory management
  • Power demand
  • Power system management
  • Runtime
  • Training

ASJC Scopus subject areas

  • Hardware and Architecture

Fingerprint Dive into the research topics of 'A Deep Q-Learning Approach for Dynamic Management of Heterogeneous Processors'. Together they form a unique fingerprint.

Cite this