Curriculum Based Reinforcement Learning of Grid Topology Controllers to Prevent Thermal Cascading

Amarsagar Reddy Ramapuram Matavalam; Kishan Prudhvi Guddanti; Yang Weng; Venkataramana Ajjarapu

doi:10.1109/TPWRS.2022.3213487

Curriculum Based Reinforcement Learning of Grid Topology Controllers to Prevent Thermal Cascading

Amarsagar Reddy Ramapuram Matavalam, Kishan Prudhvi Guddanti, Yang Weng, Venkataramana Ajjarapu

Research output: Contribution to journal › Article › peer-review

1 Scopus citations

Abstract

This paper describes how domain knowledge of power system operators can be integrated into reinforcement learning (RL) frameworks to effectively learn agents that control the grid's topology to prevent thermal cascading. Typical RL-based topology controllers fail to perform well due to the large search/optimization space. Here, we propose an actor-critic-based agent to address the problem's combinatorial nature and train the agent using the RL environment developed by RTE, the French TSO. To address the challenge of the large optimization space, a curriculum-based approach with reward tuning is incorporated into the training procedure by modifying the environment using network physics for enhanced agent learning. Further, a parallel training approach on multiple scenarios is employed to avoid biasing the agent to a few scenarios and make it robust to the natural variability in grid operations. Without these modifications to the training procedure, the RL agent failed for most test scenarios, illustrating the importance of properly integrating domain knowledge of physical systems for real-world RL learning. The agent was tested by RTE for the 2019 learning to run the power network challenge and was awarded the 2nd place in accuracy and 1st place in speed. The developed code is open-sourced for public use. Analysis of a simple system proves the enhancement in training RL-agents using the curriculum.

Original language	English (US)
Pages (from-to)	4206-4220
Number of pages	15
Journal	IEEE Transactions on Power Systems
Volume	38
Issue number	5
DOIs	https://doi.org/10.1109/TPWRS.2022.3213487
State	Published - Sep 1 2023

Keywords

L2RPN
Reinforcement learning
actor-critic agents
cascading mitigation
open-sourced
parallel computing

ASJC Scopus subject areas

Energy Engineering and Power Technology
Electrical and Electronic Engineering

Access to Document

10.1109/TPWRS.2022.3213487

Cite this

@article{f66bc63a528e4864abd929a337ac2ccd,

title = "Curriculum Based Reinforcement Learning of Grid Topology Controllers to Prevent Thermal Cascading",

abstract = "This paper describes how domain knowledge of power system operators can be integrated into reinforcement learning (RL) frameworks to effectively learn agents that control the grid's topology to prevent thermal cascading. Typical RL-based topology controllers fail to perform well due to the large search/optimization space. Here, we propose an actor-critic-based agent to address the problem's combinatorial nature and train the agent using the RL environment developed by RTE, the French TSO. To address the challenge of the large optimization space, a curriculum-based approach with reward tuning is incorporated into the training procedure by modifying the environment using network physics for enhanced agent learning. Further, a parallel training approach on multiple scenarios is employed to avoid biasing the agent to a few scenarios and make it robust to the natural variability in grid operations. Without these modifications to the training procedure, the RL agent failed for most test scenarios, illustrating the importance of properly integrating domain knowledge of physical systems for real-world RL learning. The agent was tested by RTE for the 2019 learning to run the power network challenge and was awarded the 2nd place in accuracy and 1st place in speed. The developed code is open-sourced for public use. Analysis of a simple system proves the enhancement in training RL-agents using the curriculum.",

keywords = "L2RPN, Reinforcement learning, actor-critic agents, cascading mitigation, open-sourced, parallel computing",

author = "{Ramapuram Matavalam}, {Amarsagar Reddy} and Guddanti, {Kishan Prudhvi} and Yang Weng and Venkataramana Ajjarapu",

note = "Publisher Copyright: {\textcopyright} 1969-2012 IEEE.",

year = "2023",

month = sep,

day = "1",

doi = "10.1109/TPWRS.2022.3213487",

language = "English (US)",

volume = "38",

pages = "4206--4220",

journal = "IEEE Transactions on Power Systems",

issn = "0885-8950",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "5",

}

TY - JOUR

T1 - Curriculum Based Reinforcement Learning of Grid Topology Controllers to Prevent Thermal Cascading

AU - Ramapuram Matavalam, Amarsagar Reddy

AU - Guddanti, Kishan Prudhvi

AU - Weng, Yang

AU - Ajjarapu, Venkataramana

PY - 2023/9/1

Y1 - 2023/9/1

N2 - This paper describes how domain knowledge of power system operators can be integrated into reinforcement learning (RL) frameworks to effectively learn agents that control the grid's topology to prevent thermal cascading. Typical RL-based topology controllers fail to perform well due to the large search/optimization space. Here, we propose an actor-critic-based agent to address the problem's combinatorial nature and train the agent using the RL environment developed by RTE, the French TSO. To address the challenge of the large optimization space, a curriculum-based approach with reward tuning is incorporated into the training procedure by modifying the environment using network physics for enhanced agent learning. Further, a parallel training approach on multiple scenarios is employed to avoid biasing the agent to a few scenarios and make it robust to the natural variability in grid operations. Without these modifications to the training procedure, the RL agent failed for most test scenarios, illustrating the importance of properly integrating domain knowledge of physical systems for real-world RL learning. The agent was tested by RTE for the 2019 learning to run the power network challenge and was awarded the 2nd place in accuracy and 1st place in speed. The developed code is open-sourced for public use. Analysis of a simple system proves the enhancement in training RL-agents using the curriculum.

AB - This paper describes how domain knowledge of power system operators can be integrated into reinforcement learning (RL) frameworks to effectively learn agents that control the grid's topology to prevent thermal cascading. Typical RL-based topology controllers fail to perform well due to the large search/optimization space. Here, we propose an actor-critic-based agent to address the problem's combinatorial nature and train the agent using the RL environment developed by RTE, the French TSO. To address the challenge of the large optimization space, a curriculum-based approach with reward tuning is incorporated into the training procedure by modifying the environment using network physics for enhanced agent learning. Further, a parallel training approach on multiple scenarios is employed to avoid biasing the agent to a few scenarios and make it robust to the natural variability in grid operations. Without these modifications to the training procedure, the RL agent failed for most test scenarios, illustrating the importance of properly integrating domain knowledge of physical systems for real-world RL learning. The agent was tested by RTE for the 2019 learning to run the power network challenge and was awarded the 2nd place in accuracy and 1st place in speed. The developed code is open-sourced for public use. Analysis of a simple system proves the enhancement in training RL-agents using the curriculum.

KW - L2RPN

KW - Reinforcement learning

KW - actor-critic agents

KW - cascading mitigation

KW - open-sourced

KW - parallel computing

UR - http://www.scopus.com/inward/record.url?scp=85139868768&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85139868768&partnerID=8YFLogxK

U2 - 10.1109/TPWRS.2022.3213487

DO - 10.1109/TPWRS.2022.3213487

M3 - Article

AN - SCOPUS:85139868768

SN - 0885-8950

VL - 38

SP - 4206

EP - 4220

JO - IEEE Transactions on Power Systems

JF - IEEE Transactions on Power Systems

IS - 5

ER -

Curriculum Based Reinforcement Learning of Grid Topology Controllers to Prevent Thermal Cascading

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this