Worst-case satisfaction of STL specifications using feedforward neural network controllers: A lagrange multipliers approach

Shakiba Yaghoubi; Georgios Fainekos

doi:10.1145/3358239

Worst-case satisfaction of STL specifications using feedforward neural network controllers: A lagrange multipliers approach

Shakiba Yaghoubi, Georgios Fainekos

Research output: Contribution to journal › Article › peer-review

15 Scopus citations

Abstract

In this paper, a reinforcement learning approach for designing feedback neural network controllers for nonlinear systems is proposed. Given a Signal Temporal Logic (STL) specification which needs to be satisfied by the system over a set of initial conditions, the neural network parameters are tuned in order to maximize the satisfaction of the STL formula. The framework is based on a max-min formulation of the robustness of the STL formula. The maximization is solved through a Lagrange multipliers method, while the minimization corresponds to a falsification problem. We present our results on a vehicle and a quadrotor model and demonstrate that our approach reduces the training time more than 50 percent compared to the baseline approach.

Original language	English (US)
Article number	a107
Journal	ACM Transactions on Embedded Computing Systems
Volume	18
Issue number	5s
DOIs	https://doi.org/10.1145/3358239
State	Published - Oct 2019

Keywords

Neural network controller
Reinforcement learning
Signal temporal logic

ASJC Scopus subject areas

Software
Hardware and Architecture

Access to Document

10.1145/3358239

Cite this

@article{c13708e5f14d4b90906f38a696cdd12f,

title = "Worst-case satisfaction of STL specifications using feedforward neural network controllers: A lagrange multipliers approach",

abstract = "In this paper, a reinforcement learning approach for designing feedback neural network controllers for nonlinear systems is proposed. Given a Signal Temporal Logic (STL) specification which needs to be satisfied by the system over a set of initial conditions, the neural network parameters are tuned in order to maximize the satisfaction of the STL formula. The framework is based on a max-min formulation of the robustness of the STL formula. The maximization is solved through a Lagrange multipliers method, while the minimization corresponds to a falsification problem. We present our results on a vehicle and a quadrotor model and demonstrate that our approach reduces the training time more than 50 percent compared to the baseline approach.",

keywords = "Neural network controller, Reinforcement learning, Signal temporal logic",

author = "Shakiba Yaghoubi and Georgios Fainekos",

note = "Publisher Copyright: {\textcopyright} 2019 Association for Computing Machinery.",

year = "2019",

month = oct,

doi = "10.1145/3358239",

language = "English (US)",

volume = "18",

journal = "ACM Transactions on Embedded Computing Systems",

issn = "1539-9087",

publisher = "Association for Computing Machinery (ACM)",

number = "5s",

}

TY - JOUR

T1 - Worst-case satisfaction of STL specifications using feedforward neural network controllers

T2 - A lagrange multipliers approach

AU - Yaghoubi, Shakiba

AU - Fainekos, Georgios

PY - 2019/10

Y1 - 2019/10

N2 - In this paper, a reinforcement learning approach for designing feedback neural network controllers for nonlinear systems is proposed. Given a Signal Temporal Logic (STL) specification which needs to be satisfied by the system over a set of initial conditions, the neural network parameters are tuned in order to maximize the satisfaction of the STL formula. The framework is based on a max-min formulation of the robustness of the STL formula. The maximization is solved through a Lagrange multipliers method, while the minimization corresponds to a falsification problem. We present our results on a vehicle and a quadrotor model and demonstrate that our approach reduces the training time more than 50 percent compared to the baseline approach.

AB - In this paper, a reinforcement learning approach for designing feedback neural network controllers for nonlinear systems is proposed. Given a Signal Temporal Logic (STL) specification which needs to be satisfied by the system over a set of initial conditions, the neural network parameters are tuned in order to maximize the satisfaction of the STL formula. The framework is based on a max-min formulation of the robustness of the STL formula. The maximization is solved through a Lagrange multipliers method, while the minimization corresponds to a falsification problem. We present our results on a vehicle and a quadrotor model and demonstrate that our approach reduces the training time more than 50 percent compared to the baseline approach.

KW - Neural network controller

KW - Reinforcement learning

KW - Signal temporal logic

UR - http://www.scopus.com/inward/record.url?scp=85073161036&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85073161036&partnerID=8YFLogxK

U2 - 10.1145/3358239

DO - 10.1145/3358239

M3 - Article

AN - SCOPUS:85073161036

SN - 1539-9087

VL - 18

JO - ACM Transactions on Embedded Computing Systems

JF - ACM Transactions on Embedded Computing Systems

IS - 5s

M1 - a107

ER -

Worst-case satisfaction of STL specifications using feedforward neural network controllers: A lagrange multipliers approach

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this