Toward Design of Nonlinear ADP Learning Controllers with Performance Assurance

Jennie Si; Lei Yang; Chao Lu; Konstantinos Tsakalis; Armando Rodriguez

doi:10.1002/9781118453988.ch9

Toward Design of Nonlinear ADP Learning Controllers with Performance Assurance

Jennie Si, Lei Yang, Chao Lu, Konstantinos Tsakalis, Armando Rodriguez

Research output: Chapter in Book/Report/Conference proceeding › Chapter

2 Scopus citations

Abstract

This chapter discusses nonlinear control system design using approximate/adaptive dynamic programming (ADP). ADP algorithms based on learning and approximation have shown great promise to reduce the curses of dimensionality suffered by dynamic programming (DP). They benefited from the design techinques of artificial neural networks and other function approximators, which have developed principled ways for universal function approximation. Direct heuristic dynamic programming (HDP) was introduced as an on-line learning control scheme inspired by adaptive critique designs, a family of ADP algorithms. Applications of the direct HDP to large and complex problems have demonstrated the feasibility and scalability of the learning controller design. The results, such as Apache helicopter control and coordination of large power networks for damping low-frequency oscillation, are encouraging and promising as proof of concepts toward scalable ADP designs, however, real controllers demand performance assurances, not merely a statistical learning success rate indicating that most of the time the controller works. With this in mind, this chapter discusses some recent developments in this direction.

Original language	English (US)
Title of host publication	Reinforcement Learning and Approximate Dynamic Programming for Feedback Control
Publisher	John Wiley and Sons
Pages	182-202
Number of pages	21
ISBN (Print)	9781118104200
DOIs	https://doi.org/10.1002/9781118453988.ch9
State	Published - Feb 7 2013

Keywords

ADP, reducing DP curse of dimensionality
Adaptive critique, control output/value
Direct HDP, sensitivity maps in action/critic
Nonlinear ADP, performance assurance
Nonlinear control design using ADP

ASJC Scopus subject areas

General Engineering

Access to Document

10.1002/9781118453988.ch9

Cite this

@inbook{3d6c7ef0b5a64310b00ee8c96de4a313,

title = "Toward Design of Nonlinear ADP Learning Controllers with Performance Assurance",

abstract = "This chapter discusses nonlinear control system design using approximate/adaptive dynamic programming (ADP). ADP algorithms based on learning and approximation have shown great promise to reduce the curses of dimensionality suffered by dynamic programming (DP). They benefited from the design techinques of artificial neural networks and other function approximators, which have developed principled ways for universal function approximation. Direct heuristic dynamic programming (HDP) was introduced as an on-line learning control scheme inspired by adaptive critique designs, a family of ADP algorithms. Applications of the direct HDP to large and complex problems have demonstrated the feasibility and scalability of the learning controller design. The results, such as Apache helicopter control and coordination of large power networks for damping low-frequency oscillation, are encouraging and promising as proof of concepts toward scalable ADP designs, however, real controllers demand performance assurances, not merely a statistical learning success rate indicating that most of the time the controller works. With this in mind, this chapter discusses some recent developments in this direction.",

keywords = "ADP, reducing DP curse of dimensionality, Adaptive critique, control output/value, Direct HDP, sensitivity maps in action/critic, Nonlinear ADP, performance assurance, Nonlinear control design using ADP",

author = "Jennie Si and Lei Yang and Chao Lu and Konstantinos Tsakalis and Armando Rodriguez",

year = "2013",

month = feb,

day = "7",

doi = "10.1002/9781118453988.ch9",

language = "English (US)",

isbn = "9781118104200",

pages = "182--202",

booktitle = "Reinforcement Learning and Approximate Dynamic Programming for Feedback Control",

publisher = "John Wiley and Sons",

}

TY - CHAP

T1 - Toward Design of Nonlinear ADP Learning Controllers with Performance Assurance

AU - Si, Jennie

AU - Yang, Lei

AU - Lu, Chao

AU - Tsakalis, Konstantinos

AU - Rodriguez, Armando

PY - 2013/2/7

Y1 - 2013/2/7

N2 - This chapter discusses nonlinear control system design using approximate/adaptive dynamic programming (ADP). ADP algorithms based on learning and approximation have shown great promise to reduce the curses of dimensionality suffered by dynamic programming (DP). They benefited from the design techinques of artificial neural networks and other function approximators, which have developed principled ways for universal function approximation. Direct heuristic dynamic programming (HDP) was introduced as an on-line learning control scheme inspired by adaptive critique designs, a family of ADP algorithms. Applications of the direct HDP to large and complex problems have demonstrated the feasibility and scalability of the learning controller design. The results, such as Apache helicopter control and coordination of large power networks for damping low-frequency oscillation, are encouraging and promising as proof of concepts toward scalable ADP designs, however, real controllers demand performance assurances, not merely a statistical learning success rate indicating that most of the time the controller works. With this in mind, this chapter discusses some recent developments in this direction.

AB - This chapter discusses nonlinear control system design using approximate/adaptive dynamic programming (ADP). ADP algorithms based on learning and approximation have shown great promise to reduce the curses of dimensionality suffered by dynamic programming (DP). They benefited from the design techinques of artificial neural networks and other function approximators, which have developed principled ways for universal function approximation. Direct heuristic dynamic programming (HDP) was introduced as an on-line learning control scheme inspired by adaptive critique designs, a family of ADP algorithms. Applications of the direct HDP to large and complex problems have demonstrated the feasibility and scalability of the learning controller design. The results, such as Apache helicopter control and coordination of large power networks for damping low-frequency oscillation, are encouraging and promising as proof of concepts toward scalable ADP designs, however, real controllers demand performance assurances, not merely a statistical learning success rate indicating that most of the time the controller works. With this in mind, this chapter discusses some recent developments in this direction.

KW - ADP, reducing DP curse of dimensionality

KW - Adaptive critique, control output/value

KW - Direct HDP, sensitivity maps in action/critic

KW - Nonlinear ADP, performance assurance

KW - Nonlinear control design using ADP

UR - http://www.scopus.com/inward/record.url?scp=84886338359&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84886338359&partnerID=8YFLogxK

U2 - 10.1002/9781118453988.ch9

DO - 10.1002/9781118453988.ch9

M3 - Chapter

AN - SCOPUS:84886338359

SN - 9781118104200

SP - 182

EP - 202

BT - Reinforcement Learning and Approximate Dynamic Programming for Feedback Control

PB - John Wiley and Sons

ER -

Toward Design of Nonlinear ADP Learning Controllers with Performance Assurance

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this