Direct heuristic dynamic programming based on an improved PID neural network and initial weighs choosing method

Jian Sun; Feng Liu; Jennie Si; Shengwei Mei

doi:10.1109/CRIS.2010.5617558

Direct heuristic dynamic programming based on an improved PID neural network and initial weighs choosing method

Jian Sun, Feng Liu, Jennie Si, Shengwei Mei

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

As an online learning algorithm of approximate dynamic programming (ADP), direct heuristic dynamic programming (DHDP) has demonstrated its applicability to large state and control problems. However, there still lacks of a systemic approach to initialize the network weights for DHDP. In this paper, an improved PID-neural network (IPIDNN) configuration is proposed and applied to the critic and action networks of DHDP, which is flexible and easy to expand. Because of incorporating an inherent PID control structure, it is easy to use a well-designed PID controller to guide the initial weighs choosing for the action network. Based on this framework, a novel initializing approach is suggested based on a PID controller, such that the DHDP learning process starts from a good enough initial state. Simulations are carried on a cart-pole system to validate the effectiveness of the IPIDNN-based DHDP and the proposed initializing approach.

Original language	English (US)
Title of host publication	2010 5th International Conference on Critical Infrastructure, CRIS 2010 - Proceedings
DOIs	https://doi.org/10.1109/CRIS.2010.5617558
State	Published - 2010
Event	2010 5th International Conference on Critical Infrastructure, CRIS 2010 - Beijing, China Duration: Sep 20 2010 → Sep 22 2010

Publication series

Name	2010 5th International Conference on Critical Infrastructure, CRIS 2010 - Proceedings

Other

Other	2010 5th International Conference on Critical Infrastructure, CRIS 2010
Country/Territory	China
City	Beijing
Period	9/20/10 → 9/22/10

Keywords

Approximate dynamic programming (ADP)
Direct heuristic dynamic programming (direct HDP)
Improved PID neural network (IPIDNN)
Initial weighs choosing
PID controller

ASJC Scopus subject areas

Computer Networks and Communications
Hardware and Architecture
Electrical and Electronic Engineering

Access to Document

10.1109/CRIS.2010.5617558

Cite this

Sun, J., Liu, F., Si, J., & Mei, S. (2010). Direct heuristic dynamic programming based on an improved PID neural network and initial weighs choosing method. In 2010 5th International Conference on Critical Infrastructure, CRIS 2010 - Proceedings Article 5617558 (2010 5th International Conference on Critical Infrastructure, CRIS 2010 - Proceedings). https://doi.org/10.1109/CRIS.2010.5617558

Direct heuristic dynamic programming based on an improved PID neural network and initial weighs choosing method. / Sun, Jian; Liu, Feng; Si, Jennie et al.
2010 5th International Conference on Critical Infrastructure, CRIS 2010 - Proceedings. 2010. 5617558 (2010 5th International Conference on Critical Infrastructure, CRIS 2010 - Proceedings).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Sun, J, Liu, F, Si, J & Mei, S 2010, Direct heuristic dynamic programming based on an improved PID neural network and initial weighs choosing method. in 2010 5th International Conference on Critical Infrastructure, CRIS 2010 - Proceedings., 5617558, 2010 5th International Conference on Critical Infrastructure, CRIS 2010 - Proceedings, 2010 5th International Conference on Critical Infrastructure, CRIS 2010, Beijing, China, 9/20/10. https://doi.org/10.1109/CRIS.2010.5617558

@inproceedings{ef9834a171c1455aa1dbb062fae3bc96,

title = "Direct heuristic dynamic programming based on an improved PID neural network and initial weighs choosing method",

abstract = "As an online learning algorithm of approximate dynamic programming (ADP), direct heuristic dynamic programming (DHDP) has demonstrated its applicability to large state and control problems. However, there still lacks of a systemic approach to initialize the network weights for DHDP. In this paper, an improved PID-neural network (IPIDNN) configuration is proposed and applied to the critic and action networks of DHDP, which is flexible and easy to expand. Because of incorporating an inherent PID control structure, it is easy to use a well-designed PID controller to guide the initial weighs choosing for the action network. Based on this framework, a novel initializing approach is suggested based on a PID controller, such that the DHDP learning process starts from a good enough initial state. Simulations are carried on a cart-pole system to validate the effectiveness of the IPIDNN-based DHDP and the proposed initializing approach.",

keywords = "Approximate dynamic programming (ADP), Direct heuristic dynamic programming (direct HDP), Improved PID neural network (IPIDNN), Initial weighs choosing, PID controller",

author = "Jian Sun and Feng Liu and Jennie Si and Shengwei Mei",

year = "2010",

doi = "10.1109/CRIS.2010.5617558",

language = "English (US)",

isbn = "9781424480814",

series = "2010 5th International Conference on Critical Infrastructure, CRIS 2010 - Proceedings",

booktitle = "2010 5th International Conference on Critical Infrastructure, CRIS 2010 - Proceedings",

note = "2010 5th International Conference on Critical Infrastructure, CRIS 2010 ; Conference date: 20-09-2010 Through 22-09-2010",

}

TY - GEN

T1 - Direct heuristic dynamic programming based on an improved PID neural network and initial weighs choosing method

AU - Sun, Jian

AU - Liu, Feng

AU - Si, Jennie

AU - Mei, Shengwei

PY - 2010

Y1 - 2010

N2 - As an online learning algorithm of approximate dynamic programming (ADP), direct heuristic dynamic programming (DHDP) has demonstrated its applicability to large state and control problems. However, there still lacks of a systemic approach to initialize the network weights for DHDP. In this paper, an improved PID-neural network (IPIDNN) configuration is proposed and applied to the critic and action networks of DHDP, which is flexible and easy to expand. Because of incorporating an inherent PID control structure, it is easy to use a well-designed PID controller to guide the initial weighs choosing for the action network. Based on this framework, a novel initializing approach is suggested based on a PID controller, such that the DHDP learning process starts from a good enough initial state. Simulations are carried on a cart-pole system to validate the effectiveness of the IPIDNN-based DHDP and the proposed initializing approach.

AB - As an online learning algorithm of approximate dynamic programming (ADP), direct heuristic dynamic programming (DHDP) has demonstrated its applicability to large state and control problems. However, there still lacks of a systemic approach to initialize the network weights for DHDP. In this paper, an improved PID-neural network (IPIDNN) configuration is proposed and applied to the critic and action networks of DHDP, which is flexible and easy to expand. Because of incorporating an inherent PID control structure, it is easy to use a well-designed PID controller to guide the initial weighs choosing for the action network. Based on this framework, a novel initializing approach is suggested based on a PID controller, such that the DHDP learning process starts from a good enough initial state. Simulations are carried on a cart-pole system to validate the effectiveness of the IPIDNN-based DHDP and the proposed initializing approach.

KW - Approximate dynamic programming (ADP)

KW - Direct heuristic dynamic programming (direct HDP)

KW - Improved PID neural network (IPIDNN)

KW - Initial weighs choosing

KW - PID controller

UR - http://www.scopus.com/inward/record.url?scp=78649927364&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=78649927364&partnerID=8YFLogxK

U2 - 10.1109/CRIS.2010.5617558

DO - 10.1109/CRIS.2010.5617558

M3 - Conference contribution

AN - SCOPUS:78649927364

SN - 9781424480814

T3 - 2010 5th International Conference on Critical Infrastructure, CRIS 2010 - Proceedings

BT - 2010 5th International Conference on Critical Infrastructure, CRIS 2010 - Proceedings

T2 - 2010 5th International Conference on Critical Infrastructure, CRIS 2010

Y2 - 20 September 2010 through 22 September 2010

ER -

Direct heuristic dynamic programming based on an improved PID neural network and initial weighs choosing method

Abstract

Publication series

Other

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this