Learning programs for decision and control

Jennie Si; R. Enns; Yu Tsung Wang

doi:10.1109/ICII.2001.983100

Learning programs for decision and control

Jennie Si, R. Enns, Yu Tsung Wang

Electrical Engineering

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

Introduces learning programs, an approximate dynamic programming (ADP) or otherwise named neural dynamic programming (NDP) algorithm developed and tested by the authors. We first introduce the basic framework of our learning programs, the associated learning algorithms, and then extensive case studies to demonstrate the effectiveness of our learning programs. This is probably the first time that neural dynamic programming type of learning algorithms has been applied to complex, real life continuous state problems. Until now, reinforcement learning (another learning approach for approximate dynamic programming) has been mostly successful in discrete state space problems. On the other hand, prior NDP based approaches to controlling continuous state space systems have all been limited to smaller, or linearized, or decoupled problems. Therefore the work presented here compliments and advances the existing literature in the general area of learning approaches in approximate dynamic programming.

Original language	English (US)
Title of host publication	2001 International Conferences on Info-Tech and Info-Net
Subtitle of host publication	A Key to Better Life, ICII 2001 - Proceedings
Editors	Zhongzhi Shi, Hui Li, Y.X. Zhong
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	462-467
Number of pages	6
ISBN (Electronic)	0780370104, 9780780370104
DOIs	https://doi.org/10.1109/ICII.2001.983100
State	Published - 2001
Event	International Conferences on Info-Tech and Info-Net, ICII 2001 - Beijing, China Duration: Oct 29 2001 → Nov 1 2001

Publication series

Name	2001 International Conferences on Info-Tech and Info-Net: A Key to Better Life, ICII 2001 - Proceedings
Volume	3

Other

Other	International Conferences on Info-Tech and Info-Net, ICII 2001
Country/Territory	China
City	Beijing
Period	10/29/01 → 11/1/01

ASJC Scopus subject areas

Artificial Intelligence
Computer Networks and Communications
Computer Science Applications
Signal Processing
Computers in Earth Sciences
Control and Systems Engineering
Instrumentation

Access to Document

10.1109/ICII.2001.983100

Cite this

Si, J., Enns, R., & Wang, Y. T. (2001). Learning programs for decision and control. In Z. Shi, H. Li, & Y. X. Zhong (Eds.), 2001 International Conferences on Info-Tech and Info-Net: A Key to Better Life, ICII 2001 - Proceedings (pp. 462-467). Article 983100 (2001 International Conferences on Info-Tech and Info-Net: A Key to Better Life, ICII 2001 - Proceedings; Vol. 3). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICII.2001.983100

Learning programs for decision and control. / Si, Jennie; Enns, R.; Wang, Yu Tsung.
2001 International Conferences on Info-Tech and Info-Net: A Key to Better Life, ICII 2001 - Proceedings. ed. / Zhongzhi Shi; Hui Li; Y.X. Zhong. Institute of Electrical and Electronics Engineers Inc., 2001. p. 462-467 983100 (2001 International Conferences on Info-Tech and Info-Net: A Key to Better Life, ICII 2001 - Proceedings; Vol. 3).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Si, J, Enns, R & Wang, YT 2001, Learning programs for decision and control. in Z Shi, H Li & YX Zhong (eds), 2001 International Conferences on Info-Tech and Info-Net: A Key to Better Life, ICII 2001 - Proceedings., 983100, 2001 International Conferences on Info-Tech and Info-Net: A Key to Better Life, ICII 2001 - Proceedings, vol. 3, Institute of Electrical and Electronics Engineers Inc., pp. 462-467, International Conferences on Info-Tech and Info-Net, ICII 2001, Beijing, China, 10/29/01. https://doi.org/10.1109/ICII.2001.983100

Si J, Enns R, Wang YT. Learning programs for decision and control. In Shi Z, Li H, Zhong YX, editors, 2001 International Conferences on Info-Tech and Info-Net: A Key to Better Life, ICII 2001 - Proceedings. Institute of Electrical and Electronics Engineers Inc. 2001. p. 462-467. 983100. (2001 International Conferences on Info-Tech and Info-Net: A Key to Better Life, ICII 2001 - Proceedings). doi: 10.1109/ICII.2001.983100

Si, Jennie ; Enns, R. ; Wang, Yu Tsung. / Learning programs for decision and control. 2001 International Conferences on Info-Tech and Info-Net: A Key to Better Life, ICII 2001 - Proceedings. editor / Zhongzhi Shi ; Hui Li ; Y.X. Zhong. Institute of Electrical and Electronics Engineers Inc., 2001. pp. 462-467 (2001 International Conferences on Info-Tech and Info-Net: A Key to Better Life, ICII 2001 - Proceedings).

@inproceedings{1553f3b9c7964557a97fc4ed2baefd03,

title = "Learning programs for decision and control",

abstract = "Introduces learning programs, an approximate dynamic programming (ADP) or otherwise named neural dynamic programming (NDP) algorithm developed and tested by the authors. We first introduce the basic framework of our learning programs, the associated learning algorithms, and then extensive case studies to demonstrate the effectiveness of our learning programs. This is probably the first time that neural dynamic programming type of learning algorithms has been applied to complex, real life continuous state problems. Until now, reinforcement learning (another learning approach for approximate dynamic programming) has been mostly successful in discrete state space problems. On the other hand, prior NDP based approaches to controlling continuous state space systems have all been limited to smaller, or linearized, or decoupled problems. Therefore the work presented here compliments and advances the existing literature in the general area of learning approaches in approximate dynamic programming.",

author = "Jennie Si and R. Enns and Wang, {Yu Tsung}",

note = "Funding Information: Research supported by NSF under grants ECS-9553202 and ECS-0002098,a nd in part by Motorola. Funding Information: Research supported by NSF under grants ECS-9553202 and ECS-0002098, and in part by Motorola. The third author is now with Scientific Monitoring, Inc. in Tempe, Arizona. Publisher Copyright: {\textcopyright} 2001 IEEE.; International Conferences on Info-Tech and Info-Net, ICII 2001 ; Conference date: 29-10-2001 Through 01-11-2001",

year = "2001",

doi = "10.1109/ICII.2001.983100",

language = "English (US)",

series = "2001 International Conferences on Info-Tech and Info-Net: A Key to Better Life, ICII 2001 - Proceedings",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "462--467",

editor = "Zhongzhi Shi and Hui Li and Y.X. Zhong",

booktitle = "2001 International Conferences on Info-Tech and Info-Net",

}

TY - GEN

T1 - Learning programs for decision and control

AU - Si, Jennie

AU - Enns, R.

AU - Wang, Yu Tsung

N1 - Funding Information: Research supported by NSF under grants ECS-9553202 and ECS-0002098,a nd in part by Motorola. Funding Information: Research supported by NSF under grants ECS-9553202 and ECS-0002098, and in part by Motorola. The third author is now with Scientific Monitoring, Inc. in Tempe, Arizona. Publisher Copyright: © 2001 IEEE.

PY - 2001

Y1 - 2001

N2 - Introduces learning programs, an approximate dynamic programming (ADP) or otherwise named neural dynamic programming (NDP) algorithm developed and tested by the authors. We first introduce the basic framework of our learning programs, the associated learning algorithms, and then extensive case studies to demonstrate the effectiveness of our learning programs. This is probably the first time that neural dynamic programming type of learning algorithms has been applied to complex, real life continuous state problems. Until now, reinforcement learning (another learning approach for approximate dynamic programming) has been mostly successful in discrete state space problems. On the other hand, prior NDP based approaches to controlling continuous state space systems have all been limited to smaller, or linearized, or decoupled problems. Therefore the work presented here compliments and advances the existing literature in the general area of learning approaches in approximate dynamic programming.

AB - Introduces learning programs, an approximate dynamic programming (ADP) or otherwise named neural dynamic programming (NDP) algorithm developed and tested by the authors. We first introduce the basic framework of our learning programs, the associated learning algorithms, and then extensive case studies to demonstrate the effectiveness of our learning programs. This is probably the first time that neural dynamic programming type of learning algorithms has been applied to complex, real life continuous state problems. Until now, reinforcement learning (another learning approach for approximate dynamic programming) has been mostly successful in discrete state space problems. On the other hand, prior NDP based approaches to controlling continuous state space systems have all been limited to smaller, or linearized, or decoupled problems. Therefore the work presented here compliments and advances the existing literature in the general area of learning approaches in approximate dynamic programming.

UR - http://www.scopus.com/inward/record.url?scp=84964501368&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84964501368&partnerID=8YFLogxK

U2 - 10.1109/ICII.2001.983100

DO - 10.1109/ICII.2001.983100

M3 - Conference contribution

AN - SCOPUS:84964501368

T3 - 2001 International Conferences on Info-Tech and Info-Net: A Key to Better Life, ICII 2001 - Proceedings

SP - 462

EP - 467

BT - 2001 International Conferences on Info-Tech and Info-Net

A2 - Shi, Zhongzhi

A2 - Li, Hui

A2 - Zhong, Y.X.

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - International Conferences on Info-Tech and Info-Net, ICII 2001

Y2 - 29 October 2001 through 1 November 2001

ER -

Learning programs for decision and control

Abstract

Publication series

Other

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this