Hierarchical strategy learning with hybrid representations

Sungwook Yoon; Subbarao Kambhampati

Hierarchical strategy learning with hybrid representations

Sungwook Yoon, Subbarao Kambhampati

Computer Science and Engineering

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

6 Scopus citations

Abstract

Good problem solving knowledge for real life domains is hard to define in a single representation. In some situations, a direct policy is a better choice while in others, value function is better. Typically, direct policy representation is better suited to strategic level plans, while value function representation is better suited to tactical level plans. We propose a hybrid hierarchical representation machine (HHRM) where direct policy representation and value function based representation can co-exist in a level-wise fashion. We provide simple learning and planning algorithms with our new representation and discuss their application to Airspace Deconfliction domain. In our experiments, we provided our system LSP with two level HHRM for the domain. LSP could successfully learn from limited number of experts' solution traces and show superior performance compared to average of human novice learners.

Original language	English (US)
Title of host publication	Acquiring Planning Knowledge via Demonstration - Papers from the 2007 AAAI Workshop, Technical Report
Pages	52-56
Number of pages	5
State	Published - Dec 1 2007
Event	2007 AAAI Workshop - Vancouver, BC, Canada Duration: Jul 23 2007 → Jul 23 2007

Publication series

Name	AAAI Workshop - Technical Report
Volume	WS-07-02

Other

Other	2007 AAAI Workshop
Country/Territory	Canada
City	Vancouver, BC
Period	7/23/07 → 7/23/07

ASJC Scopus subject areas

General Engineering

Cite this

@inproceedings{26e6194364e44b6f944b52416b78b55e,

title = "Hierarchical strategy learning with hybrid representations",

abstract = "Good problem solving knowledge for real life domains is hard to define in a single representation. In some situations, a direct policy is a better choice while in others, value function is better. Typically, direct policy representation is better suited to strategic level plans, while value function representation is better suited to tactical level plans. We propose a hybrid hierarchical representation machine (HHRM) where direct policy representation and value function based representation can co-exist in a level-wise fashion. We provide simple learning and planning algorithms with our new representation and discuss their application to Airspace Deconfliction domain. In our experiments, we provided our system LSP with two level HHRM for the domain. LSP could successfully learn from limited number of experts' solution traces and show superior performance compared to average of human novice learners.",

author = "Sungwook Yoon and Subbarao Kambhampati",

year = "2007",

month = dec,

day = "1",

language = "English (US)",

isbn = "9781577353294",

series = "AAAI Workshop - Technical Report",

pages = "52--56",

booktitle = "Acquiring Planning Knowledge via Demonstration - Papers from the 2007 AAAI Workshop, Technical Report",

note = "2007 AAAI Workshop ; Conference date: 23-07-2007 Through 23-07-2007",

}

TY - GEN

T1 - Hierarchical strategy learning with hybrid representations

AU - Yoon, Sungwook

AU - Kambhampati, Subbarao

PY - 2007/12/1

Y1 - 2007/12/1

N2 - Good problem solving knowledge for real life domains is hard to define in a single representation. In some situations, a direct policy is a better choice while in others, value function is better. Typically, direct policy representation is better suited to strategic level plans, while value function representation is better suited to tactical level plans. We propose a hybrid hierarchical representation machine (HHRM) where direct policy representation and value function based representation can co-exist in a level-wise fashion. We provide simple learning and planning algorithms with our new representation and discuss their application to Airspace Deconfliction domain. In our experiments, we provided our system LSP with two level HHRM for the domain. LSP could successfully learn from limited number of experts' solution traces and show superior performance compared to average of human novice learners.

AB - Good problem solving knowledge for real life domains is hard to define in a single representation. In some situations, a direct policy is a better choice while in others, value function is better. Typically, direct policy representation is better suited to strategic level plans, while value function representation is better suited to tactical level plans. We propose a hybrid hierarchical representation machine (HHRM) where direct policy representation and value function based representation can co-exist in a level-wise fashion. We provide simple learning and planning algorithms with our new representation and discuss their application to Airspace Deconfliction domain. In our experiments, we provided our system LSP with two level HHRM for the domain. LSP could successfully learn from limited number of experts' solution traces and show superior performance compared to average of human novice learners.

UR - http://www.scopus.com/inward/record.url?scp=51849088509&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=51849088509&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:51849088509

SN - 9781577353294

T3 - AAAI Workshop - Technical Report

SP - 52

EP - 56

BT - Acquiring Planning Knowledge via Demonstration - Papers from the 2007 AAAI Workshop, Technical Report

T2 - 2007 AAAI Workshop

Y2 - 23 July 2007 through 23 July 2007

ER -

Hierarchical strategy learning with hybrid representations

Abstract

Publication series

Other

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this