TY - GEN
T1 - Hierarchical and Partially Observable Goal-driven Policy Learning with Goals Relational Graph
AU - Ye, Xin
AU - Yang, Yezhou
N1 - Funding Information:
This work is partially supported by the NSF grant #1750082, and Samsung Research.
Publisher Copyright:
© 2021 IEEE
PY - 2021
Y1 - 2021
N2 - We present a novel two-layer hierarchical reinforcement learning approach equipped with a Goals Relational Graph (GRG) for tackling the partially observable goal-driven task, such as goal-driven visual navigation. Our GRG captures the underlying relations of all goals in the goal space through a Dirichlet-categorical process that facilitates: 1) the high-level network raising a sub-goal towards achieving a designated final goal; 2) the low-level network towards an optimal policy; and 3) the overall system generalizing unseen environments and goals. We evaluate our approach with two settings of partially observable goal-driven tasks - a grid-world domain and a robotic object search task. Our experimental results show that our approach exhibits superior generalization performance on both unseen environments and new goals.
AB - We present a novel two-layer hierarchical reinforcement learning approach equipped with a Goals Relational Graph (GRG) for tackling the partially observable goal-driven task, such as goal-driven visual navigation. Our GRG captures the underlying relations of all goals in the goal space through a Dirichlet-categorical process that facilitates: 1) the high-level network raising a sub-goal towards achieving a designated final goal; 2) the low-level network towards an optimal policy; and 3) the overall system generalizing unseen environments and goals. We evaluate our approach with two settings of partially observable goal-driven tasks - a grid-world domain and a robotic object search task. Our experimental results show that our approach exhibits superior generalization performance on both unseen environments and new goals.
UR - http://www.scopus.com/inward/record.url?scp=85104096498&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85104096498&partnerID=8YFLogxK
U2 - 10.1109/CVPR46437.2021.01388
DO - 10.1109/CVPR46437.2021.01388
M3 - Conference contribution
AN - SCOPUS:85104096498
T3 - Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
SP - 14096
EP - 14105
BT - Proceedings - 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2021
PB - IEEE Computer Society
T2 - 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2021
Y2 - 19 June 2021 through 25 June 2021
ER -