Directing policy search with interactively taught via-points

Yannick Schroecker; Hani Ben Amor; Andrea Thomaz

Directing policy search with interactively taught via-points

Yannick Schroecker, Hani Ben Amor, Andrea Thomaz

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

Policy search has been successfully applied to robot motor learning problems. However, for moderately complex tasks the necessity of good heuristics or initialization still arises. One method that has been used to alleviate this problem is to utilize demonstrations obtained by a human teacher as a starting point for policy search in the space of trajectories. In this paper we describe an alternative way of giving demonstrations as soft via-points and show how they can be used for initialization as well as for active corrections during the learning process. With this approach, we restrict the search space to trajectories that will be close to the taught via-points at the taught time and thereby significantly reduce the number of samples necessary to learn a good policy. We show with a simulated robot arm that our method can efficiently learn to insert an object in a hole with just a minimal demonstration and evaluate our method further on a synthetic letter reproduction task.

Original language	English (US)
Title of host publication	AAMAS 2016 - Proceedings of the 2016 International Conference on Autonomous Agents and Multiagent Systems
Publisher	International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)
Pages	1052-1059
Number of pages	8
ISBN (Electronic)	9781450342391
State	Published - 2016
Event	15th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2016 - Singapore, Singapore Duration: May 9 2016 → May 13 2016

Publication series

Name	Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS
ISSN (Print)	1548-8403
ISSN (Electronic)	1558-2914

Other

Other	15th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2016
Country/Territory	Singapore
City	Singapore
Period	5/9/16 → 5/13/16

Keywords

Dynamic movement primitives
Keyframe demonstrations
Learning from demonstration
Reinforcement learning
Reinforcement learning for motor skills

ASJC Scopus subject areas

Artificial Intelligence
Software
Control and Systems Engineering

Cite this

Schroecker, Y., Ben Amor, H., & Thomaz, A. (2016). Directing policy search with interactively taught via-points. In AAMAS 2016 - Proceedings of the 2016 International Conference on Autonomous Agents and Multiagent Systems (pp. 1052-1059). (Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS). International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS).

Directing policy search with interactively taught via-points. / Schroecker, Yannick; Ben Amor, Hani; Thomaz, Andrea.
AAMAS 2016 - Proceedings of the 2016 International Conference on Autonomous Agents and Multiagent Systems. International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS), 2016. p. 1052-1059 (Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Schroecker, Y, Ben Amor, H & Thomaz, A 2016, Directing policy search with interactively taught via-points. in AAMAS 2016 - Proceedings of the 2016 International Conference on Autonomous Agents and Multiagent Systems. Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS, International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS), pp. 1052-1059, 15th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2016, Singapore, Singapore, 5/9/16.

Schroecker Y, Ben Amor H, Thomaz A. Directing policy search with interactively taught via-points. In AAMAS 2016 - Proceedings of the 2016 International Conference on Autonomous Agents and Multiagent Systems. International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS). 2016. p. 1052-1059. (Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS).

Schroecker, Yannick ; Ben Amor, Hani ; Thomaz, Andrea. / Directing policy search with interactively taught via-points. AAMAS 2016 - Proceedings of the 2016 International Conference on Autonomous Agents and Multiagent Systems. International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS), 2016. pp. 1052-1059 (Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS).

@inproceedings{6ca809ff5cab469d88e034a6f16ea436,

title = "Directing policy search with interactively taught via-points",

abstract = "Policy search has been successfully applied to robot motor learning problems. However, for moderately complex tasks the necessity of good heuristics or initialization still arises. One method that has been used to alleviate this problem is to utilize demonstrations obtained by a human teacher as a starting point for policy search in the space of trajectories. In this paper we describe an alternative way of giving demonstrations as soft via-points and show how they can be used for initialization as well as for active corrections during the learning process. With this approach, we restrict the search space to trajectories that will be close to the taught via-points at the taught time and thereby significantly reduce the number of samples necessary to learn a good policy. We show with a simulated robot arm that our method can efficiently learn to insert an object in a hole with just a minimal demonstration and evaluate our method further on a synthetic letter reproduction task.",

keywords = "Dynamic movement primitives, Keyframe demonstrations, Learning from demonstration, Reinforcement learning, Reinforcement learning for motor skills",

author = "Yannick Schroecker and {Ben Amor}, Hani and Andrea Thomaz",

note = "Funding Information: This work was conducted as a part of the OpenLabs project 1436618 sponsored by PSA Peugeot and partially funded under ONR grant number N000141410003. Publisher Copyright: Copyright {\textcopyright} 2016, International Foundation for Autonomous Agents and Multiagent Systems (www.ifaamas.org). All rights reserved.; 15th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2016 ; Conference date: 09-05-2016 Through 13-05-2016",

year = "2016",

language = "English (US)",

series = "Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS",

publisher = "International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)",

pages = "1052--1059",

booktitle = "AAMAS 2016 - Proceedings of the 2016 International Conference on Autonomous Agents and Multiagent Systems",

}

TY - GEN

T1 - Directing policy search with interactively taught via-points

AU - Schroecker, Yannick

AU - Ben Amor, Hani

AU - Thomaz, Andrea

N1 - Funding Information: This work was conducted as a part of the OpenLabs project 1436618 sponsored by PSA Peugeot and partially funded under ONR grant number N000141410003. Publisher Copyright: Copyright © 2016, International Foundation for Autonomous Agents and Multiagent Systems (www.ifaamas.org). All rights reserved.

PY - 2016

Y1 - 2016

N2 - Policy search has been successfully applied to robot motor learning problems. However, for moderately complex tasks the necessity of good heuristics or initialization still arises. One method that has been used to alleviate this problem is to utilize demonstrations obtained by a human teacher as a starting point for policy search in the space of trajectories. In this paper we describe an alternative way of giving demonstrations as soft via-points and show how they can be used for initialization as well as for active corrections during the learning process. With this approach, we restrict the search space to trajectories that will be close to the taught via-points at the taught time and thereby significantly reduce the number of samples necessary to learn a good policy. We show with a simulated robot arm that our method can efficiently learn to insert an object in a hole with just a minimal demonstration and evaluate our method further on a synthetic letter reproduction task.

AB - Policy search has been successfully applied to robot motor learning problems. However, for moderately complex tasks the necessity of good heuristics or initialization still arises. One method that has been used to alleviate this problem is to utilize demonstrations obtained by a human teacher as a starting point for policy search in the space of trajectories. In this paper we describe an alternative way of giving demonstrations as soft via-points and show how they can be used for initialization as well as for active corrections during the learning process. With this approach, we restrict the search space to trajectories that will be close to the taught via-points at the taught time and thereby significantly reduce the number of samples necessary to learn a good policy. We show with a simulated robot arm that our method can efficiently learn to insert an object in a hole with just a minimal demonstration and evaluate our method further on a synthetic letter reproduction task.

KW - Dynamic movement primitives

KW - Keyframe demonstrations

KW - Learning from demonstration

KW - Reinforcement learning

KW - Reinforcement learning for motor skills

UR - http://www.scopus.com/inward/record.url?scp=85014153645&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85014153645&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85014153645

T3 - Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS

SP - 1052

EP - 1059

BT - AAMAS 2016 - Proceedings of the 2016 International Conference on Autonomous Agents and Multiagent Systems

PB - International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)

T2 - 15th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2016

Y2 - 9 May 2016 through 13 May 2016

ER -

Directing policy search with interactively taught via-points

Abstract

Publication series

Other

Keywords

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this