When Shall I Estimate Your Intent? Costs and Benefits of Intent Inference in Multi-Agent Interactions

Sunny Amatya; Mukesh Ghimire; Yi Ren; Zhe Xu; Wenlong Zhang

doi:10.23919/ACC53348.2022.9867155

When Shall I Estimate Your Intent? Costs and Benefits of Intent Inference in Multi-Agent Interactions

Sunny Amatya, Mukesh Ghimire, Yi Ren, Zhe Xu, Wenlong Zhang

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

This paper addresses incomplete-information dynamic games, where reward parameters of agents are private. Previous studies have shown that online belief update is necessary for deriving equilibrial policies of such games, especially for high-risk games such as vehicle interactions. However, updating beliefs in real time is computationally expensive as it requires continuous computation of Nash equilibria of the sub-games starting from the current states. In this paper, we consider the triggering mechanism of belief update as a policy defined on the agents' physical and belief states, and propose learning this policy through reinforcement learning (RL). Using a two-vehicle uncontrolled intersection case, we show that intermittent belief update via RL is sufficient for safe interactions, reducing the computation cost of updates by 59% when agents have full observations of physical states. Simulation results also show that the belief update frequency will increase as noise becomes more significant in measurements of the vehicle positions.

Original language	English (US)
Title of host publication	2022 American Control Conference, ACC 2022
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	586-592
Number of pages	7
ISBN (Electronic)	9781665451963
DOIs	https://doi.org/10.23919/ACC53348.2022.9867155
State	Published - 2022
Event	2022 American Control Conference, ACC 2022 - Atlanta, United States Duration: Jun 8 2022 → Jun 10 2022

Publication series

Name	Proceedings of the American Control Conference
Volume	2022-June
ISSN (Print)	0743-1619

Conference

Conference	2022 American Control Conference, ACC 2022
Country/Territory	United States
City	Atlanta
Period	6/8/22 → 6/10/22

ASJC Scopus subject areas

Electrical and Electronic Engineering

Access to Document

10.23919/ACC53348.2022.9867155

Cite this

Amatya, S., Ghimire, M., Ren, Y., Xu, Z., & Zhang, W. (2022). When Shall I Estimate Your Intent? Costs and Benefits of Intent Inference in Multi-Agent Interactions. In 2022 American Control Conference, ACC 2022 (pp. 586-592). (Proceedings of the American Control Conference; Vol. 2022-June). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.23919/ACC53348.2022.9867155

When Shall I Estimate Your Intent? Costs and Benefits of Intent Inference in Multi-Agent Interactions. / Amatya, Sunny; Ghimire, Mukesh; Ren, Yi et al.
2022 American Control Conference, ACC 2022. Institute of Electrical and Electronics Engineers Inc., 2022. p. 586-592 (Proceedings of the American Control Conference; Vol. 2022-June).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Amatya, S, Ghimire, M, Ren, Y , Xu, Z & Zhang, W 2022, When Shall I Estimate Your Intent? Costs and Benefits of Intent Inference in Multi-Agent Interactions. in 2022 American Control Conference, ACC 2022. Proceedings of the American Control Conference, vol. 2022-June, Institute of Electrical and Electronics Engineers Inc., pp. 586-592, 2022 American Control Conference, ACC 2022, Atlanta, United States, 6/8/22. https://doi.org/10.23919/ACC53348.2022.9867155

@inproceedings{f8994490aaff4faeb7be367268ad237d,

title = "When Shall I Estimate Your Intent? Costs and Benefits of Intent Inference in Multi-Agent Interactions",

abstract = "This paper addresses incomplete-information dynamic games, where reward parameters of agents are private. Previous studies have shown that online belief update is necessary for deriving equilibrial policies of such games, especially for high-risk games such as vehicle interactions. However, updating beliefs in real time is computationally expensive as it requires continuous computation of Nash equilibria of the sub-games starting from the current states. In this paper, we consider the triggering mechanism of belief update as a policy defined on the agents' physical and belief states, and propose learning this policy through reinforcement learning (RL). Using a two-vehicle uncontrolled intersection case, we show that intermittent belief update via RL is sufficient for safe interactions, reducing the computation cost of updates by 59% when agents have full observations of physical states. Simulation results also show that the belief update frequency will increase as noise becomes more significant in measurements of the vehicle positions.",

author = "Sunny Amatya and Mukesh Ghimire and Yi Ren and Zhe Xu and Wenlong Zhang",

note = "Publisher Copyright: {\textcopyright} 2022 American Automatic Control Council.; 2022 American Control Conference, ACC 2022 ; Conference date: 08-06-2022 Through 10-06-2022",

year = "2022",

doi = "10.23919/ACC53348.2022.9867155",

language = "English (US)",

series = "Proceedings of the American Control Conference",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "586--592",

booktitle = "2022 American Control Conference, ACC 2022",

}

TY - GEN

T1 - When Shall I Estimate Your Intent? Costs and Benefits of Intent Inference in Multi-Agent Interactions

AU - Amatya, Sunny

AU - Ghimire, Mukesh

AU - Ren, Yi

AU - Xu, Zhe

AU - Zhang, Wenlong

PY - 2022

Y1 - 2022

N2 - This paper addresses incomplete-information dynamic games, where reward parameters of agents are private. Previous studies have shown that online belief update is necessary for deriving equilibrial policies of such games, especially for high-risk games such as vehicle interactions. However, updating beliefs in real time is computationally expensive as it requires continuous computation of Nash equilibria of the sub-games starting from the current states. In this paper, we consider the triggering mechanism of belief update as a policy defined on the agents' physical and belief states, and propose learning this policy through reinforcement learning (RL). Using a two-vehicle uncontrolled intersection case, we show that intermittent belief update via RL is sufficient for safe interactions, reducing the computation cost of updates by 59% when agents have full observations of physical states. Simulation results also show that the belief update frequency will increase as noise becomes more significant in measurements of the vehicle positions.

AB - This paper addresses incomplete-information dynamic games, where reward parameters of agents are private. Previous studies have shown that online belief update is necessary for deriving equilibrial policies of such games, especially for high-risk games such as vehicle interactions. However, updating beliefs in real time is computationally expensive as it requires continuous computation of Nash equilibria of the sub-games starting from the current states. In this paper, we consider the triggering mechanism of belief update as a policy defined on the agents' physical and belief states, and propose learning this policy through reinforcement learning (RL). Using a two-vehicle uncontrolled intersection case, we show that intermittent belief update via RL is sufficient for safe interactions, reducing the computation cost of updates by 59% when agents have full observations of physical states. Simulation results also show that the belief update frequency will increase as noise becomes more significant in measurements of the vehicle positions.

UR - http://www.scopus.com/inward/record.url?scp=85138495923&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85138495923&partnerID=8YFLogxK

U2 - 10.23919/ACC53348.2022.9867155

DO - 10.23919/ACC53348.2022.9867155

M3 - Conference contribution

AN - SCOPUS:85138495923

T3 - Proceedings of the American Control Conference

SP - 586

EP - 592

BT - 2022 American Control Conference, ACC 2022

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2022 American Control Conference, ACC 2022

Y2 - 8 June 2022 through 10 June 2022

ER -

When Shall I Estimate Your Intent? Costs and Benefits of Intent Inference in Multi-Agent Interactions

Abstract

Publication series

Conference

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this