Why? Why not? When? Visual Explanations of Agent Behaviour in Reinforcement Learning

Aditi Mishra; Utkarsh Soni; Jinbin Huang; Chris Bryan

doi:10.1109/PacificVis53943.2022.00020

Why? Why not? When? Visual Explanations of Agent Behaviour in Reinforcement Learning

Aditi Mishra, Utkarsh Soni, Jinbin Huang, Chris Bryan

Engineering, Ira A. Fulton Schools of (IAFSE)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

6 Scopus citations

Abstract

Reinforcement learning (RL) is used in many domains, including autonomous driving, robotics, stock trading, and video games. Unfortunately, the black box nature of RL agents, combined with legal and ethical considerations, makes it increasingly important that humans (including those are who not experts in RL) understand the reasoning behind the actions taken by an RL agent, particularly in safety-critical domains. To help address this challenge, we introduce PolicyExplainer, a visual analytics interface which lets the user directly query an autonomous agent. PolicyExplainer visualizes the states, policy, and expected future rewards for an agent, and supports asking and answering questions such as: 'Why take this action? Why not take this other action? When is this action taken?' PolicyExplainer is designed based upon a domain analysis with RL researchers, and is evaluated via qualitative and quantitative assessments on a trio of domains: taxi navigation, a stack bot domain, and drug recommendation for HIV patients. We find that PolicyExplainer's visual approach promotes trust and understanding of agent decisions better than a state-of-the-art text-based explanation approach. Interviews with domain practitioners provide further validation for PolicyExplainer as applied to safety-critical domains. Our results help demonstrate how visualization-based approaches can be leveraged to decode the behavior of autonomous RL agents, particularly for RL non-experts.

Original language	English (US)
Title of host publication	Proceedings - 2022 IEEE 15th Pacific Visualization Symposium, PacificVis 2022
Publisher	IEEE Computer Society
Pages	111-120
Number of pages	10
ISBN (Electronic)	9781665423359
DOIs	https://doi.org/10.1109/PacificVis53943.2022.00020
State	Published - 2022
Event	15th IEEE Pacific Visualization Symposium, PacificVis 2022 - Virtual, Online, Japan Duration: Apr 11 2022 → Apr 14 2022

Publication series

Name	IEEE Pacific Visualization Symposium
Volume	2022-April
ISSN (Print)	2165-8765
ISSN (Electronic)	2165-8773

Conference

Conference	15th IEEE Pacific Visualization Symposium, PacificVis 2022
Country/Territory	Japan
City	Virtual, Online
Period	4/11/22 → 4/14/22

Keywords

Human-centered computing-Visualization-Visualization design and evaluation methods
Human-centered computing-Visualization-Visualization techniques-Treemaps

ASJC Scopus subject areas

Computer Graphics and Computer-Aided Design
Computer Vision and Pattern Recognition
Hardware and Architecture
Software

Access to Document

10.1109/PacificVis53943.2022.00020

Cite this

Why? Why not? When? Visual Explanations of Agent Behaviour in Reinforcement Learning. / Mishra, Aditi; Soni, Utkarsh; Huang, Jinbin et al.
Proceedings - 2022 IEEE 15th Pacific Visualization Symposium, PacificVis 2022. IEEE Computer Society, 2022. p. 111-120 (IEEE Pacific Visualization Symposium; Vol. 2022-April).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Mishra, A, Soni, U, Huang, J & Bryan, C 2022, Why? Why not? When? Visual Explanations of Agent Behaviour in Reinforcement Learning. in Proceedings - 2022 IEEE 15th Pacific Visualization Symposium, PacificVis 2022. IEEE Pacific Visualization Symposium, vol. 2022-April, IEEE Computer Society, pp. 111-120, 15th IEEE Pacific Visualization Symposium, PacificVis 2022, Virtual, Online, Japan, 4/11/22. https://doi.org/10.1109/PacificVis53943.2022.00020

@inproceedings{c336209daa464b0ab85f7f36eed1b088,

title = "Why? Why not? When? Visual Explanations of Agent Behaviour in Reinforcement Learning",

abstract = "Reinforcement learning (RL) is used in many domains, including autonomous driving, robotics, stock trading, and video games. Unfortunately, the black box nature of RL agents, combined with legal and ethical considerations, makes it increasingly important that humans (including those are who not experts in RL) understand the reasoning behind the actions taken by an RL agent, particularly in safety-critical domains. To help address this challenge, we introduce PolicyExplainer, a visual analytics interface which lets the user directly query an autonomous agent. PolicyExplainer visualizes the states, policy, and expected future rewards for an agent, and supports asking and answering questions such as: 'Why take this action? Why not take this other action? When is this action taken?' PolicyExplainer is designed based upon a domain analysis with RL researchers, and is evaluated via qualitative and quantitative assessments on a trio of domains: taxi navigation, a stack bot domain, and drug recommendation for HIV patients. We find that PolicyExplainer's visual approach promotes trust and understanding of agent decisions better than a state-of-the-art text-based explanation approach. Interviews with domain practitioners provide further validation for PolicyExplainer as applied to safety-critical domains. Our results help demonstrate how visualization-based approaches can be leveraged to decode the behavior of autonomous RL agents, particularly for RL non-experts.",

keywords = "Human-centered computing-Visualization-Visualization design and evaluation methods, Human-centered computing-Visualization-Visualization techniques-Treemaps",

author = "Aditi Mishra and Utkarsh Soni and Jinbin Huang and Chris Bryan",

note = "Funding Information: This research was supported by the U.S. National Science Foundation through grant OAC-1934766. Publisher Copyright: {\textcopyright} 2022 IEEE.; 15th IEEE Pacific Visualization Symposium, PacificVis 2022 ; Conference date: 11-04-2022 Through 14-04-2022",

year = "2022",

doi = "10.1109/PacificVis53943.2022.00020",

language = "English (US)",

series = "IEEE Pacific Visualization Symposium",

publisher = "IEEE Computer Society",

pages = "111--120",

booktitle = "Proceedings - 2022 IEEE 15th Pacific Visualization Symposium, PacificVis 2022",

}

TY - GEN

T1 - Why? Why not? When? Visual Explanations of Agent Behaviour in Reinforcement Learning

AU - Mishra, Aditi

AU - Soni, Utkarsh

AU - Huang, Jinbin

AU - Bryan, Chris

PY - 2022

Y1 - 2022

N2 - Reinforcement learning (RL) is used in many domains, including autonomous driving, robotics, stock trading, and video games. Unfortunately, the black box nature of RL agents, combined with legal and ethical considerations, makes it increasingly important that humans (including those are who not experts in RL) understand the reasoning behind the actions taken by an RL agent, particularly in safety-critical domains. To help address this challenge, we introduce PolicyExplainer, a visual analytics interface which lets the user directly query an autonomous agent. PolicyExplainer visualizes the states, policy, and expected future rewards for an agent, and supports asking and answering questions such as: 'Why take this action? Why not take this other action? When is this action taken?' PolicyExplainer is designed based upon a domain analysis with RL researchers, and is evaluated via qualitative and quantitative assessments on a trio of domains: taxi navigation, a stack bot domain, and drug recommendation for HIV patients. We find that PolicyExplainer's visual approach promotes trust and understanding of agent decisions better than a state-of-the-art text-based explanation approach. Interviews with domain practitioners provide further validation for PolicyExplainer as applied to safety-critical domains. Our results help demonstrate how visualization-based approaches can be leveraged to decode the behavior of autonomous RL agents, particularly for RL non-experts.

AB - Reinforcement learning (RL) is used in many domains, including autonomous driving, robotics, stock trading, and video games. Unfortunately, the black box nature of RL agents, combined with legal and ethical considerations, makes it increasingly important that humans (including those are who not experts in RL) understand the reasoning behind the actions taken by an RL agent, particularly in safety-critical domains. To help address this challenge, we introduce PolicyExplainer, a visual analytics interface which lets the user directly query an autonomous agent. PolicyExplainer visualizes the states, policy, and expected future rewards for an agent, and supports asking and answering questions such as: 'Why take this action? Why not take this other action? When is this action taken?' PolicyExplainer is designed based upon a domain analysis with RL researchers, and is evaluated via qualitative and quantitative assessments on a trio of domains: taxi navigation, a stack bot domain, and drug recommendation for HIV patients. We find that PolicyExplainer's visual approach promotes trust and understanding of agent decisions better than a state-of-the-art text-based explanation approach. Interviews with domain practitioners provide further validation for PolicyExplainer as applied to safety-critical domains. Our results help demonstrate how visualization-based approaches can be leveraged to decode the behavior of autonomous RL agents, particularly for RL non-experts.

KW - Human-centered computing-Visualization-Visualization design and evaluation methods

KW - Human-centered computing-Visualization-Visualization techniques-Treemaps

UR - http://www.scopus.com/inward/record.url?scp=85132446785&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85132446785&partnerID=8YFLogxK

U2 - 10.1109/PacificVis53943.2022.00020

DO - 10.1109/PacificVis53943.2022.00020

M3 - Conference contribution

AN - SCOPUS:85132446785

T3 - IEEE Pacific Visualization Symposium

SP - 111

EP - 120

BT - Proceedings - 2022 IEEE 15th Pacific Visualization Symposium, PacificVis 2022

PB - IEEE Computer Society

T2 - 15th IEEE Pacific Visualization Symposium, PacificVis 2022

Y2 - 11 April 2022 through 14 April 2022

ER -

Why? Why not? When? Visual Explanations of Agent Behaviour in Reinforcement Learning

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this