Joint inference of reward machines and policies for reinforcement learning

Zhe Xu, Ivan Gavran, Yousef Ahmad, Rupak Majumdar, Daniel Neider, Ufuk Topcu, Bo Wu

Research output: Contribution to journalConference articlepeer-review

39 Scopus citations

Fingerprint

Dive into the research topics of 'Joint inference of reward machines and policies for reinforcement learning'. Together they form a unique fingerprint.

Business & Economics

Engineering & Materials Science