Finite-time performance bounds and adaptive learning rate selection for two time-scale reinforcement learning

Harsh Gupta, R. Srikant, Lei Ying

Research output: Contribution to journalConference articlepeer-review

46 Scopus citations

Fingerprint

Dive into the research topics of 'Finite-time performance bounds and adaptive learning rate selection for two time-scale reinforcement learning'. Together they form a unique fingerprint.

Engineering & Materials Science