TD-regularized actor-critic methods

Parisi, S; Tangkaratt, V; Peters, J; Khan, ME

Parisi, S (reprint author), Tech Unviersitat Darmstadt, Hsch Str 10, D-64289 Darmstadt, Germany.

MACHINE LEARNING, 2019; 108 (8-9): 1467

Abstract

Actor-critic methods can achieve incredible performance on difficult reinforcement learning problems, but they are also prone to instability. This is ......

Full Text Link