Demonstration actor critic

Liu, GQ; Zhao, L; Zhang, PS; Bian, J; Qin, T; Yu, NH; Liu, TY

Liu, GQ (corresponding author), Univ Sci & Technol China, Hefei, Anhui, Peoples R China.

NEUROCOMPUTING, 2021; 434 (): 194

Abstract

We study the problem of Reinforcement Learning from Demonstrations (RLfD), where the agent has access to not only reward signals from the environment,......

Full Text Link