Decentralized TD(0) With Gradient Tracking

Lin, QF; Ling, Q

Lin, QF (corresponding author), Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China.; Lin, QF (corresponding author), Sun Yat Sen Univ, Guangdong Prov Key Lab Computat Sci, Guangzhou 510006, Peoples R China.

IEEE SIGNAL PROCESSING LETTERS, 2021; 28 (): 723

Abstract

In this letter, we consider the policy evaluation problem with linear function approximation in the context of decentralized multi-agent reinforcement......

Full Text Link