Abstract
Because of a powerful temporal-difference (TD) with lambda [TD( lambda )] learning method, this paper presents a novel n-step adaptive dynamic program......
小提示:本篇文献需要登录阅读全文,点击跳转登录