DBA: Dynamic Multi-Armed Bandit Algorithm

Nobari, S

Nobari, S (reprint author), Rakuten Inc, Tokyo, Japan.

THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH, 2019; (): 9869

Abstract

We introduce Dynamic Bandit Algorithm (DBA), a practical solution to improve the shortcoming of the pervasively employed reinforcement learning algori......

Full Text Link