Discrete action on-policy learning with action-value critic

Published in International Conference on Artificial Intelligence and Statistics (AISTATS), 2020

Recommended citation: Y. Yue, Y. Tang, M. Yin, and M. Zhou . "Discrete action on-policy learning with action-value critic" AISTATS(2020). https://arxiv.org/abs/2002.03534