UCHIBE, E. Competitive-cooperative-concurrent reinforcement learning with importance sampling. The 8th International Conference on the Simulation of Adaptive Behavior, 2004. 2004, 287-296
BAGNELL, D. Policy search by dynamic programming. Proceedings of Neural Information Processing Systems, 2004. 2004
RONSENSTEIN, M. T. Supervised actor-critic reinforcement learning. 2004, 359-380