BRADTKE, S. J. Reinforcement Learning Method for Continuous-Time Markov Decision Problems. Advances in Neural Information Processing Systems. 1994, 7, 393-400
DOYA, K. Efficient Nonlinear Control with Actor-Tutor Architecture. Advances in Neural Information Processing Systems. 1996, 9, 1012-1018