A. S. Polydoros and L. Nalpantidis: Survey of model-based reinforcement learning: applications on robotics. J. Intell. Robotics Syst., 86-2, 153/173 (2017)
R. Sutton and A. Barto: Reinforcement learning: an introduction, MIT Press (1998)
J. Morimoto and K. Doya: Robust reinforcement learning. Neural computation, 17-2, 335/359 (2005).
D. P. Bertsekas and S. E. Shreve: Stochastic optimal control: the discrete-time case, Athena Scientific (1996)
M. Duff: Design for an optimal probe, Proc. of the 19th Intl. Conf. on Machine Learning (ICML), 131/138 (2003)