1) G. Tesauro: Practical issues in temporal difference learning, Machine Learning, 8-3/4, 257/277 (1992)
2) S. P. Singh and D. Bertsekas: Reinforcement learning for dynamic channel allocation in cellular telephone system, Advances in Neural Information Processing Systems 9, eds. M. C. Mozer, M. I. Jordan, and T. Petsche, Cambridge MA., MIT Press, 974/980 (1997)