GREFENSTETTE, J. J. Credit Assignment in Rule Discovery Systems Based on Genetic Algorithms. Machine Learning. 1988, 3, 225-245
RUMMERY, G. A. On-line Q-learning Using Connectionist Systems. Technical Report. 1994
SUTTON, R. S. Integrated architecture for learning, planning, and reacting based on approximating dynamic programing. Proc. of 7th International Conference on Machine Learning, 1990. 1990, 216-224
SUTTON, R. S. Learning to Predict by Method of Temporal Differences. Machine Learning. 1988, 4, 9-44
WATKINS, C. J. C. H. Technical Note : Q-Learning. Machine Learning. 1992, 8, 279-292