The Exploitation Reinforcement Learning Method on POMDPs

UEMURA WATARU; UENO ATSUSHI; TATSUMI SHOJI

Art

J-GLOBAL ID：200902280699010655 Reference number：04A0654021

The Exploitation Reinforcement Learning Method on POMDPs

POMDPs環境下での経験強化型強化学習法

Publisher site Copy service {{ this.onShowCLink("http://jdream3.com/copy/?sid=JGLOBAL&noSystem=1&documentNoArray=04A0654021&COPY=1") }}
Access JDreamⅢ for advanced search and analysis. {{ this.onShowJLink("http://jdream3.com/lp/jglobal/index.html?docNo=04A0654021&from=J-GLOBAL&jstjournalNo=S0532B") }}

Author (3)： , ,
Material：
Volume： 104 Issue： 233(AI2004 12-18) Page： 1-5 Publication year： Jul. 29, 2004
JST Material Number： S0532B ISSN： 0913-5685 Document type： Proceedings
Article type：原著論文 Country of issue： Japan (JPN) Language： JAPANESE (JA)

,...

To see more with JDream III (charged). {{ this.onShowAbsJLink("http://jdream3.com/lp/jglobal/index.html?docNo=04A0654021&from=J-GLOBAL&jstjournalNo=S0532B") }}

Artificial intelligence

Reference (10)：

GREFENSTETTE, J. J. Credit Assignment in Rule Discovery Systems Based on Genetic Algorithms. Machine Learning. 1988, 3, 225-245
RUMMERY, G. A. On-line Q-learning Using Connectionist Systems. Technical Report. 1994
SUTTON, R. S. Integrated architecture for learning, planning, and reacting based on approximating dynamic programing. Proc. of 7th International Conference on Machine Learning, 1990. 1990, 216-224
SUTTON, R. S. Learning to Predict by Method of Temporal Differences. Machine Learning. 1988, 4, 9-44
WATKINS, C. J. C. H. Technical Note : Q-Learning. Machine Learning. 1992, 8, 279-292

ｍore...

, ,

Return to Previous Page