学習理論って何?強化学習の基礎理論と応用

吉本潤一郎; 銅谷賢治; 石井信

文献

J-GLOBAL ID：200902299204895454 整理番号：05A0470060

学習理論って何?強化学習の基礎理論と応用

出版者サイト {{ this.onShowPLink() }} 複写サービスで全文入手 {{ this.onShowCLink("http://jdream3.com/copy/?sid=JGLOBAL&noSystem=1&documentNoArray=05A0470060&COPY=1") }}
高度な検索・分析はJDreamⅢで {{ this.onShowJLink("http://jdream3.com/lp/jglobal/index.html?docNo=05A0470060&from=J-GLOBAL&jstjournalNo=F0131A") }}

著者 (3件)： , ,
資料名：
巻： 44 号： 5 ページ： 313-318 発行年： 2005年05月10日
JST資料番号： F0131A ISSN： 0453-4662 CODEN： KESEA 資料種別：逐次刊行物 (A)
記事区分：原著論文発行国：日本 (JPN) 言語：日本語 (JA)

本稿では,研究分野に係わらずこれから問題解決に強化学習を役立...

,...
,...

続きはJDreamIII（有料）にて {{ this.onShowAbsJLink("http://jdream3.com/lp/jglobal/index.html?docNo=05A0470060&from=J-GLOBAL&jstjournalNo=F0131A") }}

人工知能

引用文献 (31件)：

1) G. Tesauro: Practical issues in temporal difference learning, Machine Learning, 8-3/4, 257/277 (1992)
2) S. P. Singh and D. Bertsekas: Reinforcement learning for dynamic channel allocation in cellular telephone system, Advances in Neural Information Processing Systems 9, eds. M. C. Mozer, M. I. Jordan, and T. Petsche, Cambridge MA., MIT Press, 974/980 (1997)
3) J. Morimoto and K. Doya: Acquisition of stand-up Behavior by a real robot using hierarchical reinforcement leaning, Robotics and Autonomous Systems, 36-1, 37/51 (2001)
4) C. J. C. H. Watkins and P. Dayan: Q-learning, Machine Learning, 8-3/4, 279/292 (1992)
5) R. E. Bellman: Dynamic Programming, Princeton University Press, Princeton, NJ. (1957)

, , , ,

前のページに戻る