部分観測マルコフ決定過程下での強化学習: 確率的傾斜法による接近

文献

J-GLOBAL ID：200902168136770888 整理番号：96A0772764

Reinforcement Learning in Partially Observable Markov Decision Processes: A Stochastic Gradient Method.

出版者サイト複写サービスで全文入手 {{ this.onShowCLink("http://jdream3.com/copy/?sid=JGLOBAL&noSystem=1&documentNoArray=96A0772764&COPY=1") }}
高度な検索・分析はJDreamⅢで {{ this.onShowJLink("http://jdream3.com/lp/jglobal/index.html?docNo=96A0772764&from=J-GLOBAL&jstjournalNo=X0330A") }}

著者 (3件)： , ,
資料名：
巻： 11 号： 5 ページ： 761-768 発行年： 1996年09月
JST資料番号： X0330A ISSN： 0912-8085 資料種別：逐次刊行物 (A)
記事区分：原著論文発行国：日本 (JPN) 言語：日本語 (JA)

環境の状態についてエージェントの知覚能力が不十分な場合,不完...

,...

続きはJDreamIII（有料）にて {{ this.onShowAbsJLink("http://jdream3.com/lp/jglobal/index.html?docNo=96A0772764&from=J-GLOBAL&jstjournalNo=X0330A") }}

人工知能 , システム・制御理論一般

引用文献 (13件)：

CHRISMAN, L. Reinforcement learning with Perceptual aliasing : The perceptual Distinctions Approach. Proc. 10th Nat. Conf. on Artificial Intelligence. 1992, 183-188
JAAKKOLA, T. Reinforcement Learning Algorithm for Partially Observable Markov Decision Problems. Advances in Neural Information Processing Systems. 1994, 345-352
KIMURA, H. Reinforcement Learning by Stochastic Hill Climbing on Discounted Reward. Proc. 12th Int. Conf. on Machine Learning. 1995, 295-303
LITTMAN, M. L. Learning policies for partially observable environments : Scaling up. Proc. 12th Int. Conf. on Machine Learning. 1995, 362-370
McCALLUM, R. A. Instance-Based Utile Distinctions for Reinforcement Learning with Hidden State. Proc. 12th Int. Conf. on Machine Learning. 1995, 387-395

, , , ,

前のページに戻る