Analysis of Time Series Data Accompanied with Rewards and Actions using Reinforcement Learning

ASO HIDEKI; SHIRO MASANORI; KAMISHIMA TOSHIHIRO; AKAHO SHOTARO; KORO TAKAHIDE

Art

J-GLOBAL ID：201202274916773803 Reference number：12A0422030

Analysis of Time Series Data Accompanied with Rewards and Actions using Reinforcement Learning

報酬と行動決定を伴う時系列データの強化学習を用いたオフライン分析

Publisher site Copy service {{ this.onShowCLink("http://jdream3.com/copy/?sid=JGLOBAL&noSystem=1&documentNoArray=12A0422030&COPY=1") }}
Access JDreamⅢ for advanced search and analysis. {{ this.onShowJLink("http://jdream3.com/lp/jglobal/index.html?docNo=12A0422030&from=J-GLOBAL&jstjournalNo=S0532B") }}

Author (5)： , , , ,
Material：
Volume： 111 Issue： 419(NC2011 97-120) Page： 107-112 Publication year： Jan. 19, 2012
JST Material Number： S0532B ISSN： 0913-5685 Document type： Proceedings
Article type：短報 Country of issue： Japan (JPN) Language： JAPANESE (JA)

, , , , , , , , , , ,
, , , , , ,

Artificial intelligence

Reference (15)：

ENGEL, Y. Bayes meets Bellman : the Gaussian process approach to temporal difference learning. Proceedings of ICML-2003. 2003
HAUSKRECHT, M. Planning treatment of ischemic heart disease with partially observable Markov decision processes. Artificial Intelligence in Medicine. 2000, 18, 221-244
LEVIN, E. Using Markov decision processes for learning dialogue strategies. IEEE Transactions on Speech and Audio Processing. 1998, 8, 11-23
NG, A. Y. Algorithms for inverse reinforcement learning. Proceedings of 17th International Conference on Machine Learning, 2000. 2000, 663-670
PAEK, T. Reinforcement learning for spoken dialogue systems : comparing strength and weaknesses for practical deployment. 2006

ｍore...

, , , , ,

Return to Previous Page