Art
J-GLOBAL ID:201202274916773803   Reference number:12A0422030

Analysis of Time Series Data Accompanied with Rewards and Actions using Reinforcement Learning

報酬と行動決定を伴う時系列データの強化学習を用いたオフライン分析
Author (5):
Material:
Volume: 111  Issue: 419(NC2011 97-120)  Page: 107-112  Publication year: Jan. 19, 2012 
JST Material Number: S0532B  ISSN: 0913-5685  Document type: Proceedings
Article type: 短報  Country of issue: Japan (JPN)  Language: JAPANESE (JA)
Thesaurus term:
Thesaurus term/Semi thesaurus term
Keywords indexed to the article.
All keywords is available on JDreamIII(charged).
On J-GLOBAL, this item will be available after more than half a year after the record posted. In addtion, medical articles require to login to MyJ-GLOBAL.

Semi thesaurus term:
Thesaurus term/Semi thesaurus term
Keywords indexed to the article.
All keywords is available on JDreamIII(charged).
On J-GLOBAL, this item will be available after more than half a year after the record posted. In addtion, medical articles require to login to MyJ-GLOBAL.

JST classification (1):
JST classification
Category name(code) classified by JST.
Artificial intelligence 
Reference (15):
  • ENGEL, Y. Bayes meets Bellman : the Gaussian process approach to temporal difference learning. Proceedings of ICML-2003. 2003
  • HAUSKRECHT, M. Planning treatment of ischemic heart disease with partially observable Markov decision processes. Artificial Intelligence in Medicine. 2000, 18, 221-244
  • LEVIN, E. Using Markov decision processes for learning dialogue strategies. IEEE Transactions on Speech and Audio Processing. 1998, 8, 11-23
  • NG, A. Y. Algorithms for inverse reinforcement learning. Proceedings of 17th International Conference on Machine Learning, 2000. 2000, 663-670
  • PAEK, T. Reinforcement learning for spoken dialogue systems : comparing strength and weaknesses for practical deployment. 2006
more...

Return to Previous Page