Art
J-GLOBAL ID:200902280699010655   Reference number:04A0654021

The Exploitation Reinforcement Learning Method on POMDPs

POMDPs環境下での経験強化型強化学習法
Author (3):
Material:
Volume: 104  Issue: 233(AI2004 12-18)  Page: 1-5  Publication year: Jul. 29, 2004 
JST Material Number: S0532B  ISSN: 0913-5685  Document type: Proceedings
Article type: 原著論文  Country of issue: Japan (JPN)  Language: JAPANESE (JA)
Thesaurus term:
Thesaurus term/Semi thesaurus term
Keywords indexed to the article.
All keywords is available on JDreamIII(charged).
On J-GLOBAL, this item will be available after more than half a year after the record posted. In addtion, medical articles require to login to MyJ-GLOBAL.
,...
Semi thesaurus term:
Thesaurus term/Semi thesaurus term
Keywords indexed to the article.
All keywords is available on JDreamIII(charged).
On J-GLOBAL, this item will be available after more than half a year after the record posted. In addtion, medical articles require to login to MyJ-GLOBAL.

   To see more with JDream III (charged).   {{ this.onShowAbsJLink("http://jdream3.com/lp/jglobal/index.html?docNo=04A0654021&from=J-GLOBAL&jstjournalNo=S0532B") }}
JST classification (1):
JST classification
Category name(code) classified by JST.
Artificial intelligence 
Reference (10):
  • GREFENSTETTE, J. J. Credit Assignment in Rule Discovery Systems Based on Genetic Algorithms. Machine Learning. 1988, 3, 225-245
  • RUMMERY, G. A. On-line Q-learning Using Connectionist Systems. Technical Report. 1994
  • SUTTON, R. S. Integrated architecture for learning, planning, and reacting based on approximating dynamic programing. Proc. of 7th International Conference on Machine Learning, 1990. 1990, 216-224
  • SUTTON, R. S. Learning to Predict by Method of Temporal Differences. Machine Learning. 1988, 4, 9-44
  • WATKINS, C. J. C. H. Technical Note : Q-Learning. Machine Learning. 1992, 8, 279-292
more...
Terms in the title (3):
Terms in the title
Keywords automatically extracted from the title.

Return to Previous Page