A Study on Model-based Deep Reinforcement Learning with Intrinsic Subgoal Reward

MARUYAMA MOTOKI; ENDO SATOSHI; YAMADA KOJI

Art

J-GLOBAL ID：202002283965993954 Reference number：20A2181707

A Study on Model-based Deep Reinforcement Learning with Intrinsic Subgoal Reward

サブゴールによる内発的報酬を用いたモデルベース深層強化学習の考察

Publisher site {{ this.onShowPLink() }} Copy service {{ this.onShowCLink("http://jdream3.com/copy/?sid=JGLOBAL&noSystem=1&documentNoArray=20A2181707&COPY=1") }}
Access JDreamⅢ for advanced search and analysis. {{ this.onShowJLink("http://jdream3.com/lp/jglobal/index.html?docNo=20A2181707&from=J-GLOBAL&jstjournalNo=L4664A") }}

Author (3)： , ,
Material：
Volume： 19th Issue：第2分冊 Page： 47-50 Publication year： Aug. 18, 2020
JST Material Number： L4664A Document type： Proceedings
Article type：原著論文 Country of issue： Japan (JPN) Language： JAPANESE (JA)

, , , , , , , ,
, , , ,

Artificial intelligence

Reference (9)：

Steven Kapturowski, Georg Ostrovski, John Quan, Rémi Munos, and Will Dabney. Recurrent experience replay in distributed reinforcement learning. In 7th International Conference on Learning Representations, ICLR 2019, 2019.
Dan Horgan, John Quan, David Budden, Gabriel Barth-Maron, Matteo Hessel, Hado Van Hasselt, and David Silver. Distributed prioritized experience replay. In 6th International Conference on Learning Representations, ICLR 2018 - Conference Track Proceedings, 2018.
Julian Schrittwieser, Ioannis Antonoglou, Thomas Hubert, Karen Simonyan, Laurent Sifre, Simon Schmitt, Arthur Guez, Edward Lockhart, Demis Hassabis, Thore Graepel, Timothy P. Lillicrap, and David Silver. Mastering atari, go, chess and shogi by planning with a learned model. ArXiv, abs/1911.08265, 2019.
Deepak Pathak, Pulkit Agrawal, Alexei A. Efros, and Trevor Darrell. Curiosity-driven exploration by self-supervised prediction. In 34th International Conference on Machine Learning, ICML 2017,2017.
Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. Generative adversarial nets. In Z. Ghahramani, M. Welling, C. Cortes, N. D. Lawrence, and K. Q. Weinberger, editors, Advances in Neural Information Processing Systems 27, pages 2672-2680. Curran Associates, Inc., 2014.

ｍore...

Return to Previous Page