Parmas Paavo

パラマスパーヴォ | Parmas Paavo

所属機関・部署：
職名：特定助教

研究分野 (1件)：知能情報学

研究キーワード (1件)：機械学習、強化学習

競争的資金等の研究課題 (1件)：

論文 (5件)：

Paavo Parmas, Takuma Seno. Proppo: a Message Passing Framework for Customizable and Composable Learning Algorithms. NeurIPS. 2022
Paavo Parmas, Masashi Sugiyama. A unified view of likelihood ratio and reparameterization gradients. The 24th International Conference on Artificial Intelligence and Statistics(AISTATS). 2021. 4078-4086
Daniel Hennes, Dustin Morrill, Shayegan Omidshafiei, Rémi Munos, Julien Pérolat, Marc Lanctot, Audrunas Gruslys, Jean-Baptiste Lespiau, Paavo Parmas, Edgar A. Duéñez-Guzmán, et al. Neural Replicator Dynamics: Multiagent Learning via Hedging Policy Gradients. Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems(AAMAS). 2020. 492-501
Paavo Parmas. Total stochastic gradient algorithms and applications in reinforcement learning. Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018(NeurIPS). 2018. 10225-10235
Paavo Parmas, Carl Edward Rasmussen, Jan Peters, Kenji Doya. PIPPS: Flexible Model-Based Policy Search Robust to the Curse of Chaos. Proceedings of the 35th International Conference on Machine Learning(ICML). 2018. 4062-4071

学歴 (2件)：

学位 (2件)：

経歴 (4件)：

受賞 (3件)：

2020 - 沖縄科学技術大学院大学ピーター・グルース最優秀博士論文賞 Total stochastic gradient algorithms and applications to model-based reinforcement learning
2014 - University of Cambridge 最優秀発表賞情報工学卒業研究発表会
2013 - University of Cambridge, Churchill College Bill Brown Prize, チャーチルカレッジの工学部の学生の中で最も優れた成績

※　J-GLOBALの研究者情報は、researchmapの登録情報に基づき表示しています。登録・更新については、こちらをご覧ください。