Rchr
J-GLOBAL ID:202301012314546276   Update date: Jul. 01, 2023

Parmas Paavo

パラマス パーヴォ | Parmas Paavo
Affiliation and department:
Job title: Program-Specific Assistant Professor
Research field  (1): Intelligent informatics
Research keywords  (1): Machine Learning, Reinforcement Learning
Research theme for competitive and other funds  (1):
  • 2022 - 2027 Generative adversarial brain: a comprehensive study of multi-agent learning by natural and artificial intelligence
Papers (5):
  • Paavo Parmas, Takuma Seno. Proppo: a Message Passing Framework for Customizable and Composable Learning Algorithms. NeurIPS. 2022
  • Paavo Parmas, Masashi Sugiyama. A unified view of likelihood ratio and reparameterization gradients. The 24th International Conference on Artificial Intelligence and Statistics(AISTATS). 2021. 4078-4086
  • Daniel Hennes, Dustin Morrill, Shayegan Omidshafiei, Rémi Munos, Julien Pérolat, Marc Lanctot, Audrunas Gruslys, Jean-Baptiste Lespiau, Paavo Parmas, Edgar A. Duéñez-Guzmán, et al. Neural Replicator Dynamics: Multiagent Learning via Hedging Policy Gradients. Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems(AAMAS). 2020. 492-501
  • Paavo Parmas. Total stochastic gradient algorithms and applications in reinforcement learning. Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018(NeurIPS). 2018. 10225-10235
  • Paavo Parmas, Carl Edward Rasmussen, Jan Peters, Kenji Doya. PIPPS: Flexible Model-Based Policy Search Robust to the Curse of Chaos. Proceedings of the 35th International Conference on Machine Learning(ICML). 2018. 4062-4071
Education (2):
  • 2014 - 2020 Okinawa Institute of Science and Technology Graduate University 5 Year Graduate Course
  • 2010 - 2014 University of Cambridge Department of Engineering 4 Year Course
Professional career (2):
  • PhD (Okinawa Institute of Science and Technology Graduate University)
  • MEng, BA (University of Cambridge)
Work history (4):
  • 2020/11 - 現在 Kyoto University Graduate School of Informatics Program-Specific Assistant Professor
  • 2020/02 - 2020/10 Okinawa Institute of Science and Technology Graduate University Junior Research Fellow
  • 2019/07 - 2019/10 Google DeepMind Paris Research Intern
  • 2019/04 - 2019/07 RIKEN-AIP Research Intern
Awards (3):
  • 2020 - Okinawa Institute of Science and Technology Graduate University Peter Gruss Doctoral Dissertation Excellence Award Total stochastic gradient algorithms and applications to model-based reinforcement learning
  • 2014 - University of Cambridge Best final presentation award in the Information Engineering group
  • 2013 - University of Cambridge, Churchill College Bill Brown Prize, Best academic results in Engineering at Churchill College
※ Researcher’s information displayed in J-GLOBAL is based on the information registered in researchmap. For details, see here.

Return to Previous Page