Ivan R. Ivanov, Jens Domke, Toshio Endo, Johannes Doerfert. Automatic Parallelization and OpenMP Offloading of Fortran Array Notation. 2024
Hayato Fujita, Akihiro Nomura, Toshio Endo, Masakazu Sekijima. Enhancing the Performance of AlphaFold Through Modified Storage Method and Optimization of HHblits on TSUBAME3.0 Supercomputer. 2023 Congress in Computer Science, Computer Engineering, & Applied Computing (CSCE). 2023
Lingqi Zhang, Mohamed Wahib, Peng Chen, Jintao Meng, Xiao Wang, Toshio Endo, Satoshi Matsuoka. PERKS: a Locality-Optimized Execution Model for Iterative Memory-bound GPU Applications. In proceedings of ACM International Conference on Supercomputing (ICS 2023), Orlando, June 2023. 2023
Lingqi Zhang, Mohamed Wahib, Peng Chen, Jintao Meng, Xiao Wang, Toshio Endo, Satoshi Matsuoka. Revisiting Temporal Blocking Stencil Optimizations. In proceedings of ACM International Conference on Supercomputing (ICS 2023), Orlando, June 2023. 2023
Lingqi Zhang, Mohamed Wahib, Peng Chen, Jintao Meng, Xiao Wang, Toshio Endo, Satoshi Matsuoka. Exploiting Scratchpad Memory for Deep Temporal Blocking. Proceedings of the 15th Workshop on General Purpose Processing Using GPU. 2023