Rchr
J-GLOBAL ID:202101005745062070   Update date: Aug. 05, 2024

SAKTI SAKRIANI

サクティ サクリアニ | SAKTI SAKRIANI
Affiliation and department:
Job title: Professor
Research field  (1): Perceptual information processing
Research keywords  (3): zero-resourced speech technology, ,  statistical pattern recognition ,  spoken language processing
Research theme for competitive and other funds  (11):
  • 2021 - 2026 A Study on Multi-modal Automatic Simultaneous Interpretation System and Evaluation Method
  • 2021 - 2026 Developing Low-Resource Multilingual Machine Speech Chain for Breaking Language Barriers
  • 2017 - 2022 Next generation speech translation research
  • 2017 - 2020 Research for unsupervised acoustic pattern discovery with zero resources
  • 2015 - 2019 Development of silent speech telecommunication techniques robust against external noise
Show all
Papers (243):
  • Ryo Fukuda, Yuta Nishikawa, Yasumasa Kano, Yuka Ko, Tomoya Yanagita, Kosuke Doi, Mana Makinae, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura. NAIST Simultaneous Speech-to-speech Translation System for IWSLT 2023. Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023). 2023
  • Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura. Japanese Neural Incremental Text-to-Speech Synthesis Framework With an Accent Phrase Input. IEEE Access. 2023. 11. 22355-22363
  • Fan Yang, Zheng Wang, Yang Wu, Sakriani Sakti, Satoshi Nakamura. Tackling multiple object tracking with complicated motions - Re-designing the integration of motion and appearance. Image and Vision Computing. 2022. 124. 104514-104514
  • 柳田 智也, サクティ サクリアニ, 中村 哲. 日本語逐次音声合成における合成単位. 情報処理学会論文誌. 2022. 63. 4. 1149-1158
  • Sashi Novitasari, Sakriani Sakti, Satoshi Nakamura. A Machine Speech Chain Approach for Dynamically Adaptive Lombard TTS in Static and Dynamic Noise Environments. IEEE/ACM Transactions on Audio, Speech, and Language Processing. 2022. 1-16
more...
MISC (46):
  • Katsuhito Sudoh, Takatomo Kano, Sashi Novitasari, Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura. Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS. 2020
  • Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura. Cross-Lingual Machine Speech Chain for Javanese, Sundanese, Balinese, and Bataks Speech Recognition and Synthesis. 2020
  • Fan Yang, Xin Chang, Chenyu Dang, Ziqiang Zheng, Sakriani Sakti, Satoshi Nakamura, Yang Wu. ReMOTS: Self-Supervised Refining Multi-Object Tracking and Segmentation. 2020
  • 品川政太朗, 品川政太朗, 吉野幸一郎, 吉野幸一郎, SAKTI Sakriani, SAKTI Sakriani, 鈴木優, 中村哲, 中村哲. 自然言語の指示による画像操作システム. 電子情報通信学会論文誌 D(Web). 2019. J102-D. 8
  • NGUYEN Tung The, YOSHINO Koichiro, SAKTI Sakriani, NAKAMURA Satoshi. Utilizing deception information for dialog management of doctor-patient conversations. Proc. of JSAI. 2018. 2018. 0. 2M201-2M201
more...
Patents (3):
  • Speech chain apparatus, computer program and DNN speech recognition and synthesis mutual learning method
  • A Recording Medium that Stores a Statistical Pronunciation Variation Model, Automatic Speech Recognition and Computer Program”
  • An Apparatus of Rescoring a Hypothesis in a Speech Recognizing System
Books (4):
  • State of the art of indigenous languages in research: a collection of selected research papers
    UNESCO Open Access Repository 2022 ISBN:9789231005213
  • Multimodal Agents for Ageing and Multicultural Societies
    Springer 2021 ISBN:9789811634758
  • 音声言語の自動翻訳 : コンピュータによる自動翻訳を目指して
    コロナ社 2018 ISBN:9784339013382
  • Incorporating knowledge sources into statistical speech recognition
    Springer 2009 ISBN:9781441946768
Lectures and oral presentations  (17):
  • Semi-supervised Learning for Low-resource Multilingual and Multimodal Speech Processing with Machine Speech Chain
    (HiTZ Language Technology Webinar 2022)
  • Self-Adaptive Machine Speech Chain in Noisy Environment
    (The AAAI workshop on Self-supervised Learning for Audio and Speech Processing 2022)
  • Machine Speech Chain: A Deep Learning Approach for Modeling Human Speech Perception and Production with Auditory Feedback Mechanism
    (The ITB Seminar 2021)
  • Machine Speech Chain: A Deep Learning Approach for Training and Inference through Feedback Loop
    (IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Cartagena, Colombia 2021)
  • Listening while Speaking and Visualizing: A Semi-supervised Approach with Multimodal Machine Speech Chain
    (The SoCS International Seminar 2021)
more...
Education (2):
  • 2005 - 2008 University of Ulm, Germany Faculty of Engineering & Computer Science Dr.-Ing.
  • 2000 - 2002 University of Ulm, Germany Faculty of Engineering & Computer Science MSc
Professional career (1):
  • Dr.-Ing. (University of Ulm, Germany)
Work history (14):
  • 2024/04 - 現在 Nara Institute of Science and Technology Division of Information Science Professor
  • 2024/04 - 現在 Japan Advanced Institute of Science and Technology School of Information Science Adjunct Professor
  • 2021/10 - 現在 RIKEN Center for Advanced Intelligence Project (AIP) Tourism Information Analytics Team Visiting Research Scientist
  • 2021/07 - 現在 University of Indonesia (UI) Computer Science Department Adjunct Professor
  • 2021/10 - 2024/03 Japan Advanced Institute of Science and Technology School of Information Science Associate Professor
Show all
Committee career (6):
  • 2022/06 - 現在 Frontiers in Language Sciences Review editor
  • 2021/01 - 現在 ISCA/ELRA Special Interest Group for Under-resourced Language (SIGUL) Chair
  • 2020/11 - 現在 IEEE Signal Processing Society Speech and Language Technical Committee (SLTC)
  • 2020/10 - 現在 IEEE/ACM Transactions on Speech, Audio, and Language Processing Associate editor
  • 2016/05 - 現在 Spoken Language Technology for Under-resourced Language (SLTU) Board member
Show all
Awards (14):
  • 2021/11 - The Oriental COCOSDA Oriental COCOSDA Best Paper Award Multi-Encoder Sequential Attention Network for Context-Aware Speech Recognition in Japanese Dialog Conversation
  • 2020/11 - The Oriental COCOSDA Oriental COCOSDA Best Paper Award Towards Speech Entrainment: Considering ASR Information in Speaking Rate Variation of TTS Waveform Generation
  • 2020/06 - The CVPR 2020 Workshop on Autonomous Driving (WAD) 1st place Award of BDD100K MOT Challenge at WAD 2020 ReMOTS: Self-Supervised Refining Multi-Object Tracking and Segmentation
  • 2020/06 - The 5th BMTT MOT Challenge Workshop of CVPR 2020 1st place Award on MOTS Challenge 2020
  • 2019/03 - Nara Institute of Science and Technology Best Teaching Award
Show all
Association Membership(s) (9):
The Association for Natural Language Processing ,  Information Processing Society of Japan (IPSJ) ,  The Institute of Electronics, Information and Communication Engineers (IEICE) ,  Society of Neuroscience (SFN) ,  Japan Neuroscience Society (JNS) ,  Institute of Electrical and Electronics Engineers (IEEE) Computer Society ,  Association for Computational Linguistics (ACL) ,  Acoustical Society of Japan (ASJ) ,  International Speech Communication Association (ISCA)
※ Researcher’s information displayed in J-GLOBAL is based on the information registered in researchmap. For details, see here.

Return to Previous Page