2021 - 2024 Self-supervised graph-based representation for language and speaker detection
論文 (35件):
X. Lu, P. Shen, Y. Tsao, H. Kawai. Cross-modal alignment with optimal transport for CTC-based ASR. IEEE ASRU. 2023
P. Shen, X. Lu, H. Kawai. Generative linguistic representation for spoken language identification. IEEE ASRU. 2023
P. Shen, X. Lu, H. Kawai. Transducer-based language embedding for spoken language identification. ISCA Interspeech. 2022
P. Shen, X. Lu, H. Kawai. Pronunciation-aware unique character encoding for RNN Transducer-based Mandarin speech recognition. IEEE SLT. 2022
X. Lu, P. Shen, Y. Tsao, H. Kawai. Unsupervised neural adaptation model based on optimal transport for spoken language identification. IEEE ICASSP. 2022
P. Shen, X. Lu, H. Kawai. Investigation on Multi-task Universal Speech Models. Autumn Meeting of Acoustical Society of Japan. 2023
P. Shen, X. Lu, H. Kawai. Investigation on sub-character tokenization for RNN-Transducer. Autumn Meeting of Acoustical Society of Japan. 2022
T. Yoshimoto, P. Shen, X. Lu, R. Takashima, T. Takiguchi, H. Kawai. Unsupervised Feature Learning based on wav2vec for Cross-channel Spoken Language Identification. Acoustical Society of Japan, spring. 2021
T. Ogura, M. Fujimoto, P. Shen, X. Lu, H. Kawai. A Study on Language Modeling with BERT-based Word Embedding. Acoustical Society of Japan. 2021