Machine Learning and Speech Generation: Progress of Speech Waveform Modeling

TODA TOMOKI

Art

J-GLOBAL ID：202002228505669547 Reference number：20A0460162

Machine Learning and Speech Generation: Progress of Speech Waveform Modeling

《第9回》機械学習と音声生成:音声波形モデリングの進展

Publisher site {{ this.onShowPLink() }} Copy service {{ this.onShowCLink("http://jdream3.com/copy/?sid=JGLOBAL&noSystem=1&documentNoArray=20A0460162&COPY=1") }}
Access JDreamⅢ for advanced search and analysis. {{ this.onShowJLink("http://jdream3.com/lp/jglobal/index.html?docNo=20A0460162&from=J-GLOBAL&jstjournalNo=F0131A") }}

Author (1)：
Material：
Volume： 58 Issue： 12 Page： 951-954(J-STAGE) Publication year： 2019
JST Material Number： F0131A ISSN： 0453-4662 CODEN： KESEA Document type： Article
Article type：解説 Country of issue： Japan (JPN) Language： JAPANESE (JA)

, , , , , , , , , ,
, ,

Author keywords (6)： , , , , ,

Speach processing , Artificial intelligence

Reference (20)：

1) H. Dudley: Remaking Speech, J. Acoust. Soc. Am., 11-2, 169/177 (1939)
2) F. Itakura and S. Saito: Analysis Synthesis Telephony Based Upon the Maximum Likelihood Method, Proc. ICA, C-5-5, C17/20 (1968)
3) A. K. Syrdal, C. W. Wightman, A. Conkie, Y. Stylianou, M. Beutnagel, J. Schroeter, V. Strom, K.-S. Lee, and M. J. Makashay: Corpus-Based Techniques in the AT&T NextGen Synthesis System, <i>Proc. INTERSPEECH</i>, <b>3</b>, 410/415 (2000)
4) A. van den Oord, S. Dieleman, H. Zen, K. Simonyan, O. Vinyals, A. Graves, N. Kalchbrenner, A. W. Senior, and K. Kavukcuoglu: WaveNet: a Generative Model for Raw Audio, arXiv preprint, arXiv:1609.03499, 15 pages (2016)
5) H. Kawahara, I. Masuda-Katsuse, and A.de Cheveigné: Restructuring Speech Representations Using a Pitch-Adaptive Time-Frequency Smoothing and an Instantaneous-Frequency-Based F₀ Extraction: Possible Role of a Repetitive Structure in Sounds, Speech Communication, 27-3-4, 187/207 (1999)

ｍore...

, , , ,

Return to Previous Page