Art
J-GLOBAL ID:202002228505669547   Reference number:20A0460162

Machine Learning and Speech Generation: Progress of Speech Waveform Modeling

《第9回》機械学習と音声生成:音声波形モデリングの進展
Author (1):
Material:
Volume: 58  Issue: 12  Page: 951-954(J-STAGE)  Publication year: 2019 
JST Material Number: F0131A  ISSN: 0453-4662  CODEN: KESEA  Document type: Article
Article type: 解説  Country of issue: Japan (JPN)  Language: JAPANESE (JA)
Thesaurus term:
Thesaurus term/Semi thesaurus term
Keywords indexed to the article.
All keywords is available on JDreamIII(charged).
On J-GLOBAL, this item will be available after more than half a year after the record posted. In addtion, medical articles require to login to MyJ-GLOBAL.

Semi thesaurus term:
Thesaurus term/Semi thesaurus term
Keywords indexed to the article.
All keywords is available on JDreamIII(charged).
On J-GLOBAL, this item will be available after more than half a year after the record posted. In addtion, medical articles require to login to MyJ-GLOBAL.

JST classification (2):
JST classification
Category name(code) classified by JST.
Speach processing  ,  Artificial intelligence 
Reference (20):
  • 1) H. Dudley: Remaking Speech, J. Acoust. Soc. Am., 11-2, 169/177 (1939)
  • 2) F. Itakura and S. Saito: Analysis Synthesis Telephony Based Upon the Maximum Likelihood Method, Proc. ICA, C-5-5, C17/20 (1968)
  • 3) A. K. Syrdal, C. W. Wightman, A. Conkie, Y. Stylianou, M. Beutnagel, J. Schroeter, V. Strom, K.-S. Lee, and M. J. Makashay: Corpus-Based Techniques in the AT&T NextGen Synthesis System, <i>Proc. INTERSPEECH</i>, <b>3</b>, 410/415 (2000)
  • 4) A. van den Oord, S. Dieleman, H. Zen, K. Simonyan, O. Vinyals, A. Graves, N. Kalchbrenner, A. W. Senior, and K. Kavukcuoglu: WaveNet: a Generative Model for Raw Audio, arXiv preprint, arXiv:1609.03499, 15 pages (2016)
  • 5) H. Kawahara, I. Masuda-Katsuse, and A.de Cheveigné: Restructuring Speech Representations Using a Pitch-Adaptive Time-Frequency Smoothing and an Instantaneous-Frequency-Based F0 Extraction: Possible Role of a Repetitive Structure in Sounds, Speech Communication, 27-3-4, 187/207 (1999)
more...
Terms in the title (5):
Terms in the title
Keywords automatically extracted from the title.

Return to Previous Page