2018 - 2021 Video Archiving of Dialect Speech and Establishment of Information Processing Technology for Understanding Dialect Speech
2015 - 2018 Research on construction and application of high discriminative speech feature space using heterogeneous speech units and multiple languages
2015 - 2018 Spoken term detection system with high retrieval accuracy, high speed and small resources using Deep Neural Network
2012 - 2015 Bi-directional retrieval of speech and image by indexing both speech and image data.
2010 - 2012 Development of Continuous Voice Morphing Using Separated Vocal TractArea Functions, Glottal Source Waves, and Prosodic Features
2008 - 2010 Research and development of a methodology for on-demand broadcast of local FM radio and a retrieval technology for spoken broadcast data sets
2005 - 2007 A research for automatic structure extraction and information retrieval from video data using speech and voice
2003 - 2005 Universal-Phonetic-Segment-Based Speech Coding and Its Applications to Speech Processing
Show all
Papers (34):
Daisuke Kaneko, Ryota Konno, Kazunori Kojima, Kazuyo Tanaka, Shi-Wook Lee, Yoshiaki Itoh. Constructing acoustic distances between subwords and states obtained from a deep neural network for spoken term detection. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2017. 2017-. 2879-2883
Shi-wook Lee, Kazuyo Tanaka, Yoshiaki Itoh. Generating complementary acoustic model spaces in DNN-based sequence-to-frame DTW scheme for out-of-vocabulary spoken term detection. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5. 2016. 755-759
Masato Obara, Kazunori Kojima, Kazuyo Tanaka, Shi-wook Lee, Yoshiaki Itoh. Rescoring by Combination of Posteriorgram Score and Subword-Matching Score for Use in Query-by-Example. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5. 2016. 1918-1922