Rchr
J-GLOBAL ID:200901008595485931
Update date: Feb. 01, 2024
Yamamoto Kazumasa
ヤマモト カズマサ | Yamamoto Kazumasa
Affiliation and department:
Job title:
Professor
Research field (3):
Intelligent informatics
, Intelligent robotics
, Perceptual information processing
Research keywords (40):
プライバシー保護
, 発話速度変動
, 音節連鎖モデル
, 音節モデル
, 話者認識
, 分布間距離
, 声質変換
, BNモデル
, 訂正発話
, 話し言葉音声
, 混合数
, 音源分離
, セグメント統計量
, 対角共分散行列
, ROVER法
, スペクトルマッピング
, マルチパスモデル
, 最小記述長(MDL)基準
, 状態
, 発話速度推定
, マイクロホンアレイ
, 遠隔発話音声
, 遠隔発話音声認識
, 全共分散行列
, 雑音
, プライバシ保護
, 発話速度
, 実環境音声認識
, 発音変動
, 話し言葉
, HMM
, ハンズフリー音声認識
, 自然発話
, 残響
, 遠隔発話
, 音響モデル
, 隠れマルコフモデル
, プライバシ
, 話し言葉音声認識
, 音声認識
Research theme for competitive and other funds (20):
- 2023 - 2027 高齢者を対象とした永続的に利用できるマルチモーダル対話システム基盤技術の構築
- 2022 - 2025 Development of end-to-end speech recognition techniques for super-elderly that can deal with the cause of recognition errors
- 2019 - 2023 スムーズな対話のための対話テンポのリアルタイム制御に基づく音声対話システム
- 2019 - 2022 Automatic acquisition of optimized acoustic model unit for automatic speech recognition using deep learning
- 2019 - 2022 高齢者を対象とした音声認識・対話システム基盤技術の構築
- 2018 - 2022 Automatic generation of lecture's materials with Japanese caption based on English lecture's speech translation and speech summarization
- 2017 - 2021 A Technology Transfer System based on Creation of Work Records and Procedures using Speech and Language Processing Technologies
- 2015 - 2018 Accurate speech recognition system with deep neural network introducing human auditory characteristic in real environments
- 2013 - 2018 A study of automatic English caption generation for Japanese lecture speech
- 2012 - 2012 Improvement of speech recognition performance by using phase information with long analysis window
- 2010 - 2012 High accuracy transcription, cleaning and fast term detection for spoken documents
- 2010 - 2012 Study on privacy protection in spoken language
- 2009 - 2011 Research of speech signal processing for privacy protection
- 2007 - 2009 Study on Speech Enhancement Based on Distorted Speech Corpora in the Real-world
- 2007 - 2009 実世界環境下における遠隔発話の音声認識と話者認識およびインデックス化に関する研究
- 2006 - 2008 Development of speech recognition system which can recognize slow speaking utterance
- 2004 - 2005 話し言葉音声認識のための発話速度変動に頑健な音響モデルの開発
- 2003 - 2004 Hands free speech recognition method based on auditory characteristics
- 2001 - 2002 音声認識のための動的特徴を効果的に用いる隠れマルコフモデルに関する研究
- 2000 - 2001 Development of robust acoustic model for hands-free speech recognition
Show all
Papers (93):
-
Meiko Fukuda, Ryota Nishimura, Hiromitsu Nishizaki, Koharu Horii, Yurie Iribe, Kazumasa Yamamoto, Norihide Kitaoka. A new speech corpus of super-elderly Japanese for acoustic modeling. Computer Speech & Language. 2023. 77. 101424-101424
-
Meiko Fukuda, Masakazu Sugiyama, Ryota Nishimura, Yurie Iribe, Kazumasa Yamamoto, Norihide Kitaoka. A Corpus-Based Analysis Of Age-Related Changes In The Acoustic Features Of Elderly To Super Elderly Speech. 2022 25th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA). 2022
-
Maki Nanahara (Kato), Kazumasa Yamamoto, Seiichi Nakagawa. Correlation of acoustic features of pitch/rhythm/power and perceptual impressions after singing training for people with dysarthria. Acoustical Science and Technology. 2022. 43. 1. 22-31
-
Hiroshi SEKI, Kazumasa YAMAMOTO, Tomoyosi AKIBA, Seiichi NAKAGAWA. Discriminative Learning of Filterbank Layer within Deep Neural Network Based Speech Recognition for Speaker Adaptation. IEICE Transactions on Information and Systems. 2019. E102.D. 2. 364-374
-
Hiroshi Seki, Kazumasa Yamamoto, Tomoyosi Akiba, Seiichi Nakagawa. Rapid Speaker Adaptation of Neural Network Based Filterbank Layer for Automatic Speech Recognition. 2018 IEEE Spoken Language Technology Workshop (SLT). 2018
more...
MISC (237):
-
早川友瑛, 西崎博光, 山本一公, 小林彰夫, 宇津呂武仁. Investigation of Various Multi-task Learning for End-to-End Multi-Language Speech Recognition Model. 日本音響学会研究発表会講演論文集(CD-ROM). 2020. 2020
-
Utilization of Parallel Corpus with Speech Misrecognition for English to Japanese Lecture Speech Translation. 2016. 116. 378. 75-81
-
Lecture Speech Translation based on Preprocessing and 2-Step Translation. IPSJ SIG Notes. 2015. 2015. 8. 1-6
-
SEKI HIROSHI, YAMAMOTO KAZUMASA, NAKAGAWA SEIICHI. Consideration on Age- and Gender-independent Speech Recognition using DNN-HMM. IEICE technical report. Speech. 2014. 114. 365. 159-164
-
川井 大陸, 山本 一公, 中川 聖一. 朗読音声-歌声音声の特徴量変換と話者適応を用いた歌詞認識の性能向上の検討(音声認識,第16回音声言語シンポジウム). 電子情報通信学会技術研究報告. SP, 音声. 2014. 114. 365. 7-12
more...
Books (2):
-
音声言語処理と自然言語処理
コロナ社 2018 ISBN:9784339028881
-
音声言語処理と自然言語処理
コロナ社 2013 ISBN:9784339024692
Lectures and oral presentations (40):
-
大規模データベースCSJを用いたDNNに基づくフィルタバンク学習の評価とフィルタ関数の比較
(日本音響学会研究発表会講演論文集(CD-ROM) 2017)
-
ドメイン間遷移を持つ雑談音声対話システムの検討
(日本音響学会研究発表会講演論文集(CD-ROM) 2017)
-
DNNに基づくフィルタバンクの再学習による話者クラス適応の検討
(日本音響学会研究発表会講演論文集(CD-ROM) 2017)
-
講義スライド中の文章・図表を対象とする説明箇所自動推定手法の検討
(日本音響学会研究発表会講演論文集(CD-ROM) 2017)
-
音声感情のコンテキスト情報を考慮したラベリングと認識手法の検討
(日本音響学会研究発表会講演論文集(CD-ROM) 2017)
more...
Professional career (1):
- Doctor of Engineering (Toyohashi University of Technology)
Work history (8):
- 2021/04 - 現在 Chubu University Department of Computer Science Professor
- 2017/04 - 2021/03 Chubu University Department of Computer Science Associate Professor
- 2013/03 - 2017/03 Toyohashi University of Technology Department of Computer Science and Engineering Associate Professor
- 2013/04 - 2014/03 Toyota National College of Technology Department of Information and Computer Engineering Associate Professor
- 2010/04 - 2013/02 Toyohashi University of Technology Department of Computer Science and Engineering Assistant Professor
- 2012/02 - 2012/09 Carnegie Mellon University Department of Electrical and Computer Engineering Visiting Researcher
- 2007/04 - 2010/03 Toyohashi University of Technology Department of Information and Computer Sciences Assistant Professor
- 2000/04 - 2007/03 Shinshu University Department of Electrical and Elerctronic Engineering Research Associate
Show all
Awards (1):
- 2013/05 - The Institute of Electronics, Information and Communication Engineers Best Paper Award Hidden Conditional Neural Fields for Continuous Phoneme Speech Recognition
Association Membership(s) (5):
Asia Pacific Signal and Information Processing Association
, International Speech Communication Association
, 情報処理学会
, 電子情報通信学会
, 日本音響学会
Return to Previous Page