Speaker-Adaptive Speech Recognition Based on Surface Electromyography

被引:0
|
作者
Wand, Michael [1 ]
Schultz, Tanja [1 ]
机构
[1] Univ Karlsruhe TH, Karlsruhe, Germany
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present our recent advances in silent speech interfaces using electromyographic signals that capture the movements of the human articulatory muscles at the skin surface for recognizing continuously spoken speech. Previous systems were limited to speaker- and session-dependent recognition tasks on small amounts of training and test data. In this article we present speaker-independent and speaker-adaptive training methods which allow us to use a large corpus of data from many speakers to train acoustic models more reliably. We use the speaker-dependent system as baseline, carefully tuning the data preprocessing and acoustic modeling. Then on our corpus we compare the performance of speaker-dependent and speaker-independent acoustic models and carry out model adaptation experiments.
引用
收藏
页码:271 / 285
页数:15
相关论文
共 50 条
  • [31] Speaker Adaptive Classification Procedure for Speech Recognition.
    Katterfeldt, Harald
    Thon, Werner
    1974, 27 (06): : 230 - 232
  • [32] Adaptive systems for unsupervised speaker tracking and speech recognition
    Herbig, Tobias
    Gerl, Franz
    Minker, Wolfgang
    Haeb-Umbach, Reinhold
    EVOLVING SYSTEMS, 2011, 2 (03) : 199 - 214
  • [33] Speaker-Adaptive Multimodal Prediction Model for Listener Responses
    de Kok, Iwan
    Heylen, Dirk
    Morency, Louis-Philippe
    ICMI'13: PROCEEDINGS OF THE 2013 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2013, : 51 - 58
  • [34] Towards Continuous Speech Recognition Using Surface Electromyography
    Jou, Szu-Chen
    Schultz, Tanja
    Walliczek, Matthias
    Kraft, Florian
    Waibel, Alex
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 573 - +
  • [35] Study on Integration of Speaker Diarization with Speaker Adaptive Speech Recognition for Broadcast Transcription
    Silovsky, Jan
    Cerva, Petr
    Zdansky, Jindrich
    Nouza, Jan
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 478 - 481
  • [36] Surface Electromyography-Based Recognition, Synthesis, and Perception of Prosodic Subvocal Speech
    Vojtech, Jennifer M.
    Chan, Michael D.
    Shiwani, Bhawna
    Roy, Serge H.
    Heaton, James T.
    Meltzner, Geoffrey S.
    Contessa, Paola
    De Luca, Gianluca
    Patel, Rupal
    Kline, Joshua C.
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2021, 64 (06): : 2134 - 2153
  • [37] Deep learning-based speaker-adaptive postfiltering with limited adaptation data for embedded text-to-speech synthesis systems
    Eren, Eray
    Demiroglu, Cenk
    COMPUTER SPEECH AND LANGUAGE, 2023, 81
  • [38] Speaker Recognition and Speech Emotion Recognition Based on GMM
    Xu, Shupeng
    Liu, Yan
    Liu, Xiping
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON ELECTRIC AND ELECTRONICS, 2013, : 434 - 436
  • [39] Speaker-Adaptive Lip Reading with User-Dependent Padding
    Kim, Minsu
    Kim, Hyunjun
    Ro, Yong Man
    COMPUTER VISION, ECCV 2022, PT XXXVI, 2022, 13696 : 576 - 593
  • [40] Speaker Adaptive Model for Hindi Speech using Kaldi Speech Recognition toolkit
    Upadhyaya, Prashant
    Mittal, Sanjeev Kumar
    Varshney, Yash Vardhan
    Farooq, Omar
    Abidi, Musiur Raza
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON MULTIMEDIA, SIGNAL PROCESSING AND COMMUNICATION TECHNOLOGIES (IMPACT), 2017, : 222 - 226