Tackling Speaking Mode Varieties in EMG-Based Speech Recognition

被引:43
|
作者
Wand, Michael [1 ]
Janke, Matthias [1 ]
Schultz, Tanja [1 ]
机构
[1] Karlsruhe Inst Technol, Cognit Syst Lab, D-76227 Karlsruhe, Germany
关键词
Electromyography (EMG); EMG-based speech recognition; silent speech interfaces (SSI); AUDITORY-FEEDBACK; DECOMPOSITION; MUSCLE;
D O I
10.1109/TBME.2014.2319000
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
An electromyographic (EMG) silent speech recognizer is a system that recognizes speech by capturing the electric potentials of the human articulatory muscles, thus enabling the user to communicate silently. After having established a baseline EMG-based continuous speech recognizer, in this paper, we investigate speaking mode variations, i.e., discrepancies between audible and silent speech that deteriorate recognition accuracy. We introduce multimode systems that allow seamless switching between audible and silent speech, investigate different measures which quantify speaking mode differences, and present the spectral mapping algorithm, which improves the word error rate (WER) on silent speech by up to 14.3% relative. Our best average silent speech WER is 34.7%, and our best WER on audibly spoken speech is 16.8%.
引用
收藏
页码:2515 / 2526
页数:12
相关论文
共 50 条
  • [1] Investigations on Speaking Mode Discrepancies in EMG-based Speech Recognition
    Wand, Michael
    Janke, Matthias
    Schultz, Tanja
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 608 - 611
  • [2] Impact of Different Speaking Modes on EMG-based Speech Recognition
    Wand, Michael
    Jou, Szu-Chen Stan
    Toth, Arthur R.
    Schultz, Tanja
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 640 - +
  • [3] SESSION-INDEPENDENT EMG-BASED SPEECH RECOGNITION
    Wand, Michael
    Schultz, Tanja
    [J]. BIOSIGNALS 2011, 2011, : 295 - 300
  • [4] ANALYSIS OF PHONE CONFUSION IN EMG-BASED SPEECH RECOGNITION
    Wand, Michael
    Schultz, Tanja
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 757 - 760
  • [5] Modeling coarticulation in EMG-based continuous speech recognition
    Schultz, Tanja
    Wand, Michael
    [J]. SPEECH COMMUNICATION, 2010, 52 (04) : 341 - 353
  • [6] Impact of Different Feedback Mechanisms in EMG-based Speech Recognition
    Herff, Christian
    Janke, Matthias
    Wand, Michael
    Schultz, Tanja
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2224 - 2227
  • [7] A Spectral Mapping Method for EMG-based Recognition of Silent Speech
    Janke, Matthias
    Wand, Michael
    Schultz, Tanja
    [J]. BIO-INSPIRED HUMAN- MACHINE INTERFACES AND HEALTHCARE APPLICATIONS, 2010, : 22 - 31
  • [8] Multi-stream HMM for EMG-based speech recognition
    Manabe, H
    Zhang, Z
    [J]. PROCEEDINGS OF THE 26TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2004, 26 : 4389 - 4392
  • [9] EMG-based speech recognition using dimensionality reduction methods
    Anat Ratnovsky
    Sarit Malayev
    Shahar Ratnovsky
    Sara Naftali
    Neta Rabin
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 597 - 607
  • [10] EMG-based speech recognition using dimensionality reduction methods
    Ratnovsky, Anat
    Malayev, Sarit
    Ratnovsky, Shahar
    Naftali, Sara
    Rabin, Neta
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 14 (1) : 597 - 607