Towards EMG-to-Speech with a Necklace Form Factor

被引:0
|
作者
Wu, Peter [1 ]
Kaveh, Ryan [1 ]
Nautiyal, Raghav [1 ]
Zhang, Christine [1 ]
Guo, Albert [1 ]
Kachinthayal, Anvitha [1 ]
Mishra, Tavish [1 ]
Yu, Bohan [1 ]
Black, Alan W. [1 ]
Krishna, Rikky Gopala [1 ]
Anumanchipalli, K. [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
来源
关键词
electromyography; EMG; EMG-to-speech; DRY; WIRELESS; SIGNALS; DEVICE;
D O I
10.21437/Interspeech.2024-1568
中图分类号
学科分类号
摘要
Electrodes for decoding speech from electromyography (EMG) are typically placed on the face, requiring adhesives that are inconvenient and skin-irritating if used regularly. We explore a different device form factor, where dry electrodes are placed around the neck instead. 11-word, multi-speaker voiced EMG classifiers trained on data recorded with this device achieve 92.7% accuracy. Ablation studies reveal the importance of having more than two electrodes on the neck, and phonological analyses reveal similar classification confusions between neck-only and neck-and-face form factors. Finally, speech-EMG correlation experiments demonstrate a linear relationship between many EMG spectrogram frequency bins and self-supervised speech representation dimensions.
引用
收藏
页码:402 / 406
页数:5
相关论文
共 50 条
  • [31] Progress towards speech models that model speech
    Russell, M
    1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 115 - 123
  • [32] COMPLIMENT AS A FORM OF SPEECH ETIQUETTE
    Titova, Zh. N.
    Orlova, E. V.
    Zarubina, D. N.
    JOURNAL OF MINING INSTITUTE, 2011, 193 : 290 - 292
  • [33] The scientific model as a form of speech
    Sutton, C
    RESEARCH IN SCIENCE EDUCATION IN EUROPE: CURRENT ISSUES AND THEMES, 1996, : 143 - 152
  • [34] EVASIVE SPEECH AS A FORM OF RESISTANCE
    Evans, William N.
    PSYCHOANALYTIC QUARTERLY, 1953, 22 (04): : 548 - 560
  • [35] POETRY BETWEEN FORM AND SPEECH
    STAMAC, A
    NEOHELICON, 1982, 9 (01) : 103 - 114
  • [36] SUBVOCAL REHERSAL AS A FORM OF SPEECH
    LOCKE, JL
    JOURNAL OF VERBAL LEARNING AND VERBAL BEHAVIOR, 1970, 9 (05): : 495 - &
  • [37] Feasibility of facial EMG in gender classification during speech production
    Godiyal, Anoop Kant
    Sharma, Richa
    Joshi, Deepak
    Bhatia, Dinesh
    Journal of Medical Engineering and Technology, 2013, 37 (02): : 86 - 90
  • [38] A Fusion of EMG and IMU for an Augmentative Speech Detection and Recognition System
    Shafiq, Uzma
    Waris, Asim
    Iqbal, Javaid
    Gilani, Syed Omer
    IEEE ACCESS, 2024, 12 : 14027 - 14039
  • [39] ANALYSIS OF PHONE CONFUSION IN EMG-BASED SPEECH RECOGNITION
    Wand, Michael
    Schultz, Tanja
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 757 - 760
  • [40] SESSION-INDEPENDENT EMG-BASED SPEECH RECOGNITION
    Wand, Michael
    Schultz, Tanja
    BIOSIGNALS 2011, 2011, : 295 - 300