Towards EMG-to-Speech with a Necklace Form Factor

被引:0
|
作者
Wu, Peter [1 ]
Kaveh, Ryan [1 ]
Nautiyal, Raghav [1 ]
Zhang, Christine [1 ]
Guo, Albert [1 ]
Kachinthayal, Anvitha [1 ]
Mishra, Tavish [1 ]
Yu, Bohan [1 ]
Black, Alan W. [1 ]
Krishna, Rikky Gopala [1 ]
Anumanchipalli, K. [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
来源
关键词
electromyography; EMG; EMG-to-speech; DRY; WIRELESS; SIGNALS; DEVICE;
D O I
10.21437/Interspeech.2024-1568
中图分类号
学科分类号
摘要
Electrodes for decoding speech from electromyography (EMG) are typically placed on the face, requiring adhesives that are inconvenient and skin-irritating if used regularly. We explore a different device form factor, where dry electrodes are placed around the neck instead. 11-word, multi-speaker voiced EMG classifiers trained on data recorded with this device achieve 92.7% accuracy. Ablation studies reveal the importance of having more than two electrodes on the neck, and phonological analyses reveal similar classification confusions between neck-only and neck-and-face form factors. Finally, speech-EMG correlation experiments demonstrate a linear relationship between many EMG spectrogram frequency bins and self-supervised speech representation dimensions.
引用
收藏
页码:402 / 406
页数:5
相关论文
共 50 条
  • [1] FURTHER INVESTIGATIONS ON EMG-TO-SPEECH CONVERSION
    Janke, Matthias
    Wand, Michael
    Nakamura, Keigo
    Schultz, Tanja
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 365 - 368
  • [2] CSL-EMG Array: An Open Access Corpus for EMG-to-Speech Conversion
    Diener, Lorenz
    Vishkasougheh, Mehrdad Roustay
    Schultz, Tanja
    INTERSPEECH 2020, 2020, : 3745 - 3749
  • [3] EMG-to-Speech: Direct Generation of Speech From Facial Electromyographic Signals
    Janke, Matthias
    Diener, Lorenz
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (12) : 2375 - 2385
  • [4] Codebook Clustering for Unit Selection based EMG-to-Speech Conversion
    Diener, Lorenz
    Janke, Matthias
    Schultz, Tanja
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2420 - 2424
  • [5] Multiaccent EMG-to-Speech Optimized Transduction With PerFL and MAML Adaptations
    Ullah, Shan
    Kim, Deok-Hwan
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [6] Investigating Objective Intelligibility in Real-Time EMG-to-Speech Conversion
    Diener, Lorenz
    Schultz, Tanja
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3162 - 3166
  • [7] IMPROVING FUNDAMENTAL FREQUENCY GENERATION IN EMG-TO-SPEECH CONVERSION USING A QUANTIZATION APPROACH
    Diener, Lorenz
    Umesh, Tejas
    Schultz, Tanja
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 682 - 689
  • [8] Towards a Measurement of the ωπ Transition Form Factor
    Khan, Farha Anjum
    MESON 2012 - 12TH INTERNATIONAL WORKSHOP ON PRODUCTION, PROPERTIES AND INTERACTION OF MESONS, 2012, 37
  • [9] An Optimized EMG Encoder to minimize soft speech loss for speech to EMG conversions
    Ullah, Shan
    Kim, Deok-Hwan
    2024 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING, IEEE BIGCOMP 2024, 2024, : 215 - 218
  • [10] SpeeChin: A Smart Necklace for Silent Speech Recognition
    Zhang, Ruidong
    Chen, Mingyang
    Steeper, Benjamin
    Li, Yaxuan
    Yan, Zihan
    Chen, Yizhuo
    Tao, Songyun
    Chen, Tuochao
    Lim, Hyunchul
    Zhang, Cheng
    PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2021, 5 (04):