Towards EMG-to-Speech with a Necklace Form Factor

被引：0

作者：

Wu, Peter ^{[1
]}

Kaveh, Ryan ^{[1
]}

Nautiyal, Raghav ^{[1
]}

Zhang, Christine ^{[1
]}

Guo, Albert ^{[1
]}

Kachinthayal, Anvitha ^{[1
]}

Mishra, Tavish ^{[1
]}

Yu, Bohan ^{[1
]}

Black, Alan W. ^{[1
]}

Krishna, Rikky Gopala ^{[1
]}

Anumanchipalli, K. ^{[1
]}

机构：

[1] Univ Calif Berkeley, Berkeley, CA 94720 USA

来源：

INTERSPEECH 2024 | 2024年

关键词：

electromyography; EMG; EMG-to-speech; DRY; WIRELESS; SIGNALS; DEVICE;

D O I：

10.21437/Interspeech.2024-1568

中图分类号：

学科分类号：

摘要：

Electrodes for decoding speech from electromyography (EMG) are typically placed on the face, requiring adhesives that are inconvenient and skin-irritating if used regularly. We explore a different device form factor, where dry electrodes are placed around the neck instead. 11-word, multi-speaker voiced EMG classifiers trained on data recorded with this device achieve 92.7% accuracy. Ablation studies reveal the importance of having more than two electrodes on the neck, and phonological analyses reveal similar classification confusions between neck-only and neck-and-face form factors. Finally, speech-EMG correlation experiments demonstrate a linear relationship between many EMG spectrogram frequency bins and self-supervised speech representation dimensions.

引用

页码：402 / 406

页数：5

共 50 条

[1] FURTHER INVESTIGATIONS ON EMG-TO-SPEECH CONVERSION
Janke, Matthias
Wand, Michael
Nakamura, Keigo
Schultz, Tanja
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 365 - 368
[2] CSL-EMG Array: An Open Access Corpus for EMG-to-Speech Conversion
Diener, Lorenz
Vishkasougheh, Mehrdad Roustay
Schultz, Tanja
INTERSPEECH 2020, 2020, : 3745 - 3749
[3] EMG-to-Speech: Direct Generation of Speech From Facial Electromyographic Signals
Janke, Matthias
Diener, Lorenz
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (12) : 2375 - 2385
[4] Codebook Clustering for Unit Selection based EMG-to-Speech Conversion
Diener, Lorenz
Janke, Matthias
Schultz, Tanja
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2420 - 2424
[5] Multiaccent EMG-to-Speech Optimized Transduction With PerFL and MAML Adaptations
Ullah, Shan
Kim, Deok-Hwan
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
[6] Investigating Objective Intelligibility in Real-Time EMG-to-Speech Conversion
Diener, Lorenz
Schultz, Tanja
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3162 - 3166
[7] IMPROVING FUNDAMENTAL FREQUENCY GENERATION IN EMG-TO-SPEECH CONVERSION USING A QUANTIZATION APPROACH
Diener, Lorenz
Umesh, Tejas
Schultz, Tanja
2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 682 - 689
[8] Towards a Measurement of the ωπ Transition Form Factor
Khan, Farha Anjum
MESON 2012 - 12TH INTERNATIONAL WORKSHOP ON PRODUCTION, PROPERTIES AND INTERACTION OF MESONS, 2012, 37
[9] An Optimized EMG Encoder to minimize soft speech loss for speech to EMG conversions
Ullah, Shan
Kim, Deok-Hwan
2024 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING, IEEE BIGCOMP 2024, 2024, : 215 - 218
[10] SpeeChin: A Smart Necklace for Silent Speech Recognition
Zhang, Ruidong
Chen, Mingyang
Steeper, Benjamin
Li, Yaxuan
Yan, Zihan
Chen, Yizhuo
Tao, Songyun
Chen, Tuochao
Lim, Hyunchul
Zhang, Cheng
PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2021, 5 (04):

← 1 2 3 4 5 →