FURTHER INVESTIGATIONS ON EMG-TO-SPEECH CONVERSION

被引:0
|
作者
Janke, Matthias [1 ]
Wand, Michael [1 ]
Nakamura, Keigo [1 ]
Schultz, Tanja [1 ]
机构
[1] Karlsruhe Inst Technol KIT, Cognit Syst Lab, Karlsruhe, Germany
关键词
Silent Speech; Electromyography; Speech Synthesis; Voice Conversion;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Our study deals with a Silent Speech Interface based on mapping surface electromyographic (EMG) signals to speech waveforms. Electromyographic signals recorded from the facial muscles capture the activity of the human articulatory apparatus and therefore allow to retrace speech, even when no audible signal is produced. The mapping of EMG signals to speech is done via a Gaussian mixture model (GMM)-based conversion technique. In this paper, we follow the lead of EMG-based speech-to-text systems and apply two major recent technological advances to our system, namely, we consider session-independent systems, which are robust against electrode repositioning, and we show that mapping the EMG signal to whispered speech creates a better speech signal than a mapping to normally spoken speech. We objectively evaluate the performance of our systems using a spectral distortion measure.
引用
收藏
页码:365 / 368
页数:4
相关论文
共 50 条
  • [1] CSL-EMG Array: An Open Access Corpus for EMG-to-Speech Conversion
    Diener, Lorenz
    Vishkasougheh, Mehrdad Roustay
    Schultz, Tanja
    INTERSPEECH 2020, 2020, : 3745 - 3749
  • [2] Codebook Clustering for Unit Selection based EMG-to-Speech Conversion
    Diener, Lorenz
    Janke, Matthias
    Schultz, Tanja
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2420 - 2424
  • [3] Investigating Objective Intelligibility in Real-Time EMG-to-Speech Conversion
    Diener, Lorenz
    Schultz, Tanja
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3162 - 3166
  • [4] IMPROVING FUNDAMENTAL FREQUENCY GENERATION IN EMG-TO-SPEECH CONVERSION USING A QUANTIZATION APPROACH
    Diener, Lorenz
    Umesh, Tejas
    Schultz, Tanja
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 682 - 689
  • [5] Towards EMG-to-Speech with a Necklace Form Factor
    Wu, Peter
    Kaveh, Ryan
    Nautiyal, Raghav
    Zhang, Christine
    Guo, Albert
    Kachinthayal, Anvitha
    Mishra, Tavish
    Yu, Bohan
    Black, Alan W.
    Krishna, Rikky Gopala
    Anumanchipalli, K.
    INTERSPEECH 2024, 2024, : 402 - 406
  • [6] EMG-to-Speech: Direct Generation of Speech From Facial Electromyographic Signals
    Janke, Matthias
    Diener, Lorenz
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (12) : 2375 - 2385
  • [7] Multiaccent EMG-to-Speech Optimized Transduction With PerFL and MAML Adaptations
    Ullah, Shan
    Kim, Deok-Hwan
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [8] Investigations on Speaking Mode Discrepancies in EMG-based Speech Recognition
    Wand, Michael
    Janke, Matthias
    Schultz, Tanja
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 608 - 611
  • [9] MOTOR-MOTOR ADAPTATION TO SPEECH - FURTHER INVESTIGATIONS
    SHUSTER, LI
    PERCEPTUAL AND MOTOR SKILLS, 1990, 71 (01) : 275 - 280
  • [10] An Optimized EMG Encoder to minimize soft speech loss for speech to EMG conversions
    Ullah, Shan
    Kim, Deok-Hwan
    2024 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING, IEEE BIGCOMP 2024, 2024, : 215 - 218