FURTHER INVESTIGATIONS ON EMG-TO-SPEECH CONVERSION

被引:0
|
作者
Janke, Matthias [1 ]
Wand, Michael [1 ]
Nakamura, Keigo [1 ]
Schultz, Tanja [1 ]
机构
[1] Karlsruhe Inst Technol KIT, Cognit Syst Lab, Karlsruhe, Germany
关键词
Silent Speech; Electromyography; Speech Synthesis; Voice Conversion;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Our study deals with a Silent Speech Interface based on mapping surface electromyographic (EMG) signals to speech waveforms. Electromyographic signals recorded from the facial muscles capture the activity of the human articulatory apparatus and therefore allow to retrace speech, even when no audible signal is produced. The mapping of EMG signals to speech is done via a Gaussian mixture model (GMM)-based conversion technique. In this paper, we follow the lead of EMG-based speech-to-text systems and apply two major recent technological advances to our system, namely, we consider session-independent systems, which are robust against electrode repositioning, and we show that mapping the EMG signal to whispered speech creates a better speech signal than a mapping to normally spoken speech. We objectively evaluate the performance of our systems using a spectral distortion measure.
引用
收藏
页码:365 / 368
页数:4
相关论文
共 50 条
  • [21] Comparison of Classifiers for EMG based Speech Recognition
    Srisuwan, N.
    Prukpattaranont, P.
    Limsakul, C.
    2019 4TH INTERNATIONAL CONFERENCE ON COMMUNICATION, IMAGE AND SIGNAL PROCESSING (CCISP 2019), 2020, 1438
  • [22] Robust and preceding speech detection using EMG
    Manabe, H.
    Fukumoto, M.
    2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 5812 - 5815
  • [23] CLINICAL EMG FEEDBACK IN MOTOR SPEECH DISORDERS
    DRAIZAR, A
    ARCHIVES OF PHYSICAL MEDICINE AND REHABILITATION, 1984, 65 (08): : 481 - 484
  • [24] Speech identity conversion
    Vondra, M
    Vích, R
    NONLINEAR SPEECH MODELING AND APPLICATIONS, 2005, 3445 : 421 - 426
  • [25] RADICULOPATHY VERSUS CONVERSION REACTION - USEFULNESS OF EMG
    JEBSEN, RH
    LON, E
    ELECTROENCEPHALOGRAPHY AND CLINICAL NEUROPHYSIOLOGY, 1973, 35 (01): : 107 - 107
  • [26] FURTHER INVESTIGATIONS OF A CONVERSION SERIES OF DIOCTAHEDRAL MICA SMECTITES IN THE SHINZAN HYDROTHERMAL ALTERATION AREA, NORTHEAST JAPAN
    INOUE, A
    UTADA, M
    CLAYS AND CLAY MINERALS, 1983, 31 (06) : 401 - 412
  • [27] A FURTHER NOTE ON SPEECH FRIGHT
    GRUNER, CR
    SPEECH TEACHER, 1964, 13 (03): : 223 - 224
  • [28] Phonetic Speech Analysis for Speech to Text Conversion
    Bapat, Abhijit V.
    Nagalkar, Lalit K.
    IEEE REGION 10 COLLOQUIUM AND THIRD INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS, VOLS 1 AND 2, 2008, : 320 - 323
  • [29] Speech Synthesis for Bangla Text to Speech Conversion
    Arafat, Mohammad Yasir
    Fahrin, Sanjana
    Islam, Md. Jamirul
    Siddiquee, Md. Ashraf
    Khan, Afsana
    Kotwal, Mohammed Rokibul Alam
    Huda, Mohammad Nurul
    8TH INTERNATIONAL CONFERENCE ON SOFTWARE, KNOWLEDGE, INFORMATION MANAGEMENT AND APPLICATIONS (SKIMA 2014), 2014,
  • [30] Conversion of Neutral Speech to Storytelling Style Speech
    Verma, Rashmi
    Sarkar, Parakrant
    Rao, K. Sreenivasa
    2015 EIGHTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION (ICAPR), 2015, : 226 - +