Speaker-Adaptive Speech Recognition Based on Surface Electromyography

被引：0

作者：

Wand, Michael ^{[1
]}

Schultz, Tanja ^{[1
]}

机构：

[1] Univ Karlsruhe TH, Karlsruhe, Germany

来源：

BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES | 2010年 / 52卷

关键词：

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

We present our recent advances in silent speech interfaces using electromyographic signals that capture the movements of the human articulatory muscles at the skin surface for recognizing continuously spoken speech. Previous systems were limited to speaker- and session-dependent recognition tasks on small amounts of training and test data. In this article we present speaker-independent and speaker-adaptive training methods which allow us to use a large corpus of data from many speakers to train acoustic models more reliably. We use the speaker-dependent system as baseline, carefully tuning the data preprocessing and acoustic modeling. Then on our corpus we compare the performance of speaker-dependent and speaker-independent acoustic models and carry out model adaptation experiments.

引用

页码：271 / 285

页数：15

共 50 条

[31] Speaker Adaptive Classification Procedure for Speech Recognition.
Katterfeldt, Harald
Thon, Werner
1974, 27 (06): : 230 - 232
[32] Adaptive systems for unsupervised speaker tracking and speech recognition
Herbig, Tobias
Gerl, Franz
Minker, Wolfgang
Haeb-Umbach, Reinhold
EVOLVING SYSTEMS, 2011, 2 (03) : 199 - 214
[33] Speaker-Adaptive Multimodal Prediction Model for Listener Responses
de Kok, Iwan
Heylen, Dirk
Morency, Louis-Philippe
ICMI'13: PROCEEDINGS OF THE 2013 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2013, : 51 - 58
[34] Towards Continuous Speech Recognition Using Surface Electromyography
Jou, Szu-Chen
Schultz, Tanja
Walliczek, Matthias
Kraft, Florian
Waibel, Alex
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 573 - +
[35] Study on Integration of Speaker Diarization with Speaker Adaptive Speech Recognition for Broadcast Transcription
Silovsky, Jan
Cerva, Petr
Zdansky, Jindrich
Nouza, Jan
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 478 - 481
[36] Surface Electromyography-Based Recognition, Synthesis, and Perception of Prosodic Subvocal Speech
Vojtech, Jennifer M.
Chan, Michael D.
Shiwani, Bhawna
Roy, Serge H.
Heaton, James T.
Meltzner, Geoffrey S.
Contessa, Paola
De Luca, Gianluca
Patel, Rupal
Kline, Joshua C.
JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2021, 64 (06): : 2134 - 2153
[37] Deep learning-based speaker-adaptive postfiltering with limited adaptation data for embedded text-to-speech synthesis systems
Eren, Eray
Demiroglu, Cenk
COMPUTER SPEECH AND LANGUAGE, 2023, 81
[38] Speaker Recognition and Speech Emotion Recognition Based on GMM
Xu, Shupeng
Liu, Yan
Liu, Xiping
PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON ELECTRIC AND ELECTRONICS, 2013, : 434 - 436
[39] Speaker-Adaptive Lip Reading with User-Dependent Padding
Kim, Minsu
Kim, Hyunjun
Ro, Yong Man
COMPUTER VISION, ECCV 2022, PT XXXVI, 2022, 13696 : 576 - 593
[40] Speaker Adaptive Model for Hindi Speech using Kaldi Speech Recognition toolkit
Upadhyaya, Prashant
Mittal, Sanjeev Kumar
Varshney, Yash Vardhan
Farooq, Omar
Abidi, Musiur Raza
PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON MULTIMEDIA, SIGNAL PROCESSING AND COMMUNICATION TECHNOLOGIES (IMPACT), 2017, : 222 - 226

← 1 2 3 4 5 →