Combining standard and throat microphones for robust speech recognition

被引：66

作者：

Graciarena, M

Franco, H

Sonmez, K

Bratt, H

机构：

[1] SRI Int, Speech Technol & Res Lab, Menlo Pk, CA 94025 USA

[2] Univ Buenos Aires, Sch Engn, Inst Biomed Engn, RA-1053 Buenos Aires, DF, Argentina

来源：

IEEE SIGNAL PROCESSING LETTERS | 2003年 / 10卷 / 03期

关键词：

noise robustness; probabilistic optimum filtering; speech recognition; throat microphone;

D O I：

10.1109/LSP.2003.808549

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

We present a method to combine the standard and throat microphone signals for robust speech recognition in noisy environments. Our approach is to use the. probabilistic optimum filter (POF) mapping algorithm to estimate the standard microphone clean-speech feature vectors, used by standard speech recognizers, from both microphones' noisy-speech feature vectors. A small untranscribed "stereo" database (noisy and clean simultaneous recordings) is required to train the POF mappings. In continuous-speech recognition experiments using SRI International's DECIPHER recognition system, both using artificially added noise and using recorded noisy speech, the combined-microphone approach significantly outperforms the single-microphone approach.

引用

页码：72 / 74

页数：3

共 50 条

[1] Feature Vector Normalization with Combined Standard and Throat Microphones for Robust ASR
Buera, Luis
Miguel, Antonio
Saz, Oscar
Ortega, Alfonso
Lleida, Eduardo
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1289 - 1292
[2] Multi-sensory microphones for robust speech detection, enhancement and recognition
Zhang, ZY
Liu, ZC
Sinclair, M
Acero, A
Deng, L
Droppo, J
Huang, XD
Zheng, YL
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 781 - 784
[3] Speech intelligibility in noise using throat and acoustic microphones
Acker-Mills, BE
Houtsma, AJM
Ahroon, WA
AVIATION SPACE AND ENVIRONMENTAL MEDICINE, 2006, 77 (01): : 26 - 31
[4] Robust Speaker Recognition with Combined Use of Acoustic and Throat Microphone Speech
Sahidullah, Md
Hautamaki, Rosa Gonzalez
Thomsen, Dennis Alexander Lehmann
Kinntinenl, Tomi
Tang, Zheng-Hua
Hautamaki, Ville
Parts, Robert
Pitkanen, Martti
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1720 - 1724
[5] Combining speech enhancement and auditory feature extraction for robust speech recognition
Kleinschmidt, M
Tchorz, J
Kollmeier, B
SPEECH COMMUNICATION, 2001, 34 (1-2) : 75 - 91
[6] Combining Binaural and Cortical Features for Robust Speech Recognition
Spille, Constantin
Kollmeier, Birger
Meyer, Bernd T.
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (04) : 756 - 767
[7] Robust Speech Recognition Combining Cepstral and Articulatory Features
Zha, Zhuan-ling
Hu, Jin
Zhan, Qing-ran
Shan, Ya-hui
Xie, Xiang
Wang, Jing
Cheng, Hao-bo
PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 1401 - 1405
[8] Voice Analysis Using Acoustic and Throat Microphones for Speech Therapy
Mathew, Lani Rachel
Gopakumar, K.
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 173 - 174
[9] Privacy Preserving Continuous Speech Recording using Throat Microphones
Schneegans, Tim
Simmon, Leon
Beigl, Michael
PROCEEDINGS OF THE 2022 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, ISWC 2022, 2022, : 106 - 108
[10] Combining speech enhancement with feature post-processing for robust speech recognition
Lei, Jianjun
Guo, Jun
Liu, Gang
Wang, Jian
Nie, Xiangfei
Yang, Zhen
INTELLIGENT COMPUTING IN SIGNAL PROCESSING AND PATTERN RECOGNITION, 2006, 345 : 773 - 778

← 1 2 3 4 5 →