Combining standard and throat microphones for robust speech recognition

被引:66
|
作者
Graciarena, M
Franco, H
Sonmez, K
Bratt, H
机构
[1] SRI Int, Speech Technol & Res Lab, Menlo Pk, CA 94025 USA
[2] Univ Buenos Aires, Sch Engn, Inst Biomed Engn, RA-1053 Buenos Aires, DF, Argentina
关键词
noise robustness; probabilistic optimum filtering; speech recognition; throat microphone;
D O I
10.1109/LSP.2003.808549
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present a method to combine the standard and throat microphone signals for robust speech recognition in noisy environments. Our approach is to use the. probabilistic optimum filter (POF) mapping algorithm to estimate the standard microphone clean-speech feature vectors, used by standard speech recognizers, from both microphones' noisy-speech feature vectors. A small untranscribed "stereo" database (noisy and clean simultaneous recordings) is required to train the POF mappings. In continuous-speech recognition experiments using SRI International's DECIPHER recognition system, both using artificially added noise and using recorded noisy speech, the combined-microphone approach significantly outperforms the single-microphone approach.
引用
收藏
页码:72 / 74
页数:3
相关论文
共 50 条
  • [21] Throat Microphone Speech Recognition using MFCC
    Vijayan, Amritha
    Mathai, Bipil Mary
    Valsalan, Karthik
    Johnson, Riyanka Raji
    Mathew, Lani Rachel
    Gopakumar, K.
    2017 INTERNATIONAL CONFERENCE ON NETWORKS & ADVANCES IN COMPUTATIONAL TECHNOLOGIES (NETACT), 2017, : 392 - 395
  • [22] Combining MMSE enhancement with LA model adaptation for robust automatic speech recognition
    Ding, P
    Cao, ZG
    ELECTRONICS LETTERS, 2001, 37 (08) : 539 - 540
  • [23] COMBINING MISSING-DATA RECONSTRUCTION AND UNCERTAINTY DECODING FOR ROBUST SPEECH RECOGNITION
    Gonzalez, Jose A.
    Peinado, Antonio M.
    Gomez, Angel M.
    Ma, Ning
    Barker, Jon
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4693 - 4696
  • [24] ON COMBINING DNN AND GMM WITH UNSUPERVISED SPEAKER ADAPTATION FOR ROBUST AUTOMATIC SPEECH RECOGNITION
    Liu, Shilin
    Sim, Khe Chai
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [25] Combining the modified CTRANC and posterior union model for robust distant speech recognition
    Lin, Jie
    Li, Jianping
    Ming, Ji
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE INFORMATION COMPUTING AND AUTOMATION, VOLS 1-3, 2008, : 1068 - +
  • [26] Speech enhancement for robust speech recognition in car environments using grifriths-jim ANC based on two-paired microphones
    Cho, YS
    Ko, HS
    2004 IEEE INTERNATIONAL SYMPOSIUM ON CONSUMER ELECTRONICS, PROCEEDINGS, 2004, : 123 - 127
  • [27] Noise robust speech recognition by combining speech enhancement in the wavelet domain and Lin-log RASTA
    Yang Jie
    Wang Zhenli
    2009 ISECS INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT, VOL II, 2009, : 415 - +
  • [28] A robust speech analysis in speech recognition
    Miyanaga, Y
    Gozen, S
    Ohtsuki, N
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 706 - 709
  • [29] An analysis of the effect of combining standard and alternate sensor signals on recognition of syllabic units for multimodal speech recognition
    Radha, N.
    Shahina, A.
    Prabha, P.
    Sri, Preethi B. T.
    Khan, Nayeemulla A.
    PATTERN RECOGNITION LETTERS, 2018, 115 : 39 - 49
  • [30] Combined Use of Standard and Throat Microphones for Measurement of Acoustic Voice Parameters and Voice Categorization
    Uloza, Virgilijus
    Padervinskis, Evaldas
    Uloziene, Ingrida
    Saferis, Viktoras
    Verikas, Antanas
    JOURNAL OF VOICE, 2015, 29 (05) : 552 - 559