Combining standard and throat microphones for robust speech recognition

被引:66
|
作者
Graciarena, M
Franco, H
Sonmez, K
Bratt, H
机构
[1] SRI Int, Speech Technol & Res Lab, Menlo Pk, CA 94025 USA
[2] Univ Buenos Aires, Sch Engn, Inst Biomed Engn, RA-1053 Buenos Aires, DF, Argentina
关键词
noise robustness; probabilistic optimum filtering; speech recognition; throat microphone;
D O I
10.1109/LSP.2003.808549
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present a method to combine the standard and throat microphone signals for robust speech recognition in noisy environments. Our approach is to use the. probabilistic optimum filter (POF) mapping algorithm to estimate the standard microphone clean-speech feature vectors, used by standard speech recognizers, from both microphones' noisy-speech feature vectors. A small untranscribed "stereo" database (noisy and clean simultaneous recordings) is required to train the POF mappings. In continuous-speech recognition experiments using SRI International's DECIPHER recognition system, both using artificially added noise and using recorded noisy speech, the combined-microphone approach significantly outperforms the single-microphone approach.
引用
收藏
页码:72 / 74
页数:3
相关论文
共 50 条
  • [1] Feature Vector Normalization with Combined Standard and Throat Microphones for Robust ASR
    Buera, Luis
    Miguel, Antonio
    Saz, Oscar
    Ortega, Alfonso
    Lleida, Eduardo
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1289 - 1292
  • [2] Multi-sensory microphones for robust speech detection, enhancement and recognition
    Zhang, ZY
    Liu, ZC
    Sinclair, M
    Acero, A
    Deng, L
    Droppo, J
    Huang, XD
    Zheng, YL
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 781 - 784
  • [3] Speech intelligibility in noise using throat and acoustic microphones
    Acker-Mills, BE
    Houtsma, AJM
    Ahroon, WA
    [J]. AVIATION SPACE AND ENVIRONMENTAL MEDICINE, 2006, 77 (01): : 26 - 31
  • [4] Robust Speaker Recognition with Combined Use of Acoustic and Throat Microphone Speech
    Sahidullah, Md
    Hautamaki, Rosa Gonzalez
    Thomsen, Dennis Alexander Lehmann
    Kinntinenl, Tomi
    Tang, Zheng-Hua
    Hautamaki, Ville
    Parts, Robert
    Pitkanen, Martti
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1720 - 1724
  • [5] Combining speech enhancement and auditory feature extraction for robust speech recognition
    Kleinschmidt, M
    Tchorz, J
    Kollmeier, B
    [J]. SPEECH COMMUNICATION, 2001, 34 (1-2) : 75 - 91
  • [6] Combining Binaural and Cortical Features for Robust Speech Recognition
    Spille, Constantin
    Kollmeier, Birger
    Meyer, Bernd T.
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (04) : 756 - 767
  • [7] Robust Speech Recognition Combining Cepstral and Articulatory Features
    Zha, Zhuan-ling
    Hu, Jin
    Zhan, Qing-ran
    Shan, Ya-hui
    Xie, Xiang
    Wang, Jing
    Cheng, Hao-bo
    [J]. PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 1401 - 1405
  • [8] Voice Analysis Using Acoustic and Throat Microphones for Speech Therapy
    Mathew, Lani Rachel
    Gopakumar, K.
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 173 - 174
  • [9] Privacy Preserving Continuous Speech Recording using Throat Microphones
    Schneegans, Tim
    Simmon, Leon
    Beigl, Michael
    [J]. PROCEEDINGS OF THE 2022 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, ISWC 2022, 2022, : 106 - 108
  • [10] Combining speech enhancement with feature post-processing for robust speech recognition
    Lei, Jianjun
    Guo, Jun
    Liu, Gang
    Wang, Jian
    Nie, Xiangfei
    Yang, Zhen
    [J]. INTELLIGENT COMPUTING IN SIGNAL PROCESSING AND PATTERN RECOGNITION, 2006, 345 : 773 - 778