A PITCH BASED NOISE ESTIMATION TECHNIQUE FOR ROBUST SPEECH RECOGNITION WITH MISSING DATA

被引:0
|
作者
Morales-Cordovilla, Juan A. [1 ]
Ma, Ning [2 ]
Sanchez, Victoria [1 ]
Carmona, Jose L. [1 ]
Peinado, Antonio M. [1 ]
Barker, Jon [2 ]
机构
[1] Univ Granada, Dept Teoria Senal Telemat & Comunicac, E-18071 Granada, Spain
[2] Univ Sheffield, Dept Comp Sci, Sheffield, South Yorkshire, England
基金
英国工程与自然科学研究理事会;
关键词
Robust speech recognition; missing data; noise estimation; VAD; harmonic tunnelling;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a noise estimation technique based on knowledge of pitch information for robust speech recognition. In the first stage the noise is estimated by means of extrapolating the noise from frames where speech is believed to be absent. These frames are detected with a proposed pitch based VAD (Voice Activity Detector). In the second stage the noise estimation is revised in voiced frames using harmonic tunnelling thechnique. The tunnelling noise estimation is used at high SNRs as an upper bound of the noise rather than a suitable estimation. A spectrogram MD (Missing Data) recognition system is chosen to evaluate the proposed noise estimation. The proposed system is compared in Aurora-2 with other similar techniques like cepstral SS (Spectral Subtraction).
引用
收藏
页码:4808 / 4811
页数:4
相关论文
共 50 条
  • [1] Compressive Sensing for Missing Data Imputation in Noise Robust Speech Recognition
    Gemmeke, Jort Florent
    Van Hamme, Hugo
    Cranen, Bert
    Boves, Lou
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2010, 4 (02) : 272 - 287
  • [2] Mask Estimation in Non-stationary Noise Environments for Missing Feature Based Robust Speech Recognition
    Badiezadegan, Shirin
    Rose, Richard C.
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2062 - 2065
  • [3] Missing data techniques for robust speech recognition
    Cooke, M
    Morris, A
    Green, P
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 863 - 866
  • [4] Robust speech recognition using missing feature theory and target speech enhancement based on degenerate unmixing and estimation technique
    Kim, Minook
    Kim, Ji-Seon
    Park, Hyung-Min
    INDEPENDENT COMPONENT ANALYSES, WAVELETS, NEURAL NETWORKS, BIOSYSTEMS, AND NANOENGINEERING IX, 2011, 8058
  • [5] SPECTRAL ESTIMATION FOR NOISE ROBUST SPEECH RECOGNITION
    ERELL, A
    WEINTRAUB, M
    SPEECH AND NATURAL LANGUAGE, 1989, : 319 - 324
  • [6] Feature compensation based on independent noise estimation for robust speech recognition
    Lu, Yong
    Lin, Han
    Wu, Pingping
    Chen, Yitao
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2021, 2021 (01)
  • [7] Feature compensation based on independent noise estimation for robust speech recognition
    Yong Lü
    Han Lin
    Pingping Wu
    Yitao Chen
    EURASIP Journal on Audio, Speech, and Music Processing, 2021
  • [8] Sequential MAP estimation based speech feature enhancement for noise robust speech recognition
    Jia, C
    Ding, P
    Xu, B
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 412 - 415
  • [9] Mask estimation based on sound localisation for missing data speech recognition
    Harding, S
    Barker, J
    Brown, GJ
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 537 - 540
  • [10] A robust pitch estimation approach for colored noise-corrupted speech
    Shahnaz, C
    Zhu, WP
    Ahmad, MO
    2005 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), VOLS 1-6, CONFERENCE PROCEEDINGS, 2005, : 3143 - 3146