Voicing detection based on adaptive aperiodicity thresholding for speech enhancement in non-stationary noise

被引:4
|
作者
Cabanas-Molero, Pablo [1 ]
Martinez-Munoz, Damian [1 ]
Vera-Candeas, Pedro [1 ]
Ruiz-Reyes, Nicolas [1 ]
Jose Rodriguez-Serrano, Francisco [1 ]
机构
[1] Univ Jaen, Polytech Sch, Dept Telecommun Engn, Jaen 23700, Spain
关键词
hearing aids; speech enhancement; signal-to-noise ratios; voicing classifier; speech sentences database; fluctuating noise; signal-adaptive decision; nonstationary noise; adaptive aperiodicity thresholding; voicing detection; FUNDAMENTAL-FREQUENCY ESTIMATION; SPECTRAL SUBTRACTION; ENVIRONMENTS; ESTIMATOR;
D O I
10.1049/iet-spr.2012.0224
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this study, the authors present a novel voicing detection algorithm which employs the well-known aperiodicity measure to detect voiced speech in signals contaminated with non-stationary noise. The method computes a signal-adaptive decision threshold which takes into account the current noise level, enabling voicing detection by direct comparison with the extracted aperiodicity. This adaptive threshold is updated at each frame by making a simple estimate of the current noise power, and thus is adapted to fluctuating noise conditions. Once the aperiodicity is computed, the method only requires a small number of operations, and enables its implementation in challenging devices (such as hearing aids) if an efficient approximation of the difference function is employed to extract the aperiodicity. Evaluation over a database of speech sentences degraded by several types of noise reveals that the proposed voicing classifier is robust against different noises and signal-to-noise ratios. In addition, to evaluate the applicability of the method for speech enhancement, a simple F-0-based speech enhancement algorithm integrating the proposed classifier is implemented. The system is shown to achieve competitive results, in terms of objective measures, when compared with other well-known speech enhancement approaches.
引用
收藏
页码:119 / 130
页数:12
相关论文
共 50 条
  • [21] Speech Enhancement by Online Non-negative Spectrogram Decomposition in Non-stationary Noise Environments
    Duan, Zhiyao
    Mysore, Gautham J.
    Smaragdis, Paris
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 594 - 597
  • [22] Non-stationary noise estimation with adaptive filters
    Bennis, RJM
    Chu, QP
    Mulder, JA
    AIAA GUIDANCE, NAVIGATION, AND CONTROL CONFERENCE, VOLS 1-3: A COLLECTION OF TECHNICAL PAPERS, 1999, : 1769 - 1782
  • [23] An Algorithm of Single-Microphone Telephone Speech Enhancement in Non-Stationary Noise Environment
    Yao, Yuan
    Wang, Xia
    Xue, Tao
    2012 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING (WICOM), 2012,
  • [24] A Novel Expectation-Maximization Framework for Speech Enhancement in Non-Stationary Noise Environments
    Lun, Daniel P. K.
    Shen, Tak-Wai
    Ho, K. C.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (02) : 335 - 346
  • [25] A novel expectation-maximization framework for speech enhancement in non-stationary noise environments
    Lun, Daniel P. K.
    Shen, Tak-Wai
    Ho, K.C.
    IEEE Transactions on Audio, Speech and Language Processing, 2014, 22 (02): : 335 - 346
  • [26] MODEL-BASED NOISE PSD ESTIMATION FROM SPEECH IN NON-STATIONARY NOISE
    Nielsen, Jesper Kjaer
    Kavalekalam, Mathew Shaji
    Christensen, Mads Graesboll
    Boldt, Jesper
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5424 - 5428
  • [27] Dynamic adjustment of the forgetting factor in adaptive filters for non-stationary noise cancellation in speech
    Martinez, R
    Gomez, P
    Alvarez, A
    Nieto, V
    Rodellar, V
    Rubio, M
    Perez, M
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 1009 - 1012
  • [28] Wavelet-domain soft-thresholding for non-stationary noise
    Lo, Wan Yee
    Selesnick, Ivan W.
    2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 1441 - +
  • [29] A Non-iterative Kalman Filter for Single Channel Speech Enhancement in Non-stationary Noise Condition
    Roy, Sujan Kumar
    Paliwal, Kuldip K.
    2018 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2018,
  • [30] Voice activity detection in non-stationary noise
    Li Ye
    Wang Tong
    Cui Huijuan
    Tang Kun
    2006 IMACS: MULTICONFERENCE ON COMPUTATIONAL ENGINEERING IN SYSTEMS APPLICATIONS, VOLS 1 AND 2, 2006, : 1573 - +