Voicing detection based on adaptive aperiodicity thresholding for speech enhancement in non-stationary noise

被引:4
|
作者
Cabanas-Molero, Pablo [1 ]
Martinez-Munoz, Damian [1 ]
Vera-Candeas, Pedro [1 ]
Ruiz-Reyes, Nicolas [1 ]
Jose Rodriguez-Serrano, Francisco [1 ]
机构
[1] Univ Jaen, Polytech Sch, Dept Telecommun Engn, Jaen 23700, Spain
关键词
hearing aids; speech enhancement; signal-to-noise ratios; voicing classifier; speech sentences database; fluctuating noise; signal-adaptive decision; nonstationary noise; adaptive aperiodicity thresholding; voicing detection; FUNDAMENTAL-FREQUENCY ESTIMATION; SPECTRAL SUBTRACTION; ENVIRONMENTS; ESTIMATOR;
D O I
10.1049/iet-spr.2012.0224
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this study, the authors present a novel voicing detection algorithm which employs the well-known aperiodicity measure to detect voiced speech in signals contaminated with non-stationary noise. The method computes a signal-adaptive decision threshold which takes into account the current noise level, enabling voicing detection by direct comparison with the extracted aperiodicity. This adaptive threshold is updated at each frame by making a simple estimate of the current noise power, and thus is adapted to fluctuating noise conditions. Once the aperiodicity is computed, the method only requires a small number of operations, and enables its implementation in challenging devices (such as hearing aids) if an efficient approximation of the difference function is employed to extract the aperiodicity. Evaluation over a database of speech sentences degraded by several types of noise reveals that the proposed voicing classifier is robust against different noises and signal-to-noise ratios. In addition, to evaluate the applicability of the method for speech enhancement, a simple F-0-based speech enhancement algorithm integrating the proposed classifier is implemented. The system is shown to achieve competitive results, in terms of objective measures, when compared with other well-known speech enhancement approaches.
引用
收藏
页码:119 / 130
页数:12
相关论文
共 50 条
  • [11] Robust Estimation of Non-Stationary Noise Power Spectrum for Speech Enhancement
    Mai, Van-Khanh
    Pastor, Dominique
    Aissa-El-Bey, Abdeldjalil
    Le-Bidan, Raphael
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (04) : 670 - 682
  • [12] Single Channel Speech Enhancement for Mixed Non-stationary Noise Environments
    Singh, Sachin
    Tripathy, Manoj
    Anand, R. S.
    ADVANCES IN SIGNAL PROCESSING AND INTELLIGENT RECOGNITION SYSTEMS, 2014, 264 : 545 - 555
  • [13] Multi notch adaptive digital filter design for enhancement of speech signals embedded in non-stationary noise
    Erçelebi, E
    COMPUTERS & ELECTRICAL ENGINEERING, 2004, 30 (02) : 79 - 95
  • [14] An Adaptive Wavelet-Based Denoising Algorithm for Enhancing Speech in Non-stationary Noise Environment
    Wang, Kun-Ching
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (02): : 341 - 349
  • [15] Sparse Hidden Markov Models for Speech Enhancement in Non-Stationary Noise Environments
    Deng, Feng
    Bao, Changchun
    Kleijn, W. Bastiaan
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (11) : 1973 - 1987
  • [16] Robust Speech Enhancement Techniques for ASR in Non-stationary Noise and Dynamic Environments
    Liu, Gang
    Dimitriadis, Dimitrios
    Bocchieri, Enrico
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3016 - 3020
  • [17] Noise estimation for speech enhancement in non-stationary environments-A new method
    Rama Rao, Ch. V.
    Gowthami
    Harsha
    Rajkumar
    Rama Murthy, M.B.
    Srinivasa Rao, K.
    Anitha Sheela, K.
    World Academy of Science, Engineering and Technology, 2010, 46 : 738 - 741
  • [18] A more effective speech enhancement algorithm under non-stationary noise environment
    Cheng, Gong
    Guo, Lei
    Zhao, Tianyun
    He, Sheng
    Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University, 2010, 28 (05): : 664 - 668
  • [19] Tracking speech-presence uncertainty to improve speech enhancement in non-stationary noise environments
    Malah, D
    Cox, RV
    Accardi, AJ
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 789 - 792
  • [20] DETECTION OF A NON-STATIONARY SIGNAL IN NOISE
    MCNEIL, DR
    AUSTRALIAN JOURNAL OF PHYSICS, 1967, 20 (03): : 325 - +