SNR-Progressive Model With Harmonic Compensation for Low-SNR Speech Enhancement

被引:0
|
作者
Hou, Zhongshu [1 ,2 ]
Lei, Tong [1 ,2 ]
Hu, Qinwen [1 ,2 ]
Cao, Zhanzhong [3 ]
Lu, Jing [1 ,2 ]
机构
[1] Nanjing Univ, Key Lab Modern Acoust, Nanjing 210008, Peoples R China
[2] Horizon Robot, NJU Horizon Intelligent Audio Lab, Beijing 100094, Peoples R China
[3] Nanjing Inst Informat Technol, Nanjing 210036, Peoples R China
基金
中国国家自然科学基金;
关键词
Harmonic analysis; Signal to noise ratio; Speech enhancement; Power harmonic filters; Estimation; Spectrogram; Noise measurement; Filtering; Training; Artificial neural networks; Low-SNR speech enhancement; neural network; pitch estimation; SNR-progressive learning;
D O I
10.1109/LSP.2024.3484288
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Despite significant progress made in the last decade, deep neural network (DNN) based speech enhancement (SE) still faces the challenge of notable degradation in the quality of recovered speech under low signal-to-noise ratio (SNR) conditions. In this letter, we propose an SNR-progressive speech enhancement model with harmonic compensation for low-SNR SE. Reliable pitch estimation is obtained from the intermediate output, which has the benefit of retaining more speech components than the coarse estimate while possessing a significantly higher SNR than the input noisy speech. An effective harmonic compensation mechanism is introduced for better harmonic recovery. Extensive experiments demonstrate the advantage of our proposed model.
引用
收藏
页码:476 / 480
页数:5
相关论文
共 50 条
  • [1] Low-SNR Speech Enhancement in Driving Environment
    Yang Hongxiao
    Wei, Jie
    Zhong, Xiaofeng
    2016 16TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES (ISCIT), 2016, : 151 - 155
  • [2] Low-SNR Speech Enhancement and Separation in Driving Environment
    Wei, Jie
    Li, Lingling
    Zhong, Xiaofeng
    2019 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2019,
  • [3] CST: Complex Sparse Transformer for Low-SNR Speech Enhancement
    Tan, Kaijun
    Mao, Wenyu
    Guo, Xiaozhou
    Lu, Huaxiang
    Zhang, Chi
    Cao, Zhanzhong
    Wang, Xingang
    SENSORS, 2023, 23 (05)
  • [4] Analysis of the Decision-Directed SNR Estimator for Speech Enhancement With Respect to Low-SNR and Transient Conditions
    Breithaupt, Colin
    Martin, Rainer
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (02): : 277 - 289
  • [5] A Multi-Target SNR-Progressive Learning Approach to Regression Based Speech Enhancement
    Tu, Yan-Hui
    Du, Jun
    Gao, Tian
    Lee, Chin-Hui
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 1608 - 1619
  • [6] Low-SNR, Speaker-Dependent Speech Enhancement using GMMs and MFCCs
    Boucheron, Laura E.
    De Leon, Phillip L.
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 574 - 577
  • [7] A Speech Enhancement Neural Network Architecture with SNR-Progressive Multi-Target Learning for Robust Speech Recognition
    Zhou, Nan
    Du, Jun
    Tu, Yan-Hui
    Gao, Tian
    Lee, Chin-Hui
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 873 - 877
  • [8] Intelligibility for Binaural Speech with Discarded Low-SNR Speech Components
    Schoenmaker, Esther
    van de Par, Steven
    PHYSIOLOGY, PSYCHOACOUSTICS AND COGNITION IN NORMAL AND IMPAIRED HEARING, 2016, 894 : 73 - 81
  • [9] A Maximum Likelihood Approach to SNR-Progressive Learning Using Generalized Gaussian Distribution for LSTM-Based Speech Enhancement
    Zhang, Xiao-Qi
    Du, Jun
    Chai, Li
    Lee, Chin-Hui
    INTERSPEECH 2021, 2021, : 2701 - 2705
  • [10] Channel coherence in the low-SNR regime
    Zheng, Lizhong
    Tse, David N. C.
    Medard, Muriel
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2007, 53 (03) : 976 - 997