SNR-Progressive Model With Harmonic Compensation for Low-SNR Speech Enhancement

被引：0

作者：

Hou, Zhongshu ^{[1
,2
]}

Lei, Tong ^{[1
,2
]}

Hu, Qinwen ^{[1
,2
]}

Cao, Zhanzhong ^{[3
]}

Lu, Jing ^{[1
,2
]}

机构：

[1] Nanjing Univ, Key Lab Modern Acoust, Nanjing 210008, Peoples R China

[2] Horizon Robot, NJU Horizon Intelligent Audio Lab, Beijing 100094, Peoples R China

[3] Nanjing Inst Informat Technol, Nanjing 210036, Peoples R China

来源：

IEEE SIGNAL PROCESSING LETTERS | 2025年 / 32卷

基金：

中国国家自然科学基金;

关键词：

Harmonic analysis; Signal to noise ratio; Speech enhancement; Power harmonic filters; Estimation; Spectrogram; Noise measurement; Filtering; Training; Artificial neural networks; Low-SNR speech enhancement; neural network; pitch estimation; SNR-progressive learning;

D O I：

10.1109/LSP.2024.3484288

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Despite significant progress made in the last decade, deep neural network (DNN) based speech enhancement (SE) still faces the challenge of notable degradation in the quality of recovered speech under low signal-to-noise ratio (SNR) conditions. In this letter, we propose an SNR-progressive speech enhancement model with harmonic compensation for low-SNR SE. Reliable pitch estimation is obtained from the intermediate output, which has the benefit of retaining more speech components than the coarse estimate while possessing a significantly higher SNR than the input noisy speech. An effective harmonic compensation mechanism is introduced for better harmonic recovery. Extensive experiments demonstrate the advantage of our proposed model.

引用

页码：476 / 480

页数：5

共 50 条

[1] Low-SNR Speech Enhancement in Driving Environment
Yang Hongxiao
Wei, Jie
Zhong, Xiaofeng
2016 16TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES (ISCIT), 2016, : 151 - 155
[2] Low-SNR Speech Enhancement and Separation in Driving Environment
Wei, Jie
Li, Lingling
Zhong, Xiaofeng
2019 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2019,
[3] CST: Complex Sparse Transformer for Low-SNR Speech Enhancement
Tan, Kaijun
Mao, Wenyu
Guo, Xiaozhou
Lu, Huaxiang
Zhang, Chi
Cao, Zhanzhong
Wang, Xingang
SENSORS, 2023, 23 (05)
[4] Analysis of the Decision-Directed SNR Estimator for Speech Enhancement With Respect to Low-SNR and Transient Conditions
Breithaupt, Colin
Martin, Rainer
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (02): : 277 - 289
[5] A Multi-Target SNR-Progressive Learning Approach to Regression Based Speech Enhancement
Tu, Yan-Hui
Du, Jun
Gao, Tian
Lee, Chin-Hui
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 1608 - 1619
[6] Low-SNR, Speaker-Dependent Speech Enhancement using GMMs and MFCCs
Boucheron, Laura E.
De Leon, Phillip L.
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 574 - 577
[7] A Speech Enhancement Neural Network Architecture with SNR-Progressive Multi-Target Learning for Robust Speech Recognition
Zhou, Nan
Du, Jun
Tu, Yan-Hui
Gao, Tian
Lee, Chin-Hui
2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 873 - 877
[8] Intelligibility for Binaural Speech with Discarded Low-SNR Speech Components
Schoenmaker, Esther
van de Par, Steven
PHYSIOLOGY, PSYCHOACOUSTICS AND COGNITION IN NORMAL AND IMPAIRED HEARING, 2016, 894 : 73 - 81
[9] A Maximum Likelihood Approach to SNR-Progressive Learning Using Generalized Gaussian Distribution for LSTM-Based Speech Enhancement
Zhang, Xiao-Qi
Du, Jun
Chai, Li
Lee, Chin-Hui
INTERSPEECH 2021, 2021, : 2701 - 2705
[10] Channel coherence in the low-SNR regime
Zheng, Lizhong
Tse, David N. C.
Medard, Muriel
IEEE TRANSACTIONS ON INFORMATION THEORY, 2007, 53 (03) : 976 - 997

← 1 2 3 4 5 →