On phase recovery and preserving early reflections for deep-learning speech dereverberation

被引:1
|
作者
Luo, Xiaoxue [1 ,2 ]
Ke, Yuxuan [1 ,2 ]
Li, Xiaodong [1 ,2 ]
Zheng, Chengshi [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Acoust, Key Lab Noise & Vibrat Res, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
SEPARATION; NETWORKS; INTELLIGIBILITY; REVERBERATION; ALGORITHM; MASKING;
D O I
10.1121/10.0024348
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In indoor environments, reverberation often distorts clean speech. Although deep learning-based speech dereverberation approaches have shown much better performance than traditional ones, the inferior speech quality of the dereverberated speech caused by magnitude distortion and limited phase recovery is still a serious problem for practical applications. This paper improves the performance of deep learning-based speech dereverberation from the perspectives of both network design and mapping target optimization. Specifically, on the one hand, a bifurcated-and-fusion network and its guidance loss functions were designed to help reduce the magnitude distortion while enhancing the phase recovery. On the other hand, the time boundary between the early and late reflections in the mapped speech was investigated, so as to make a balance between the reverberation tailing effect and the difficulty of magnitude/phase recovery. Mathematical derivations were provided to show the rationality of the specially designed loss functions. Geometric illustrations were given to explain the importance of preserving early reflections in reducing the difficulty of phase recovery. Ablation study results confirmed the validity of the proposed network topology and the importance of preserving 20 ms early reflections in the mapped speech. Objective and subjective test results showed that the proposed system outperformed other baselines in the speech dereverberation task.
引用
收藏
页码:436 / 451
页数:16
相关论文
共 50 条
  • [31] Multi-channel photonic sampled ADC with hybrid deep-learning for distortion recovery
    Zhang, Tianhang
    Hu, Shanshan
    Zhang, Lijuan
    Yang, Changqi
    OPTICS COMMUNICATIONS, 2025, 574
  • [32] Evaluation of a deep-learning segmentation software in thoracic organs at risk: an early analysis
    Boschetti, A.
    Votta, C.
    Re, A.
    Piro, D.
    Marras, M.
    D'Aviero, A.
    Catucci, F.
    Cusumano, D.
    Di Dio, C.
    Menna, S.
    Iezzi, M.
    Quaranta, F. V.
    Flore, C.
    Sanna, E. G.
    Piccari, D.
    Mattiucci, G. C.
    Valentini, V.
    RADIOTHERAPY AND ONCOLOGY, 2022, 170 : S299 - S300
  • [33] A speech denoising demonstration system using multi-model deep-learning neural networks
    Lu, Ching-Ta
    Shen, Jun-Hong
    Castiglione, Aniello
    Chung, Cheng-Han
    Lu, Yen-Yu
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023,
  • [34] A Smart Binaural Hearing Aid Architecture Leveraging a Smartphone APP With Deep-Learning Speech Enhancement
    Li, Yingdan
    Chen, Fei
    Sun, Zhuoyi
    Ji, Junyu
    Jia, Wen
    Wang, Zhihua
    IEEE ACCESS, 2020, 8 : 56798 - 56810
  • [35] Effects of base-frequency and spectral envelope on deep-learning speech separation and recognition models
    Hui, J.
    Wei, Y.
    Chen, S. T.
    So, R. H. Y.
    INTERSPEECH 2019, 2019, : 634 - 638
  • [36] Deep-learning based flat-fielding quantitative phase contrast microscopy
    Wang, Wenjian
    Zhuo, Kequn
    Liu, Xin
    Feng, Wenjing
    Xiong, Zihan
    Liu, Ruihua
    Ali, Nauman
    Ma, Ying
    Zheng, Juanjuan
    An, Sha
    Gao, Peng
    OPTICS EXPRESS, 2024, 32 (07) : 12462 - 12475
  • [37] Development of a deep-learning model for detecting positive tubules during sperm recovery for nonobstructive azoospermia
    Takeshima, Teppei
    Karibe, Jurii
    Kuroda, Shinnosuke
    Yumura, Yasushi
    REPRODUCTION, 2024, 168 (04)
  • [38] Early event detection in a deep-learning driven quality prediction model for ultrasonic welding
    Wang, Baicun
    Li, Yang
    Luo, Ying
    Li, Xingyu
    Freiheit, Theodor
    JOURNAL OF MANUFACTURING SYSTEMS, 2021, 60 : 325 - 336
  • [39] Early Detection of Drought Stress in Plants Using Hyperspectral Imaging and Deep-Learning Method
    Kim, Hangi
    Areif, Muhammad Akbar Andi
    Kim, Taehyun
    Suh, Hyun-Kwon
    Cho, Byoung-Kwan
    JOURNAL OF THE KOREAN SOCIETY FOR NONDESTRUCTIVE TESTING, 2022, 42 (06) : 503 - 513
  • [40] COVID-19 Detection Systems Using Deep-Learning Algorithms Based on Speech and Image Data
    Nassif, Ali Bou
    Shahin, Ismail
    Bader, Mohamed
    Hassan, Abdelfatah
    Werghi, Naoufel
    MATHEMATICS, 2022, 10 (04)