On the Use of Absolute Threshold of Hearing-based Loss for Full-band Speech Enhancement

被引:0
|
作者
Mars, Rohith [1 ]
Das, Rohan Kumar [1 ]
机构
[1] Fortemedia Singapore, Singapore, Singapore
关键词
speech enhancement; deep neural networks; absolute threshold of hearing; NOISE; SUPPRESSION;
D O I
10.1109/ISCSLP57327.2022.10038050
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we investigate the use of a perceptually motivated loss function for training single-channel full-band speech enhancement models. Specifically, we modify the conventional squared error loss function by incorporating the use of a frequency-importance based weighting scheme utilizing absolute threshold of hearing (ATH). We placed more emphasis on the perceptually relevant frequency bins of the speech spectrogram by applying larger weights to train the speech enhancement model targeting for a higher perceptual quality. We compare the models trained using both the conventional loss and the loss utilizing the proposed ATH-based weighting scheme on the VCTK and 4th DNS challenge datasets. The results demonstrate that the proposed loss using ATH-based weighting scheme achieves better performance than the conventional loss in terms of multiple objective speech quality metrics.
引用
收藏
页码:458 / 462
页数:5
相关论文
共 50 条
  • [21] Objective Quality Assessment of Echo-Impaired Full-Band Speech Signals
    Avila, Flavio R.
    Nunes, Leonardo O.
    Biscainho, Luiz W. P.
    Tygel, Alan F.
    Lee, Bowon
    [J]. 2014 INTERNATIONAL TELECOMMUNICATIONS SYMPOSIUM (ITS), 2014,
  • [22] DETECTING ADHD FROM SPEECH USING FULL-BAND AND SUB-BAND CONVOLUTION FUSION NETWORK
    Li, Shuanglin
    Nair, Rajesh
    Naqvi, Syed Mohsen
    [J]. 2023 IEEE SENSORS, 2023,
  • [23] Research on Full-band Arc Simulation Based on Logic Judgment
    Liang, Yifeng
    Shen, Yang
    Zhang, Donghui
    Zhu, Yong
    Chen, Fan
    Wang, Chuansheng
    [J]. PROCEEDINGS OF 2018 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION, ELECTRONICS AND ELECTRICAL ENGINEERING (AUTEEE), 2018, : 49 - 53
  • [24] Auditory enhancement at the absolute threshold of hearing and its relationship to the Zwicker tone
    Wiegrebe, L
    Kossl, M
    Schmidt, S
    [J]. HEARING RESEARCH, 1996, 100 (1-2) : 171 - 180
  • [25] MULTI-CHANNEL NARROW-BAND DEEP SPEECH SEPARATION WITH FULL-BAND PERMUTATION INVARIANT TRAINING
    Quan, Changsheng
    Li, Xiaofei
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 541 - 545
  • [26] Sub-band adaptive speech enhancement for hearing aids
    Campbell, DR
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 180 - 183
  • [27] TS-CGANet: A Two-Stage Complex and Real Dual-Path Sub-Band Fusion Network for Full-Band Speech Enhancement
    Chen, Haozhe
    Zhang, Xiaojuan
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (07):
  • [28] An Investigation of Implantable Capacitive Coupling Intra-body Power Transfer based on Full-band Loss Compensation
    Han, Cheng
    Yu, Shan
    Zhang, Zhiwei
    Mao, Jingna
    [J]. IEEE TRANSACTIONS ON POWER ELECTRONICS, 2024, 39 (07) : 8904 - 8915
  • [29] Speech Enhancement for Listeners With Hearing Loss Based on a Model for Vowel Coding in the Auditory Midbrain
    Rao, Akshay
    Carney, Laurel H.
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2014, 61 (07) : 2081 - 2091
  • [30] OBJECTIVE COMPARISON OF SPEECH ENHANCEMENT ALGORITHMS WITH HEARING LOSS SIMULATION
    Zhang, Zhuohuang
    Shen, Yi
    Williamson, Donald S.
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6845 - 6849