Auditory filterbank denoising neural network for speech enhancement in wearable auditory device

被引:0
|
作者
Kim, Seon Man [1 ]
机构
[1] Korea Photon Technol Inst, Spatial Opt Informat Res Ctr, Gwangju, South Korea
基金
新加坡国家研究基金会;
关键词
acoustic devices; acoustic signal processing; hearing aids; signal denoising; speech enhancement;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this study, a speech enhancing neural network (NN) is proposed, which is designed for monaural auditory devices, specifically designed for use in hearing aids. Herein, a 32-channel auditory filterbank (FB) is first implemented with an algorithm processing delay of 8 ms, which is tailored to meet the requirements of auditory devices. The proposed method primarily aims to integrate a denoising NN within the analysis phase of a uniform polyphase discrete Fourier transform (DFT) FB, aimed at enhancing speech within each band. For the denoising model, complex-valued convolutional NNs have been applied, specifically targeting the restoration of speech phase information based on the spectral components of the DFT. A multi-loss method is introduced, which is designed to further account for the loss of analysed speech signals within the split bands during the training process, leveraging the DFT FB strategy. To evaluate the efficacy of the proposed method, objective assessments of speech intelligibility and quality scores are conducted under various noise conditions. The results demonstrate that the proposed method can outperform the existing method across all types of noise. The proposed auditory filterbank denoising neural network aims at enhancing speech within each band by integrating a denoising neural network within the analysis phase of a uniform polyphase discrete Fourier transform filterbank for auditory devices such as hearing aids. All components of the proposed architecture, that is, analysis filterbank, synthesis filterbank and speech denoising model, are integrated into a single neural network architecture, and used for inference and training. image
引用
收藏
页数:3
相关论文
共 50 条
  • [1] Speech denoising based on an auditory filterbank
    Lin, L
    Ambikairajah, E
    [J]. 2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 552 - 555
  • [2] Auditory-based wavelet packet filterbank for speech recognition using neural network
    Gandhiraj, R.
    Sathidevi, P. S.
    [J]. ADCOM 2007: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATIONS, 2007, : 666 - +
  • [3] Improved Speech Enhancement Method Based on Auditory Filterbank and Fast Noise Estimation
    Kianyfar, Ali
    Abutalebi, Hamid Reza
    [J]. 2014 7TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2014, : 441 - 445
  • [4] Speech Denoising with Auditory Models
    Saddler, Mark R.
    Francl, Andrew
    Feather, Jenelle
    Qian, Kaizhi
    Zhang, Yang
    McDermott, Josh H.
    [J]. INTERSPEECH 2021, 2021, : 2681 - 2685
  • [5] Plastic multi-resolution auditory model based neural network for speech enhancement
    Lai, Chen-Yen
    Lo, Yu-Wen
    Shen, Yih-Liang
    Chi, Tai-Shih
    [J]. 2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 605 - 609
  • [6] Speech enhancement system based on auditory system and time-delay neural network
    Choi, Jae-Seung
    Park, Seung-Jin
    [J]. ADAPTIVE AND NATURAL COMPUTING ALGORITHMS, PT 2, 2007, 4432 : 153 - +
  • [7] Auditory simulation for speech enhancement
    Lu, Shengli
    Shi, Longxing
    Yu, Chongzhi
    Wei, Rongjue
    [J]. Shengxue Xuebao/Acta Acustica, 1996, 21 (06): : 879 - 883
  • [8] Neural network-based adaptive noise cancellation for enhancement of speech auditory brainstem responses
    Shiva Gholami-Boroujeny
    Anwar Fallatah
    Brian P. Heffernan
    Hilmi R. Dajani
    [J]. Signal, Image and Video Processing, 2016, 10 : 389 - 395
  • [9] An auditory-based adaptive speech enhancement system by neural network according to noise intensity
    Choi, J
    Okamoto, J
    Nakajima, S
    Suzuki, Y
    Hosokawa, S
    [J]. 42ND MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, PROCEEDINGS, VOLS 1 AND 2, 1999, : 993 - 996
  • [10] Neural network-based adaptive noise cancellation for enhancement of speech auditory brainstem responses
    Gholami-Boroujeny, Shiva
    Fallatah, Anwar
    Heffernan, Brian P.
    Dajani, Hilmi R.
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2016, 10 (02) : 389 - 395