Auditory filterbank denoising neural network for speech enhancement in wearable auditory device

被引：0

作者：

Kim, Seon Man ^{[1
]}

机构：

[1] Korea Photon Technol Inst, Spatial Opt Informat Res Ctr, Gwangju, South Korea

来源：

ELECTRONICS LETTERS | 2024年 / 60卷 / 10期

基金：

新加坡国家研究基金会;

关键词：

acoustic devices; acoustic signal processing; hearing aids; signal denoising; speech enhancement;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this study, a speech enhancing neural network (NN) is proposed, which is designed for monaural auditory devices, specifically designed for use in hearing aids. Herein, a 32-channel auditory filterbank (FB) is first implemented with an algorithm processing delay of 8 ms, which is tailored to meet the requirements of auditory devices. The proposed method primarily aims to integrate a denoising NN within the analysis phase of a uniform polyphase discrete Fourier transform (DFT) FB, aimed at enhancing speech within each band. For the denoising model, complex-valued convolutional NNs have been applied, specifically targeting the restoration of speech phase information based on the spectral components of the DFT. A multi-loss method is introduced, which is designed to further account for the loss of analysed speech signals within the split bands during the training process, leveraging the DFT FB strategy. To evaluate the efficacy of the proposed method, objective assessments of speech intelligibility and quality scores are conducted under various noise conditions. The results demonstrate that the proposed method can outperform the existing method across all types of noise. The proposed auditory filterbank denoising neural network aims at enhancing speech within each band by integrating a denoising neural network within the analysis phase of a uniform polyphase discrete Fourier transform filterbank for auditory devices such as hearing aids. All components of the proposed architecture, that is, analysis filterbank, synthesis filterbank and speech denoising model, are integrated into a single neural network architecture, and used for inference and training. image

引用

页数：3

共 50 条

[1] Speech denoising based on an auditory filterbank
Lin, L
Ambikairajah, E
[J]. 2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 552 - 555
[2] Auditory-based wavelet packet filterbank for speech recognition using neural network
Gandhiraj, R.
Sathidevi, P. S.
[J]. ADCOM 2007: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATIONS, 2007, : 666 - +
[3] Improved Speech Enhancement Method Based on Auditory Filterbank and Fast Noise Estimation
Kianyfar, Ali
Abutalebi, Hamid Reza
[J]. 2014 7TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2014, : 441 - 445
[4] Speech Denoising with Auditory Models
Saddler, Mark R.
Francl, Andrew
Feather, Jenelle
Qian, Kaizhi
Zhang, Yang
McDermott, Josh H.
[J]. INTERSPEECH 2021, 2021, : 2681 - 2685
[5] Plastic multi-resolution auditory model based neural network for speech enhancement
Lai, Chen-Yen
Lo, Yu-Wen
Shen, Yih-Liang
Chi, Tai-Shih
[J]. 2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 605 - 609
[6] Speech enhancement system based on auditory system and time-delay neural network
Choi, Jae-Seung
Park, Seung-Jin
[J]. ADAPTIVE AND NATURAL COMPUTING ALGORITHMS, PT 2, 2007, 4432 : 153 - +
[7] Auditory simulation for speech enhancement
Lu, Shengli
Shi, Longxing
Yu, Chongzhi
Wei, Rongjue
[J]. Shengxue Xuebao/Acta Acustica, 1996, 21 (06): : 879 - 883
[8] Neural network-based adaptive noise cancellation for enhancement of speech auditory brainstem responses
Shiva Gholami-Boroujeny
Anwar Fallatah
Brian P. Heffernan
Hilmi R. Dajani
[J]. Signal, Image and Video Processing, 2016, 10 : 389 - 395
[9] An auditory-based adaptive speech enhancement system by neural network according to noise intensity
Choi, J
Okamoto, J
Nakajima, S
Suzuki, Y
Hosokawa, S
[J]. 42ND MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, PROCEEDINGS, VOLS 1 AND 2, 1999, : 993 - 996
[10] Neural network-based adaptive noise cancellation for enhancement of speech auditory brainstem responses
Gholami-Boroujeny, Shiva
Fallatah, Anwar
Heffernan, Brian P.
Dajani, Hilmi R.
[J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2016, 10 (02) : 389 - 395

← 1 2 3 4 5 →