A Real-Time Dual-Microphone Speech Enhancement Algorithm Assisted by Bone Conduction Sensor

被引:11
|
作者
Zhou, Yi [1 ]
Chen, Yufan [1 ]
Ma, Yongbao [2 ]
Liu, Hongqing [1 ]
机构
[1] Chongqing Univ Posts & Telecommun, Sch Commun & Informat Engn, Chongqing 400065, Peoples R China
[2] Suresense Technol, Chongqing 400065, Peoples R China
基金
中国国家自然科学基金;
关键词
array signal processing; bone conduction; beamforming; speech enhancement; deep learning; real time; ARRAY POST-FILTER; NOISE ESTIMATION; RECOGNITION; SEPARATION; GSC;
D O I
10.3390/s20185050
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The quality and intelligibility of the speech are usually impaired by the interference of background noise when using internet voice calls. To solve this problem in the context of wearable smart devices, this paper introduces a dual-microphone, bone-conduction (BC) sensor assisted beamformer and a simple recurrent unit (SRU)-based neural network postfilter for real-time speech enhancement. Assisted by the BC sensor, which is insensitive to the environmental noise compared to the regular air-conduction (AC) microphone, the accurate voice activity detection (VAD) can be obtained from the BC signal and incorporated into the adaptive noise canceller (ANC) and adaptive block matrix (ABM). The SRU-based postfilter consists of a recurrent neural network with a small number of parameters, which improves the computational efficiency. The sub-band signal processing is designed to compress the input features of the neural network, and the scale-invariant signal-to-distortion ratio (SI-SDR) is developed as the loss function to minimize the distortion of the desired speech signal. Experimental results demonstrate that the proposed real-time speech enhancement system provides significant speech sound quality and intelligibility improvements for all noise types and levels when compared with the AC-only beamformer with a postfiltering algorithm.
引用
收藏
页码:1 / 17
页数:18
相关论文
共 50 条
  • [1] A Robust Dual-Microphone Generalized Sidelobe Canceller Using a Bone-Conduction Sensor for Speech Enhancement
    Zhou, Yi
    Wang, Haiping
    Chu, Yijing
    Liu, Hongqing
    [J]. SENSORS, 2021, 21 (05) : 1 - 16
  • [2] Deep Learning Based Real-Time Speech Enhancement for Dual-Microphone Mobile Phones
    Tan, Ke
    Zhang, Xueliang
    Wang, DeLiang
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 1853 - 1863
  • [3] Real-time dual-microphone speech enhancement using field programmable gate arrays
    Halupka, D
    Rabi, SA
    Aarabi, P
    Sheikholeslami, A
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 149 - 152
  • [4] Sound Localization and Speech Enhancement Algorithm Based on Dual-Microphone
    Tao, Tao
    Zheng, Hong
    Yang, Jianfeng
    Guo, Zhongyuan
    Zhang, Yiyang
    Ao, Jiahui
    Chen, Yuao
    Lin, Weiting
    Tan, Xiao
    [J]. SENSORS, 2022, 22 (03)
  • [5] A Dual-Microphone Speech Enhancement Algorithm Based on the Coherence Function
    Yousefian, Nima
    Loizou, Philipos C.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (02): : 599 - 609
  • [6] Real-time DSP implementation of a subband beamforming algorithm for dual microphone speech enhancement
    Yermeche, Zohra
    Sallberg, Benny
    Grbic, Nedelko
    Claesson, Ingvar
    [J]. 2007 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, 2007, : 353 - 356
  • [7] Phase-based dual-microphone robust speech enhancement
    Aarabi, P
    Shi, G
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2004, 34 (04): : 1763 - 1773
  • [8] FEATURE ENHANCEMENT FOR ROBUST SPEECH RECOGNITION ON SMARTPHONES WITH DUAL-MICROPHONE
    Lopez-Espejo, Ivan
    Gomez, Angel M.
    Gonzalez, Jose A.
    Peinado, Antonio M.
    [J]. 2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 21 - 25
  • [9] A New Dual-Microphone Speech Enhancement Method for Oriented Noises
    Abutalebi, H. R.
    Pourahmadi, M.
    Aghabozorgi, M. R.
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2150 - 2153
  • [10] REAL-TIME SPEECH ENHANCEMENT USING AN EFFICIENT CONVOLUTIONAL RECURRENT NETWORK FOR DUAL-MICROPHONE MOBILE PHONES IN CLOSE-TALK SCENARIOS
    Tan, Ke
    Zhang, Xueliang
    Wang, DeLiang
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5751 - 5755