Binaural Deep Neural Network for Robust Speech Enhancement

被引:0
|
作者
Jiang, Yi [1 ]
Liu, Runsheng [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China
关键词
Deep neural network; computational auditory scene analysis (CASA); speech enhancement; binaural features; LOCALIZATION; ALGORITHM; NOISE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Robust speech enhancement is a challenge task, especially in noisy environments. The deep neural network has shown good performance on binaural speech enhancement with various speakers at a same distance. As binaural cues are based on the locations of sound sources, this paper analyze the performance of binaural deep neural network with different distances. The theoretical derivation and experiment shows, the computational auditory scene analysis based binaural deep neural network speech enhancement system has robust performance with various sound locations.
引用
收藏
页码:692 / 695
页数:4
相关论文
共 50 条
  • [41] PACDNN: A phase-aware composite deep neural network for speech enhancement
    Hasannezhad, Mojtaba
    Yu, Hongjiang
    Zhu, Wei-Ping
    Champagne, Benoit
    [J]. SPEECH COMMUNICATION, 2022, 136 : 1 - 13
  • [42] Improving Deep Neural Network Based Speech Enhancement in Low SNR Environments
    Gao, Tian
    Du, Jun
    Xu, Yong
    Liu, Cong
    Dai, Li-Rong
    Lee, Chin-Hui
    [J]. LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION, LVA/ICA 2015, 2015, 9237 : 75 - 82
  • [43] Broad Phoneme Class Specific Deep Neural Network Based Speech Enhancement
    Karjol, Pavan
    Ghosh, Prasanta Kumar
    [J]. 2018 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM 2018), 2018, : 372 - 376
  • [44] Improving Speech Enhancement in Unseen Noise Using Deep Convolutional Neural Network
    Yuan W.-H.
    Sun W.-Z.
    Xia B.
    Ou S.-F.
    [J]. Zidonghua Xuebao/Acta Automatica Sinica, 2018, 44 (04): : 751 - 759
  • [45] Deep neural network based speech enhancement using mono channel mask
    Pallavi P. Ingale
    Sanjay L. Nalbalwar
    [J]. International Journal of Speech Technology, 2019, 22 : 841 - 850
  • [46] Speech enhancement using deep complex convolutional neural network (DCCNN) model
    Iqbal, Yasir
    Zhang, Tao
    Fahad, Muhammad
    Rahman, Sadiq ur
    Iqbal, Anjum
    Geng, Yanzhang
    Zhao, Xin
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2024, : 8675 - 8692
  • [47] SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement
    Gao, Tian
    Du, Jun
    Dai, Li-Rong
    Lee, Chin-Hui
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3713 - 3717
  • [48] Speech enhancement method based on the perceptual joint optimization deep neural network
    Yuan W.
    Lou Y.
    Liang C.
    Wang Z.
    [J]. Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2019, 46 (02): : 90 - 94
  • [49] Deep Time Delay Neural Network for Speech Enhancement with Full Data Learning
    Fan, Cunhang
    Liu, Bin
    Tao, Jianhua
    Yi, Jiangyan
    Wen, Zhengqi
    Song, Leichao
    [J]. 2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2021,
  • [50] A Novel Adversarial Training Scheme for Deep Neural Network based Speech Enhancement
    Cornell, Samuele
    Principi, Emanuele
    Squartini, Stefano
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,