Two-Branch Network with Selective Kernel Convolution for Time-Domain Speech Enhancement

被引:0
|
作者
Li, Hui [1 ]
Huang, Zhihua [1 ]
Guo, Chuangjian [1 ]
机构
[1] Xinjiang Univ, Xinjiang Key Lab Signal Detect & Proc, Sch Informat Sci & Engn, Urumqi, Peoples R China
关键词
speech enhancement; U-Net; two-branch; selective kernel convolution;
D O I
10.1109/ISCSLP57327.2022.10038143
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech enhancement methods based on deep neural networks usually take noisy speech as input and clean speech as output. However, there is still a correlation between predicted speech and removed noise, which is not fully utilized. And the common convolutional codec has mostly the same receptive field at each layer, which may be difficult to represent the multi-scale features of speech. In this paper, we propose a two-branch interactive time-domain speech enhancement model combining selective kernel units (TBSK-Net). We introduce Selective Kernel Convolution into each layer of the encoder to adaptively encode the information of noisy speech at multiple scales. The stacked selective kernel units are designed to construct two branches to learn speech and noise features where insert exchange information modules. The results show that the proposed model has obvious advantages in dealing with unknown noise than the baseline and other references. Introducing the Selective Kernel mechanism can effectively improve the enhancement performance. Ablation experiments are also performed to demonstrate the effectiveness of different components in the structure.
引用
收藏
页码:478 / 482
页数:5
相关论文
共 50 条
  • [1] Hybrid Dilated and Recursive Recurrent Convolution Network for Time-Domain Speech Enhancement
    Song, Zhendong
    Ma, Yupeng
    Tan, Fang
    Feng, Xiaoyi
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (07):
  • [2] A Two-Branch Convolution Residual Network for Image Compressive Sensing
    Gan, Chenquan
    Yan, Xiaoqin
    Wu, Yunfeng
    Zhang, Zufan
    [J]. IEEE ACCESS, 2020, 8 : 1705 - 1714
  • [3] Two-branch 3D convolution neural network for gait recognition
    Hui Huang
    Yuanyu Zhang
    Yuhang Si
    Jin Wang
    Dongzhi He
    [J]. Signal, Image and Video Processing, 2023, 17 : 3495 - 3504
  • [4] TBENet:A two-branch boundary enhancement Network for cerebrovascular segmentation
    Yan, Songlin
    Xu, Weijing
    Liu, Wentao
    Yang, Huihua
    Wang, Lemeng
    Deng, Yiming
    Gao, Feng
    [J]. 2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
  • [5] Two-branch 3D convolution neural network for gait recognition
    Huang, Hui
    Zhang, Yuanyu
    Si, Yuhang
    Wang, Jin
    He, Dongzhi
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (07) : 3495 - 3504
  • [6] Generalizing Face Forgery Detection by Suppressed Texture Network With Two-Branch Convolution
    Zhang, Dengyong
    Li, Daijie
    Sangaiah, Arun Kumar
    Li, Feng
    Deng, Zelin
    Wu, Chengcheng
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024,
  • [7] Visually Assisted Time-Domain Speech Enhancement
    Ideli, Elham
    Sharpe, Bruce
    Bajic, Ivan, V
    Vaughan, Rodney G.
    [J]. 2019 7TH IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (IEEE GLOBALSIP), 2019,
  • [8] Two-Stage Learning and Fusion Network With Noise Aware for Time-Domain Monaural Speech Enhancement
    Xiang, Xiaoxiao
    Zhang, Xiaojuan
    Chen, Haozhe
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1754 - 1758
  • [9] A Time-domain Monaural Speech Enhancement with Feedback Learning
    Li, Andong
    Zheng, Chengshi
    Cheng, Linjuan
    Peng, Renhua
    Li, Xiaodong
    [J]. 2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 769 - 774
  • [10] Two-Branch Attention Adversarial Domain Adaptation Network for Hyperspectral Image Classification
    Huang, Yi
    Peng, Jiangtao
    Sun, Weiwei
    Chen, Na
    Du, Qian
    Ning, Yujie
    Su, Han
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60