Multi-stage music separation network with dual-branch attention and hybrid convolution

被引:1
|
作者
Chen, Yadong [1 ,2 ]
Hu, Ying [1 ,2 ]
He, Liang [1 ,3 ]
Huang, Hao [1 ,4 ]
机构
[1] Xinjiang Univ, Sch Informat Sci & Engn, Urumqi 830046, Peoples R China
[2] Key Lab Signal Detect & Proc, Urumqi 830046, Xinjiang, Peoples R China
[3] Tsinghua Univ, Dept Elect Engn, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China
[4] Key Lab Multilingual Informat Technol, Urumqi 830046, Xinjiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Vocal separation; Multi-stage separation network; Dual-Branch attention; Hybrid convolution;
D O I
10.1007/s10844-022-00711-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a lightweight multi-stage network for monaural vocal and accompaniment separation. We design a dual-branch attention (DBA) module to obtain the correlation of each position pair and that among the channels of feature maps, respectively. The square CNN (i.e. the size of the filter is kx k) shares the weights of each of the square areas in feature maps that which makes its ability of feature extraction limited. In order to address it, we propose a hybrid convolution (HC) block based on hybrid convolutional mechanism instead of square CNN to capture the dependencies along with the time dimension and the frequency dimension respectively. The ablation experiments demonstrate that the DBA module and HC block can assist in improving the separation performance. Experimental results show that our proposed network achieves outstanding performance on the MIR-1K dataset only with fewer parameters, and competitive performance compared with state-of-the-arts on DSD100 and MUSDB18 datasets.
引用
收藏
页码:635 / 656
页数:22
相关论文
共 50 条
  • [1] Multi-stage music separation network with dual-branch attention and hybrid convolution
    Yadong Chen
    Ying Hu
    Liang He
    Hao Huang
    Journal of Intelligent Information Systems, 2022, 59 : 635 - 656
  • [2] Multi-stage Image Fusion Method Based on Differential Dual-Branch Encoder
    Hong, Yulu
    Wu, Xiaojun
    Xu, Tianyang
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (07): : 661 - 670
  • [3] A dual-branch and dual attention transformer and CNN hybrid network for ultrasound image segmentation
    Zhang, Chong
    Wang, Lingtong
    Wei, Guohui
    Kong, Zhiyong
    Qiu, Min
    FRONTIERS IN PHYSIOLOGY, 2024, 15
  • [4] Dual-branch multi-information aggregation network with transformer and convolution for polyp segmentation
    Zhang, Wenyu
    Lu, Fuxiang
    Su, Hongjing
    Hu, Yawen
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 168
  • [5] A Medical Image Segmentation Network with Multi-Scale and Dual-Branch Attention
    Zhu, Cancan
    Cheng, Ke
    Hua, Xuecheng
    APPLIED SCIENCES-BASEL, 2024, 14 (14):
  • [6] Transmission Line State Recognition Method Based on Dual-Branch Convolution Neural Network Structure and Multi- Attention Mechanism
    Shang, Qiufeng
    Fan, Xiaokai
    Gu, Yuanyu
    Wang, Jianjian
    Yao, Guozhen
    ACTA OPTICA SINICA, 2024, 44 (22)
  • [7] Residual cosine similar attention and bidirectional convolution in dual-branch network for skin lesion image classification
    Li, Aolun
    Zhang, Dezhi
    Yu, Long
    Kang, Xiaojing
    Tian, Shengwei
    Wu, Weidong
    You, Hongfeng
    Huo, Xiangzuo
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [8] Dual-Branch Convolution Network With Efficient Channel Attention for EEG-Based Motor Imagery Classification
    Zhou, Kai
    Haimudula, Aierken
    Tang, Wanying
    IEEE ACCESS, 2024, 12 : 74930 - 74943
  • [9] Dual-Branch Deep Convolution Neural Network for Polarimetric SAR Image Classification
    Gao, Fei
    Huang, Teng
    Wang, Jun
    Sun, Jinping
    Hussain, Amir
    Yang, Erfu
    APPLIED SCIENCES-BASEL, 2017, 7 (05):
  • [10] Context Dual-Branch Attention Network for Depth Completion of Transparent Object
    Hu, Yutao
    Wang, Zheng
    Chen, Jiacheng
    Wang, Wanliang
    INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2022), PT IV, 2022, 13458 : 604 - 614