Multi-stage music separation network with dual-branch attention and hybrid convolution

被引:1
|
作者
Chen, Yadong [1 ,2 ]
Hu, Ying [1 ,2 ]
He, Liang [1 ,3 ]
Huang, Hao [1 ,4 ]
机构
[1] Xinjiang Univ, Sch Informat Sci & Engn, Urumqi 830046, Peoples R China
[2] Key Lab Signal Detect & Proc, Urumqi 830046, Xinjiang, Peoples R China
[3] Tsinghua Univ, Dept Elect Engn, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China
[4] Key Lab Multilingual Informat Technol, Urumqi 830046, Xinjiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Vocal separation; Multi-stage separation network; Dual-Branch attention; Hybrid convolution;
D O I
10.1007/s10844-022-00711-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a lightweight multi-stage network for monaural vocal and accompaniment separation. We design a dual-branch attention (DBA) module to obtain the correlation of each position pair and that among the channels of feature maps, respectively. The square CNN (i.e. the size of the filter is kx k) shares the weights of each of the square areas in feature maps that which makes its ability of feature extraction limited. In order to address it, we propose a hybrid convolution (HC) block based on hybrid convolutional mechanism instead of square CNN to capture the dependencies along with the time dimension and the frequency dimension respectively. The ablation experiments demonstrate that the DBA module and HC block can assist in improving the separation performance. Experimental results show that our proposed network achieves outstanding performance on the MIR-1K dataset only with fewer parameters, and competitive performance compared with state-of-the-arts on DSD100 and MUSDB18 datasets.
引用
收藏
页码:635 / 656
页数:22
相关论文
共 50 条
  • [31] Dual-Branch Multi-Level Feature Aggregation Network for Pansharpening
    Cheng, Gui
    Shao, Zhenfeng
    Wang, Jiaming
    Huang, Xiao
    Dang, Chaoya
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2022, 9 (11) : 2023 - 2026
  • [32] Joint Segmentation and Quantification of Main Coronary Vessels Using Dual-Branch Multi-scale Attention Network
    Zhang, Hongwei
    Zhang, Dong
    Gao, Zhifan
    Zhang, Heye
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT I, 2021, 12901 : 369 - 378
  • [33] Dual-Branch Multi-Level Feature Aggregation Network for Pansharpening
    Gui Cheng
    Zhenfeng Shao
    Jiaming Wang
    Xiao Huang
    Chaoya Dang
    IEEE/CAAJournalofAutomaticaSinica, 2022, 9 (11) : 2023 - 2026
  • [34] MHAN: Multi-Stage Hybrid Attention Network for MRI reconstruction and super-resolution
    Wang, Wanliang
    Shen, Haoxin
    Chen, Jiacheng
    Xing, Fangsen
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 163
  • [35] Multi-stage attention network for monaural speech enhancement
    Wang, Kunpeng
    Lu, Wenjing
    Liu, Peng
    Yao, Juan
    Li, Huafeng
    IET SIGNAL PROCESSING, 2023, 17 (03)
  • [36] Dual-Branch Structured De-Striping Convolution Network Using Parametric Noise Model
    Lee, Jongho
    Ro, Yong Man
    IEEE ACCESS, 2020, 8 (08): : 155519 - 155528
  • [37] DBDAN: Dual-Branch Dynamic Attention Network for Semantic Segmentation of Remote Sensing Images
    Che, Rui
    Ma, Xiaowen
    Hong, Tingfeng
    Wang, Xinyu
    Feng, Tian
    Zhang, Wei
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IV, 2024, 14428 : 306 - 317
  • [38] A Nonintrusive Load Identification Method Based on Dual-Branch Attention GRU Fusion Network
    Yuan, Jie
    Jin, Ran
    Wang, Lidong
    Wang, Ting
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [39] A dual-branch network for ultrasound image segmentation
    Zhu, Zhiqin
    Zhang, Zimeng
    Qi, Guanqiu
    Li, Yuanyuan
    Li, Yuzhen
    Mu, Lan
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 103
  • [40] A New Dual-Branch Embedded Multivariate Attention Network for Hyperspectral Remote Sensing Classification
    Chen, Yuyi
    Wang, Xiaopeng
    Zhang, Jiahua
    Shang, Xiaodi
    Hu, Yabin
    Zhang, Shichao
    Wang, Jiajie
    REMOTE SENSING, 2024, 16 (11)