Feature aggregation with transformer for RGB-T salient object detection

被引:6
|
作者
Zhang, Ping [1 ]
Xu, Mengnan [1 ]
Zhang, Ziyan [1 ]
Gao, Pan [1 ]
Zhang, Jing [1 ]
机构
[1] Univ Elect Sci & Technol China, Chengdu 611731, Peoples R China
基金
中国国家自然科学基金;
关键词
Salient object detection; Transformer; Dual stream network; Cross-modal interaction;
D O I
10.1016/j.neucom.2023.126329
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The main purpose of RGB-T salient object detection (SOD) is to fully integrate and exploit the information from the complementary fusion of modalities to address the underperformance of RGB SOD in some challenging scenes. In this paper, we propose a novel feature aggregation network that can fully mine multi-scale and multi-modal information for complete and accurate RGB-T SOD. Subsequently, a cross-attention fusion module is proposed to adaptively integrate high-level features by using the attention mechanism in the Transformer. Then we design a simple yet effective fast feature aggregation module to fuse low-level features. Through the combined work of the above modules, our network can perform well in some complex scenes by effectively fusing features from RGB and thermal modalities. Finally, sev-eral experiments on publicly available datasets such as VT821, VT1000, and VT5000 demonstrate that our method outperforms state-of-the-art methods. And our code has been released at:https://github.com/ ELOESZHANG/FANet.(c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] FEATURE ENHANCEMENT AND FUSION FOR RGB-T SALIENT OBJECT DETECTION
    Sun, Fengming
    Zhang, Kang
    Yuan, Xia
    Zhao, Chunxia
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1300 - 1304
  • [2] Revisiting Feature Fusion for RGB-T Salient Object Detection
    Zhang, Qiang
    Xiao, Tonglin
    Huang, Nianchang
    Zhang, Dingwen
    Han, Jungong
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (05) : 1804 - 1818
  • [3] Adaptive interactive network for RGB-T salient object detection with double mapping transformer
    Dong, Feng
    Wang, Yuxuan
    Zhu, Jinchao
    Li, Yuehua
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (20) : 59169 - 59193
  • [4] ECFFNet: Effective and Consistent Feature Fusion Network for RGB-T Salient Object Detection
    Zhou, Wujie
    Guo, Qinling
    Lei, Jingsheng
    Yu, Lu
    Hwang, Jenq-Neng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (03) : 1224 - 1235
  • [5] Frequency-aware feature aggregation network with dual-task consistency for RGB-T salient object detection
    Zhou, Heng
    Tian, Chunna
    Zhang, Zhenxi
    Li, Chengyang
    Xie, Yongqiang
    Li, Zhongbo
    [J]. PATTERN RECOGNITION, 2024, 146
  • [6] Enabling modality interactions for RGB-T salient object detection
    Zhang, Qiang
    Xi, Ruida
    Xiao, Tonglin
    Huang, Nianchang
    Luo, Yongjiang
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 222
  • [7] Scribble-Supervised RGB-T Salient Object Detection
    Liu, Zhengyi
    Huang, Xiaoshen
    Zhang, Guanghui
    Fang, Xianyong
    Wang, Linbo
    Tang, Bin
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2369 - 2374
  • [8] Saliency Prototype for RGB-D and RGB-T Salient Object Detection
    Zhang, Zihao
    Wang, Jie
    Han, Yahong
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3696 - 3705
  • [9] EAF-Net: an enhancement and aggregation–feedback network for RGB-T salient object detection
    Haiyang He
    Jing Wang
    Xiaolin Li
    Minglin Hong
    Shiguo Huang
    Tao Zhou
    [J]. Machine Vision and Applications, 2022, 33
  • [10] RGB-T salient object detection via CNN feature and result saliency map fusion
    Chang Xu
    Qingwu Li
    Mingyu Zhou
    Qingkai Zhou
    Yaqin Zhou
    Yunpeng Ma
    [J]. Applied Intelligence, 2022, 52 : 11343 - 11362