Frequency-aware feature aggregation network with dual-task consistency for RGB-T salient object detection

被引:9
|
作者
Zhou, Heng [1 ,2 ]
Tian, Chunna [1 ]
Zhang, Zhenxi [1 ]
Li, Chengyang [2 ,3 ]
Xie, Yongqiang [2 ]
Li, Zhongbo [2 ]
机构
[1] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
[2] AMS, Inst Syst Engn, Beijing 100141, Peoples R China
[3] Peking Univ, Sch Elect Engn & Comp Sci, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
RGB-thermal; Salient object detection; Frequency feature aggregation; Dual-task consistency;
D O I
10.1016/j.patcog.2023.110043
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
RGB-Thermal salient object detection (SOD) aims to merge two spectral images to segment visually appealing objects. Current methods primarily extract salient object information in the pixel perspective. However, biological and psychological research indicates notable frequency sensitivity of the human visual system (HVS). The high-frequency (HF) and low-frequency (LF) information in images are processed by different neural channels, which has been overlooked in SOD. In this study, we argue that the objective of RGB-T SOD is not only to enhance feature representation in the pixel-aware but also to emulate human visual mechanisms. To our best knowledge, we explore RGB-T SOD from the frequency perspective for the first time. Specifically, we first present a frequency-aware multi-spectral feature aggregation module (FMFA) to exploit the separability and complementarity of frequency-aware features, generating and making full use of LF and HF cues. FMFA improves the feature representation of RGB-T from the frequency perspective and provides stronger frequency cues for boundary auxiliary tasks. Then, we develop an HF-guided signed distance map prediction module (HF-SDM) with dual-task consistency to effectively alleviate the coarse mask caused by blur boundary. HF-SDM employs the geometric relationship of objects which boosts the interaction between salient regions and boundaries. As a result, the model can gain sharper boundaries for salient objects. Finally, we propose a frequency-aware feature aggregation network (FFANet) incorporated with dual-task learning. Extensive experiments on RGB-T SOD datasets demonstrate that our proposed method outperforms other state -of-the-art methods. Ablation studies and visualizations further verify the effectiveness and interpretability of our method.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Feature aggregation with transformer for RGB-T salient object detection
    Zhang, Ping
    Xu, Mengnan
    Zhang, Ziyan
    Gao, Pan
    Zhang, Jing
    [J]. NEUROCOMPUTING, 2023, 546
  • [2] Interactive context-aware network for RGB-T salient object detection
    Wang, Yuxuan
    Dong, Feng
    Zhu, Jinchao
    Chen, Jianren
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (28) : 72153 - 72174
  • [3] FEATURE ENHANCEMENT AND FUSION FOR RGB-T SALIENT OBJECT DETECTION
    Sun, Fengming
    Zhang, Kang
    Yuan, Xia
    Zhao, Chunxia
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1300 - 1304
  • [4] Revisiting Feature Fusion for RGB-T Salient Object Detection
    Zhang, Qiang
    Xiao, Tonglin
    Huang, Nianchang
    Zhang, Dingwen
    Han, Jungong
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (05) : 1804 - 1818
  • [5] Edge-guided feature fusion network for RGB-T salient object detection
    Chen, Yuanlin
    Sun, Zengbao
    Yan, Cheng
    Zhao, Ming
    [J]. Frontiers in Neurorobotics, 2024, 18
  • [6] ECFFNet: Effective and Consistent Feature Fusion Network for RGB-T Salient Object Detection
    Zhou, Wujie
    Guo, Qinling
    Lei, Jingsheng
    Yu, Lu
    Hwang, Jenq-Neng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (03) : 1224 - 1235
  • [7] EAF-Net: an enhancement and aggregation–feedback network for RGB-T salient object detection
    Haiyang He
    Jing Wang
    Xiaolin Li
    Minglin Hong
    Shiguo Huang
    Tao Zhou
    [J]. Machine Vision and Applications, 2022, 33
  • [8] Feature differences reduction and specific features preserving network for RGB-T salient object detection
    Xu, Qiqi
    Di, Zhenguang
    Dong, Haoyu
    Yang, Gang
    [J]. Image and Vision Computing, 2024, 152
  • [9] PSNet: Parallel symmetric network for RGB-T salient object detection
    Bi, Hongbo
    Wu, Ranwan
    Liu, Ziqi
    Zhang, Jiayuan
    Zhang, Cong
    Xiang, Tian-Zhu
    Wang, Xiufang
    [J]. NEUROCOMPUTING, 2022, 511 (410-425) : 410 - 425
  • [10] Modal complementary fusion network for RGB-T salient object detection
    Ma, Shuai
    Song, Kechen
    Dong, Hongwen
    Tian, Hongkun
    Yan, Yunhui
    [J]. APPLIED INTELLIGENCE, 2023, 53 (08) : 9038 - 9055