Position-Aware Relation Learning for RGB-Thermal Salient Object Detection

被引:13
|
作者
Zhou, Heng [1 ,2 ]
Tian, Chunna [3 ]
Zhang, Zhenxi [3 ]
Li, Chengyang [2 ,4 ]
Ding, Yuxuan [3 ]
Xie, Yongqiang [2 ]
Li, Zhongbo [2 ]
机构
[1] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
[2] AMS, Inst Syst Engn, Beijing 100141, Peoples R China
[3] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
[4] Peking Univ, Sch Elect Engn & Comp Sci, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
Transformers; Feature extraction; Decoding; Object detection; Merging; Task analysis; Level set; Salient object detection; RGB-thermal images; swin transformer; position-aware relation learning; REFINEMENT NETWORK; SEGMENTATION; ATTENTION; MODEL;
D O I
10.1109/TIP.2023.3270801
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Salient object detection (SOD) is an important task in computer vision that aims to identify visually conspicuous regions in images. RGB-Thermal SOD combines two spectra to achieve better segmentation results. However, most existing methods for RGB-T SOD use boundary maps to learn sharp boundaries, which lead to sub-optimal performance as they ignore the interactions between isolated boundary pixels and other confident pixels. To address this issue, we propose a novel position-aware relation learning network (PRLNet) for RGB-T SOD. PRLNet explores the distance and direction relationships between pixels by designing an auxiliary task and optimizing the feature structure to strengthen intra-class compactness and inter-class separation. Our method consists of two main components: A signed distance map auxiliary module (SDMAM), and a feature refinement approach with direction field (FRDF). SDMAM improves the encoder feature representation by considering the distance relationship between foreground-background pixels and boundaries, which increases the inter-class separation between foreground and background features. FRDF rectifies the features of boundary neighborhoods by exploiting the features inside salient objects. It utilizes the direction relationship of object pixels to enhance the intra-class compactness of salient features. In addition, we constitute a transformer-based decoder to decode multispectral feature representation. Experimental results on three public RGB-T SOD datasets demonstrate that our proposed method not only outperforms the state-of-the-art methods, but also can be integrated with different backbone networks in a plug-and-play manner. Ablation study and visualizations further prove the validity and interpretability of our method.
引用
收藏
页码:2593 / 2607
页数:15
相关论文
共 50 条
  • [1] Mirror complementary transformer network for RGB-thermal salient object detection
    Jiang, Xiurong
    Hou, Yifan
    Tian, Hui
    Zhu, Lin
    [J]. IET COMPUTER VISION, 2024, 18 (01) : 15 - 32
  • [2] Multi-Interactive Dual-Decoder for RGB-Thermal Salient Object Detection
    Tu, Zhengzheng
    Li, Zhun
    Li, Chenglong
    Lang, Yang
    Tang, Jin
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 5678 - 5691
  • [3] Three-stream interaction decoder network for RGB-thermal salient object detection
    Huo, Fushuo
    Zhu, Xuegui
    Li, Bingheng
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 258
  • [4] PaIaNet: position-aware and identification-aware network for low-light salient object detection
    Yue, Huihui
    Guo, Jichang
    Yin, Xiangjun
    Zhang, Yi
    Zheng, Sida
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (03) : 1137 - 1151
  • [5] PaIaNet: position-aware and identification-aware network for low-light salient object detection
    Huihui Yue
    Jichang Guo
    Xiangjun Yin
    Yi Zhang
    Sida Zheng
    [J]. International Journal of Machine Learning and Cybernetics, 2024, 15 : 1137 - 1151
  • [6] Cross-Collaborative Fusion-Encoder Network for Robust RGB-Thermal Salient Object Detection
    Liao, Guibiao
    Gao, Wei
    Li, Ge
    Wang, Junle
    Kwong, Sam
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7646 - 7661
  • [7] Real-Time One-Stream Semantic-Guided Refinement Network for RGB-Thermal Salient Object Detection
    Huo, Fushuo
    Zhu, Xuegui
    Zhang, Qian
    Liu, Ziming
    Yu, Wenchao
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [8] PARN: Position-Aware Relation Networks for Few-Shot Learning
    Wu, Ziyang
    Li, Yuwei
    Guo, Lihua
    Jia, Kui
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6658 - 6666
  • [9] Federated Learning with Position-Aware Neurons
    Li, Xin-Chun
    Xu, Yi-Chu
    Song, Shaoming
    Li, Bingshuai
    Li, Yinchuan
    Shao, Yunfeng
    Zhan, De-Chuan
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 10072 - 10081
  • [10] Context-aware network for RGB-D salient object detection
    Liang, Fangfang
    Duan, Lijuan
    Ma, Wei
    Qiao, Yuanhua
    Miao, Jun
    Ye, Qixiang
    [J]. PATTERN RECOGNITION, 2021, 111