GroupTransNet: Group transformer network for RGB-D salient object detection

被引:1
|
作者
Fang, Xian [1 ,2 ]
Jiang, Mingfeng [1 ]
Zhu, Jinchao [3 ]
Shao, Xiuli [2 ]
Wang, Hongpeng [3 ]
机构
[1] Zhejiang Sci Tech Univ, Sch Comp Sci & Technol, Hangzhou 310018, Peoples R China
[2] Nankai Univ, Coll Comp Sci, Tianjin 300350, Peoples R China
[3] Nankai Univ, Coll Artificial Intelligence, Tianjin 300350, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
RGB-D saliency detection; Convolutional neural networks; Transformer; Group transformer network; Clustering rule; FUSION NETWORK;
D O I
10.1016/j.neucom.2024.127865
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As an active topic in computer vision, RGB-D salient object detection has witnessed substantial progress. Although the existing methods have achieved appreciable performance, there are still some challenges. The locality of convolutional neural networks requires that the model has a sufficiently deep global receptive field, while the local characteristic represented by transformer with strong globality is always not enough. Besides, the shared information of contextual features tends to be usually overlooked. To address these bottlenecks, we propose a novel group transformer network (GroupTransNet), which is good at learning the long-range dependencies of cross layer features to promote more perfect feature expression between high-level and lowlevel features. Importantly, we soft group the features of the middle and latter three levels to absorb the semantic information of slightly former level features. Firstly, the input features are adaptively purified by the element-wise operation and sequential attention mechanism. Afterwards, the intermediate features are uniformly fused at different layers, and then processed by several transformers in multiple groups. Finally, the output features are clustered within different classifications and combined with underlying features. Extensive experiments demonstrate the proposed GroupTransNet outperforms the competitors and achieves new state -of -the -art performance.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] TriTransNet: RGB-D Salient Object Detection with a Triplet Transformer Embedding Network
    Liu, Zhengyi
    Wang, Yuan
    Tu, Zhengzheng
    Xiao, Yun
    Tang, Bin
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4481 - 4490
  • [2] CATNet: A Cascaded and Aggregated Transformer Network for RGB-D Salient Object Detection
    Sun, Fuming
    Ren, Peng
    Yin, Bowen
    Wang, Fasheng
    Li, Haojie
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 2249 - 2262
  • [3] TANet: Transformer-based asymmetric network for RGB-D salient object detection
    Liu, Chang
    Yang, Gang
    Wang, Shuo
    Wang, Hangxu
    Zhang, Yunhua
    Wang, Yutao
    [J]. IET COMPUTER VISION, 2023, 17 (04) : 415 - 430
  • [4] Transformer-based difference fusion network for RGB-D salient object detection
    Cui, Zhi-Qiang
    Wang, Feng
    Feng, Zheng-Yong
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
  • [5] MULTI-MODAL TRANSFORMER FOR RGB-D SALIENT OBJECT DETECTION
    Song, Peipei
    Zhang, Jing
    Koniusz, Piotr
    Barnes, Nick
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2466 - 2470
  • [6] AirSOD: A Lightweight Network for RGB-D Salient Object Detection
    Zeng, Zhihong
    Liu, Haijun
    Chen, Fenglei
    Tan, Xiaoheng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1656 - 1669
  • [7] Circular Complement Network for RGB-D Salient Object Detection
    Bai, Zhen
    Liu, Zhi
    Li, Gongyang
    Ye, Linwei
    Wang, Yang
    [J]. NEUROCOMPUTING, 2021, 451 : 95 - 106
  • [8] Bilateral Attention Network for RGB-D Salient Object Detection
    Zhang, Zhao
    Lin, Zheng
    Xu, Jun
    Jin, Wen-Da
    Lu, Shao-Ping
    Fan, Deng-Ping
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 1949 - 1961
  • [9] Dynamic Selective Network for RGB-D Salient Object Detection
    Wen, Hongfa
    Yan, Chenggang
    Zhou, Xiaofei
    Cong, Runmin
    Sun, Yaoqi
    Zheng, Bolun
    Zhang, Jiyong
    Bao, Yongjun
    Ding, Guiguang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 9179 - 9192
  • [10] DYNAMIC SELECTION NETWORK FOR RGB-D SALIENT OBJECT DETECTION
    Zhou, Jinlin
    Luo, Zhiming
    Li, Shaozi
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 776 - 780