GroupTransNet: Group transformer network for RGB-D salient object detection

被引:1
|
作者
Fang, Xian [1 ,2 ]
Jiang, Mingfeng [1 ]
Zhu, Jinchao [3 ]
Shao, Xiuli [2 ]
Wang, Hongpeng [3 ]
机构
[1] Zhejiang Sci Tech Univ, Sch Comp Sci & Technol, Hangzhou 310018, Peoples R China
[2] Nankai Univ, Coll Comp Sci, Tianjin 300350, Peoples R China
[3] Nankai Univ, Coll Artificial Intelligence, Tianjin 300350, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
RGB-D saliency detection; Convolutional neural networks; Transformer; Group transformer network; Clustering rule; FUSION NETWORK;
D O I
10.1016/j.neucom.2024.127865
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As an active topic in computer vision, RGB-D salient object detection has witnessed substantial progress. Although the existing methods have achieved appreciable performance, there are still some challenges. The locality of convolutional neural networks requires that the model has a sufficiently deep global receptive field, while the local characteristic represented by transformer with strong globality is always not enough. Besides, the shared information of contextual features tends to be usually overlooked. To address these bottlenecks, we propose a novel group transformer network (GroupTransNet), which is good at learning the long-range dependencies of cross layer features to promote more perfect feature expression between high-level and lowlevel features. Importantly, we soft group the features of the middle and latter three levels to absorb the semantic information of slightly former level features. Firstly, the input features are adaptively purified by the element-wise operation and sequential attention mechanism. Afterwards, the intermediate features are uniformly fused at different layers, and then processed by several transformers in multiple groups. Finally, the output features are clustered within different classifications and combined with underlying features. Extensive experiments demonstrate the proposed GroupTransNet outperforms the competitors and achieves new state -of -the -art performance.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] WGI-Net: A weighted group integration network for RGB-D salient object detection
    Ge, Yanliang
    Zhang, Cong
    Wang, Kang
    Liu, Ziqi
    Bi, Hongbo
    [J]. COMPUTATIONAL VISUAL MEDIA, 2021, 7 (01) : 115 - 125
  • [42] DVSOD: RGB-D Video Salient Object Detection
    Li, Jingjing
    Ji, Wei
    Wang, Size
    Li, Wenbo
    Cheng, Li
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [43] Disentangled Cross-Modal Transformer for RGB-D Salient Object Detection and Beyond
    Chen, Hao
    Shen, Feihong
    Ding, Ding
    Deng, Yongjian
    Li, Chao
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 1699 - 1709
  • [44] TSVT: Token Sparsification Vision Transformer for robust RGB-D salient object detection
    Gao, Lina
    Liu, Bing
    Fu, Ping
    Xu, Mingzhu
    [J]. PATTERN RECOGNITION, 2024, 148
  • [45] Advancing in RGB-D Salient Object Detection: A Survey
    Chen, Ai
    Li, Xin
    He, Tianxiang
    Zhou, Junlin
    Chen, Duanbing
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (17):
  • [46] Adaptive Fusion for RGB-D Salient Object Detection
    Wang, Ningning
    Gong, Xiaojin
    [J]. IEEE ACCESS, 2019, 7 : 55277 - 55284
  • [47] SPSN: Superpixel Prototype Sampling Network for RGB-D Salient Object Detection
    Lee, Minhyeok
    Park, Chaewon
    Cho, Suhwan
    Lee, Sangyoun
    [J]. COMPUTER VISION, ECCV 2022, PT XXIX, 2022, 13689 : 630 - 647
  • [48] AFLNet: Adversarial focal loss network for RGB-D salient object detection
    Zhao, Xiaoli
    Chen, Zheng
    Hwang, Jenq-Neng
    Shang, Xiwu
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 94
  • [49] Heterogeneous Fusion and Integrity Learning Network for RGB-D Salient Object Detection
    Gao, Haorao
    Su, Yiming
    Wang, Fasheng
    Li, Haojie
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (07)
  • [50] Perceptual localization and focus refinement network for RGB-D salient object detection
    Han, Jinyu
    Wang, Mengyin
    Wu, Weiyi
    Jia, Xu
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2025, 259