TANet: Transformer-based asymmetric network for RGB-D salient object detection

被引:6
|
作者
Liu, Chang [1 ]
Yang, Gang [1 ,3 ]
Wang, Shuo [1 ]
Wang, Hangxu [1 ,2 ]
Zhang, Yunhua [1 ]
Wang, Yutao [1 ]
机构
[1] Northeastern Univ, Shenyang, Liaoning, Peoples R China
[2] DUT Artificial Intelligence Inst, Dalian, Peoples R China
[3] Northeastern Univ, Wenhua Rd, Shenyang 110000, Liaoning, Peoples R China
基金
中国国家自然科学基金;
关键词
computer vision; image segmentation; object detection; REGION;
D O I
10.1049/cvi2.12177
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing RGB-D salient object detection methods mainly rely on a symmetric two-stream Convolutional Neural Network (CNN)-based network to extract RGB and depth channel features separately. However, there are two problems with the symmetric conventional network structure: first, the ability of CNN in learning global contexts is limited; second, the symmetric two-stream structure ignores the inherent differences between modalities. In this study, a Transformer-based asymmetric network is proposed to tackle the issues mentioned above. The authors employ the powerful feature extraction capability of Transformer to extract global semantic information from RGB data and design a lightweight CNN backbone to extract spatial structure information from depth data without pre-training. The asymmetric hybrid encoder effectively reduces the number of parameters in the model while increasing speed without sacrificing performance. Then, a cross-modal feature fusion module which enhances and fuses RGB and depth features with each other is designed. Finally, the authors add edge prediction as an auxiliary task and propose an edge enhancement module to generate sharper contours. Extensive experiments demonstrate that our method achieves superior performance over 14 state-of-the-art RGB-D methods on six public datasets. The code of the authors will be released at .
引用
收藏
页码:415 / 430
页数:16
相关论文
共 50 条
  • [21] ICNet: Information Conversion Network for RGB-D Based Salient Object Detection
    Li, Gongyang
    Liu, Zhi
    Ling, Haibin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 4873 - 4884
  • [22] RGB-D salient object detection: A survey
    Tao Zhou
    Deng-Ping Fan
    Ming-Ming Cheng
    Jianbing Shen
    Ling Shao
    ComputationalVisualMedia, 2021, 7 (01) : 37 - 69
  • [23] RGB-D salient object detection: A survey
    Zhou, Tao
    Fan, Deng-Ping
    Cheng, Ming-Ming
    Shen, Jianbing
    Shao, Ling
    COMPUTATIONAL VISUAL MEDIA, 2021, 7 (01) : 37 - 69
  • [24] RGB-D salient object detection: A survey
    Tao Zhou
    Deng-Ping Fan
    Ming-Ming Cheng
    Jianbing Shen
    Ling Shao
    Computational Visual Media, 2021, 7 : 37 - 69
  • [25] Transformer-based Adaptive Interactive Promotion Network for RGB-T Salient Object Detection
    Zhu, Jinchao
    Zhang, Xiaoyu
    Dong, Feng
    Yan, Siyu
    Meng, Xianbang
    Li, Yuehua
    Tan, Panlong
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 1989 - 1994
  • [26] Salient Object Detection in RGB-D Videos
    Mou, Ao
    Lu, Yukang
    He, Jiahao
    Min, Dingyao
    Fu, Keren
    Zhao, Qijun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 6660 - 6675
  • [27] Calibrated RGB-D Salient Object Detection
    Ji, Wei
    Li, Jingjing
    Yu, Shuang
    Zhang, Miao
    Piao, Yongri
    Yao, Shunyu
    Bi, Qi
    Ma, Kai
    Zheng, Yefeng
    Lu, Huchuan
    Cheng, Li
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9466 - 9476
  • [28] Lightweight cross-modal transformer for RGB-D salient object detection
    Huang, Nianchang
    Yang, Yang
    Zhang, Qiang
    Han, Jungong
    Huang, Jin
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 249
  • [29] A salient object detection algorithm based on RGB-D images
    Song, Can
    Wu, Jin
    Deng, Huiping
    Zhu, Lei
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 1692 - 1697
  • [30] Bidirectional feature learning network for RGB-D salient object detection
    Niu, Ye
    Zhou, Sanping
    Dong, Yonghao
    Wang, Le
    Wang, Jinjun
    Zheng, Nanning
    PATTERN RECOGNITION, 2024, 150