Improving RGB-D Salient Object Detection via Modality-Aware Decoder

被引:0
|
作者
Song, Mengke [1 ,2 ]
Song, Wenfeng [3 ]
Yang, Guowei [4 ]
Chen, Chenglizhao [1 ,2 ]
机构
[1] China Univ Petr East China, Coll Comp Sci & Technol, Qingdao 266580, Peoples R China
[2] China Univ Petr East China, Qingdao Inst Software, Qingdao 266580, Peoples R China
[3] Beijing Informat Sci & Technol Univ, Comp Sch, Beijing 100192, Peoples R China
[4] Qingdao Univ, Sch Elect Informat, Qingdao 266071, Peoples R China
基金
中国国家自然科学基金;
关键词
Decoding; Object detection; Training; Task analysis; Saliency detection; Image segmentation; Feature extraction; RGB-D salient object detection; modality-aware fusion; deep learning; GRAPH CONVOLUTION NETWORK; IMAGE; ATTENTION; FUSION;
D O I
10.1109/TIP.2022.3205747
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most existing RGB-D salient object detection (SOD) methods are primarily focusing on cross-modal and cross-level saliency fusion, which has been proved to be efficient and effective. However, these methods still have a critical limitation, i.e., their fusion patterns - typically the combination of selective characteristics and its variations, are too highly dependent on the network's non-linear adaptability. In such methods, the balances between RGB and D (Depth) are formulated individually considering the intermediate feature slices, but the relation at the modality level may not be learned properly. The optimal RGB-D combinations differ depending on the RGB-D scenarios, and the exact complementary status is frequently determined by multiple modality-level factors, such as D quality, the complexity of the RGB scene, and degree of harmony between them. Therefore, given the existing approaches, it may be difficult for them to achieve further performance breakthroughs, as their methodologies belong to some methods that are somewhat less modality sensitive. To conquer this problem, this paper presents the Modality-aware Decoder (MaD). The critical technical innovations include a series of feature embedding, modality reasoning, and feature back-projecting and collecting strategies, all of which upgrade the widely-used multi-scale and multi-level decoding process to be modality-aware. Our MaD achieves competitive performance over other state-of-the-art (SOTA) models without using any fancy tricks in the decoder's design. Codes and results will be publicly available at https://github.com/MengkeSong/MaD.
引用
收藏
页码:6124 / 6138
页数:15
相关论文
共 50 条
  • [1] CFIDNet: cascaded feature interaction decoder for RGB-D salient object detection
    Tianyou Chen
    Xiaoguang Hu
    Jin Xiao
    Guofeng Zhang
    Shaojie Wang
    [J]. Neural Computing and Applications, 2022, 34 : 7547 - 7563
  • [2] Context-aware network for RGB-D salient object detection
    Liang, Fangfang
    Duan, Lijuan
    Ma, Wei
    Qiao, Yuanhua
    Miao, Jun
    Ye, Qixiang
    [J]. PATTERN RECOGNITION, 2021, 111
  • [3] CFIDNet: cascaded feature interaction decoder for RGB-D salient object detection
    Chen, Tianyou
    Hu, Xiaoguang
    Xiao, Jin
    Zhang, Guofeng
    Wang, Shaojie
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (10): : 7547 - 7563
  • [4] RGB-D salient object detection: A survey
    Tao Zhou
    Deng-Ping Fan
    Ming-Ming Cheng
    Jianbing Shen
    Ling Shao
    [J]. Computational Visual Media, 2021, 7 : 37 - 69
  • [5] RGB-D salient object detection: A survey
    Tao Zhou
    Deng-Ping Fan
    Ming-Ming Cheng
    Jianbing Shen
    Ling Shao
    [J]. Computational Visual Media, 2021, 7 (01) : 37 - 69
  • [6] RGB-D salient object detection: A survey
    Zhou, Tao
    Fan, Deng-Ping
    Cheng, Ming-Ming
    Shen, Jianbing
    Shao, Ling
    [J]. COMPUTATIONAL VISUAL MEDIA, 2021, 7 (01) : 37 - 69
  • [7] Salient Object Detection in RGB-D Videos
    Mou, Ao
    Lu, Yukang
    He, Jiahao
    Min, Dingyao
    Fu, Keren
    Zhao, Qijun
    [J]. IEEE Transactions on Image Processing, 2024, 33 : 6660 - 6675
  • [8] FCMNet: Frequency-aware cross-modality attention networks for RGB-D salient object detection
    Jin, Xiao
    Guo, Chunle
    He, Zhen
    Xu, Jing
    Wang, Yongwei
    Su, Yuting
    [J]. NEUROCOMPUTING, 2022, 491 : 414 - 425
  • [9] Calibrated RGB-D Salient Object Detection
    Ji, Wei
    Li, Jingjing
    Yu, Shuang
    Zhang, Miao
    Piao, Yongri
    Yao, Shunyu
    Bi, Qi
    Ma, Kai
    Zheng, Yefeng
    Lu, Huchuan
    Cheng, Li
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9466 - 9476
  • [10] HDNet: Multi-Modality Hierarchy-Aware Decision Network for RGB-D Salient Object Detection
    Xia, Chengxing
    Duan, Songsong
    Ge, Bin
    Zhang, Hanling
    Li, Kuan-Ching
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2577 - 2581