Salient object detection for RGB-D images by generative adversarial network

被引:0
|
作者
Zhengyi Liu
Jiting Tang
Qian Xiang
Peng Zhao
机构
[1] Anhui University,Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, School of Computer Science and Technology
来源
关键词
Generative adversarial network; Salient object detection; RGB-D image; Self-attention; Double stream network;
D O I
暂无
中图分类号
学科分类号
摘要
Salient object detection for RGB-D image aims to automatically detect the objects of human interest by color and depth information. In the paper generative adversarial network is adopted to improve its performance by adversarial learning. Generator network takes RGB-D images as inputs and outputs synthetic saliency maps. It adopts double stream network to extract color and depth feature individually and then fuses them from deep to shallow progressively. Discriminator network takes RGB image and synthetic saliency maps (RGBS), RGB image and ground truth saliency map (RGBY) as inputs, and outputs their labels indicating whether input is synthetics or ground truth. It consists of three convolution blocks and three fully connected layers. In order to pursuit long-range dependency of feature, self-attention layer is inserted in both generator and discriminator network. Supervised by real labels and ground truth saliency map, discriminator network and generator network are adversarial trained to make generator network cheat discriminator network successfully and discriminator network distinguish synthetics or ground truth correctly. Experiments demonstrate adversarial learning enhances the ability of generator network, RGBS and RGBY input in discriminator network and self-attention layer play an important role in improving the performance. Meanwhile our method outperforms state-of-the-art methods.
引用
收藏
页码:25403 / 25425
页数:22
相关论文
共 50 条
  • [1] Salient object detection for RGB-D images by generative adversarial network
    Liu, Zhengyi
    Tang, Jiting
    Xiang, Qian
    Zhao, Peng
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (35-36) : 25403 - 25425
  • [2] AFLNet: Adversarial focal loss network for RGB-D salient object detection
    Zhao, Xiaoli
    Chen, Zheng
    Hwang, Jenq-Neng
    Shang, Xiwu
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 94
  • [3] A cross-modal adaptive gated fusion generative adversarial network for RGB-D salient object detection
    Liu, Zhengyi
    Zhang, Wei
    Zhao, Peng
    [J]. NEUROCOMPUTING, 2020, 387 : 210 - 220
  • [4] cmSalGAN: RGB-D Salient Object Detection With Cross-View Generative Adversarial Networks
    Jiang, Bo
    Zhou, Zitai
    Wang, Xiao
    Tang, Jin
    Luo, Bin
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 1343 - 1353
  • [5] A salient object detection algorithm based on RGB-D images
    Song, Can
    Wu, Jin
    Deng, Huiping
    Zhu, Lei
    [J]. 2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 1692 - 1697
  • [6] AirSOD: A Lightweight Network for RGB-D Salient Object Detection
    Zeng, Zhihong
    Liu, Haijun
    Chen, Fenglei
    Tan, Xiaoheng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1656 - 1669
  • [7] Circular Complement Network for RGB-D Salient Object Detection
    Bai, Zhen
    Liu, Zhi
    Li, Gongyang
    Ye, Linwei
    Wang, Yang
    [J]. NEUROCOMPUTING, 2021, 451 : 95 - 106
  • [8] Bilateral Attention Network for RGB-D Salient Object Detection
    Zhang, Zhao
    Lin, Zheng
    Xu, Jun
    Jin, Wen-Da
    Lu, Shao-Ping
    Fan, Deng-Ping
    [J]. IEEE Transactions on Image Processing, 2021, 30 : 1949 - 1961
  • [9] Bilateral Attention Network for RGB-D Salient Object Detection
    Zhang, Zhao
    Lin, Zheng
    Xu, Jun
    Jin, Wen-Da
    Lu, Shao-Ping
    Fan, Deng-Ping
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 1949 - 1961
  • [10] Dynamic Selective Network for RGB-D Salient Object Detection
    Wen, Hongfa
    Yan, Chenggang
    Zhou, Xiaofei
    Cong, Runmin
    Sun, Yaoqi
    Zheng, Bolun
    Zhang, Jiyong
    Bao, Yongjun
    Ding, Guiguang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 9179 - 9192