Deep Dual Learning for Semantic Image Segmentation

被引:45
|
作者
Luo, Ping [2 ]
Wang, Guangrun [1 ,2 ]
Lin, Liang [1 ,3 ]
Wang, Xiaogang [2 ]
机构
[1] Sun Yat Sen Univ, Guangzhou, Guangdong, Peoples R China
[2] Chinese Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China
[3] SenseTime Grp Ltd, Hong Kong, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICCV.2017.296
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep neural networks have advanced many computer vision tasks, because of their compelling capacities to learn from large amount of labeled data. However, their performances are not fully exploited in semantic image segmentation as the scale of training set is limited, where per-pixel labelmaps are expensive to obtain. To reduce labeling efforts, a natural solution is to collect additional images from Internet that are associated with image-level tags. Unlike existing works that treated labelmaps and tags as independent supervisions, we present a novel learning setting, namely dual image segmentation (DIS), which consists of two complementary learning problems that are jointly solved. One predicts labelmaps and tags from images, and the other reconstructs the images using the predicted labelmaps. DIS has three appealing properties. 1) Given an image with tags only, its labelmap can be inferred by leveraging the images and tags as constraints. The estimated labelmaps that capture accurate object classes and boundaries are used as ground truths in training to boost performance. 2) DIS is able to clean tags that have noises. 3) DIS significantly reduces the number of per-pixel annotations in training, while still achieves state-of-the-art performance. Extensive experiments demonstrate the effectiveness of DIS, which outperforms an existing best-performing baseline by 12.6% on Pascal VOC 2012 test set, without any post-processing such as CRF/MRF smoothing.
引用
收藏
页码:2737 / 2745
页数:9
相关论文
共 50 条
  • [1] Image Classification and Semantic Segmentation with Deep Learning
    Quazi, Saiman
    Musa, Sarhan M.
    [J]. 6TH IEEE INTERNATIONAL CONFERENCE ON RECENT ADVANCES AND INNOVATIONS IN ENGINEERING (ICRAIE), 2021,
  • [2] Multimodal Deep Learning in Semantic Image Segmentation: A Review
    Raman, Vishal
    Kumari, Madhu
    [J]. PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTERNET OF THINGS (CCIOT 2018), 2018, : 7 - 11
  • [3] Medical image semantic segmentation based on deep learning
    Jiang, Feng
    Grigorev, Aleksei
    Rho, Seungmin
    Tian, Zhihong
    Fu, YunSheng
    Jifara, Worku
    Adil, Khan
    Liu, Shaohui
    [J]. NEURAL COMPUTING & APPLICATIONS, 2018, 29 (05): : 1257 - 1265
  • [4] Semantic image segmentation network based on deep learning
    Chen, Bo
    Zhang, Jiahao
    Zhou, Jianbang
    Chen, Zhong
    Yang, Tian
    Zhang, Yanna
    [J]. MIPPR 2019: AUTOMATIC TARGET RECOGNITION AND NAVIGATION, 2020, 11429
  • [5] A survey on deep learning techniques for image and video semantic segmentation
    Garcia-Garcia, Alberto
    Orts-Escolano, Sergio
    Oprea, Sergiu
    Villena-Martinez, Victor
    Martinez-Gonzalez, Pablo
    Garcia-Rodriguez, Jose
    [J]. APPLIED SOFT COMPUTING, 2018, 70 : 41 - 65
  • [6] A Survey on Image Semantic Segmentation Using Deep Learning Techniques
    Cheng, Jieren
    Li, Hua
    Li, Dengbo
    Hua, Shuai
    Sheng, Victor S.
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (01): : 1941 - 1957
  • [7] Research of animals image semantic segmentation based on deep learning
    Liu, Shouqiang
    Li, Miao
    Li, Min
    Xu, Qingzhen
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (01):
  • [8] Deep CRF-Graph Learning for Semantic Image Segmentation
    Ding, Fuguang
    Wang, Zhenhua
    Guo, Dongyan
    Chen, Shengyong
    Zhang, Jianhua
    Shao, Zhanpeng
    [J]. PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2018, 11013 : 360 - 368
  • [9] Advancements in Deep Learning Architectures for Image Recognition and Semantic Segmentation
    Nimma, Divya
    Uddagiri, Arjun
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (08) : 1172 - 1185
  • [10] Semantic Image Segmentation with Deep Learning for Vine Leaf Phenotyping
    Tamvakis, Petros N.
    Kiourt, Chairi
    Solomou, Alexandra D.
    Ioannakis, George
    Tsirliganis, Nestoras C.
    [J]. IFAC PAPERSONLINE, 2022, 55 (32): : 83 - 88