Semantic Amodal Segmentation

被引:85
|
作者
Zhu, Yan [1 ,2 ]
Tian, Yuandong [1 ]
Metaxas, Dimitris [2 ]
Dollar, Piotr [1 ]
机构
[1] Facebook AI Res, Menlo Pk, CA 94029 USA
[2] Rutgers State Univ, Dept Comp Sci, New Brunswick, NJ USA
关键词
D O I
10.1109/CVPR.2017.320
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Common visual recognition tasks such as classification, object detection, and semantic segmentation are rapidly reaching maturity, and given the recent rate of progress, it is not unreasonable to conjecture that techniques for many of these problems will approach human levels of performance in the next few years. In this paper we look to the future: what is the next frontier in visual recognition? We offer one possible answer to this question. We propose a detailed image annotation that captures information beyond the visible pixels and requires complex reasoning about full scene structure. Specifically, we create an amodal segmentation of each image: the full extent of each region is marked, not just the visible pixels. Annotators outline and name all salient regions in the image and specify a partial depth order. The result is a rich scene structure, including visible and occluded portions of each region, figure-ground edge information, semantic labels, and object overlap. We create two datasets for semantic amodal segmentation. First, we label 500 images in the BSDS dataset with multiple annotators per image, allowing us to study the statistics of human annotations. We show that the proposed full scene annotation is surprisingly consistent between annotators, including for regions and edges. Second, we annotate 5000 images from COCO. This larger dataset allows us to explore a number of algorithmic ideas for amodal segmentation and depth ordering. We introduce novel metrics for these tasks, and along with our strong baselines, define concrete new challenges for the community.
引用
收藏
页码:3001 / 3009
页数:9
相关论文
共 50 条
  • [31] A Survey on Semantic Segmentation
    Li, Biao
    Shi, Yong
    Qi, Zhiquan
    Chen, Zhensong
    2018 18TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2018, : 1233 - 1240
  • [32] Progressive Semantic Segmentation
    Chuong Huynh
    Anh Tuan Tran
    Khoa Luu
    Minh Hoai
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16750 - 16759
  • [33] Generative Semantic Segmentation
    Chen, Jiaqi
    Lu, Jiachen
    Zhu, Xiatian
    Zhang, Li
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 7111 - 7120
  • [34] Introspective Semantic Segmentation
    Singh, Gautam
    Kosecka, Jana
    2014 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2014, : 714 - 720
  • [35] Semantic Relocation Parallel Network for Semantic Segmentation
    Chen S.
    Xu L.
    Zou B.
    Chen J.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2022, 34 (03): : 373 - 381
  • [36] AMODAL SEGMENTATION CONSIDERING VISIBLE AND NON-VISIBLE ELEMENTS OF URBAN SURFACES
    Ferreira de Carvalho, Osmar Luiz
    de Albuquerque, Anesmar Olino
    de Carvalho Junior, Osmar Abilio
    Mou, Lichao
    Guerreiro e Silva, Daniel
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 5676 - 5679
  • [37] Class semantic enhancement network for semantic segmentation
    Fu, Siming
    Wang, Hualiang
    Hu, Haoji
    He, Xiaoxuan
    Long, Yongwen
    Bai, Jianhong
    Ou, Yangtao
    Huang, Yuanjia
    Zhou, Mengqiu
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 96
  • [38] Learning to See the Invisible: End-to-End Trainable Amodal Instance Segmentation
    Follmann, Patrick
    Koenig, Rebecca
    Haertinger, Philipp
    Klostermann, Michael
    Boettger, Tobias
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1328 - 1336
  • [39] Amodal Instance Segmentation for Mealworm Growth Monitoring Using Synthetic Training Images
    Dolata, Przemyslaw
    Majewski, Pawel
    Lampa, Piotr
    Zieba, Maciej
    Reiner, Jacek
    IEEE ACCESS, 2025, 13 : 52157 - 52175
  • [40] BLADE: Box-Level Supervised Amodal Segmentation through Directed Expansion
    Liu, Zhaochen
    Li, Zhixuan
    Jiang, Tingting
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 4, 2024, : 3846 - 3854