Semantic Amodal Segmentation

被引:85
|
作者
Zhu, Yan [1 ,2 ]
Tian, Yuandong [1 ]
Metaxas, Dimitris [2 ]
Dollar, Piotr [1 ]
机构
[1] Facebook AI Res, Menlo Pk, CA 94029 USA
[2] Rutgers State Univ, Dept Comp Sci, New Brunswick, NJ USA
关键词
D O I
10.1109/CVPR.2017.320
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Common visual recognition tasks such as classification, object detection, and semantic segmentation are rapidly reaching maturity, and given the recent rate of progress, it is not unreasonable to conjecture that techniques for many of these problems will approach human levels of performance in the next few years. In this paper we look to the future: what is the next frontier in visual recognition? We offer one possible answer to this question. We propose a detailed image annotation that captures information beyond the visible pixels and requires complex reasoning about full scene structure. Specifically, we create an amodal segmentation of each image: the full extent of each region is marked, not just the visible pixels. Annotators outline and name all salient regions in the image and specify a partial depth order. The result is a rich scene structure, including visible and occluded portions of each region, figure-ground edge information, semantic labels, and object overlap. We create two datasets for semantic amodal segmentation. First, we label 500 images in the BSDS dataset with multiple annotators per image, allowing us to study the statistics of human annotations. We show that the proposed full scene annotation is surprisingly consistent between annotators, including for regions and edges. Second, we annotate 5000 images from COCO. This larger dataset allows us to explore a number of algorithmic ideas for amodal segmentation and depth ordering. We introduce novel metrics for these tasks, and along with our strong baselines, define concrete new challenges for the community.
引用
收藏
页码:3001 / 3009
页数:9
相关论文
共 50 条
  • [1] Amodal Cityscapes: A New Dataset, its Generation, and an Amodal Semantic Segmentation Challenge Baseline
    Breitenstein, Jasmin
    Fingscheidt, Tim
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 1018 - 1025
  • [2] Amodal Instance Segmentation
    Li, Ke
    Malik, Jitendra
    COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 : 677 - 693
  • [3] Amodal Panoptic Segmentation
    Mohan, Rohit
    Valada, Abhinav
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 20991 - 21000
  • [4] BEYOND THE VISIBLE PIXELS USING SEMANTIC AMODAL SEGMENTATION IN REMOTE SENSING IMAGES
    de Carvalho, Osmar L. F.
    de Carvalho Junior, Osmar A.
    de Albuquerque, Anesmar O.
    Luiz, Argelica S.
    Santana, Nickolas C.
    Borges, Dibio L.
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 310 - 313
  • [5] Amodal Instance Segmentation with KINS Dataset
    Qi, Lu
    Jiang, Li
    Liu, Shu
    Shen, Xiaoyong
    Jia, Jiaya
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3009 - 3018
  • [6] Application of amodal segmentation on cucumber segmentation and occlusion recovery
    Kim, Sungjay
    Hong, Suk-Ju
    Ryu, Jiwon
    Kim, Eungchan
    Lee, Chang-Hyup
    Kim, Ghiseok
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 210
  • [7] Layered Embeddings for Amodal Instance Segmentation
    Liu, Yanfeng
    Psota, Eric T.
    Perez, Lance C.
    IMAGE ANALYSIS AND RECOGNITION, ICIAR 2019, PT I, 2019, 11662 : 102 - 111
  • [8] SAIL-VOS: Semantic Amodal Instance Level Video Object Segmentation - A Synthetic Dataset and Baselines
    Hu, Yuan-Ting X.
    Chen, Hong-Shuo
    Hui, Kexin
    Huang, Jia-Bin
    Schwing, Alexander
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3100 - 3110
  • [9] Amodal Segmentation Based on Visible Region Segmentation and Shape Prior
    Xiao, Yuting
    Xu, Yanyu
    Zhong, Ziming
    Luo, Weixin
    Li, Jiawei
    Gao, Shenghua
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2995 - 3003
  • [10] Amodal Segmentation Just Like Doing a Jigsaw
    Zeng, Xunli
    Liu, Xiaoli
    Yin, Jianqin
    APPLIED SCIENCES-BASEL, 2022, 12 (08):