Learning to See the Invisible: End-to-End Trainable Amodal Instance Segmentation

被引:59
|
作者
Follmann, Patrick [1 ]
Koenig, Rebecca [1 ]
Haertinger, Philipp [1 ]
Klostermann, Michael [1 ]
Boettger, Tobias [1 ]
机构
[1] MVTec Software GmbH, Munich, Germany
关键词
D O I
10.1109/WACV.2019.00146
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Semantic amodal segmentation is a recently proposed extension to instance-aware segmentation that includes the prediction of the invisible region of each object instance. We present the first all-in-one end-to-end trainable model for semantic amodal segmentation that predicts the amodal instance masks as well as their visible and invisible part in a single forward pass. In a detailed analysis, we provide experiments to show which architecture choices are beneficial for an all-in-one amodal segmentation model. On the COCO amodal dataset, our model outperforms the current baseline for amodal segmentation by a large margin. To further evaluate our model, we provide two new datasets with ground truth for semantic amodal segmentation, D2S amodal and COCOA cls. For both datasets, our model provides a strong baseline performance. Using special data augmentation techniques, we show that amodal segmentation on D2S amodal is possible with reasonable performance, even without providing amodal training data.
引用
下载
收藏
页码:1328 / 1336
页数:9
相关论文
共 50 条
  • [21] Autonomous Neurosurgical Instrument Segmentation Using End-to-End Learning
    Kalavakonda, Niveditha
    Hannaford, Blake
    Qazi, Zeeshan
    Sekhar, Laligam
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 514 - 516
  • [22] End-to-end Trainable Deep Neural Network for Robotic Grasp Detection and Semantic Segmentation from RGB
    Ainetter, Stefan
    Fraundorfer, Friedrich
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 13452 - 13458
  • [23] The Effect of Within-Bag Sampling on End-to-End Multiple Instance Learning
    Koriakina, Nadezhda
    Sladoje, Natasa
    Lindblad, Joakim
    PROCEEDINGS OF THE 12TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2021), 2021, : 183 - 188
  • [24] End-to-end learning of representations for instance-level document image retrieval
    Liu, Li
    Lu, Yue
    Suen, Ching Y.
    APPLIED SOFT COMPUTING, 2023, 136
  • [25] An end-to-end trainable hybrid classical-quantum classifier
    Chen, Samuel Yen-Chi
    Huang, Chih-Min
    Hsing, Chia-Wei
    Kao, Ying-Jer
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2021, 2 (04):
  • [26] End-to-End Trainable Non-Collaborative Dialog System
    Li, Yu
    Qian, Kun
    Shi, Weiyan
    Yu, Zhou
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 8293 - 8302
  • [27] END-TO-END TRAINABLE WEAKLY NON-NEGATIVE FACTORIZATION
    Kobayashi, Takumi
    Watanabe, Kenji
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 490 - 494
  • [28] ITERATIVE POLICY LEARNING IN END-TO-END TRAINABLE TASK-ORIENTED NEURAL DIALOG MODELS
    Liu, Bing
    Lane, Ian
    2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 482 - 489
  • [29] End-to-End Video Instance Segmentation via Spatial-Temporal Graph Neural Networks
    Wang, Tao
    Xu, Ning
    Chen, Kean
    Lin, Weiyao
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10777 - 10786
  • [30] FEELVOS: Fast End-to-End Embedding Learning for Video Object Segmentation
    Voigtlaender, Paul
    Chai, Yuning
    Schroff, Florian
    Adam, Hartwig
    Leibe, Bastian
    Chen, Liang-Chieh
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9473 - 9482