Cut and Learn for Unsupervised Object Detection and Instance Segmentation

被引:69
|
作者
Wang, Xudong [1 ,2 ]
Girdhar, Rohit [1 ]
Yu, Stella X. [2 ,3 ]
Misra, Ishan [1 ]
机构
[1] Meta AI, FAIR, New York, NY 94720 USA
[2] Univ Calif Berkeley, ICSI, Berkeley, CA 94720 USA
[3] Univ Michigan, Ann Arbor, MI USA
关键词
D O I
10.1109/CVPR52729.2023.00305
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose Cut-and-LEaRn (CutLER), a simple approach for training unsupervised object detection and segmentation models. We leverage the property of self-supervised models to 'discover' objects without supervision and amplify it to train a state-of-the-art localization model without any human labels. CutLER first uses our proposed MaskCut approach to generate coarse masks for multiple objects in an image, and then learns a detector on these masks using our robust loss function. We further improve performance by self-training the model on its predictions. Compared to prior work, CutLER is simpler, compatible with different detection architectures, and detects multiple objects. CutLER is also a zero-shot unsupervised detector and improves detection performance AP50 by over 2.7x on 11 benchmarks across domains like video frames, paintings, sketches, etc. With finetuning, CutLER serves as a low-shot detector surpassing MoCo-v2 by 7.3% APbox and 6.6% APmask on COCO when training with 5% labels.
引用
收藏
页码:3124 / 3134
页数:11
相关论文
共 50 条
  • [31] Enhancing Geometric Factors in Model Learning and Inference for Object Detection and Instance Segmentation
    Zheng, Zhaohui
    Wang, Ping
    Ren, Dongwei
    Liu, Wei
    Ye, Rongguang
    Hu, Qinghua
    Zuo, Wangmeng
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (08) : 8574 - 8586
  • [32] The MIS Check-Dam Dataset for Object Detection and Instance Segmentation Tasks
    Tundia, Chintan
    Kumar, Rajiv
    Damani, Om
    Sivakumar, G.
    PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2022, : 323 - 330
  • [33] Interactive Deep Annotation as DARos: Object Detection Supervision for Efficient Instance Segmentation
    Wang, Lihao
    Benmokhtar, Rachid
    Perrotton, Xavier
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT II, 2022, 13232 : 528 - 540
  • [34] CDANet: Common-and-Differential Attention Network for Object Detection and Instance Segmentation
    Wang, Yan
    Li, Yang
    Guo, Xiaohui
    Jiao, Licheng
    Liu, Xu
    PATTERN RECOGNITION LETTERS, 2022, 158 : 48 - 54
  • [35] Cascade R-CNN: High Quality Object Detection and Instance Segmentation
    Cai, Zhaowei
    Vasconcelos, Nuno
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) : 1483 - 1498
  • [36] Bimodal-based Object Detection and Instance Segmentation Models for Substation Equipments
    Yan, Nannan
    Zhou, Taiji
    Gu, Chunjie
    Jiang, Anfeng
    Lu, Wenlian
    IECON 2020: THE 46TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2020, : 428 - 434
  • [37] A Simple Single-Scale Vision Transformer for Object Detection and Instance Segmentation
    Chen, Wuyang
    Du, Xianzhi
    Yang, Fan
    Beyer, Lucas
    Zhai, Xiaohua
    Lin, Tsung-Yi
    Chen, Huizhong
    Li, Jing
    Song, Xiaodan
    Wang, Zhangyang
    Zhou, Denny
    COMPUTER VISION, ECCV 2022, PT X, 2022, 13670 : 711 - 727
  • [38] An Object Detection Method Using Probability Maps for Instance Segmentation to Mask Background
    Uchinoura, Shinji
    Miyao, Junichi
    Kurita, Takio
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2023, 27 (05) : 886 - 895
  • [39] Decoupling Classifier for Boosting Few-shot Object Detection and Instance Segmentation
    Gao, Bin-Bin
    Chen, Xiaochen
    Huang, Zhongyi
    Nie, Congchong
    Liu, Jun
    Lai, Jinxiang
    Jiang, Guannan
    Wang, Xi
    Wang, Chengjie
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [40] Beyond document object detection: instance-level segmentation of complex layouts
    Sanket Biswas
    Pau Riba
    Josep Lladós
    Umapada Pal
    International Journal on Document Analysis and Recognition (IJDAR), 2021, 24 : 269 - 281