Unsupervised Object Detection Pretraining with Joint Object Priors Generation and Detector Learning

被引:0
|
作者
Wang, Yizhou [1 ,3 ]
Chen, Meilin [1 ,3 ]
Tang, Shixiang [2 ]
Zhu, Feng [3 ]
Yang, Haiyang [5 ]
Bai, Lei [4 ]
Zhao, Rui [3 ,6 ]
Yan, Yunfeng [1 ]
Qi, Donglian [1 ]
Ouyang, Wanli [2 ,4 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
[2] Univ Sydney, Sydney, NSW, Australia
[3] SenseTime Res, Hong Kong, Peoples R China
[4] Shanghai AI Lab, Shanghai, Peoples R China
[5] Nanjing Univ, Nanjing, Peoples R China
[6] Shanghai Jiao Tong Univ, Qing Yuan Res Inst, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsupervised pretraining methods for object detection aim to learn object discrimination and localization ability from large amounts of images. Typically, recent works design pretext tasks that supervise the detector to predict the defined object priors. They normally leverage heuristic methods to produce object priors, e.g., selective search, which separates the prior generation and detector learning and leads to sub-optimal solutions. In this work, we propose a novel object detection pretraining framework that could generate object priors and learn detectors jointly by generating accurate object priors from the model itself. Specifically, region priors are extracted by attention maps from the encoder, which highlights foregrounds. Instance priors are the selected high-quality output bounding boxes of the detection decoder. By assuming objects as instances in the foreground, we can generate object priors with both region and instance priors. Moreover, our object priors are jointly refined along with the detector optimization. With better object priors as supervision, the model could achieve better detection capability, which in turn promotes the object priors generation. Our method improves the competitive approaches by +1.3 AP, +1.7 AP in 1% and 10% COCO low-data regimes object detection.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Joint Attention Mechanism for Unsupervised Video Object Segmentation
    Yao, Rui
    Xu, Xin
    Zhou, Yong
    Zhao, Jiaqi
    Fang, Liang
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, 2021, 13019 : 154 - 165
  • [42] MVContrast: Unsupervised Pretraining for Multi-view 3D Object Recognition
    Wang, Luequan
    Xu, Hongbin
    Kang, Wenxiong
    MACHINE INTELLIGENCE RESEARCH, 2023, 20 (06) : 872 - 883
  • [43] MVContrast: Unsupervised Pretraining for Multi-view 3D Object Recognition
    Luequan Wang
    Hongbin Xu
    Wenxiong Kang
    Machine Intelligence Research, 2023, 20 : 872 - 883
  • [44] Unsupervised learning of object detectors for everyday scenes
    1600, Science and Engineering Research Support Society (07):
  • [45] Unsupervised learning of probabilistic object models (POMs) for object classification, segmentation and recognition
    Chen, Yuanhao
    Zhu, Long
    Yuille, Alan
    Zhang, Hongjiang
    2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 31 - +
  • [46] Optimal, unsupervised learning in invariant object recognition
    Wallis, G
    Baddeley, R
    NEURAL COMPUTATION, 1997, 9 (04) : 883 - 894
  • [47] Unsupervised Object Learning via Common Fate
    Tangemann, Matthias
    Schneider, Steffen
    von Kuegelgen, Julius
    Locatello, Francesco
    Gehler, Peter
    Brox, Thomas
    Kuemmerer, Matthias
    Bethge, Matthias
    Schoelkopf, Bernhard
    CONFERENCE ON CAUSAL LEARNING AND REASONING, VOL 213, 2023, 213 : 281 - 327
  • [48] Unsupervised Learning of Object Keypoints for Perception and Control
    Kulkarni, Tejas
    Gupta, Ankush
    Ionescu, Catalin
    Borgeaud, Sebastian
    Reynolds, Malcolm
    Zisserman, Andrew
    Mnih, Volodymyr
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [49] LOCUS: Learning object classes with unsupervised segmentation
    Winn, J
    Jojic, N
    TENTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 756 - 763
  • [50] Unsupervised Selective Transfer Learning for Object Recognition
    Zheng, Wei-Shi
    Gong, Shaogang
    Xiang, Tao
    COMPUTER VISION - ACCV 2010, PT II, 2011, 6493 : 527 - +