Mixed Supervision for Instance Learning in Object Detection with Few-shot Annotation

被引:0
|
作者
Zhong, Yi [1 ]
Wang, Chengyao [1 ]
Li, Shiyong [2 ]
Zhou, Zhu [2 ]
Wang, Yaowei [3 ]
Zheng, Wei-Shi [1 ]
机构
[1] Sun Yat Sen Univ, Guangzhou, Peoples R China
[2] Huawei, AI Applicat Res Ctr, Shenzhen, Peoples R China
[3] Pengcheng Lab, Shenzhen, Peoples R China
关键词
object detection; mixed supervision; few shot; instance learning;
D O I
10.1145/3503161.3548242
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Mixed supervision for object detection (MSOD) that utilizes imagelevel annotations and a small amount of instance-level annotations has emerged as an efficient tool by alleviating the requirement for a large amount of costly instance-level annotations and providing effective instance supervision on previous methods that only use image-level annotations. In this work, we introduce the mixed supervision instance learning (MSIL), as a novel MSOD framework to leverage a handful of instance-level annotations to provide more explicit and implicit supervision. Rather than just adding instance-level annotations directly on loss functions for detection, we aim to dig out more effective explicit and implicit relations between these two different level annotations. In particular, we firstly propose the Instance-Annotation Guided Image Classification strategy to provide explicit guidance from instance-level annotations by using positional relation to force the image classifier to focus on the proposals which contain the correct object. And then, in order to exploit more implicit interaction between the mixed annotations, an instance reproduction strategy guided by the extra instance-level annotations is developed for generating more accurate pseudo ground truth, achieving a more discriminative detector. Finally, a false target instance mining strategy is used to refine the above processing by enriching the number and diversity of training instances with the position and score information. Our experiments show that the proposed MSIL framework outperforms recent state-of-the-art mixed supervised detectors with a large margin on both the Pascal VOC2007 and the MS-COCO dataset.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Few-Shot Learning for Road Object Detection
    Majee, Anay
    Agrawal, Kshitij
    Subramanian, Anbumani
    AAAI WORKSHOP ON META-LEARNING AND METADL CHALLENGE, VOL 140, 2021, 140 : 115 - 126
  • [2] Dynamic relevance learning for few-shot object detection
    Weijie Liu
    Xiaojie Cai
    Chong Wang
    Haohe Li
    Shenghao Yu
    Signal, Image and Video Processing, 2025, 19 (4)
  • [3] Few-shot object detection via baby learning
    Vu, Anh-Khoa Nguyen
    Nguyen, Nhat-Duy
    Nguyen, Khanh-Duy
    Nguyen, Vinh-Tiep
    Ngo, Thanh Duc
    Do, Thanh-Toan
    Nguyen, Tam V.
    IMAGE AND VISION COMPUTING, 2022, 120
  • [4] Few-shot Object Detection with Refined Contrastive Learning
    Shangguan, Zeyu
    Huai, Lian
    Liu, Tong
    Jiang, Xingqun
    2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, : 991 - 996
  • [5] Fast Hierarchical Learning for Few-Shot Object Detection
    She, Yihang
    Bhat, Goutam
    Danelljan, Martin
    Yu, Fisher
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 1993 - 2000
  • [6] Few-Shot Object Detection via Metric Learning
    Zhu Min
    Zhang Chongyang
    FOURTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2021), 2022, 12084
  • [7] Object detection based on few-shot learning via instance-level feature correlation and aggregation
    Wang, Meng
    Ning, Hongwei
    Liu, Haipeng
    APPLIED INTELLIGENCE, 2023, 53 (01) : 351 - 368
  • [8] Object detection based on few-shot learning via instance-level feature correlation and aggregation
    Meng Wang
    Hongwei Ning
    Haipeng Liu
    Applied Intelligence, 2023, 53 : 351 - 368
  • [9] Few-Shot Object Detection: A Survey
    Antonelli, Simone
    Avola, Danilo
    Cinque, Luigi
    Crisostomi, Donato
    Foresti, Gian Luca
    Galasso, Fabio
    Marini, Marco Raoul
    Mecca, Alessio
    Pannone, Daniele
    ACM COMPUTING SURVEYS, 2022, 54 (11S)
  • [10] Few-Shot Video Object Detection
    Fan, Qi
    Tang, Chi-Keung
    Tai, Yu-Wing
    COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 76 - 98