Progressive Frame-Proposal Mining for Weakly Supervised Video Object Detection

被引:4
|
作者
Han, Mingfei [1 ]
Wang, Yali [2 ,3 ]
Li, Mingjie [4 ]
Chang, Xiaojun [1 ]
Yang, Yi [5 ]
Qiao, Yu [2 ,3 ]
机构
[1] Univ Technol Sydney, Australian Artificial Intelligence Inst, Fac Engn & Informat Technol, ReLER Lab, Ultimo, NSW 2007, Australia
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China
[3] Shanghai Artificial Intelligence Lab, Shanghai 202150, Peoples R China
[4] Stanford Univ, Dept Radiat Oncol, Stanford, CA 94305 USA
[5] Zhejiang Univ, Sch Comp Sci, Hangzhou 310000, Peoples R China
基金
澳大利亚研究理事会;
关键词
Proposals; Object detection; Detectors; Annotations; Task analysis; Training; Benchmark testing; Video object detection; weakly supervised learning; holistic-view refinement;
D O I
10.1109/TIP.2024.3364536
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we focus on the weakly supervised video object detection problem, where each training video is only tagged with object labels, without any bounding box annotations of objects. To effectively train object detectors from such weakly-annotated videos, we propose a Progressive Frame-Proposal Mining (PFPM) framework by exploiting discriminative proposals in a coarse-to-fine manner. First, we design a flexible Multi-Level Selection (MLS) scheme, with explicit guidance of video tags. By selecting object-relevant frames and mining important proposals from these frames, the proposed MLS can effectively reduce frame redundancy as well as improve proposal effectiveness to boost weakly-supervised detectors. Moreover, we develop a novel Holistic-View Refinement (HVR) scheme, which can globally evaluate importance of proposals among frames, and thus correctly refine pseudo ground truth boxes for training video detectors in a self-supervised manner. Finally, we evaluate the proposed PFPM on a large-scale benchmark for video object detection, on ImageNet VID, under the setting of weak annotations. The experimental results demonstrate that our PFPM significantly outperforms the state-of-the-art weakly-supervised detectors.
引用
收藏
页码:1560 / 1573
页数:14
相关论文
共 50 条
  • [21] Weakly Supervised Object Detection in Artworks
    Gonthier, Nicolas
    Gousseau, Yann
    Ladjal, Said
    Bonfait, Olivier
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT II, 2019, 11130 : 692 - 709
  • [22] Misclassification in Weakly Supervised Object Detection
    Wu, Zhihao
    Xu, Yong
    Yang, Jian
    Li, Xuelong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 3413 - 3427
  • [23] Training Weakly Supervised Video Frame Interpolation with Events
    Yu, Zhiyang
    Zhang, Yu
    Liu, Deyuan
    Zou, Dongqing
    Chen, Xijun
    Liu, Yebin
    Ren, Jimmy
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14569 - 14578
  • [24] Progressive Representation Adaptation for Weakly Supervised Object Localization
    Li, Dong
    Huang, Jia-Bin
    Li, Yali
    Wang, Shengjin
    Yang, Ming-Hsuan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (06) : 1424 - 1438
  • [25] Weakly Supervised Object Localization with Progressive Domain Adaptation
    Su, Shuochen
    Heide, Felix
    Swanson, Robin
    Klein, Jonathan
    Callenberg, Clara
    Hullin, Matthias
    Heidrich, Wolfgang
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : CP40 - CP40
  • [26] Weakly Supervised Object Detection Using Proposal- and Semantic-Level Relationships
    Zhang, Dingwen
    Zeng, Wenyuan
    Yao, Jieru
    Han, Junwei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (06) : 3349 - 3363
  • [27] A Robust Context-Aware Proposal Refinement Method for Weakly Supervised Object Detection
    Awan, Mehwish
    Shin, Jitae
    IEEE ACCESS, 2020, 8 (08): : 199768 - 199780
  • [28] Weakly-supervised salient object detection with the multi-scale progressive network
    Liu X.
    Guo J.
    Zheng S.
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2023, 50 (01): : 48 - 57
  • [29] Progressive Contextual Instance Refinement for Weakly Supervised Object Detection in Remote Sensing Images
    Feng, Xiaoxu
    Han, Junwei
    Yao, Xiwen
    Cheng, Gong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (11): : 8002 - 8012
  • [30] Weakly-Supervised RGBD Video Object Segmentation
    Yang, Jinyu
    Gao, Mingqi
    Zheng, Feng
    Zhen, Xiantong
    Ji, Rongrong
    Shao, Ling
    Leonardis, Ales
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 2158 - 2170