Progressive Frame-Proposal Mining for Weakly Supervised Video Object Detection

被引:4
|
作者
Han, Mingfei [1 ]
Wang, Yali [2 ,3 ]
Li, Mingjie [4 ]
Chang, Xiaojun [1 ]
Yang, Yi [5 ]
Qiao, Yu [2 ,3 ]
机构
[1] Univ Technol Sydney, Australian Artificial Intelligence Inst, Fac Engn & Informat Technol, ReLER Lab, Ultimo, NSW 2007, Australia
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China
[3] Shanghai Artificial Intelligence Lab, Shanghai 202150, Peoples R China
[4] Stanford Univ, Dept Radiat Oncol, Stanford, CA 94305 USA
[5] Zhejiang Univ, Sch Comp Sci, Hangzhou 310000, Peoples R China
基金
澳大利亚研究理事会;
关键词
Proposals; Object detection; Detectors; Annotations; Task analysis; Training; Benchmark testing; Video object detection; weakly supervised learning; holistic-view refinement;
D O I
10.1109/TIP.2024.3364536
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we focus on the weakly supervised video object detection problem, where each training video is only tagged with object labels, without any bounding box annotations of objects. To effectively train object detectors from such weakly-annotated videos, we propose a Progressive Frame-Proposal Mining (PFPM) framework by exploiting discriminative proposals in a coarse-to-fine manner. First, we design a flexible Multi-Level Selection (MLS) scheme, with explicit guidance of video tags. By selecting object-relevant frames and mining important proposals from these frames, the proposed MLS can effectively reduce frame redundancy as well as improve proposal effectiveness to boost weakly-supervised detectors. Moreover, we develop a novel Holistic-View Refinement (HVR) scheme, which can globally evaluate importance of proposals among frames, and thus correctly refine pseudo ground truth boxes for training video detectors in a self-supervised manner. Finally, we evaluate the proposed PFPM on a large-scale benchmark for video object detection, on ImageNet VID, under the setting of weak annotations. The experimental results demonstrate that our PFPM significantly outperforms the state-of-the-art weakly-supervised detectors.
引用
收藏
页码:1560 / 1573
页数:14
相关论文
共 50 条
  • [1] Object Instance Mining for Weakly Supervised Object Detection
    Lin, Chenhao
    Wang, Siwen
    Xu, Dongqi
    Lu, Yu
    Zhang, Wayne
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11482 - 11489
  • [2] Weakly Supervised Video Salient Object Detection
    Zhao, Wangbo
    Zhang, Jing
    Li, Long
    Barnes, Nick
    Liu, Nian
    Han, Junwei
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16821 - 16830
  • [3] Dynamic proposal sampling for weakly supervised object detection
    Jiang, Wenhui
    Zhao, Zhicheng
    Su, Fei
    Fang, Yuming
    NEUROCOMPUTING, 2021, 441 : 248 - 259
  • [4] Weakly Supervised Region Proposal Network and Object Detection
    Tang, Peng
    Wang, Xinggang
    Wang, Angtian
    Yan, Yongluan
    Liu, Wenyu
    Huang, Junzhou
    Yuille, Alan
    COMPUTER VISION - ECCV 2018, PT XI, 2018, 11215 : 370 - 386
  • [5] A progressive segmentation with weight contrast label enhancement for weakly supervised video salient object detection
    Lu, Zelin
    Liang, Haoran
    Xu, Binwei
    Liang, Ronghua
    IET IMAGE PROCESSING, 2023, 17 (10) : 2925 - 2936
  • [6] PCL: Proposal cluster learning for weakly supervised object detection
    School of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan
    430074, China
    不详
    OX1 3PJ, United Kingdom
    不详
    200444, China
    不详
    MD
    21218-2608, United States
    arXiv, 1600,
  • [7] PCL: Proposal Cluster Learning for Weakly Supervised Object Detection
    Tang, Peng
    Wang, Xinggang
    Bai, Song
    Shen, Wei
    Bai, Xiang
    Liu, Wenyu
    Yuille, Alan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (01) : 176 - 191
  • [8] Contrastive Proposal Extension With LSTM Network for Weakly Supervised Object Detection
    Lv, Pei
    Hu, Suqi
    Hao, Tianran
    IEEE Transactions on Image Processing, 2022, 31 : 6879 - 6892
  • [9] Contrastive Proposal Extension With LSTM Network for Weakly Supervised Object Detection
    Lv, Pei
    Hu, Suqi
    Hao, Tianran
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6879 - 6892
  • [10] HIERARCHICAL REGION PROPOSAL REFINEMENT NETWORK FOR WEAKLY SUPERVISED OBJECT DETECTION
    Zhang, Ming
    Liu, Shuaicheng
    Zeng, Bing
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 669 - 673