Weakly Supervised Gaussian Networks for Action Detection

被引:0
|
作者
Fernando, Basura [1 ]
Chet, Cheston Tan Yin [2 ]
Bilen, Hakan [3 ]
机构
[1] ASTAR, A AI, Singapore, Singapore
[2] ASTAR, I2R, Singapore, Singapore
[3] Univ Edinburgh, VICO, Edinburgh, Midlothian, Scotland
基金
新加坡国家研究基金会;
关键词
RECOGNITION;
D O I
10.1109/wacv45572.2020.9093263
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting temporal extents of human actions in videos is a challenging computer vision problem that requires detailed manual supervision including frame-level labels. This expensive annotation process limits deploying action detectors to a limited number of categories. We propose a novel method, called WSGN, that learns to detect actions from weak supervision, using only video-level labels. WSGN learns to exploit both video-specific and dataset-wide statistics to predict relevance of each frame to an action category. This strategy leads to significant gains in action detection for two standard benchmarks THUMOS14 and Charades. Our method obtains excellent results compared to state-of-the-art methods that uses similar features and loss functions on THUMOS14 dataset. Similarly, our weakly supervised method is only 0.3% mAP behind a state-of-the-art supervised method on challenging Charades dataset for action localization.
引用
收藏
页码:526 / 535
页数:10
相关论文
共 50 条
  • [1] Weakly Supervised Action Detection
    Siva, Parthipan
    Xiang, Tao
    PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,
  • [2] Weakly Supervised Deep Detection Networks
    Bilen, Hakan
    Vedaldi, Andrea
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2846 - 2854
  • [3] Segregated Temporal Assembly Recurrent Networks for Weakly Supervised Multiple Action Detection
    Xu, Yunlu
    Zhang, Chengwei
    Cheng, Zhanzhan
    Xie, Jianwen
    Niu, Yi
    Pu, Shiliang
    Wu, Fei
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9070 - 9078
  • [4] UntrimmedNets for Weakly Supervised Action Recognition and Detection
    Wang, Limin
    Xiong, Yuanjun
    Lin, Dahua
    Van Gool, Luc
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6402 - 6411
  • [5] Spatial division networks for weakly supervised detection
    Yongsheng Liu
    Wenyu Chen
    Hong Qu
    S. M. Hasan Mahmud
    Kebin Miao
    Neural Computing and Applications, 2021, 33 : 4965 - 4978
  • [6] Spatial division networks for weakly supervised detection
    Liu, Yongsheng
    Chen, Wenyu
    Qu, Hong
    Mahmud, S. M. Hasan
    Miao, Kebin
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (10): : 4965 - 4978
  • [7] Temporal Structure Mining for Weakly Supervised Action Detection
    Yu, Tan
    Ren, Zhou
    Li, Yuncheng
    Yan, Enxu
    Xu, Ning
    Yuan, Junsong
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5521 - 5530
  • [8] Discriminative action tubelet detector for weakly-supervised action detection
    Lee, Jiyoung
    Kim, Seungryong
    Kim, Sunok
    Sohn, Kwanghoon
    PATTERN RECOGNITION, 2024, 155
  • [9] Action Graphs: Weakly-supervised Action Localization with Graph Convolution Networks
    Rashid, Maheen
    Kjellstrom, Hedvig
    Lee, Yong Jae
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 604 - 613
  • [10] Weakly-Supervised Action Detection Guided by Audio Narration
    Ye, Keren
    Kovashka, Adriana
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 1527 - 1537