OpenTAL: Towards Open Set Temporal Action Localization

被引:9
|
作者
Bao, Wentao [1 ]
Yu, Qi [1 ]
Kong, Yu [1 ]
机构
[1] Rochester Inst Technol, Rochester, NY 14623 USA
关键词
D O I
10.1109/CVPR52688.2022.00299
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Temporal Action Localization (TAL) has experienced remarkable success under the supervised learning paradigm. However, existing TAL methods are rooted in the closed set assumption, which cannot handle the inevitable unknown actions in open-world scenarios. In this paper, we, for the first time, step toward the Open Set TAL (OSTAL) problem and propose a general framework Open TAL based on Evidential Deep Learning (EDL). Specifically, the OpenTAL consists of uncertainty-aware action classification, actionness prediction, and temporal location regression. With the proposed importance-balanced EDL method, classification uncertainty is learned by collecting categorical evidence majorly from important samples. To distinguish the unknown actions from background video frames, the actionness is learned by the positive-unlabeled learning. The classification uncertainty is further calibrated by leveraging the guidance from the temporal localization quality. The OpenTAL is general to enable existing TAL models for open set scenarios, and experimental results on THUMOS14 and ActivityNet1.3 benchmarks show the effectiveness of our method. The code and pre-trained models are released at https://www.rit.edu/actionlab/opental.
引用
收藏
页码:2969 / 2979
页数:11
相关论文
共 50 条
  • [31] Weakly supervised temporal action localization: a survey
    Li, Ronglu
    Zhang, Tianyi
    Zhang, Rubo
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (32) : 78361 - 78386
  • [32] Graph Convolutional Networks for Temporal Action Localization
    Zeng, Runhao
    Huang, Wenbing
    Tan, Mingkui
    Rong, Yu
    Zhao, Peilin
    Huang, Junzhou
    Gan, Chuang
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7093 - 7102
  • [33] DANet: Temporal Action Localization with Double Attention
    Sun, Jianing
    Wu, Xuan
    Xiao, Yubin
    Wu, Chunguo
    Liang, Yanchun
    Liang, Yi
    Wang, Liupu
    Zhou, You
    APPLIED SCIENCES-BASEL, 2023, 13 (12):
  • [34] Temporal Action Localization by Structured Maximal Sums
    Yuan, Zehuan
    Stroud, Jonathan C.
    Lu, Tong
    Deng, Jia
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3215 - 3223
  • [35] Action recognition and localization with spatial and temporal contexts
    Xu, Wanru
    Miao, Zhenjiang
    Yu, Jian
    Ji, Qiang
    NEUROCOMPUTING, 2019, 333 : 351 - 363
  • [36] Probabilistic Temporal Modeling for Unintentional Action Localization
    Xu, Jinglin
    Chen, Guangyi
    Zhou, Nuoxing
    Zheng, Wei-Shi
    Lu, Jiwen
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 3081 - 3094
  • [37] Gaussian Temporal Awareness Networks for Action Localization
    Long, Fuchen
    Yao, Ting
    Qiu, Zhaofan
    Tian, Xinmei
    Luo, Jiebo
    Mei, Tao
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 344 - 353
  • [38] Action Shuffling for Weakly Supervised Temporal Localization
    Zhang, Xiao-Yu
    Shi, Haichao
    Li, Changsheng
    Shi, Xinchu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 4447 - 4457
  • [39] Dual relation network for temporal action localization
    Xia, Kun
    Wang, Le
    Zhou, Sanping
    Hua, Gang
    Tang, Wei
    PATTERN RECOGNITION, 2022, 129
  • [40] Temporal Dropout for Weakly Supervised Action Localization
    Xie, Chi
    Zhuang, Zikun
    Zhao, Shengjie
    Liang, Shuang
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (03)