OpenTAL: Towards Open Set Temporal Action Localization

被引:9
|
作者
Bao, Wentao [1 ]
Yu, Qi [1 ]
Kong, Yu [1 ]
机构
[1] Rochester Inst Technol, Rochester, NY 14623 USA
关键词
D O I
10.1109/CVPR52688.2022.00299
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Temporal Action Localization (TAL) has experienced remarkable success under the supervised learning paradigm. However, existing TAL methods are rooted in the closed set assumption, which cannot handle the inevitable unknown actions in open-world scenarios. In this paper, we, for the first time, step toward the Open Set TAL (OSTAL) problem and propose a general framework Open TAL based on Evidential Deep Learning (EDL). Specifically, the OpenTAL consists of uncertainty-aware action classification, actionness prediction, and temporal location regression. With the proposed importance-balanced EDL method, classification uncertainty is learned by collecting categorical evidence majorly from important samples. To distinguish the unknown actions from background video frames, the actionness is learned by the positive-unlabeled learning. The classification uncertainty is further calibrated by leveraging the guidance from the temporal localization quality. The OpenTAL is general to enable existing TAL models for open set scenarios, and experimental results on THUMOS14 and ActivityNet1.3 benchmarks show the effectiveness of our method. The code and pre-trained models are released at https://www.rit.edu/actionlab/opental.
引用
收藏
页码:2969 / 2979
页数:11
相关论文
共 50 条
  • [41] Frame Segmentation Networks for Temporal Action Localization
    Yang, Ke
    Qiao, Peng
    Wang, Qiang
    Li, Shijie
    Niu, Xin
    Li, Dongsheng
    Dou, Yong
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2018, PT II, 2018, 11165 : 242 - 252
  • [42] Temporal Superpixels based Human Action Localization
    Ullah, Sami
    Hassan, Najmul
    Bhatti, Naeem
    2018 14TH INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES (ICET), 2018,
  • [43] TVNet: Temporal Voting Network for Action Localization
    Wang, Hanyuan
    Damen, Dima
    Mirmehdi, Majid
    Perrett, Toby
    PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2022, : 550 - 558
  • [44] Revisiting Anchor Mechanisms for Temporal Action Localization
    Yang, Le
    Peng, Houwen
    Zhang, Dingwen
    Fu, Jianlong
    Han, Junwei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8535 - 8548
  • [45] A Temporal-Aware Relation and Attention Network for Temporal Action Localization
    Zhao, Yibo
    Zhang, Hua
    Gao, Zan
    Guan, Weili
    Nie, Jie
    Liu, Anan
    Wang, Meng
    Chen, Shengyong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 4746 - 4760
  • [46] Multiple Temporal Pooling Mechanisms for Weakly Supervised Temporal Action Localization
    Dou, Peng
    Zeng, Ying
    Wang, Zhuoqun
    Hu, Haifeng
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (03)
  • [47] Temporal RPN Learning for Weakly-Supervised Temporal Action Localization
    Huang, Jing
    Kong, Ming
    Chen, Luyuan
    Liang, Tian
    Zhu, Qiang
    ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222, 2023, 222
  • [48] TDP: Temporal Dynamic Pooling - A New Method for Temporal Action Localization
    Li, Lei
    Ma, LiHong
    Tian, Jing
    2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 2517 - 2522
  • [49] TEST: Temporal-spatial separated transformer for temporal action localization
    Wan, Herun
    Luo, Minnan
    Li, Zhihui
    Wang, Yang
    NEUROCOMPUTING, 2025, 614
  • [50] Action Coherence Network for Weakly-Supervised Temporal Action Localization
    Zhai, Yuanhao
    Wang, Le
    Tang, Wei
    Zhang, Qilin
    Zheng, Nanning
    Hua, Gang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1857 - 1870