OpenTAL: Towards Open Set Temporal Action Localization

被引:9
|
作者
Bao, Wentao [1 ]
Yu, Qi [1 ]
Kong, Yu [1 ]
机构
[1] Rochester Inst Technol, Rochester, NY 14623 USA
关键词
D O I
10.1109/CVPR52688.2022.00299
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Temporal Action Localization (TAL) has experienced remarkable success under the supervised learning paradigm. However, existing TAL methods are rooted in the closed set assumption, which cannot handle the inevitable unknown actions in open-world scenarios. In this paper, we, for the first time, step toward the Open Set TAL (OSTAL) problem and propose a general framework Open TAL based on Evidential Deep Learning (EDL). Specifically, the OpenTAL consists of uncertainty-aware action classification, actionness prediction, and temporal location regression. With the proposed importance-balanced EDL method, classification uncertainty is learned by collecting categorical evidence majorly from important samples. To distinguish the unknown actions from background video frames, the actionness is learned by the positive-unlabeled learning. The classification uncertainty is further calibrated by leveraging the guidance from the temporal localization quality. The OpenTAL is general to enable existing TAL models for open set scenarios, and experimental results on THUMOS14 and ActivityNet1.3 benchmarks show the effectiveness of our method. The code and pre-trained models are released at https://www.rit.edu/actionlab/opental.
引用
收藏
页码:2969 / 2979
页数:11
相关论文
共 50 条
  • [1] Learning Generalized Representations for Open-Set Temporal Action Localization
    Hu, Junshan
    Zhuang, Liansheng
    Dong, Weisong
    Ge, Shiming
    Wang, Shafei
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 1987 - 1996
  • [2] Temporal Action Unit Perception Based Open Set Action Recognition
    Yang K.
    Gao J.
    Feng Y.
    Xu C.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2023, 36 (09): : 806 - 817
  • [3] DeTAL: Open-Vocabulary Temporal Action Localization With Decoupled Networks
    Li, Zhiheng
    Zhong, Yujie
    Song, Ran
    Li, Tianjiao
    Ma, Lin
    Zhang, Wei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 7728 - 7741
  • [4] 2PESNet: Towards online processing of temporal action localization
    Kim, Young Hwi
    Nam, Seonghyeon
    Kim, Seon Joo
    PATTERN RECOGNITION, 2022, 131
  • [5] Spatial-Temporal Exclusive Capsule Network for Open Set Action Recognition
    Feng, Yangbo
    Gao, Junyu
    Yang, Shicai
    Xu, Changsheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 9464 - 9478
  • [6] Towards better utilization of pseudo labels for weakly supervised temporal action localization
    Tang, Yiping
    Ge, Junyao
    Guo, Kaitai
    Zheng, Yang
    Hu, Haihong
    Liang, Jimin
    INFORMATION SCIENCES, 2023, 623 : 693 - 708
  • [7] A Survey on Temporal Action Localization
    Xia, Huifen
    Zhan, Yongzhao
    IEEE ACCESS, 2020, 8 : 70477 - 70487
  • [8] Exploring Action Centers for Temporal Action Localization
    Xia, Kun
    Wang, Le
    Shen, Yichao
    Zhou, Sanpin
    Hua, Gang
    Tang, Wei
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 9425 - 9436
  • [9] Action Sensitivity Learning for Temporal Action Localization
    Shao, Jiayi
    Wang, Xiaohan
    Quan, Ruijie
    Zheng, Junjun
    Yang, Jiang
    Yang, Yi
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 13411 - 13423
  • [10] Action matching network: open-set action recognition using spatio-temporal representation matching
    Yu, Jongmin
    Kim, Du Yong
    Yoon, Yongsang
    Jeon, Moongu
    VISUAL COMPUTER, 2020, 36 (07): : 1457 - 1471