ZSTAD: Zero-Shot Temporal Activity Detection

被引:19
|
作者
Zhang, Lingling [1 ,2 ]
Chang, Xiaojun [3 ]
Liu, Jun [1 ,4 ]
Luo, Minnan [1 ,4 ]
Wang, Sen [5 ]
Ge, Zongyuan [3 ]
Hauptmann, Alexander [6 ]
机构
[1] Xi An Jiao Tong Univ, Sch Comp Sci & Technol, Xian, Peoples R China
[2] Minist Educ, Key Lab Intelligent Networks & Network Secur, Xian, Peoples R China
[3] Monash Univ, Fac Informat Technol, Melbourne, Vic, Australia
[4] Xi An Jiao Tong Univ, Natl Engn Lab Big Data Analyt, Xian, Peoples R China
[5] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld, Australia
[6] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
基金
澳大利亚研究理事会; 中国国家自然科学基金;
关键词
D O I
10.1109/CVPR42600.2020.00096
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An integral part of video analysis and surveillance is temporal activity detection, which means to simultaneously recognize and localize activities in long untrimmed videos. Currently, the most effective methods of temporal activity detection are based on deep learning, and they typically perform very well with large scale annotated videos for training. However, these methods are limited in real applications due to the unavailable videos about certain activity classes and the time-consuming data annotation. To solve this challenging problem, we propose a novel task setting called zero-shot temporal activity detection (ZSTAD), where activities that have never been seen in training can still be detected. We design an end-to-end deep network based on R-C3D as the architecture for this solution. The proposed network is optimized with an innovative loss function that considers the embeddings of activity labels and their superclasses while learning the common semantics of seen and unseen activities. Experiments on both the THUMOS'14 and the Charades datasets show promising performance in terms of detecting unseen activities.
引用
收藏
页码:876 / 885
页数:10
相关论文
共 50 条
  • [31] Zero-Shot Food Image Detection Based on Transformer
    Song, Jingru
    Min, Weiqing
    Zhou, Pengfei
    Rao, Quanrui
    Sheng, Guorui
    Yang, Yancun
    Wang, Lili
    Jiang, Shuqiang
    Science and Technology of Food Industry, 2024, 45 (22) : 18 - 26
  • [32] Zero-Shot Learning for Raw Network Traffic Detection
    Rani, Pooja
    Bastian, Nathaniel D.
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS VI, 2024, 13051
  • [33] HARGPT: Are LLMs Zero-Shot Human Activity Recognizers?
    Ji, Sijie
    Zheng, Xinzhe
    Wu, Chenshu
    PROCEEDINGS 2024 IEEE INTERNATIONAL WORKSHOP ON FOUNDATION MODELS FOR CYBER-PHYSICAL SYSTEMS & INTERNET OF THINGS, FMSYS 2024, 2024, : 38 - 43
  • [34] Zero-shot versus Many-shot: Unsupervised Texture Anomaly Detection
    Aota, Toshimichi
    Tong, Lloyd Teh Tzer
    Okatani, Takayuki
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5553 - 5561
  • [35] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
    Yang, Shuai
    Zhou, Yifan
    Liu, Ziwei
    Loy, Chen Change
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 8703 - 8712
  • [36] MULTI-LABEL ZERO-SHOT AUDIO CLASSIFICATION WITH TEMPORAL ATTENTION
    Dogan, Duygu
    Xie, Huang
    Heittola, Toni
    Virtanen, Tuomas
    2024 18TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT, IWAENC 2024, 2024, : 250 - 254
  • [37] From zero-shot machine learning to zero-day attack detection
    Sarhan, Mohanad
    Layeghy, Siamak
    Gallagher, Marcus
    Portmann, Marius
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2023, 22 (04) : 947 - 959
  • [38] From zero-shot machine learning to zero-day attack detection
    Mohanad Sarhan
    Siamak Layeghy
    Marcus Gallagher
    Marius Portmann
    International Journal of Information Security, 2023, 22 : 947 - 959
  • [39] Zero-Shot Hyperspectral Sharpening
    Dian, Renwei
    Guo, Anjing
    Li, Shutao
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 12650 - 12666
  • [40] Zero-shot Adversarial Quantization
    Liu, Yuang
    Zhang, Wei
    Wang, Jun
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1512 - 1521