Generalized zero-shot learning for action recognition with web-scale video data

被引:30
|
作者
Liu, Kun [1 ]
Liu, Wu [1 ]
Ma, Huadong [1 ]
Huang, Wenbing [2 ]
Dong, Xiongxiong [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
[2] Tencent AI Lab, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recognition; Generalized zero-shot learning; Surveillance video; Transfer learning; Web-scale video data; FUSION;
D O I
10.1007/s11280-018-0642-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Action recognition in surveillance video makes our life safer by detecting the criminal events or predicting violent emergencies. However, efficient action recognition is not free of difficulty. First, there are so many action classes in daily life that we cannot pre-define all possible action classes beforehand. Moreover, it is very hard to collect real-word videos for certain particular actions such as steal and street fight due to legal restrictions and privacy protection. These challenges make existing data-driven recognition methods insufficient to attain desired performance. Zero-shot learning is potential to be applied to solve these issues since it can perform classification without positive example. Nevertheless, current zero-shot learning algorithms have been studied under the unreasonable setting where seen classes are absent during the testing phase. Motivated by this, we study the task of action recognition in surveillance video under a more realistic generalized zero-shot setting, where testing data contains both seen and unseen classes. To our best knowledge, this is one of the first works to study video action recognition under the generalized zero-shot setting. We firstly perform extensive empirical studies on several existing zero-shot leaning approaches under this new setting on a web-scale video data. Our experimental results demonstrate that, under the generalize setting, typical zero-shot learning methods are no longer effective for the dataset we applied. Then, we propose to deploy generalized zero-shot learning which transfers the knowledge of Web video to detect the anomalous actions in surveillance videos. To verify the effectiveness of methods, we further construct a new surveillance video dataset consisting of nine action classes related to the public safety situation.
引用
收藏
页码:807 / 824
页数:18
相关论文
共 50 条
  • [41] VDARN: Video Disentangling Attentive Relation Network for Few-Shot and Zero-Shot Action Recognition
    Su, Yong
    Xing, Meng
    An, Simin
    Peng, Weilong
    Feng, Zhiyong
    AD HOC NETWORKS, 2021, 113
  • [42] Attention-Based Video Disentangling and Matching Network for Zero-Shot Action Recognition
    Su, Yong
    Zhu, Shuang
    Xing, Meng
    Xu, Hengpeng
    Li, Zhengtao
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, VOL. 1, 2022, 878 : 368 - 375
  • [43] Rethinking Zero-shot Action Recognition: Learning from Latent Atomic Actions
    Qian, Yijun
    Yu, Lijun
    Liu, Wenhe
    Hauptmann, Alexander G.
    COMPUTER VISION - ECCV 2022, PT IV, 2022, 13664 : 104 - 120
  • [44] Fine-grained Human Action Recognition Based on Zero-Shot Learning
    Zhao, Yahui
    Shi, Ping
    You, Jian
    PROCEEDINGS OF 2019 IEEE 10TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2019), 2019, : 294 - 297
  • [45] A ZERO-SHOT ARCHITECTURE FOR ACTION RECOGNITION IN STILL IMAGES
    Safaei, Marjaneh
    Foroosh, Hassan
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 460 - 464
  • [46] Deconfounding Causal Inference for Zero-Shot Action Recognition
    Wang, Junyan
    Jiang, Yiqi
    Long, Yang
    Sun, Xiuyu
    Pagnucco, Maurice
    Song, Yang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3976 - 3986
  • [47] Global Semantic Descriptors for Zero-Shot Action Recognition
    Estevam, Valter
    Laroca, Rayson
    Pedrini, Helio
    Menotti, David
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1843 - 1847
  • [48] Multi-Task Zero-Shot Action Recognition with Prioritised Data Augmentation
    Xu, Xun
    Hospedales, Timothy M.
    Gong, Shaogang
    COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 : 343 - 359
  • [49] SEMANTIC EMBEDDING SPACE FOR ZERO-SHOT ACTION RECOGNITION
    Xu, Xun
    Hospedales, Timothy
    Gong, Shaogang
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 63 - 67
  • [50] EXPLORING SYNONYMS AS CONTEXT IN ZERO-SHOT ACTION RECOGNITION
    Alexiou, Ioannis
    Xiang, Tao
    Gong, Shaogang
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 4190 - 4194