Generalized Zero-Shot Video Classification via Generative Adversarial Networks

被引:8
|
作者
Hong, Mingyao [1 ]
Li, Guorong [1 ,2 ]
Zhang, Xinfeng [1 ,2 ]
Huang, Qingming [1 ,2 ,3 ]
机构
[1] UCAS, Sch Comp Sci & Technol, Beijing, Peoples R China
[2] UCAS, Key Lab Big Data Min & Knowledge Management, Beijing, Peoples R China
[3] Chinese Acad Sci, ICT, Key Lab Intelligent Informat Proc, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
zero-shot learning; datasets; video classification; text description;
D O I
10.1145/3394171.3413517
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot learning (ZSL) is to classify images according to detailed attribute annotations into new categories that are unseen during the training stage. Generalized zero-shot learning (GZSL) adds seen categories to the test samples. Since the learned classifier has inherent bias against seen categories, GZSL is more challenging than traditional ZSL. However, at present, there is no detailed attribute description dataset for video classification. Therefore, the current zero-shot video classification problem is based on the synthesis of generative adversarial networks trained on seen-class features into unseen-class features for ZSL classification. In order to solve this problem, we propose a description text dataset based on the UCF101 action recognition dataset. To the best of our knowledge, this is the first work to add description of the classes to zero-shot video classification. We propose a new loss function that combines visual features with textual features. We extract text features from the proposed text data set, and constrain the process of generating synthetic features based on the principle that videos with similar text types should be similar. Our method reapplies the traditional zero-shot learning idea to video classification. From the experimental point of view, our proposed dataset and method have a positive impact on the generalized zero-shot video classification.
引用
收藏
页码:2419 / 2426
页数:8
相关论文
共 50 条
  • [1] Bias alleviating generative adversarial network for generalized zero-shot classification
    Li, Xiao
    Fang, Min
    Li, Haikun
    [J]. IMAGE AND VISION COMPUTING, 2021, 105
  • [2] Generative Adversarial Networks for Zero-Shot Remote Sensing Scene Classification
    Li, Zihao
    Zhang, Daobing
    Wang, Yang
    Lin, Daoyu
    Zhang, Jinghua
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (08):
  • [3] Zero-Shot Learning with Joint Generative Adversarial Networks
    Zhang, Minwan
    Wang, Xiaohua
    Shi, Yueting
    Ren, Shiwei
    Wang, Weijiang
    [J]. ELECTRONICS, 2023, 12 (10)
  • [4] Generative Dual Adversarial Network for Generalized Zero-shot Learning
    Huang, He
    Wang, Changhu
    Yu, Philip S.
    Wang, Chang-Dong
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 801 - 810
  • [5] Zero-shot image classification based on generative adversarial network
    Wei, Hongxi
    Zhang, Yue
    [J]. Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2019, 45 (12): : 2345 - 2350
  • [6] Triple discriminator generative adversarial network for zero-shot image classification
    Ji, Zhong
    Yan, Jiangtao
    Wang, Qiang
    Pang, Yanwei
    Li, Xuelong
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2021, 64 (02)
  • [7] Triple discriminator generative adversarial network for zero-shot image classification
    Zhong Ji
    Jiangtao Yan
    Qiang Wang
    Yanwei Pang
    Xuelong Li
    [J]. Science China Information Sciences, 2021, 64
  • [8] Triple discriminator generative adversarial network for zero-shot image classification
    Zhong JI
    Jiangtao YAN
    Qiang WANG
    Yanwei PANG
    Xuelong LI
    [J]. Science China(Information Sciences), 2021, 64 (02) : 5 - 18
  • [9] Cooperative Coupled Generative Networks for Generalized Zero-Shot Learning
    Sun, Liang
    Song, Junjie
    Wang, Ye
    Li, Baoyu
    [J]. IEEE ACCESS, 2020, 8 : 119287 - 119299
  • [10] Conditional Coupled Generative Adversarial Networks for Zero-Shot Domain Adaptation
    Wang, Jinghua
    Jiang, Jianmin
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3374 - 3383