Transductive Learning With Prior Knowledge for Generalized Zero-Shot Action Recognition

被引:1
|
作者
Su, Taiyi [1 ,2 ]
Wang, Hanli [1 ,2 ]
Qi, Qiuping [1 ,2 ]
Wang, Lei [3 ]
He, Bin [4 ]
机构
[1] Tongji Univ, Dept Comp Sci & Technol, Minist Educ, Shanghai 200092, Peoples R China
[2] Tongji Univ, Key Lab Embedded Syst & Serv Comp, Minist Educ, Shanghai 200092, Peoples R China
[3] DeepBlue Technol Shanghai Co Ltd, Shanghai 200336, Peoples R China
[4] Frontiers Sci Ctr Intelligent Autonomous Syst, Shanghai 200120, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recognition; generalized zero-shot learning; transductive learning; transfer learning; semantic embedding; NETWORK;
D O I
10.1109/TCSVT.2023.3284977
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
It is challenging to achieve generalized zero-shot action recognition. Different from the conventional zero-shot tasks which assume that the instances of the source classes are absent in the test set, the generalized zero-shot task studies the case that the test set contains both the source and the target classes. Due to the gap between visual feature and semantic embedding as well as the inherent bias of the learned classifier towards the source classes, the existing generalized zero-shot action recognition approaches are still far less effective than traditional zero-shot action recognition approaches. Facing these challenges, a novel transductive learning with prior knowledge (TLPK) model is proposed for generalized zero-shot action recognition. First, TLPK learns the prior knowledge which assists in bridging the gap between visual features and semantic embeddings, and preliminarily reduces the bias caused by the visual-semantic gap. Then, a transductive learning method that employs unlabeled target data is designed to overcome the bias problem in an effective manner. To achieve this, a target semantic-available approach and a target semantic-free approach are devised to utilize the target semantics in two different ways, where the target semantic-free approach exploits prior knowledge to produce well-performed semantic embeddings. By exploring the usage of the aforementioned prior-knowledge learning and transductive learning strategies, TLPK significantly bridges the visual-semantic gap and alleviates the bias between the source and the target classes. The experiments on the benchmark datasets of HMDB51 and UCF101 demonstrate the effectiveness of the proposed model compared to the state-of-the-art methods.
引用
收藏
页码:260 / 273
页数:14
相关论文
共 50 条
  • [1] Transductive semantic knowledge graph propagation for zero-shot learning
    Zhang, Hai-gang
    Que, Hao-yi
    Ren, Jin
    Wu, Zheng-guang
    [J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 360 (17): : 13108 - 13125
  • [2] Transductive Zero-Shot Action Recognition by Word-Vector Embedding
    Xu, Xun
    Hospedales, Timothy
    Gong, Shaogang
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2017, 123 (03) : 309 - 333
  • [3] Coupling Adversarial Graph Embedding for transductive zero-shot action recognition
    Tian, Yi
    Huang, Yaping
    Xu, Wanru
    Kong, Yu
    [J]. NEUROCOMPUTING, 2021, 452 : 239 - 252
  • [4] Transductive Zero-Shot Action Recognition by Word-Vector Embedding
    Xun Xu
    Timothy Hospedales
    Shaogang Gong
    [J]. International Journal of Computer Vision, 2017, 123 : 309 - 333
  • [5] Transductive Zero-Shot Recognition via Shared Model Space Learning
    Guo, Yuchen
    Ding, Guiguang
    Jin, Xiaoming
    Wang, Jianmin
    [J]. THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 3494 - 3500
  • [6] Adversarial strategy for transductive zero-shot learning
    Liu, Youfa
    Du, Bo
    Ni, Fuchuan
    [J]. INFORMATION SCIENCES, 2021, 578 : 750 - 761
  • [7] Bidirectional generative transductive zero-shot learning
    Xinpeng Li
    Dan Zhang
    Mao Ye
    Xue Li
    Qiang Dou
    Qiao Lv
    [J]. Neural Computing and Applications, 2021, 33 : 5313 - 5326
  • [8] Transductive Learning for Zero-Shot Object Detection
    Rahman, Shafin
    Khan, Salman
    Barnes, Nick
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6081 - 6090
  • [9] Bidirectional generative transductive zero-shot learning
    Li, Xinpeng
    Zhang, Dan
    Ye, Mao
    Li, Xue
    Dou, Qiang
    Lv, Qiao
    [J]. NEURAL COMPUTING & APPLICATIONS, 2021, 33 (10): : 5313 - 5326
  • [10] Holistically Associated Transductive Zero-Shot Learning
    Xu, Yangyang
    Xu, Xuemiao
    Han, Guoqiang
    He, Shengfeng
    [J]. IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (02) : 437 - 447