HyRSM plus plus : Hybrid relation guided temporal set matching for few-shot action recognition

被引:10
|
作者
Wang, Xiang [1 ]
Zhang, Shiwei [2 ]
Qing, Zhiwu [1 ]
Zuo, Zhengrong [1 ]
Gao, Changxin [1 ]
Jin, Rong [3 ]
Sang, Nong [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automation, Key Lab, Minist Educ Image Proc & Intelligent Control, Wuhan, Peoples R China
[2] Alibaba Grp, Hangzhou, Peoples R China
[3] Meta AI, Medford, MA USA
基金
中国国家自然科学基金;
关键词
Few-shot action recognition; Set matching; Semi-supervised few-shot action recognition; Unsupervised few-shot action recognition;
D O I
10.1016/j.patcog.2023.110110
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot action recognition is a challenging but practical problem aiming to learn a model that can be easily adapted to identify new action categories with only a few labeled samples. However, existing attempts still suffer from two drawbacks: (i) learning individual features without considering the entire task may result in limited representation capability, and (ii) existing alignment strategies are sensitive to noises and misaligned instances. To handle the two limitations, we propose a novel Hybrid Relation guided temporal Set Matching (HyRSM++) approach for few-shot action recognition. The core idea of HyRSM++ is to integrate all videos within the task to learn discriminative representations and involve a robust matching technique. To be specific, HyRSM++ consists of two key components, a hybrid relation module and a temporal set matching metric. Given the basic representations from the feature extractor, the hybrid relation module is introduced to fully exploit associated relations within and cross videos in an episodic task and thus can learn task-specific embeddings. Subsequently, in the temporal set matching metric, we carry out the distance measure between query and support videos from a set matching perspective and design a bidirectional Mean Hausdorff Metric to improve the resilience to misaligned instances. Furthermore, we extend the proposed HyRSM++ to deal with the more challenging semi-supervised few-shot action recognition and unsupervised few-shot action recognition tasks. Experimental results on multiple benchmarks demonstrate that our method consistently outperforms existing methods and achieves state-of-the-art performance under various few-shot settings. The source code is available at https://github.com/alibaba-mmai-research/HyRSMPlusPlus.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Anomalous Action Recognition Research for Few-shot Learning
    Qi, Yufei
    Liu, Ting
    Fu, Yuzhuo
    PROCEEDINGS OF 2020 IEEE 4TH INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2020), 2020, : 1306 - 1310
  • [32] Enhancing Few-Shot Action Recognition Using Skeleton Temporal Alignment and Adversarial Training
    Xu, Qingyang
    Yang, Jianjun
    Zhang, Hongyi
    Jie, Xin
    Bandara, Danushka
    IEEE ACCESS, 2024, 12 : 31745 - 31755
  • [33] Hierarchical compositional representations for few-shot action recognition
    Li, Changzhen
    Zhang, Jie
    Wu, Shuzhe
    Jin, Xin
    Shan, Shiguang
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 240
  • [34] Advances in Few-Shot Action Recognition: A Comprehensive Review
    Ruan, Zanxi
    Wei, Yingmei
    Yuan, Yifei
    Li, Yu
    Guo, Yanming
    Xie, Yuxiang
    2024 7TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA, ICAIBD 2024, 2024, : 390 - 398
  • [35] Motion-modulated Temporal Fragment Alignment Network For Few-Shot Action Recognition
    Wu, Jiamin
    Zhang, Tianzhu
    Zhang, Zhe
    Wu, Feng
    Zhang, Yongdong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9141 - 9150
  • [36] A Generative Approach to Zero-Shot and Few-Shot Action Recognition
    Mishra, Ashish
    Verma, Vinay Kumar
    Reddy, M. Shiva Krishna
    Arulkumar, S.
    Rai, Piyush
    Mittal, Anurag
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 372 - 380
  • [37] Bidirectional matching and aggregation network for few-shot relation extraction
    Wei, Zhongcheng
    Guo, Wenjie
    Zhang, Yunping
    Zhang, Jieying
    Zhao, Jijun
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [38] Heterogeneous representation learning and matching for few-shot relation prediction
    Wu, Tao
    Ma, Hongyu
    Wang, Chao
    Qiao, Shaojie
    Zhang, Liang
    Yu, Shui
    PATTERN RECOGNITION, 2022, 131
  • [39] Bidirectional matching and aggregation network for few-shot relation extraction
    Wei Z.
    Guo W.
    Zhang Y.
    Zhang J.
    Zhao J.
    PeerJ Computer Science, 2023, 9
  • [40] Relation-Guided Few-Shot Relational Triple Extraction
    Cong, Xin
    Sheng, Jiawei
    Cui, Shiyao
    Yu, Bowen
    Liu, Tingwen
    Wang, Bin
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 2206 - 2213