Few-Shot Action Recognition with Hierarchical Matching and Contrastive Learning

被引:15
|
作者
Zheng, Sipeng [1 ]
Chen, Shizhe [2 ]
Jin, Qin [1 ]
机构
[1] Renmin Univ China, Beijing, Peoples R China
[2] INRIA, Paris, France
来源
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Few-shot learning; Action recognition; Contrastive learning;
D O I
10.1007/978-3-031-19772-7_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot action recognition aims to recognize actions in test videos based on limited annotated data of target action classes. The dominant approaches project videos into a metric space and classify videos via nearest neighboring. They mainly measure video similarities using global or temporal alignment alone, while an optimum matching should be multi-level. However, the complexity of learning coarse-to-fine matching quickly rises as we focus on finer-grained visual cues, and the lack of detailed local supervision is another challenge. In this work, we propose a hierarchical matching model to support comprehensive similarity measure at global, temporal and spatial levels via a zoom-in matching module. We further propose a mixed-supervised hierarchical contrastive learning (HCL), which not only employs supervised contrastive learning to differentiate videos at different levels, but also utilizes cycle consistency as weak supervision to align discriminative temporal clips or spatial patches. Our model achieves state-of-the-art performance on four benchmarks especially under the most challenging 1-shot recognition setting.
引用
收藏
页码:297 / 313
页数:17
相关论文
共 50 条
  • [1] VISUAL TEMPO CONTRASTIVE LEARNING FOR FEW-SHOT ACTION RECOGNITION
    Wang, Guangge
    Ye, Weirong
    Wang, Xiao
    Jin, Rongrong
    Wang, Hanzi
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1096 - 1100
  • [2] Cross-Modal Contrastive Learning Network for Few-Shot Action Recognition
    Wang, Xiao
    Yan, Yan
    Hu, Hai-Miao
    Li, Bo
    Wang, Hanzi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 1257 - 1271
  • [3] Supervised Contrastive Learning for Few-Shot Action Classification
    Han, Hongfeng
    Fei, Nanyi
    Lu, Zhiwu
    Wen, Ji-Rong
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT III, 2023, 13715 : 512 - 528
  • [4] Compound Prototype Matching for Few-Shot Action Recognition
    Huang, Yifei
    Yang, Lijin
    Sato, Yoichi
    COMPUTER VISION - ECCV 2022, PT IV, 2022, 13664 : 351 - 368
  • [5] Matching Compound Prototypes for Few-Shot Action Recognition
    Huang, Yifei
    Yang, Lijin
    Chen, Guo
    Zhang, Hongjie
    Lu, Feng
    Sato, Yoichi
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (09) : 3977 - 4002
  • [6] Hierarchical compositional representations for few-shot action recognition
    Li, Changzhen
    Zhang, Jie
    Wu, Shuzhe
    Jin, Xin
    Shan, Shiguang
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 240
  • [7] Hierarchical Reasoning Network with Contrastive Learning for Few-Shot Human-Object Interaction Recognition
    Yu, Jiale
    Zhang, Baopeng
    Li, Qirui
    Chen, Haoyang
    Teng, Zhu
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4260 - 4268
  • [8] Few-Shot Classification with Contrastive Learning
    Yang, Zhanyuan
    Wang, Jinghua
    Zhu, Yingying
    COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 293 - 309
  • [9] Anomalous Action Recognition Research for Few-shot Learning
    Qi, Yufei
    Liu, Ting
    Fu, Yuzhuo
    PROCEEDINGS OF 2020 IEEE 4TH INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2020), 2020, : 1306 - 1310
  • [10] CONTAINER: Few-Shot Named Entity Recognition via Contrastive Learning
    Das, Sarkar Snigdha Sarathi
    Katiyar, Arzoo
    Passonneau, Rebecca J.
    Zhang, Rui
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 6338 - 6353