MULTI-SCALE TEMPORAL FEATURE FUSION FOR FEW-SHOT ACTION RECOGNITION

被引:0
|
作者
Lee, Jun-Tae [1 ]
Yun, Sungrack [1 ]
机构
[1] Qualcomm AI Res, Initiat Qualcomm Technol Inc, Qualcomm Korea YH, Seoul, South Korea
关键词
Few-shot learning; Few-shot action; video representation; temporal fusion; cross-attention;
D O I
10.1109/ICIP49359.2023.10223132
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The aim of this paper is to recognize actions of interest that are given by a few support videos in testing (query) videos. The focus of our approach is to develop a novel temporal enrichment module where the features describing local temporal contexts in videos are enhanced by collaboratively merging important information in frame-level (no temporal context) features. We call this module a multi-scale temporal feature fusion (MSTFF) module. Utilizing multiple MSTFF modules varying the scope of local temporal context extraction, we can obtain discriminative video representation which is crucial in the few-shot tasks where support videos are not sufficient to describe an action class. For stable learning of a model with MSTFF and the performance boost, we also learn a local temporal context-level auxiliary classifier in parallel with the main classifier. We analyze the proposed components to demonstrate their importance. We achieve state-of-the-art on three few-shot action recognition benchmarks: Something-Something V2 (SSv2), HMDB51, and Kinetics.
引用
收藏
页码:1785 / 1789
页数:5
相关论文
共 50 条
  • [31] Adaptive multi-scale transductive information propagation for few-shot learning
    Fu, Sichao
    Liu, Baodi
    Liu, Weifeng
    Zou, Bin
    You, Xinhua
    Peng, Qinmu
    Jing, Xiao-Yuan
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 249
  • [32] Multi-Scale Adaptive Task Attention Network for Few-Shot Learning
    Chen, Haoxing
    Li, Huaxiong
    Li, Yaohui
    Chen, Chunlin
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4765 - 4771
  • [33] Multi-level alignment for few-shot temporal action localization
    Keisham, Kanchan
    Jalali, Amin
    Kim, Jonghong
    Lee, Minho
    [J]. INFORMATION SCIENCES, 2023, 650
  • [34] Learning Similarity: Feature-Aligning Network for Few-shot Action Recognition
    Tan, Shaoqing
    Yang, Ruoyu
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [35] Few-shot for person re-identification based on multi-scale mixed attention and metric fusion
    Chen, Gui-Zhen
    Zou, Guo-Feng
    Liu, Yue
    Fu, Gui-Xia
    Gao, Ming-Liang
    [J]. Kongzhi yu Juece/Control and Decision, 2024, 39 (05): : 1441 - 1449
  • [36] A statistical framework for few-shot action recognition
    Mark Haddad
    Vahid K. Ghassab
    Fatma Najar
    Nizar Bouguila
    [J]. Multimedia Tools and Applications, 2021, 80 : 24303 - 24318
  • [37] A statistical framework for few-shot action recognition
    Haddad, Mark
    Ghassab, Vahid K.
    Najar, Fatma
    Bouguila, Nizar
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (16) : 24303 - 24318
  • [38] A Multi-Layer Feature Fusion Method for Few-Shot Image Classification
    Gomes, Jaco C.
    Borges, Lurdineide de A. B.
    Borges, Dibio L.
    [J]. SENSORS, 2023, 23 (15)
  • [39] Few-shot action recognition with implicit temporal alignment and pair similarity optimization
    Cao, Congqi
    Li, Yajuan
    Lv, Qinyi
    Wang, Peng
    Zhang, Yanning
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 210
  • [40] Searching for Better Spatio-temporal Alignment in Few-Shot Action Recognition
    Cao, Yichao
    Su, Xiu
    Tang, Qingfei
    You, Shan
    Lu, Xiaobo
    Xu, Chang
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,