MULTI-SCALE TEMPORAL FEATURE FUSION FOR FEW-SHOT ACTION RECOGNITION

被引:0
|
作者
Lee, Jun-Tae [1 ]
Yun, Sungrack [1 ]
机构
[1] Qualcomm AI Res, Initiat Qualcomm Technol Inc, Qualcomm Korea YH, Seoul, South Korea
关键词
Few-shot learning; Few-shot action; video representation; temporal fusion; cross-attention;
D O I
10.1109/ICIP49359.2023.10223132
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The aim of this paper is to recognize actions of interest that are given by a few support videos in testing (query) videos. The focus of our approach is to develop a novel temporal enrichment module where the features describing local temporal contexts in videos are enhanced by collaboratively merging important information in frame-level (no temporal context) features. We call this module a multi-scale temporal feature fusion (MSTFF) module. Utilizing multiple MSTFF modules varying the scope of local temporal context extraction, we can obtain discriminative video representation which is crucial in the few-shot tasks where support videos are not sufficient to describe an action class. For stable learning of a model with MSTFF and the performance boost, we also learn a local temporal context-level auxiliary classifier in parallel with the main classifier. We analyze the proposed components to demonstrate their importance. We achieve state-of-the-art on three few-shot action recognition benchmarks: Something-Something V2 (SSv2), HMDB51, and Kinetics.
引用
收藏
页码:1785 / 1789
页数:5
相关论文
共 50 条
  • [21] Few-shot pulse wave contour classification based on multi-scale feature extraction
    Peng Lu
    Chao Liu
    Xiaobo Mao
    Yvping Zhao
    Hanzhang Wang
    Hongpo Zhang
    Lili Guo
    [J]. Scientific Reports, 11
  • [22] Few-shot pulse wave contour classification based on multi-scale feature extraction
    Lu, Peng
    Liu, Chao
    Mao, Xiaobo
    Zhao, Yvping
    Wang, Hanzhang
    Zhang, Hongpo
    Guo, Lili
    [J]. SCIENTIFIC REPORTS, 2021, 11 (01)
  • [23] Temporal-Relational CrossTransformers for Few-Shot Action Recognition
    Perrett, Toby
    Masullo, Alessandro
    Burghardt, Tilo
    Mirmehdi, Majid
    Damen, Dima
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 475 - 484
  • [24] Two-Stream Temporal Feature Aggregation Based on Clustering for Few-Shot Action Recognition
    Deng, Long
    Li, Ao
    Zhou, Bingxin
    Ge, Yongxin
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 2435 - 2439
  • [25] Multi-scale Prototypical Network for Few-shot Anomaly Detection
    Wu, Jingkai
    Jiang, Weijie
    Huang, Zhiyong
    Lin, Qifeng
    Zheng, Qinghai
    Liang, Yi
    Yu, Yuanlong
    [J]. ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 1067 - 1076
  • [26] Lite-FENet: Lightweight multi-scale feature enrichment network for few-shot segmentation
    Li, Qun
    Sun, Baoquan
    Bhanu, Bir
    [J]. KNOWLEDGE-BASED SYSTEMS, 2023, 278
  • [27] TAEN: Temporal Aware Embedding Network for Few-Shot Action Recognition
    Ben-Ari, Rami
    Nacson, Mor Shpigel
    Azulai, Ophir
    Barzelay, Udi
    Rotman, Daniel
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 2780 - 2788
  • [28] Spatio-temporal Relation Modeling for Few-shot Action Recognition
    Thatipelli, Anirudh
    Narayan, Sanath
    Khan, Salman
    Anwer, Rao Muhammad
    Khan, Fahad Shahbaz
    Ghanem, Bernard
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19926 - 19935
  • [29] Multi-scale task-aware structure graph modeling for few-shot image recognition
    Zhao, Peng
    Ye, Zilong
    Wang, Liang
    Liu, Huiting
    Ji, Xia
    [J]. PATTERN RECOGNITION, 2024, 156
  • [30] Few-Shot Image Classification Based on Multi-Scale Label Propagation
    Wang, Hang
    Tian, Shengzhao
    Tang, Qing
    Chen, Duanbing
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (07): : 1486 - 1495