KS-FuseNet: An Efficient Action Recognition Method Based on Keyframe Selection and Feature Fusion

被引:0
|
作者
Mao, Keming [1 ]
Xiao, Yilong [1 ]
Jing, Xin [1 ]
Hu, Zepeng [1 ]
Ping, Yi [1 ]
机构
[1] Northeastern Univ, Software Coll, Shenyang, Peoples R China
来源
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VII | 2025年 / 15037卷
关键词
Action recognition; Spatial-temporal; Feature fusion; Keyframe selection; CONTEXT;
D O I
10.1007/978-981-97-8511-7_38
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Addressing the challenge of effectively capturing features in contemporary video tasks, we propose an action recognition approach grounded in keyframe filtering and feature fusion. Our method comprises two core modules. The keyframe screening module employs an attention mechanism to segregate the input depth feature map sequence into two distinct tensors, effectively reducing spatial redundancy computation and enhancing key feature capture. The other spatio-temporal and action feature module features two branches with divergent structures, performing spatio-temporal and action feature extraction on the differentiated features from the previous module. Through these closely linked modules, our approach effectively discerns and extracts meaningful video features for subsequent classification tasks. We construct an end-to-end deep learning model using established frameworks, training and validating it on a generic video dataset, and confirm its efficacy through comparison and ablation experiments. Experiments conducted on this dataset demonstrate that our model surpasses the majority of prior works.
引用
收藏
页码:540 / 553
页数:14
相关论文
共 50 条
  • [1] KS-FQA: Keyframe selection based on face quality assessment for efficient face recognition in video
    Bahroun, Sahbi
    Abed, Rahma
    Zagrouba, Ezzeddine
    IET IMAGE PROCESSING, 2021, 15 (01) : 77 - 90
  • [2] EFFICIENT OBJECT FEATURE SELECTION FOR ACTION RECOGNITION
    Zhang, Tianyi
    Zhang, Yu
    Cai, Jianfei
    Kot, Alex C.
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 2707 - 2711
  • [3] An Improved VLAD Coding Method Based on Fusion Feature in Action Recognition
    Luo H.-L.
    Wang C.-J.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (01): : 49 - 58
  • [4] Keyframe recommendation based on feature intercross and fusion
    Yang, Guanci
    He, Zonglin
    Su, Zhidong
    Li, Yang
    Hu, Bingqi
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (04) : 4955 - 4971
  • [5] AN EFFICIENT FEATURE SELECTION METHOD FOR SPEAKER RECOGNITION
    Sun, Hanwu
    Ma, Bin
    Li, Haizhou
    2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 181 - 184
  • [6] Realistic human action recognition with multimodal feature selection and fusion
    Wu, Qiuxia
    Wang, Zhiyong
    Deng, Feiqi
    Chi, Zheru
    Feng, David Dagan
    IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2013, 43 (04) : 875 - 885
  • [7] Realistic Human Action Recognition With Multimodal Feature Selection and Fusion
    Wu, Qiuxia
    Wang, Zhiyong
    Deng, Feiqi
    Chi, Zheru
    Feng, David Dagan
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2013, 43 (04): : 875 - 885
  • [8] Action Recognition Based on Feature-level Fusion
    Cheng, Wanli
    Chen, Enqing
    TENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2018), 2018, 10806
  • [9] Human action recognition based on multiple feature fusion
    1600, AMSE Press, 16 Avenue Grauge Blanche, Tassin-la-Demi-Lune, 69160, France (60):
  • [10] An Efficient Feature Selection Method for Video-Based Activity Recognition Systems
    Siddiqi, Muhammad Hameed
    Alsirhani, Amjad
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022