KS-FuseNet: An Efficient Action Recognition Method Based on Keyframe Selection and Feature Fusion

被引：0

作者：

Mao, Keming ^{[1
]}

Xiao, Yilong ^{[1
]}

Jing, Xin ^{[1
]}

Hu, Zepeng ^{[1
]}

Ping, Yi ^{[1
]}

机构：

[1] Northeastern Univ, Software Coll, Shenyang, Peoples R China

来源：

PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VII | 2025年 / 15037卷

关键词：

Action recognition; Spatial-temporal; Feature fusion; Keyframe selection; CONTEXT;

D O I：

10.1007/978-981-97-8511-7_38

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Addressing the challenge of effectively capturing features in contemporary video tasks, we propose an action recognition approach grounded in keyframe filtering and feature fusion. Our method comprises two core modules. The keyframe screening module employs an attention mechanism to segregate the input depth feature map sequence into two distinct tensors, effectively reducing spatial redundancy computation and enhancing key feature capture. The other spatio-temporal and action feature module features two branches with divergent structures, performing spatio-temporal and action feature extraction on the differentiated features from the previous module. Through these closely linked modules, our approach effectively discerns and extracts meaningful video features for subsequent classification tasks. We construct an end-to-end deep learning model using established frameworks, training and validating it on a generic video dataset, and confirm its efficacy through comparison and ablation experiments. Experiments conducted on this dataset demonstrate that our model surpasses the majority of prior works.

引用

页码：540 / 553

页数：14

共 50 条

[1] KS-FQA: Keyframe selection based on face quality assessment for efficient face recognition in video
Bahroun, Sahbi
Abed, Rahma
Zagrouba, Ezzeddine
IET IMAGE PROCESSING, 2021, 15 (01) : 77 - 90
[2] EFFICIENT OBJECT FEATURE SELECTION FOR ACTION RECOGNITION
Zhang, Tianyi
Zhang, Yu
Cai, Jianfei
Kot, Alex C.
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 2707 - 2711
[3] An Improved VLAD Coding Method Based on Fusion Feature in Action Recognition
Luo H.-L.
Wang C.-J.
Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (01): : 49 - 58
[4] Keyframe recommendation based on feature intercross and fusion
Yang, Guanci
He, Zonglin
Su, Zhidong
Li, Yang
Hu, Bingqi
COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (04) : 4955 - 4971
[5] AN EFFICIENT FEATURE SELECTION METHOD FOR SPEAKER RECOGNITION
Sun, Hanwu
Ma, Bin
Li, Haizhou
2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 181 - 184
[6] Realistic human action recognition with multimodal feature selection and fusion
Wu, Qiuxia
Wang, Zhiyong
Deng, Feiqi
Chi, Zheru
Feng, David Dagan
IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2013, 43 (04) : 875 - 885
[7] Realistic Human Action Recognition With Multimodal Feature Selection and Fusion
Wu, Qiuxia
Wang, Zhiyong
Deng, Feiqi
Chi, Zheru
Feng, David Dagan
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2013, 43 (04): : 875 - 885
[8] Action Recognition Based on Feature-level Fusion
Cheng, Wanli
Chen, Enqing
TENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2018), 2018, 10806
[9] Human action recognition based on multiple feature fusion
1600, AMSE Press, 16 Avenue Grauge Blanche, Tassin-la-Demi-Lune, 69160, France (60):
[10] An Efficient Feature Selection Method for Video-Based Activity Recognition Systems
Siddiqi, Muhammad Hameed
Alsirhani, Amjad
MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022

← 1 2 3 4 5 →