Cluster-guided temporal modeling for action recognition

被引:0
|
作者
Kim, Jeong-Hun [1 ]
Hao, Fei [2 ]
Leung, Carson Kai-Sang [3 ]
Nasridinov, Aziz [1 ]
机构
[1] Chungbuk Natl Univ, Dept Comp Sci, Cheongju 28644, South Korea
[2] Shaanxi Normal Univ, Sch Comp Sci, Xian 710119, Peoples R China
[3] Univ Manitoba, Dept Comp Sci, Winnipeg, MB R3T 2N2, Canada
基金
新加坡国家研究基金会;
关键词
Keyframe selection; Temporal modeling; Temporal redundancy; Data clustering; Action recognition;
D O I
10.1007/s13735-023-00280-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Action recognition is a video understanding task that is carried out to recognize an action of an object in a video. In order to recognize the action, it is necessary to extract motion information through temporal modeling. However, videos typically contain high temporal redundancy, such as iterative events and adjacent frames. This high temporal redundancy weakens information related to actual action, making it difficult for the final classifier to recognize the action. In this article, we focus on preserving helpful information for action recognition by reducing the high temporal redundancy in videos. To achieve this goal, we propose a novel frame selection method called cluster-guided frame selection (CluFrame). Specifically, CluFrame compresses an input video into keyframes of clusters discovered by applying k-means clustering to frame-wise features extracted from pre-trained 2D-CNNs in the temporal compression (TC) module. In addition, CluFrame selects keyframes related to the action of the input video by optimizing the TC module based on the action recognition results. Experimental results on five benchmark datasets demonstrate that CluFrame addresses the high temporal redundancy in the video and achieves action recognition accuracy improvement over existing action recognition methods by up to 6.6% and by about 0.7% compared to state-of-the-art frame selection methods.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Cluster-guided temporal modeling for action recognition
    Jeong-Hun Kim
    Fei Hao
    Carson Kai-Sang Leung
    Aziz Nasridinov
    International Journal of Multimedia Information Retrieval, 2023, 12
  • [2] Cluster-Guided Contrastive Graph Clustering Network
    Yang, Xihong
    Liu, Yue
    Zhou, Sihang
    Wang, Siwei
    Tu, Wenxuan
    Zheng, Qun
    Liu, Xinwang
    Fang, Liming
    Zhu, En
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 10834 - 10842
  • [3] Hardness-Aware Metric Learning With Cluster-Guided Attention for Visual Place Recognition
    Guan, Peiyu
    Cao, Zhiqiang
    Fan, Shengxuan
    Yang, Yuequan
    Yu, Junzhi
    Wang, Shuo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (01) : 367 - 379
  • [4] Cluster-guided Image Synthesis with Unconditional Models
    Georgopoulos, Markos
    Oldfield, James
    Chrysos, Grigorios G.
    Panagakis, Yannis
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11533 - 11542
  • [5] Cluster-guided graph attention auto-encoder
    Zheng, Zhiwen
    Chen, Xiaoyun
    Huang, Musheng
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (03):
  • [6] Cluster-Guided Unsupervised Domain Adaptation for Deep Speaker Embedding
    Mao, Haiquan
    Hong, Feng
    Mak, Man-wai
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 643 - 647
  • [7] Language-guided temporal primitive modeling for skeleton-based action recognition
    Pan, Qingzhe
    Xie, Xuemei
    NEUROCOMPUTING, 2025, 613
  • [8] Cluster-Guided Label Generation in Extreme Multi-Label Classification
    Jung, Taehee
    Kim, Joo-Kyung
    Lee, Sungjin
    Kang, Dongyeop
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1670 - 1685
  • [9] Cluster-Guided Image Matching Analysis of Multiscale Lung Response to Bronchial Thermoplasty
    Choi, J.
    Choi, S.
    Li, F.
    Hoffman, E. A.
    Castro, M.
    Hall, C.
    Goss, C.
    O'Shaughnessy, P.
    McEleney, S.
    Sieren, J.
    Lin, C.
    AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2019, 199
  • [10] Alignment-guided Temporal Attention for Video Action Recognition
    Zhao, Yizhou
    Li, Zhenyang
    Guo, Xun
    Lu, Yan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,