Effective Video Summarization by Extracting Parameter-Free Motion Attention

被引:0
|
作者
Han, Tingting [1 ]
Zhou, Quan [2 ]
Yu, Jun [1 ]
Yu, Zhou [2 ]
Zhang, Jianhui [2 ]
Zhao, Sicheng [3 ]
机构
[1] Hangzhou Dianzi Univ, Coll Comp Sci, Hangzhou, Peoples R China
[2] Hangzhou Dianzi Univ, Hangzhou, Zhejiang, Peoples R China
[3] Tsinghua Univ, Beijing Natl Res Ctr Informat Sci & Technol, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Video summarization; parameter-free; motion attention; feature fusion; multi-head attention;
D O I
10.1145/3654670
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video summarization remains a challenging task despite increasing research efforts. Traditional methods focus solely on long-range temporalmodeling of video frames, overlooking important local motion information that cannot be captured by frame-level video representations. In this article, we propose the Parameter-free Motion AttentionModule (PMAM) to exploit the crucial motion clues potentially contained in adjacent video frames, using a multi-head attention architecture. The PMAM requires no additional training for model parameters, leading to an efficient and effective understanding of video dynamics. Moreover, we introduce the Multi-feature Motion Attention Network (MMAN), integrating the PMAM with local and global multi-head attention based on object-centric and scene-centric video representations. The synergistic combination of local motion information, extracted by the proposed PMAM, with long-range interactions modeled by the local and global multi-head attention mechanism, can significantly enhance the performance of video summarization. Extensive experimental results on the benchmark datasets, SumMe and TVSum, demonstrate that the proposed MMAN outperforms other state-of-the-art methods, resulting in remarkable performance gains.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Parameter-Free Attention in fMRI Decoding
    Qi, Yong
    Lin, Huawei
    Li, Yanping
    Chen, Jiashu
    IEEE ACCESS, 2021, 9 (09): : 48704 - 48712
  • [2] A Parameter-Free Approach for Lossless Streaming Graph Summarization
    Ma, Ziyi
    Yang, Jianye
    Li, Kenli
    Liu, Yuling
    Zhou, Xu
    Hu, Yikun
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2021), PT I, 2021, 12681 : 385 - 393
  • [3] Probabilistic parameter-free motion detection
    Veit, T
    Cao, F
    Bouthemy, P
    PROCEEDINGS OF THE 2004 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, 2004, : 715 - 721
  • [4] A parameter-free approach to lossless summarization of fully dynamic graphs
    Ma, Ziyi
    Liu, Yuling
    Yang, Zhibang
    Yang, Jianye
    Li, Kenli
    INFORMATION SCIENCES, 2022, 589 : 376 - 394
  • [5] Deep parameter-free attention hashing for image retrieval
    Wenjing Yang
    Liejun Wang
    Shuli Cheng
    Scientific Reports, 12
  • [6] Deep parameter-free attention hashing for image retrieval
    Yang, Wenjing
    Wang, Liejun
    Cheng, Shuli
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [7] SIAM: A parameter-free, Spatial Intersection Attention Module
    Han, Gaoge
    Huang, Shaoli
    Zhao, Fang
    Tang, Jinglei
    PATTERN RECOGNITION, 2024, 153
  • [8] Effective Video Summarization Approach Based on Visual Attention
    Ahmad, Hilal
    Khan, Habib Ullah
    Ali, Sikandar
    Rahman, Syed Ijaz Ur
    Wahid, Fazli
    Khattak, Hizbullah
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (01): : 1427 - 1442
  • [9] Parameter-free modelling of dislocation motion: the case of silicon
    Bulatov, VV
    Justo, JF
    Cai, W
    Yip, S
    Argon, AS
    Lenosky, T
    de Koning, M
    de la Rubia, TD
    PHILOSOPHICAL MAGAZINE A-PHYSICS OF CONDENSED MATTER STRUCTURE DEFECTS AND MECHANICAL PROPERTIES, 2001, 81 (05): : 1257 - 1281
  • [10] Extracting representative motion flows for effective video retrieval
    Zhao, Zhe
    Cui, Bin
    Cong, Gao
    Huang, Zi
    Shen, Heng Tao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 58 (03) : 687 - 711