Efficient Transformer for Video Summarization

被引:0
|
作者
Kolmakova, Tatiana [1 ]
Makarov, Ilya [2 ,3 ]
机构
[1] HSE Univ, Moscow, Russia
[2] Artificial Intelligence Res Inst AIRI, Moscow, Russia
[3] NUST MISiS, AI Ctr, Moscow, Russia
关键词
Video Summarization; Deep Learning; Transformers; CREATION;
D O I
10.1007/978-3-031-43078-7_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The amount of user-generated content is increasing daily. That is especially true for video content that became popular with social media like TikTok. Other internet sources keep up and easier the way for video sharing. That is why automatic tools for finding core information of content but decreasing its volume are essential. Video summarization is aimed to help with it. In this work, we propose a transformer-based approach to supervised video summarization. Previous applications of attention architectures either used lighter versions or loaded models with RNN modules, that slower computations. Our proposed framework uses all advantages of transformers. Extensive evaluation on two benchmark datasets showed that the introduced model outperform existed approaches on the SumMe dataset by 3% and shows comparable results on the TVSum dataset.
引用
收藏
页码:52 / 65
页数:14
相关论文
共 50 条
  • [1] Video Summarization With Spatiotemporal Vision Transformer
    Hsu, Tzu-Chun
    Liao, Yi-Sheng
    Huang, Chun-Rong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 3013 - 3026
  • [2] Video summarization with u-shaped transformer
    Chen, Yaosen
    Guo, Bing
    Shen, Yan
    Zhou, Renshuang
    Lu, Weichen
    Wang, Wei
    Wen, Xuming
    Suo, Xinhua
    APPLIED INTELLIGENCE, 2022, 52 (15) : 17864 - 17880
  • [3] Video Summarization With Frame Index Vision Transformer
    Hsu, Tzu-Chun
    Liao, Yi-Sheng
    Huang, Chun-Rong
    PROCEEDINGS OF 17TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA 2021), 2021,
  • [4] Efficient Bronchoscopic Video Summarization
    Byrnes, Patrick D.
    Higgins, William Evan
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2019, 66 (03) : 848 - 863
  • [5] Video summarization with u-shaped transformer
    Yaosen Chen
    Bing Guo
    Yan Shen
    Renshuang Zhou
    Weichen Lu
    Wei Wang
    Xuming Wen
    Xinhua Suo
    Applied Intelligence, 2022, 52 : 17864 - 17880
  • [6] Video summarization with temporal-channel visual transformer
    Tian, Xiaoyan
    Jin, Ye
    Zhang, Zhao
    Liu, Peng
    Tang, Xianglong
    PATTERN RECOGNITION, 2025, 165
  • [7] LongSum: An Efficient Transformer for Long Document Summarization
    Wei, Jitong
    Gao, Yang
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2024, PT 2, 2025, 14851 : 406 - 415
  • [8] Efficient UAV Video Event Summarization
    Trinh, Hoang
    Li, Jun
    Miyazawa, Sachiko
    Moreno, Juan
    Pankanti, Sharath
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 2226 - 2229
  • [9] Efficient summarization of stereoscopic video sequences
    Doulamis, ND
    Doulamis, AD
    Avrithis, YS
    Ntalianis, KS
    Kollias, SD
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2000, 10 (04) : 501 - 517
  • [10] EFFICIENT VIDEO ENHANCEMENT TRANSFORMER
    Vasluianu, Florin
    Timofte, Radu
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 4068 - 4072