Efficient Transformer for Video Summarization

被引:0
|
作者
Kolmakova, Tatiana [1 ]
Makarov, Ilya [2 ,3 ]
机构
[1] HSE Univ, Moscow, Russia
[2] Artificial Intelligence Res Inst AIRI, Moscow, Russia
[3] NUST MISiS, AI Ctr, Moscow, Russia
关键词
Video Summarization; Deep Learning; Transformers; CREATION;
D O I
10.1007/978-3-031-43078-7_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The amount of user-generated content is increasing daily. That is especially true for video content that became popular with social media like TikTok. Other internet sources keep up and easier the way for video sharing. That is why automatic tools for finding core information of content but decreasing its volume are essential. Video summarization is aimed to help with it. In this work, we propose a transformer-based approach to supervised video summarization. Previous applications of attention architectures either used lighter versions or loaded models with RNN modules, that slower computations. Our proposed framework uses all advantages of transformers. Extensive evaluation on two benchmark datasets showed that the introduced model outperform existed approaches on the SumMe dataset by 3% and shows comparable results on the TVSum dataset.
引用
收藏
页码:52 / 65
页数:14
相关论文
共 50 条
  • [41] Energy efficient video summarization and transmission over a slow fading wireless channel
    Li, Z
    Zhai, F
    Katsaggelos, AK
    Pappas, TN
    Image and Video Communications and Processing 2005, Pts 1 and 2, 2005, 5685 : 940 - 948
  • [42] KEY FRAMES EXTRACTION USING GRAPH MODULARITY CLUSTERING FOR EFFICIENT VIDEO SUMMARIZATION
    Gharbi, Hana
    Bahroun, Sahbi
    Massaoudi, Mohamed
    Zagrouba, Ezzeddine
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 1502 - 1506
  • [43] Clustering of compressed illumination-invariant chromaticity signatures for efficient video summarization
    Drew, MS
    Au, J
    IMAGE AND VISION COMPUTING, 2003, 21 (08) : 705 - 716
  • [44] ESKVS: efficient and secure approach for keyframes-based video summarization framework
    Saini, Parul
    Berwal, Krishan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (30) : 74563 - 74591
  • [45] EFFICIENT TRANSFORMER WITH LOCALLY SHARED ATTENTION FOR VIDEO QUALITY ASSESSMENT
    You, Junyong
    Lin, Yuan
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 356 - 360
  • [46] Temporally Efficient Gabor Transformer for Unsupervised Video Object Segmentation
    Fan, Jiaqing
    Su, Tiankang
    Zhang, Kaihua
    Liu, Bo
    Liu, Qingshan
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3394 - 3402
  • [47] From video summarization to real time video summarization in smart cities and beyond: A survey
    Shambharkar, Prashant Giridhar
    Goel, Ruchi
    FRONTIERS IN BIG DATA, 2023, 5
  • [48] Unsupervised video summarization using deep Non-Local video summarization networks
    Zang, Sha-Sha
    Yu, Hui
    Song, Yan
    Zeng, Ru
    NEUROCOMPUTING, 2023, 519 : 26 - 35
  • [49] EDGE-MOTION VIDEO SUMMARIZATION: ECONOMICAL VIDEO SUMMARIZATION FOR LOW POWERED DEVICES
    Anagnastopoulos, Vasileios
    Doulamis, Nikolaos
    Doulamis, Anastasios
    2009 10TH INTERNATIONAL WORKSHOP ON IMAGE ANALYSIS FOR MULTIMEDIA INTERACTIVE SERVICES, 2009, : 284 - +
  • [50] Video summarization for large sports video archives
    Takahashi, Y
    Nitta, N
    Babaguchi, N
    2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, : 1171 - 1174