Efficient Transformer for Video Summarization

被引:0
|
作者
Kolmakova, Tatiana [1 ]
Makarov, Ilya [2 ,3 ]
机构
[1] HSE Univ, Moscow, Russia
[2] Artificial Intelligence Res Inst AIRI, Moscow, Russia
[3] NUST MISiS, AI Ctr, Moscow, Russia
关键词
Video Summarization; Deep Learning; Transformers; CREATION;
D O I
10.1007/978-3-031-43078-7_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The amount of user-generated content is increasing daily. That is especially true for video content that became popular with social media like TikTok. Other internet sources keep up and easier the way for video sharing. That is why automatic tools for finding core information of content but decreasing its volume are essential. Video summarization is aimed to help with it. In this work, we propose a transformer-based approach to supervised video summarization. Previous applications of attention architectures either used lighter versions or loaded models with RNN modules, that slower computations. Our proposed framework uses all advantages of transformers. Extensive evaluation on two benchmark datasets showed that the introduced model outperform existed approaches on the SumMe dataset by 3% and shows comparable results on the TVSum dataset.
引用
收藏
页码:52 / 65
页数:14
相关论文
共 50 条
  • [21] On-the-fly extraction of key frames for efficient video summarization
    Barhoumi, Walid
    Zagrouba, Ezzeddine
    2013 AASRI CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL, 2013, 4 : 78 - 84
  • [22] TSNet: Token Sparsification for Efficient Video Transformer
    Wang, Hao
    Zhang, Wenjia
    Liu, Guohua
    Rodrigues, Joao M. F.
    APPLIED SCIENCES-BASEL, 2023, 13 (19):
  • [23] Data Efficient Video Transformer for Violence Detection
    Abdali, Almamon Rasool
    2021 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION, NETWORKS AND SATELLITE (COMNETSAT 2021), 2021, : 195 - 199
  • [24] Joint Video Summarization and Transmission Adaptation for Energy-Efficient Wireless Video Streaming
    Zhu Li
    Fan Zhai
    Aggelos K. Katsaggelos
    EURASIP Journal on Advances in Signal Processing, 2008
  • [25] Insight video: Toward hierarchical video content organization for efficient browsing, summarization and retrieval
    Zhu, XQ
    Elmagarmid, AK
    Xue, XY
    Wu, LD
    Catlin, AC
    IEEE TRANSACTIONS ON MULTIMEDIA, 2005, 7 (04) : 648 - 666
  • [26] Joint video summarization and transmission adaptation for energy-efficient wireless video streaming
    Li, Zhu
    Zhai, Fan
    Katsaggelos, Aggelos K.
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2008, 2008 (1)
  • [27] Perceptual Video Summarization-A New Framework for Video Summarization
    Thomas, Sinnu Susan
    Gupta, Sumana
    Subramanian, Venkatesh K.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (08) : 1790 - 1802
  • [28] Key Frames Extraction Based on Local Features for Efficient Video Summarization
    Gharbi, Hana
    Massaoudi, Mohamed
    Bahroun, Sahbi
    Zagrouba, Ezzeddine
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, ACIVS 2016, 2016, 10016 : 275 - 285
  • [29] Efficient Video Summarization Based on Motion SIFT-Distribution Histogram
    Hannane, Rachida
    Elboushaki, Abdessamad
    Afdel, Karim
    2016 13TH INTERNATIONAL CONFERENCE ON COMPUTER GRAPHICS, IMAGING AND VISUALIZATION (CGIV), 2016, : 312 - 317
  • [30] Efficient filtering and clustering methods for temporal video segmentation and visual summarization
    Ferman, AM
    Tekalp, AM
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 1998, 9 (04) : 336 - 351