Efficient Transformer for Video Summarization

被引：0

作者：

Kolmakova, Tatiana ^{[1
]}

Makarov, Ilya ^{[2
,3
]}

机构：

[1] HSE Univ, Moscow, Russia

[2] Artificial Intelligence Res Inst AIRI, Moscow, Russia

[3] NUST MISiS, AI Ctr, Moscow, Russia

来源：

ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2023, PT II | 2023年 / 14135卷

关键词：

Video Summarization; Deep Learning; Transformers; CREATION;

D O I：

10.1007/978-3-031-43078-7_5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The amount of user-generated content is increasing daily. That is especially true for video content that became popular with social media like TikTok. Other internet sources keep up and easier the way for video sharing. That is why automatic tools for finding core information of content but decreasing its volume are essential. Video summarization is aimed to help with it. In this work, we propose a transformer-based approach to supervised video summarization. Previous applications of attention architectures either used lighter versions or loaded models with RNN modules, that slower computations. Our proposed framework uses all advantages of transformers. Extensive evaluation on two benchmark datasets showed that the introduced model outperform existed approaches on the SumMe dataset by 3% and shows comparable results on the TVSum dataset.

引用

页码：52 / 65

页数：14

共 50 条

[41] Energy efficient video summarization and transmission over a slow fading wireless channel
Li, Z
Zhai, F
Katsaggelos, AK
Pappas, TN
Image and Video Communications and Processing 2005, Pts 1 and 2, 2005, 5685 : 940 - 948
[42] KEY FRAMES EXTRACTION USING GRAPH MODULARITY CLUSTERING FOR EFFICIENT VIDEO SUMMARIZATION
Gharbi, Hana
Bahroun, Sahbi
Massaoudi, Mohamed
Zagrouba, Ezzeddine
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 1502 - 1506
[43] Clustering of compressed illumination-invariant chromaticity signatures for efficient video summarization
Drew, MS
Au, J
IMAGE AND VISION COMPUTING, 2003, 21 (08) : 705 - 716
[44] ESKVS: efficient and secure approach for keyframes-based video summarization framework
Saini, Parul
Berwal, Krishan
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (30) : 74563 - 74591
[45] EFFICIENT TRANSFORMER WITH LOCALLY SHARED ATTENTION FOR VIDEO QUALITY ASSESSMENT
You, Junyong
Lin, Yuan
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 356 - 360
[46] Temporally Efficient Gabor Transformer for Unsupervised Video Object Segmentation
Fan, Jiaqing
Su, Tiankang
Zhang, Kaihua
Liu, Bo
Liu, Qingshan
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3394 - 3402
[47] From video summarization to real time video summarization in smart cities and beyond: A survey
Shambharkar, Prashant Giridhar
Goel, Ruchi
FRONTIERS IN BIG DATA, 2023, 5
[48] Unsupervised video summarization using deep Non-Local video summarization networks
Zang, Sha-Sha
Yu, Hui
Song, Yan
Zeng, Ru
NEUROCOMPUTING, 2023, 519 : 26 - 35
[49] EDGE-MOTION VIDEO SUMMARIZATION: ECONOMICAL VIDEO SUMMARIZATION FOR LOW POWERED DEVICES
Anagnastopoulos, Vasileios
Doulamis, Nikolaos
Doulamis, Anastasios
2009 10TH INTERNATIONAL WORKSHOP ON IMAGE ANALYSIS FOR MULTIMEDIA INTERACTIVE SERVICES, 2009, : 284 - +
[50] Video summarization for large sports video archives
Takahashi, Y
Nitta, N
Babaguchi, N
2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, : 1171 - 1174

← 1 2 3 4 5 →