Efficient Transformer for Video Summarization

被引：0

作者：

Kolmakova, Tatiana ^{[1
]}

Makarov, Ilya ^{[2
,3
]}

机构：

[1] HSE Univ, Moscow, Russia

[2] Artificial Intelligence Res Inst AIRI, Moscow, Russia

[3] NUST MISiS, AI Ctr, Moscow, Russia

来源：

ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2023, PT II | 2023年 / 14135卷

关键词：

Video Summarization; Deep Learning; Transformers; CREATION;

D O I：

10.1007/978-3-031-43078-7_5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The amount of user-generated content is increasing daily. That is especially true for video content that became popular with social media like TikTok. Other internet sources keep up and easier the way for video sharing. That is why automatic tools for finding core information of content but decreasing its volume are essential. Video summarization is aimed to help with it. In this work, we propose a transformer-based approach to supervised video summarization. Previous applications of attention architectures either used lighter versions or loaded models with RNN modules, that slower computations. Our proposed framework uses all advantages of transformers. Extensive evaluation on two benchmark datasets showed that the introduced model outperform existed approaches on the SumMe dataset by 3% and shows comparable results on the TVSum dataset.

引用

页码：52 / 65

页数：14

共 50 条

[1] Video Summarization With Spatiotemporal Vision Transformer
Hsu, Tzu-Chun
Liao, Yi-Sheng
Huang, Chun-Rong
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 3013 - 3026
[2] Video summarization with u-shaped transformer
Chen, Yaosen
Guo, Bing
Shen, Yan
Zhou, Renshuang
Lu, Weichen
Wang, Wei
Wen, Xuming
Suo, Xinhua
APPLIED INTELLIGENCE, 2022, 52 (15) : 17864 - 17880
[3] Video Summarization With Frame Index Vision Transformer
Hsu, Tzu-Chun
Liao, Yi-Sheng
Huang, Chun-Rong
PROCEEDINGS OF 17TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA 2021), 2021,
[4] Efficient Bronchoscopic Video Summarization
Byrnes, Patrick D.
Higgins, William Evan
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2019, 66 (03) : 848 - 863
[5] Video summarization with u-shaped transformer
Yaosen Chen
Bing Guo
Yan Shen
Renshuang Zhou
Weichen Lu
Wei Wang
Xuming Wen
Xinhua Suo
Applied Intelligence, 2022, 52 : 17864 - 17880
[6] Video summarization with temporal-channel visual transformer
Tian, Xiaoyan
Jin, Ye
Zhang, Zhao
Liu, Peng
Tang, Xianglong
PATTERN RECOGNITION, 2025, 165
[7] LongSum: An Efficient Transformer for Long Document Summarization
Wei, Jitong
Gao, Yang
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2024, PT 2, 2025, 14851 : 406 - 415
[8] Efficient UAV Video Event Summarization
Trinh, Hoang
Li, Jun
Miyazawa, Sachiko
Moreno, Juan
Pankanti, Sharath
2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 2226 - 2229
[9] Efficient summarization of stereoscopic video sequences
Doulamis, ND
Doulamis, AD
Avrithis, YS
Ntalianis, KS
Kollias, SD
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2000, 10 (04) : 501 - 517
[10] EFFICIENT VIDEO ENHANCEMENT TRANSFORMER
Vasluianu, Florin
Timofte, Radu
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 4068 - 4072

← 1 2 3 4 5 →