An Aesthetic-Driven Approach to Unsupervised Video Summarization

被引：1

作者：

Huang, Hongben ^{[1
]}

Wu, Zaiqun ^{[2
]}

Pang, Guangyao ^{[3
]}

Xie, Jiehang ^{[3
]}

机构：

[1] Wuzhou Univ, Guangxi Key Lab Machine Vis & Intelligent Control, Wuzhou 543002, Peoples R China

[2] Baise Univ, Baise 533000, Peoples R China

[3] Guangxi Coll & Univ Key Lab Intelligent Ind Softwa, Wuzhou 543002, Peoples R China

来源：

IEEE ACCESS | 2024年 / 12卷

基金：

中国国家自然科学基金;

关键词：

Video summarization; feature extraction; multimodal information; ATTENTION; NETWORK;

D O I：

10.1109/ACCESS.2024.3434508

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The aim of video summarization is to condense lengthy videos into shorter versions, making them more accessible for viewing. Typically, people can identify important shots within a video by using audiovisual cues and assessing the aesthetic attributes of the frames. However, existing methods either focus only on unimodal features or neglect the aesthetic attributes of videos, resulting in the limited quality of the generated summaries. Particularly, the reliance on annotated data for training models also imposes limitations, as it not only demands significant time and resources but may not capture the diverse and subjective nature across different videos. To tackle these issues, we propose an aesthetic-driven approach to unsupervised video summarization, namely ADUVS. Specifically, ADUVS incorporates an aesthetics encoder to extract key aesthetic attributes. Additionally, we design a multimodal fusion module that assesses how different modalities of information complement each other and highlights the most relevant segments for the desired summary. Moreover, the training process for ADUVS does not require reliance on annotated data, thus reducing both time and labor costs. Extensive experiments demonstrate that our proposed method is better than various benchmark methods across commonly used evaluation metrics.

引用

页码：128768 / 128777

页数：10

共 50 条

[21] Unsupervised Reinforcement Learning For Video Summarization Reward Function
Wang, Lei
Zhu, Yaping
Pan, Hong
PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO AND SIGNAL PROCESSING (IVSP 2019), 2019, : 40 - 44
[22] ADVERSARIAL UNSUPERVISED VIDEO SUMMARIZATION AUGMENTED WITH DICTIONARY LOSS
Kaseris, Michail
Mademlis, Ioannis
Pitas, Ioannis
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2683 - 2687
[23] Unsupervised Video Orchestration Based on Aesthetic Features
Neri, Alessandro
Battisti, Federica
Colangelo, Federico
Carli, Marco
2017 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2017, : 428 - 431
[24] An unsupervised constrained optimization approach to compressive summarization
Vanetik, Natalia
Litvak, Marina
Churkin, Elena
Last, Mark
INFORMATION SCIENCES, 2020, 509 : 22 - 35
[25] Unsupervised Video Summarization via Multi-source Features
Kanafani, Hussain
Ghauri, Junaid Ahmed
Hakimov, Sherzod
Ewerth, Ralph
PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 466 - 470
[26] Recurrent generative adversarial networks for unsupervised WCE video summarization
Lan, Libin
Ye, Chunxiao
KNOWLEDGE-BASED SYSTEMS, 2021, 222
[27] RL Based Unsupervised Video Summarization Framework for Ultrasound Imaging
Mathews, Roshan P.
Panicker, Mahesh Raveendranatha
Hareendranathan, Abhilash R.
Chen, Yale Tung
Jaremko, Jacob L.
Buchanan, Brian
Narayan, Kiran Vishnu
Chandrasekharan, Kesavadas
Mathews, Greeta
SIMPLIFYING MEDICAL ULTRASOUND, ASMUS 2022, 2022, 13565 : 23 - 33
[28] Unsupervised Video Summarization with Attentive Conditional Generative Adversarial Networks
He, Xufeng
Hua, Yang
Song, Tao
Zhang, Zongpu
Xue, Zhengui
Ma, Ruhui
Robertson, Neil
Guan, Haibing
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2296 - 2304
[29] Unsupervised Video Summarization Based on Deep Reinforcement Learning with Interpolation
Yoon, Ui Nyoung
Hong, Myung Duk
Jo, Geun-Sik
SENSORS, 2023, 23 (07)
[30] Unsupervised Video Summarization Based on the Diffusion Model of Feature Fusion
Yu, Qinghao
Yu, Hui
Sun, Ying
Ding, Derui
Jian, Muwei
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (05): : 6010 - 6021

← 1 2 3 4 5 →