An Aesthetic-Driven Approach to Unsupervised Video Summarization

被引:1
|
作者
Huang, Hongben [1 ]
Wu, Zaiqun [2 ]
Pang, Guangyao [3 ]
Xie, Jiehang [3 ]
机构
[1] Wuzhou Univ, Guangxi Key Lab Machine Vis & Intelligent Control, Wuzhou 543002, Peoples R China
[2] Baise Univ, Baise 533000, Peoples R China
[3] Guangxi Coll & Univ Key Lab Intelligent Ind Softwa, Wuzhou 543002, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
基金
中国国家自然科学基金;
关键词
Video summarization; feature extraction; multimodal information; ATTENTION; NETWORK;
D O I
10.1109/ACCESS.2024.3434508
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The aim of video summarization is to condense lengthy videos into shorter versions, making them more accessible for viewing. Typically, people can identify important shots within a video by using audiovisual cues and assessing the aesthetic attributes of the frames. However, existing methods either focus only on unimodal features or neglect the aesthetic attributes of videos, resulting in the limited quality of the generated summaries. Particularly, the reliance on annotated data for training models also imposes limitations, as it not only demands significant time and resources but may not capture the diverse and subjective nature across different videos. To tackle these issues, we propose an aesthetic-driven approach to unsupervised video summarization, namely ADUVS. Specifically, ADUVS incorporates an aesthetics encoder to extract key aesthetic attributes. Additionally, we design a multimodal fusion module that assesses how different modalities of information complement each other and highlights the most relevant segments for the desired summary. Moreover, the training process for ADUVS does not require reliance on annotated data, thus reducing both time and labor costs. Extensive experiments demonstrate that our proposed method is better than various benchmark methods across commonly used evaluation metrics.
引用
收藏
页码:128768 / 128777
页数:10
相关论文
共 50 条
  • [21] Unsupervised Reinforcement Learning For Video Summarization Reward Function
    Wang, Lei
    Zhu, Yaping
    Pan, Hong
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO AND SIGNAL PROCESSING (IVSP 2019), 2019, : 40 - 44
  • [22] ADVERSARIAL UNSUPERVISED VIDEO SUMMARIZATION AUGMENTED WITH DICTIONARY LOSS
    Kaseris, Michail
    Mademlis, Ioannis
    Pitas, Ioannis
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2683 - 2687
  • [23] Unsupervised Video Orchestration Based on Aesthetic Features
    Neri, Alessandro
    Battisti, Federica
    Colangelo, Federico
    Carli, Marco
    2017 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2017, : 428 - 431
  • [24] An unsupervised constrained optimization approach to compressive summarization
    Vanetik, Natalia
    Litvak, Marina
    Churkin, Elena
    Last, Mark
    INFORMATION SCIENCES, 2020, 509 : 22 - 35
  • [25] Unsupervised Video Summarization via Multi-source Features
    Kanafani, Hussain
    Ghauri, Junaid Ahmed
    Hakimov, Sherzod
    Ewerth, Ralph
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 466 - 470
  • [26] Recurrent generative adversarial networks for unsupervised WCE video summarization
    Lan, Libin
    Ye, Chunxiao
    KNOWLEDGE-BASED SYSTEMS, 2021, 222
  • [27] RL Based Unsupervised Video Summarization Framework for Ultrasound Imaging
    Mathews, Roshan P.
    Panicker, Mahesh Raveendranatha
    Hareendranathan, Abhilash R.
    Chen, Yale Tung
    Jaremko, Jacob L.
    Buchanan, Brian
    Narayan, Kiran Vishnu
    Chandrasekharan, Kesavadas
    Mathews, Greeta
    SIMPLIFYING MEDICAL ULTRASOUND, ASMUS 2022, 2022, 13565 : 23 - 33
  • [28] Unsupervised Video Summarization with Attentive Conditional Generative Adversarial Networks
    He, Xufeng
    Hua, Yang
    Song, Tao
    Zhang, Zongpu
    Xue, Zhengui
    Ma, Ruhui
    Robertson, Neil
    Guan, Haibing
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2296 - 2304
  • [29] Unsupervised Video Summarization Based on Deep Reinforcement Learning with Interpolation
    Yoon, Ui Nyoung
    Hong, Myung Duk
    Jo, Geun-Sik
    SENSORS, 2023, 23 (07)
  • [30] Unsupervised Video Summarization Based on the Diffusion Model of Feature Fusion
    Yu, Qinghao
    Yu, Hui
    Sun, Ying
    Ding, Derui
    Jian, Muwei
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (05): : 6010 - 6021