An Aesthetic-Driven Approach to Unsupervised Video Summarization

被引:1
|
作者
Huang, Hongben [1 ]
Wu, Zaiqun [2 ]
Pang, Guangyao [3 ]
Xie, Jiehang [3 ]
机构
[1] Wuzhou Univ, Guangxi Key Lab Machine Vis & Intelligent Control, Wuzhou 543002, Peoples R China
[2] Baise Univ, Baise 533000, Peoples R China
[3] Guangxi Coll & Univ Key Lab Intelligent Ind Softwa, Wuzhou 543002, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
基金
中国国家自然科学基金;
关键词
Video summarization; feature extraction; multimodal information; ATTENTION; NETWORK;
D O I
10.1109/ACCESS.2024.3434508
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The aim of video summarization is to condense lengthy videos into shorter versions, making them more accessible for viewing. Typically, people can identify important shots within a video by using audiovisual cues and assessing the aesthetic attributes of the frames. However, existing methods either focus only on unimodal features or neglect the aesthetic attributes of videos, resulting in the limited quality of the generated summaries. Particularly, the reliance on annotated data for training models also imposes limitations, as it not only demands significant time and resources but may not capture the diverse and subjective nature across different videos. To tackle these issues, we propose an aesthetic-driven approach to unsupervised video summarization, namely ADUVS. Specifically, ADUVS incorporates an aesthetics encoder to extract key aesthetic attributes. Additionally, we design a multimodal fusion module that assesses how different modalities of information complement each other and highlights the most relevant segments for the desired summary. Moreover, the training process for ADUVS does not require reliance on annotated data, thus reducing both time and labor costs. Extensive experiments demonstrate that our proposed method is better than various benchmark methods across commonly used evaluation metrics.
引用
收藏
页码:128768 / 128777
页数:10
相关论文
共 50 条
  • [1] Aesthetic-driven tools for industrial design
    Giannini, Franca
    Monti, Marina
    Podehl, Gerd
    JOURNAL OF ENGINEERING DESIGN, 2006, 17 (03) : 193 - 215
  • [2] Aesthetic-Driven Simulation of GUI Elements Deployment
    Dabrowski, Pawel
    Nikiel, Slawomir
    Skiera, Daniel
    Hoenig, Mark
    Hoetzel, Juergen
    COMPUTER VISION AND GRAPHICS, 2012, 7594 : 718 - 725
  • [3] A Collaborative Aesthetic-Driven Virtual Fitness Game
    Han, Lizhen
    Zhang, Mingmin
    Tian, Feng
    Pan, Zhigeng
    E-LEARNING AND GAMES, EDUTAINMENT 2017, 2017, 10345 : 29 - 35
  • [4] Aesthetic-Driven Image Enhancement by Adversarial Learning
    Deng, Yubin
    Loy, Chen Change
    Tang, Xiaoou
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 870 - 878
  • [5] Unsupervised Video Summarization via Attention-Driven Adversarial Learning
    Apostolidis, Evlampios
    Adamantidou, Eleni
    Metsai, Alexandros, I
    Mezaris, Vasileios
    Patras, Ioannis
    MULTIMEDIA MODELING (MMM 2020), PT I, 2020, 11961 : 492 - 504
  • [6] Aesthetic-Driven Navigation for Node-Link Diagrams in VR
    Joos, Lucas
    Fischer, Maximilian T.
    Keim, Daniel A.
    Fuchs, Johannes
    ACM SYMPOSIUM ON SPATIAL USER INTERACTION, SUI 2023, 2023,
  • [7] Unsupervised video summarization using deep Non-Local video summarization networks
    Zang, Sha-Sha
    Yu, Hui
    Song, Yan
    Zeng, Ru
    NEUROCOMPUTING, 2023, 519 : 26 - 35
  • [8] Discriminative Feature Learning for Unsupervised Video Summarization
    Jung, Yunjae
    Cho, Donghyeon
    Kim, Dahun
    Woo, Sanghyun
    Kweon, In So
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8537 - 8544
  • [9] Unsupervised Video Summarization with Adversarial LSTM Networks
    Mahasseni, Behrooz
    Lam, Michael
    Todorovic, Sinisa
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2982 - 2991
  • [10] EXPLOITING CAPTION DIVERSITY FOR UNSUPERVISED VIDEO SUMMARIZATION
    Kaseris, Michail
    Mademlis, Ioannis
    Pitas, Ioannis
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1650 - 1654