Multimodal Summarization of User-Generated Videos

被引:9
|
作者
Psallidas, Theodoros [1 ]
Koromilas, Panagiotis [1 ]
Giannakopoulos, Theodoros [1 ]
Spyrou, Evaggelos [1 ,2 ]
机构
[1] Natl Ctr Sci Res Demokritos, Inst Informat & Telecommun, Athens 15310, Greece
[2] Univ Thessaly, Dept Comp Sci & Telecommun, Lamia 35100, Greece
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 11期
关键词
video summarization; audiovisual features; benchmark dataset; machine learning;
D O I
10.3390/app11115260
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The exponential growth of user-generated content has increased the need for efficient video summarization schemes. However, most approaches underestimate the power of aural features, while they are designed to work mainly on commercial/professional videos. In this work, we present an approach that uses both aural and visual features in order to create video summaries from user-generated videos. Our approach produces dynamic video summaries, that is, comprising the most "important" parts of the original video, which are arranged so as to preserve their temporal order. We use supervised knowledge from both the aforementioned modalities and train a binary classifier, which learns to recognize the important parts of videos. Moreover, we present a novel user-generated dataset which contains videos from several categories. Every 1 s part of each video from our dataset has been annotated by more than three annotators as being important or not. We evaluate our approach using several classification strategies based on audio, video and fused features. Our experimental results illustrate the potential of our approach.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Efficient Summarization From Multiple Georeferenced User-Generated Videos
    Zhang, Ying
    Zimmermann, Roger
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 18 (03) : 418 - 431
  • [2] Multimodal Semantics Extraction from User-Generated Videos
    Cricri, Francesco
    Dabov, Kostadin
    Roininen, Mikko J.
    Mate, Sujeet
    Curcio, Igor D. D.
    Gabbouj, Moncef
    [J]. ADVANCES IN MULTIMEDIA, 2012, 2012
  • [3] Summarization of User-Generated Videos Fusing Handcrafted and Deep Audiovisual Features
    Psallidas, Theodoros
    Spyrou, Evaggelos
    Perantonis, Stavros J.
    [J]. 2022 17TH INTERNATIONAL WORKSHOP ON SEMANTIC AND SOCIAL MEDIA ADAPTATION & PERSONALIZATION (SMAP 2022), 2022, : 17 - 22
  • [4] Real-Time Summarization of User-Generated Videos Based on Semantic Recognition
    Wang, Xi
    Jiang, Yu-Gang
    Chai, Zhenhua
    Gu, Zichen
    Du, Xinyu
    Wang, Dong
    [J]. PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 849 - 852
  • [5] Fast Summarization of User-Generated Videos: Exploiting Semantic, Emotional, and Quality Clues
    Xu, Baohan
    Wang, Xi
    Jiang, Yu-Gang
    [J]. IEEE MULTIMEDIA, 2016, 23 (03) : 23 - 33
  • [6] ViComp: composition of user-generated videos
    Bano, Sophia
    Cavallaro, Andrea
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (12) : 7187 - 7210
  • [7] Predicting Emotions in User-Generated Videos
    Jiang, Yu-Gang
    Xu, Baohan
    Xue, Xiangyang
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 73 - 79
  • [8] ViComp: composition of user-generated videos
    Sophia Bano
    Andrea Cavallaro
    [J]. Multimedia Tools and Applications, 2016, 75 : 7187 - 7210
  • [9] Annotating Objects and Relations in User-Generated Videos
    Shang, Xindi
    Di, Donglin
    Xiao, Junbin
    Cao, Yu
    Yang, Xun
    Chua, Tat-Seng
    [J]. ICMR'19: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2019, : 279 - 287
  • [10] A corpus of debunked and verified user-generated videos
    Papadopoulou, Olga
    Zampoglou, Markos
    Papadopoulos, Symeon
    Kompatsiaris, Ioannis
    [J]. ONLINE INFORMATION REVIEW, 2019, 43 (01) : 72 - 88