Multimodal Summarization of User-Generated Videos

被引:9
|
作者
Psallidas, Theodoros [1 ]
Koromilas, Panagiotis [1 ]
Giannakopoulos, Theodoros [1 ]
Spyrou, Evaggelos [1 ,2 ]
机构
[1] Natl Ctr Sci Res Demokritos, Inst Informat & Telecommun, Athens 15310, Greece
[2] Univ Thessaly, Dept Comp Sci & Telecommun, Lamia 35100, Greece
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 11期
关键词
video summarization; audiovisual features; benchmark dataset; machine learning;
D O I
10.3390/app11115260
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The exponential growth of user-generated content has increased the need for efficient video summarization schemes. However, most approaches underestimate the power of aural features, while they are designed to work mainly on commercial/professional videos. In this work, we present an approach that uses both aural and visual features in order to create video summaries from user-generated videos. Our approach produces dynamic video summaries, that is, comprising the most "important" parts of the original video, which are arranged so as to preserve their temporal order. We use supervised knowledge from both the aforementioned modalities and train a binary classifier, which learns to recognize the important parts of videos. Moreover, we present a novel user-generated dataset which contains videos from several categories. Every 1 s part of each video from our dataset has been annotated by more than three annotators as being important or not. We evaluate our approach using several classification strategies based on audio, video and fused features. Our experimental results illustrate the potential of our approach.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Online political participation and attitudes: Analysing election user-generated videos in Singapore
    Lin, Trisha T. C.
    [J]. COMMUNICATION RESEARCH AND PRACTICE, 2015, 1 (02) : 131 - 146
  • [42] Gyro-based Camera-Motion Detection in User-Generated Videos
    Bano, Sophia
    Cavallaro, Andrea
    Parra, Xavier
    [J]. MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 1303 - 1306
  • [43] Discovery and organization of multi-camera user-generated videos of the same event
    Bano, Sophia
    Cavallaro, Andrea
    [J]. INFORMATION SCIENCES, 2015, 302 : 108 - 121
  • [44] Social context summarization using user-generated content and third-party sources
    Minh-Tien Nguyen
    Duc-Vu Tran
    Le-Minh Nguyen
    [J]. KNOWLEDGE-BASED SYSTEMS, 2018, 144 : 51 - 64
  • [45] Beyond Text: Multimodal Credibility Assessment Approaches for Online User-Generated Content
    Choudhary, Monika
    Chouhan, Satyendra Singh
    Rathore, Santosh Singh
    [J]. ACM Transactions on Intelligent Systems and Technology, 2024, 15 (05)
  • [46] Using User-Generated YouTube Videos to Understand Unguided Interactions with Robots in Public Places
    Nielsen, Sara
    Skov, Mikael B.
    Hansen, Karl Damkjaer
    Kaszowska, Aleksandra
    [J]. ACM TRANSACTIONS ON HUMAN-ROBOT INTERACTION, 2023, 12 (01)
  • [47] Decoding international Solo women travelers' experience: A qualitative analysis of user-generated videos
    Ghadban, Socrat
    Kamar, Rim
    Haidar, Rayan
    [J]. JOURNAL OF OUTDOOR RECREATION AND TOURISM-RESEARCH PLANNING AND MANAGEMENT, 2023, 44
  • [48] Why people watch user-generated videos? A systematic review and meta-analysis
    Nguyen, Tin Trung
    Veer, Ekant
    [J]. INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 2024, 181
  • [49] User-Generated Racism: An Analysis of Stereotypes of African Americans, Latinos, and Asians in YouTube Videos
    Guo, Lei
    Harlow, Summer
    [J]. HOWARD JOURNAL OF COMMUNICATIONS, 2014, 25 (03) : 281 - 302
  • [50] Learning to encode user-generated short videos with lower bitrate and the same perceptual quality
    Meng, Shengbin
    Li, Yang
    Liao, Yiting
    Li, Junlin
    Wang, Shiqi
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 383 - 386