Multimodal Video Summarization based on Fuzzy Similarity Features

被引:3
|
作者
Psallidas, Theodoros [1 ,3 ]
Vasilakakis, Michael D. [2 ]
Spyrou, Evaggelos [1 ,3 ]
Iakovidis, Dimitris K. [2 ]
机构
[1] Univ Thessaly, Dept Comp Sci & Telecommun, Lamia, Greece
[2] Univ Thessaly, Dept Comp Sci & Biomed Informat, Lamia, Greece
[3] Natl Ctr Sci Res Demokritos, Inst Informat & Telecommun, Athens, Greece
关键词
D O I
10.1109/IVMSP54334.2022.9816266
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The continuously growing number of user-generated videos has increased the need for efficient browsing through content collections and repositories, which in turn requires descriptive, yet compact representations. To this goal, a popular approach is to create a visual summary, which is by far more expressive compared to other approaches, e.g., textual descriptions. In this work, we present a video summarization approach that is based on the extraction and fusion of audio and visual features, in order to produce dynamic video summaries, i.e., comprising of the most important video segments of the original video, while preserving their temporal order. Based on the extracted features, each segment is classified as "interesting," or "uninteresting," thus included in the final summary, or not. The novelty of our approach is that prior to classification, the fused features are fuzzified, thus becoming more intuitive and robust to uncertainty. We evaluate our approach using a large dataset of user-generated videos and demonstrate that fuzzy features are able to boost classification performance, providing for more concrete video summaries.
引用
下载
收藏
页数:5
相关论文
共 50 条
  • [41] Gated Recurrent Units and Recurrent Neural Network Based Multimodal Approach for Automatic Video Summarization
    Kaur, Lakhwinder
    Aljrees, Turki
    Kumar, Ankit
    Pandey, Saroj Kumar
    Singh, Kamred Udham
    Mishra, Pankaj Kumar
    Singh, Teekam
    TRAITEMENT DU SIGNAL, 2023, 40 (03) : 1227 - 1234
  • [42] Clip-based similarity measure for query-dependent clip retrieval and video summarization
    Peng, Yuxin
    Ngo, Chong-Wah
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2006, 16 (05) : 612 - 627
  • [43] Graph-based Multimodal Ranking Models for Multimodal Summarization
    Zhu, Junnan
    Xiang, Lu
    Zhou, Yu
    Zhang, Jiajun
    Zong, Chengqing
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (04)
  • [44] Unsupervised learning of visual and semantic features for video summarization
    Huang, Yansen
    Zhong, Rui
    Yao, Wenjin
    Wang, Rui
    2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
  • [45] Fuzzy Rough Set-Based Sentence Similarity Measure and its Application to Text Summarization
    Chatterjee, Niladri
    Yadav, Nidhika
    IETE TECHNICAL REVIEW, 2019, 36 (05) : 517 - 525
  • [46] Progressive Video Summarization via Multimodal Self-supervised Learning
    Li, Haopeng
    Ke, Qiuhong
    Gong, Mingming
    Drummond, Tom
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5573 - 5582
  • [47] Multimodal emotional analysis through hierarchical video summarization and face tracking
    Michael Moses Thiruthuvanathan
    Balachandran Krishnan
    Multimedia Tools and Applications, 2022, 81 : 35535 - 35554
  • [48] Multimodal emotional analysis through hierarchical video summarization and face tracking
    Thiruthuvanathan, Michael Moses
    Krishnan, Balachandran
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (25) : 35535 - 35554
  • [49] Video summarization via knowledge-aware multimodal deep networks
    Xie, Jiehang
    Chen, Xuanbai
    Zhao, Sicheng
    Lu, Shao-Ping
    KNOWLEDGE-BASED SYSTEMS, 2024, 293
  • [50] Hierarchical video summarization based on video structure and highlight
    Geng, Yuliang
    Xu, De
    Feng, Songhe
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, PROCEEDINGS, 2006, 4109 : 226 - 234