An iteratively reweighting algorithm for dynamic video summarization

被引:0
|
作者
Pei Dong
Yong Xia
Shanshan Wang
Li Zhuo
David Dagan Feng
机构
[1] The University of Sydney,Biomedical and Multimedia Information Technology (BMIT) Research Group, School of Information Technologies
[2] Beijing University of Technology,Signal and Information Processing Laboratory
[3] Northwestern Polytechnical University,Shaanxi Key Lab of Speech & Image Information Processing, School of Computer Science
[4] Chinese Academy of Sciences,Shenzhen Institutes of Advanced Technology
[5] Shanghai Jiao Tong University,School of Biomedical Engineering
来源
关键词
Video summarization; Semantic indicator of video segment (SEDOG); Iterative weight estimation; Multimodal features; Saliency ranking;
D O I
暂无
中图分类号
学科分类号
摘要
Information explosion has imposed unprecedented challenges on the conventional ways of video data consumption. Hence providing condensed and meaningful video summary to viewers has been recognized as a beneficial and attractive research in the multimedia community in recent years. Analyzing both the visual and textual modalities proves essential for an automatic video summarizer to pick up important contents from a video. However, most established studies in this direction either use heuristic rules or rely on simple ways of text analysis. This paper proposes an iteratively reweighting dynamic video summarization (IRDVS) algorithm based on the joint and adaptive use of the visual modality and accompanying subtitles. The proposed algorithm takes advantage of our developed SEmantic inDicator of videO seGment (SEDOG) feature for exploring the most representative concepts for describing the video. Meanwhile, the iteratively reweighting scheme effectively updates the dynamic surrogate of the original video by combining the high-level features in an adaptive manner. The proposed algorithm has been compared to four state-of-the-art video summarization approaches, namely the speech transcript-based (STVS) algorithm, attention model-based (AMVS) algorithm, sparse dictionary selection-based (DSVS) algorithm and heterogeneity image patch index-based (HIPVS) algorithm, on different video genres, including documentary, movie and TV news. Our results show that the proposed IRDVS algorithm can produce summarized videos with better quality.
引用
收藏
页码:9449 / 9473
页数:24
相关论文
共 50 条
  • [41] Rate-distortion optimal video summarization: A dynamic programming solution
    Li, Z
    Schuster, GM
    Katsaggelos, AK
    Gandhi, B
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 457 - 460
  • [42] Dynamic video summarization using two-level redundancy detection
    Gao, Yue
    Wang, Wei-Bo
    Yong, Jun-Hai
    Gu, He-Jin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2009, 42 (02) : 233 - 250
  • [43] Dynamic video summarization using two-level redundancy detection
    Yue Gao
    Wei-Bo Wang
    Jun-Hai Yong
    He-Jin Gu
    Multimedia Tools and Applications, 2009, 42 : 233 - 250
  • [44] Echocardiogram video summarization
    Ebadollahi, S
    Chang, SF
    Wu, H
    Takoma, S
    MEDICAL IMAGING 2001: ULTRASONIC IMAGING AND SIGNAL PROCESSING, 2001, 4325 : 492 - 501
  • [45] Hierarchical video summarization
    Ratakonda, K
    Sezan, MI
    Crinon, R
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING '99, PARTS 1-2, 1998, 3653 : 1531 - 1541
  • [46] Video Summarization Overview
    Otani, Mayu
    Song, Yale
    Wang, Yang
    FOUNDATIONS AND TRENDS IN COMPUTER GRAPHICS AND VISION, 2022, 13 (04): : 284 - 335
  • [47] Video retrieval and summarization
    Sebe, N
    Lew, MS
    Smeulders, AWM
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2003, 92 (2-3) : 141 - 146
  • [48] AudioVisual Video Summarization
    Zhao, Bin
    Gong, Maoguo
    Li, Xuelong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) : 5181 - 5188
  • [49] Video Co-summarization: Video Summarization by Visual Co-occurrence
    Chu, Wen-Sheng
    Song, Yale
    Jaimes, Alejandro
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 3584 - 3592
  • [50] Dynamic graph neural network-based computational paradigm for video summarization
    R. Deepa
    T. Sree Sharmila
    R. Niruban
    Multimedia Tools and Applications, 2024, 83 : 51227 - 51250