An iteratively reweighting algorithm for dynamic video summarization

被引：0

作者：

Pei Dong

Yong Xia

Shanshan Wang

Li Zhuo

David Dagan Feng

机构：

[1] The University of Sydney,Biomedical and Multimedia Information Technology (BMIT) Research Group, School of Information Technologies

[2] Beijing University of Technology,Signal and Information Processing Laboratory

[3] Northwestern Polytechnical University,Shaanxi Key Lab of Speech & Image Information Processing, School of Computer Science

[4] Chinese Academy of Sciences,Shenzhen Institutes of Advanced Technology

[5] Shanghai Jiao Tong University,School of Biomedical Engineering

来源：

Multimedia Tools and Applications | 2015年 / 74卷

关键词：

Video summarization; Semantic indicator of video segment (SEDOG); Iterative weight estimation; Multimodal features; Saliency ranking;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Information explosion has imposed unprecedented challenges on the conventional ways of video data consumption. Hence providing condensed and meaningful video summary to viewers has been recognized as a beneficial and attractive research in the multimedia community in recent years. Analyzing both the visual and textual modalities proves essential for an automatic video summarizer to pick up important contents from a video. However, most established studies in this direction either use heuristic rules or rely on simple ways of text analysis. This paper proposes an iteratively reweighting dynamic video summarization (IRDVS) algorithm based on the joint and adaptive use of the visual modality and accompanying subtitles. The proposed algorithm takes advantage of our developed SEmantic inDicator of videO seGment (SEDOG) feature for exploring the most representative concepts for describing the video. Meanwhile, the iteratively reweighting scheme effectively updates the dynamic surrogate of the original video by combining the high-level features in an adaptive manner. The proposed algorithm has been compared to four state-of-the-art video summarization approaches, namely the speech transcript-based (STVS) algorithm, attention model-based (AMVS) algorithm, sparse dictionary selection-based (DSVS) algorithm and heterogeneity image patch index-based (HIPVS) algorithm, on different video genres, including documentary, movie and TV news. Our results show that the proposed IRDVS algorithm can produce summarized videos with better quality.

引用

页码：9449 / 9473

页数：24

共 50 条

[41] Rate-distortion optimal video summarization: A dynamic programming solution
Li, Z
Schuster, GM
Katsaggelos, AK
Gandhi, B
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 457 - 460
[42] Dynamic video summarization using two-level redundancy detection
Gao, Yue
Wang, Wei-Bo
Yong, Jun-Hai
Gu, He-Jin
MULTIMEDIA TOOLS AND APPLICATIONS, 2009, 42 (02) : 233 - 250
[43] Dynamic video summarization using two-level redundancy detection
Yue Gao
Wei-Bo Wang
Jun-Hai Yong
He-Jin Gu
Multimedia Tools and Applications, 2009, 42 : 233 - 250
[44] Echocardiogram video summarization
Ebadollahi, S
Chang, SF
Wu, H
Takoma, S
MEDICAL IMAGING 2001: ULTRASONIC IMAGING AND SIGNAL PROCESSING, 2001, 4325 : 492 - 501
[45] Hierarchical video summarization
Ratakonda, K
Sezan, MI
Crinon, R
VISUAL COMMUNICATIONS AND IMAGE PROCESSING '99, PARTS 1-2, 1998, 3653 : 1531 - 1541
[46] Video Summarization Overview
Otani, Mayu
Song, Yale
Wang, Yang
FOUNDATIONS AND TRENDS IN COMPUTER GRAPHICS AND VISION, 2022, 13 (04): : 284 - 335
[47] Video retrieval and summarization
Sebe, N
Lew, MS
Smeulders, AWM
COMPUTER VISION AND IMAGE UNDERSTANDING, 2003, 92 (2-3) : 141 - 146
[48] AudioVisual Video Summarization
Zhao, Bin
Gong, Maoguo
Li, Xuelong
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) : 5181 - 5188
[49] Video Co-summarization: Video Summarization by Visual Co-occurrence
Chu, Wen-Sheng
Song, Yale
Jaimes, Alejandro
2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 3584 - 3592
[50] Dynamic graph neural network-based computational paradigm for video summarization
R. Deepa
T. Sree Sharmila
R. Niruban
Multimedia Tools and Applications, 2024, 83 : 51227 - 51250

← 1 2 3 4 5 →