Modality correlation-based video summarization

被引:0
|
作者
Xingrun Wang
Xiushan Nie
Xingbo Liu
Binze Wang
Yilong Yin
机构
[1] Shandong University,School of Computer Science and Technology
[2] Shandong Jianzhu University,School of Computer Science and Technology
[3] Chang’an University,College of Geology Engineering and Geomatics
[4] Shandong University,School of Software Engineering
来源
关键词
Video summarization; Modality correlation; Modality-specific information; Attention mechanism;
D O I
暂无
中图分类号
学科分类号
摘要
Video summarization is an important technique to help us browse, store, and retrieve a rapidly increasing amount of video data, which extracts frames or shots from the original video. Text information covers important content of a video, and thus a summarization can be generated by exploring the correlation between the frame and text. In this study, we propose a video summarization method based on the modality correlation. With this method, we first learn the correlation between the text and frame in the respective space, and then fuse two correlations to obtain the importance score of each shot. Finally, video shots that have a high importance score are chosen as the video summarization. Compared to previous methods that seldom apply text to generate the video summarization, or only use the latent common information between text and frame, the proposed method fully utilizes not only the latent common but also modality-specific information for a video summarization. Experiments were conducted on the TVSum50 dataset, and the results verify the effectiveness of our proposed approach.
引用
收藏
页码:33875 / 33890
页数:15
相关论文
共 50 条
  • [1] Modality correlation-based video summarization
    Wang, Xingrun
    Nie, Xiushan
    Liu, Xingbo
    Wang, Binze
    Yin, Yilong
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (45-46) : 33875 - 33890
  • [2] Feature Maps Correlation-based Video Quality Assessment
    Bakhtiari, Amir Hossein
    Mansouri, Azadeh
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (23) : 63309 - 63328
  • [3] Multi-modality Based Affective Video Summarization for Game Players
    Farooq, Sehar Shahzad
    Aziz, Abdullah
    Mukhtar, Hammad
    Fiaz, Mustansar
    Baek, Ki Yeol
    Choi, Naram
    Yun, Sang Bin
    Kim, Kyung Joong
    Jung, Soon Ki
    [J]. FRONTIERS OF COMPUTER VISION, IW-FCV 2021, 2021, 1405 : 59 - 69
  • [4] Correlation-based Interestingness Measure for Video Semantic Concept Detection
    Lin, Lin
    Shyu, Mei-Ling
    Chen, Shu-Ching
    [J]. PROCEEDINGS OF THE 2009 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION, 2008, : 120 - +
  • [5] Adaptive sizing of tracking window for correlation-based video tracking
    Son, JG
    Lim, CW
    Choi, I
    Kim, NC
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2002, E85D (06) : 1015 - 1021
  • [6] Correlation-based and content-enhanced network for video style transfer
    Lin, Honglin
    Wang, Mengmeng
    Liu, Yong
    Kou, Jiaxin
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (01) : 343 - 355
  • [7] Spatial correlation-based side information refinement for distributed video coding
    Mohamed Haj Taieb
    Jean-Yves Chouinard
    Demin Wang
    [J]. EURASIP Journal on Advances in Signal Processing, 2013
  • [8] Correlation-based and content-enhanced network for video style transfer
    Honglin Lin
    Mengmeng Wang
    Yong Liu
    Jiaxin Kou
    [J]. Pattern Analysis and Applications, 2023, 26 : 343 - 355
  • [9] Spatial correlation-based side information refinement for distributed video coding
    Taieb, Mohamed Haj
    Chouinard, Jean-Yves
    Wang, Demin
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2013,
  • [10] Correlation-based Video Semantic Concept Detection using Multiple Correspondence Analysis
    Lin, Lin
    Ravitz, Guy
    Shyu, Mei-Ling
    Chen, Shu-Ching
    [J]. ISM: 2008 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, 2008, : 316 - +