A Knowledge Augmented and Multimodal-Based Framework for Video Summarization

被引:7
|
作者
Xie, Jiehang [1 ]
Chen, Xuanbai [2 ]
Lu, Shao-Ping [1 ]
Yang, Yulu [1 ]
机构
[1] Nankai Univ, Tianjin, Peoples R China
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
Video Summarization; Multimodal Information;
D O I
10.1145/3503161.3548089
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Video summarization aims to generate a compact version of a lengthy video that retains its primary content. In general, humans are gifted with producing a high-quality video summary, because they acquire crucial content through multiple dimensional information and own abundant background knowledge about the original video. However, existing methods rarely consider multichannel information and ignore the impact of external knowledge, resulting in the limited quality of the generated summaries. This paper proposes a knowledge augmented and multimodal-based video summarization method, termed KAMV, to address the problem above. Specifically, we design a knowledge encoder with a hybrid method consisting of generation and retrieval, to capture descriptive content and latent connections between events and entities based on the external knowledge base, which can provide rich implicit knowledge for better comprehending the video viewed. Furthermore, for the sake of exploring the interactions among visual, audio, implicit knowledge and emphasizing the content that is most relevant to the desired summary, we present a fusion module under the supervision of these multimodal information. By conducting extensive experiments on four public datasets, the results demonstrate the superior performance yielded by the proposed KAMV compared to the state-of-the-art video summarization approaches.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] A Videography Analysis Framework for Video Retrieval and Summarization
    Li, Kang
    Oh, Sangmin
    Perera, A. G. Amitha
    Fu, Yun
    [J]. PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
  • [32] A Framework towards Domain Specific Video Summarization
    Kaushal, Vishal
    Subramanian, Sandeep
    Kothawade, Suraj
    Iyer, Rishabh
    Ramakrishnan, Ganesh
    [J]. 2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 666 - 675
  • [33] SafeCampus: Multimodal-Based Campus-Wide Pandemic Forecasting
    Lu, Sidi
    Wu, Baofu
    Cong, Xiaoda
    Yao, Yongtao
    Shi, Weisong
    [J]. IEEE INTERNET COMPUTING, 2022, 26 (01) : 60 - 67
  • [34] Multimodal Video Summarization via Time-Aware Transformers
    Shang, Xindi
    Yuan, Zehuan
    Wang, Anran
    Wang, Changhu
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1756 - 1765
  • [35] Topic-aware video summarization using multimodal transformer
    Zhu, Yubo
    Zhao, Wentian
    Hua, Rui
    Wu, Xinxiao
    [J]. PATTERN RECOGNITION, 2023, 140
  • [36] Action based Video Summarization
    Raksha, H.
    Namitha, G.
    Sejal, N.
    [J]. PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 457 - 462
  • [37] An Effective Video Summarization Framework Based on the Object of Interest Using Deep Learning
    Ul Haq, Hafiz Burhan
    Asif, Muhammad
    Ahmad, Maaz Bin
    Ashraf, Rehan
    Mahmood, Toqeer
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [38] A multiple visual models based perceptive analysis framework for multilevel video summarization
    You, Junyong
    Liu, Guizhong
    Sun, Li
    Li, Hongliang
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2007, 17 (03) : 273 - 285
  • [39] Interactive User Oriented Visual Attention Based Video Summarization and Exploration Framework
    Qian, Yiming
    Kyan, Matthew
    [J]. 2014 IEEE 27TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2014,
  • [40] ESKVS: efficient and secure approach for keyframes-based video summarization framework
    Saini, Parul
    Berwal, Krishan
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (30) : 74563 - 74591