Topic-aware video summarization using multimodal transformer

被引:9
|
作者
Zhu, Yubo [1 ]
Zhao, Wentian [1 ]
Hua, Rui [1 ]
Wu, Xinxiao [1 ,2 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing Key Lab Intelligent Informat Technol, Beijing, Peoples R China
[2] Shenzhen MSU BIT Univ, Guangdong Lab Machine Percept & Intelligent Comp, Shenzhen, Peoples R China
关键词
Topic-aware video summarization; Multimodal transformer; Video summarization dataset; NETWORKS;
D O I
10.1016/j.patcog.2023.109578
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video summarization aims to generate a short and compact summary to represent the original video. Existing methods mainly focus on how to extract a general objective synopsis that precisely summaries the video content. However, in real scenarios, a video usually contains rich content with multiple top-ics and people may cast diverse interests on the visual contents even for the same video. In this pa -per, we propose a novel topic-aware video summarization task that generates multiple video summaries with different topics. To support the study of this new task, we first build a video benchmark dataset by collecting videos from various types of movies and annotate them with topic labels and frame-level importance scores. Then we propose a multimodal Transformer model for the topic-aware video summa-rization, which simultaneously predicts topic labels and generates topic-related summaries by adaptively fusing multimodal features extracted from the video. Experimental results show the effectiveness of our method. (c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] A novel temporal and topic-aware recommender model
    Dandan Song
    Zhifan Li
    Mingming Jiang
    Lifei Qin
    Lejian Liao
    World Wide Web, 2019, 22 : 2105 - 2127
  • [42] Topic-Aware Sentiment Prediction for Chinese ConceptNet
    Chou, Po-Hao
    Tsai, Richard Tzong-Han
    Hsu, Jane Yung-jen
    2015 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2015, : 419 - 426
  • [43] Topic-aware Web Service Representation Learning
    Shi, Min
    Tang, Yufei
    Zhu, Xingquan
    Liu, Jianxun
    ACM TRANSACTIONS ON THE WEB, 2020, 14 (02)
  • [44] Topic-Aware Information Coverage Maximization in Social Networks
    Li, Zhihang
    Du, Hongwei
    Li, Xiang
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (02) : 1722 - 1732
  • [45] Neural News Recommendation with Topic-Aware News Representation
    Wu, Chuhan
    Wu, Fangzhao
    An, Mingxiao
    Huang, Yongfeng
    Xie, Xing
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 1154 - 1159
  • [46] Topic-Aware Dialogue Speech Recognition with Transfer Learning
    Song, Yuanfeng
    Jiang, Di
    Wu, Xueyang
    Xu, Qian
    Wong, Raymond Chi-Wing
    Yang, Qiang
    INTERSPEECH 2019, 2019, : 829 - 833
  • [47] TERG: Topic-Aware Emotional Response Generation for Chatbot
    Huo, Pei
    Yang, Yan
    Zhou, Jie
    Chen, Chengcai
    He, Liang
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [48] Explainable and Discourse Topic-aware Neural Language Understanding
    Chaudhary, Yatin
    Schutze, Hinrich
    Gupta, Pankaj
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [49] Topic-Aware Multi-turn Dialogue Modeling
    Xu, Yi
    Zhao, Hai
    Zhang, Zhuosheng
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14176 - 14184
  • [50] Topic-aware latent models for representation learning on networks
    Celikkanat, Abdulkadir
    Malliaros, Fragkiskos D.
    PATTERN RECOGNITION LETTERS, 2021, 144 : 89 - 96