Query-Biased Self-Attentive Network for Query-Focused Video Summarization

被引:34
|
作者
Xiao, Shuwen [1 ]
Zhao, Zhou [1 ,2 ]
Zhang, Zijian [1 ]
Guan, Ziyu [3 ]
Cai, Deng [4 ]
机构
[1] Zhejiang Univ, Coll Comp Sci, Hangzhou 310027, Peoples R China
[2] Alibaba Zhejiang Univ Joint Res Inst Frontier Tec, Hangzhou 310058, Peoples R China
[3] Northwest Univ, Sch Informat & Technol, Xian 710127, Peoples R China
[4] Zhejiang Univ, State Key Lab CAD&CG, Hangzhou 310027, Peoples R China
基金
浙江省自然科学基金;
关键词
Task analysis; Semantics; Visualization; Computational modeling; Generators; Benchmark testing; Instruments; Video summarization; vision and language; self-attention mechanism; EGOCENTRIC VIDEO;
D O I
10.1109/TIP.2020.2985868
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the task of query-focused video summarization, which takes user queries and long videos as inputs and generates query-focused video summaries. Compared to video summarization, which mainly concentrates on finding the most diverse and representative visual contents as a summary, the task of query-focused video summarization considers the user's intent and the semantic meaning of generated summary. In this paper, we propose a method, named query-biased self-attentive network (QSAN) to tackle this challenge. Our key idea is to utilize the semantic information from video descriptions to generate a generic summary and then to combine the information from the query to generate a query-focused summary. Specifically, we first propose a hierarchical self-attentive network to model the relative relationship at three levels, which are different frames from a segment, different segments of the same video, textual information of video description and its related visual contents. We train the model on video caption dataset and employ a reinforced caption generator to generate a video description, which can help us locate important frames or shots. Then we build a query-aware scoring module to compute the query-relevant score for each shot and generate the query-focused summary. Extensive experiments on the benchmark dataset demonstrate the competitive performance of our approach compared to some methods.
引用
收藏
页码:5889 / 5899
页数:11
相关论文
共 50 条
  • [41] Exploring actor–object relationships for query-focused multi-document summarization
    Mohammadreza Valizadeh
    Pavel Brazdil
    Soft Computing, 2015, 19 : 3109 - 3121
  • [42] Generation of query-biased concepts using content and structure for query reformulation
    Chang, Youjin
    Wang, Jun
    Lalmas, Mounia
    NATURAL LANGUAGE AND INFORMATION SYSTEMS, PROCEEDINGS, 2008, 5039 : 136 - 141
  • [43] Constructing Query-Focused Summarization Dataset AMTQFSum Based on ChatGPT and Prompt Engineering
    Jinling, Shang
    Jianyong, Zhang
    Data Analysis and Knowledge Discovery, 2024, 8 (8-9) : 122 - 132
  • [44] Lexical Similarity Based Query-Focused Summarization Using Artificial Immune Systems
    Katiyar, Sulabh
    Borgohain, Samir
    ARTIFICIAL INTELLIGENCE PERSPECTIVES AND APPLICATIONS (CSOC2015), 2015, 347 : 287 - 296
  • [45] Query-focused multi-document text summarization using fuzzy inference
    Agarwal, Raksha
    Chatterjee, Niladri
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (05) : 4641 - 4652
  • [46] SDbQfSum: Query-focused summarization framework based on diversity and text semantic analysis
    Mohamed, Muhidin
    Oussalah, Mourad
    Chang, Victor
    EXPERT SYSTEMS, 2024, 41 (01)
  • [47] A Novel Contextual Topic Model for Query-focused Multi-document Summarization
    Yang, Guangbing
    2014 IEEE 26TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2014, : 576 - 583
  • [48] Review on Query-focused Multi-document Summarization (QMDS) with Comparative Analysis
    Roy, Prasenjeet
    Kundu, Suman
    ACM COMPUTING SURVEYS, 2024, 56 (01)
  • [49] Exploiting relevance, coverage, and novelty for query-focused multi-document summarization
    Luo, Wenjuan
    Zhuang, Fuzhen
    He, Qing
    Shi, Zhongzhi
    KNOWLEDGE-BASED SYSTEMS, 2013, 46 : 33 - 42
  • [50] Exploring actor-object relationships for query-focused multi-document summarization
    Valizadeh, Mohammadreza
    Brazdil, Pavel
    SOFT COMPUTING, 2015, 19 (11) : 3109 - 3121