Query-Biased Self-Attentive Network for Query-Focused Video Summarization

被引:34
|
作者
Xiao, Shuwen [1 ]
Zhao, Zhou [1 ,2 ]
Zhang, Zijian [1 ]
Guan, Ziyu [3 ]
Cai, Deng [4 ]
机构
[1] Zhejiang Univ, Coll Comp Sci, Hangzhou 310027, Peoples R China
[2] Alibaba Zhejiang Univ Joint Res Inst Frontier Tec, Hangzhou 310058, Peoples R China
[3] Northwest Univ, Sch Informat & Technol, Xian 710127, Peoples R China
[4] Zhejiang Univ, State Key Lab CAD&CG, Hangzhou 310027, Peoples R China
基金
浙江省自然科学基金;
关键词
Task analysis; Semantics; Visualization; Computational modeling; Generators; Benchmark testing; Instruments; Video summarization; vision and language; self-attention mechanism; EGOCENTRIC VIDEO;
D O I
10.1109/TIP.2020.2985868
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the task of query-focused video summarization, which takes user queries and long videos as inputs and generates query-focused video summaries. Compared to video summarization, which mainly concentrates on finding the most diverse and representative visual contents as a summary, the task of query-focused video summarization considers the user's intent and the semantic meaning of generated summary. In this paper, we propose a method, named query-biased self-attentive network (QSAN) to tackle this challenge. Our key idea is to utilize the semantic information from video descriptions to generate a generic summary and then to combine the information from the query to generate a query-focused summary. Specifically, we first propose a hierarchical self-attentive network to model the relative relationship at three levels, which are different frames from a segment, different segments of the same video, textual information of video description and its related visual contents. We train the model on video caption dataset and employ a reinforced caption generator to generate a video description, which can help us locate important frames or shots. Then we build a query-aware scoring module to compute the query-relevant score for each shot and generate the query-focused summary. Extensive experiments on the benchmark dataset demonstrate the competitive performance of our approach compared to some methods.
引用
收藏
页码:5889 / 5899
页数:11
相关论文
共 50 条
  • [1] Query-Focused Extractive Video Summarization
    Sharghi, Aidean
    Gong, Boqing
    Shah, Mubarak
    COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 : 3 - 19
  • [2] Convolutional Hierarchical Attention Network for Query-Focused Video Summarization
    Xiao, Shuwen
    Zhao, Zhou
    Zhang, Zijian
    Yan, Xiaohui
    Yang, Min
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12426 - 12433
  • [3] QSG Transformer: Transformer with Query-Attentive Semantic Graph for Query-Focused Summarization
    Park, Choongwon
    Ko, Youngjoong
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 2589 - 2594
  • [4] Bayesian Query-Focused Summarization
    Daume, Hal, III
    Marcu, Daniel
    COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 305 - 312
  • [5] Hierarchical Variational Network for User-Diversified & Query-Focused Video Summarization
    Jiang, Pin
    Han, Yahong
    ICMR'19: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2019, : 202 - 206
  • [6] Query-Focused Video Summarization: Dataset, Evaluation, and A Memory Network Based Approach
    Sharghi, Aidean
    Laurel, Jacob S.
    Gong, Boqing
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2127 - 2136
  • [7] Query-biased summarization based on lexical chaining
    Manabu, O
    Hajime, M
    COMPUTATIONAL INTELLIGENCE, 2000, 16 (04) : 578 - 585
  • [8] Utility of Missing Concepts in Query-biased Summarization
    Sarwar, Sheikh Muhammad
    Moraes, Felipe
    Jiang, Jiepu
    Allan, James
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 2056 - 2060
  • [9] Query-biased summarization considering difference of paragraphs
    Otani C.
    Hoo M.K.
    Oda Y.
    Furue T.
    Uchida Y.
    Yoshie O.
    IEEJ Transactions on Electronics, Information and Systems, 2010, 130 (12) : 2256 - 2265+21
  • [10] A Query-Focused Summarization Method that Guarantees the Inclusion of Query Words
    Yasuda, Norihito
    Nishino, Masaaki
    Hirao, Tsutomu
    Suzuki, Jun
    Kataoka, Ryoji
    2012 23RD INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA), 2012, : 126 - 130