Query-Focused Video Summarization: Dataset, Evaluation, and A Memory Network Based Approach

被引:65
|
作者
Sharghi, Aidean [1 ]
Laurel, Jacob S. [2 ]
Gong, Boqing [1 ]
机构
[1] Univ Cent Florida, Ctr Res Comp Vis, Orlando, FL 32816 USA
[2] Univ Alabama Birmingham, Dept Comp Sci, Birmingham, AL 35294 USA
基金
美国国家科学基金会;
关键词
SCALE;
D O I
10.1109/CVPR.2017.229
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent years have witnessed a resurgence of interest in video summarization. However, one of the main obstacles to the research on video summarization is the user subjectivity - users have various preferences over the summaries. The subjectiveness causes at least two problems. First, no single video summarizer fits all users unless it interacts with and adapts to the individual users. Second, it is very challenging to evaluate the performance of a video summarizer. To tackle the first problem, we explore the recently proposed query-focused video summarization which introduces user preferences in the form of text queries about the video into the summarization process. We propose a memory network parameterized sequential determinantal point process in order to attend the user query onto different video frames and shots. To address the second challenge, we contend that a good evaluation metric for video summarization should focus on the semantic information that humans can perceive rather than the visual features or temporal overlaps. To this end, we collect dense per-video-shot concept annotations, compile a new dataset, and suggest an efficient evaluation method defined upon the concept annotations. We conduct extensive experiments contrasting our video summarizer to existing ones and present detailed analyses about the dataset and the new evaluation method.
引用
收藏
页码:2127 / 2136
页数:10
相关论文
共 50 条
  • [1] Query-Focused Extractive Video Summarization
    Sharghi, Aidean
    Gong, Boqing
    Shah, Mubarak
    [J]. COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 : 3 - 19
  • [2] Convolutional Hierarchical Attention Network for Query-Focused Video Summarization
    Xiao, Shuwen
    Zhao, Zhou
    Zhang, Zijian
    Yan, Xiaohui
    Yang, Min
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12426 - 12433
  • [3] Query-Biased Self-Attentive Network for Query-Focused Video Summarization
    Xiao, Shuwen
    Zhao, Zhou
    Zhang, Zijian
    Guan, Ziyu
    Cai, Deng
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 5889 - 5899
  • [4] Hierarchical Variational Network for User-Diversified & Query-Focused Video Summarization
    Jiang, Pin
    Han, Yahong
    [J]. ICMR'19: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2019, : 202 - 206
  • [5] Constructing Query-Focused Summarization Dataset AMTQFSum Based on ChatGPT and Prompt Engineering
    Jinling, Shang
    Jianyong, Zhang
    [J]. Data Analysis and Knowledge Discovery, 2024, 8 (8-9) : 122 - 132
  • [6] Bayesian Query-Focused Summarization
    Daume, Hal, III
    Marcu, Daniel
    [J]. COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 305 - 312
  • [7] QuerySum: A Multi-Document Query-Focused Summarization Dataset Augmented with Similar Query Clusters
    Liu, Yushan
    Wang, Zili
    Yuan, Ruifeng
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 18725 - 18732
  • [8] Improving Query-Focused Summarization with CNN-Based Similarity
    Ying W.
    Xiao X.
    Li S.
    Lü Y.
    Sui Z.
    [J]. Li, Sujian (lisujian@pku.edu.cn), 1600, Peking University (53): : 197 - 203
  • [9] Using query expansion in graph-based approach for query-focused multi-document summarization
    Zhao, Lin
    Wu, Lide
    Huang, Xuanjing
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2009, 45 (01) : 35 - 41
  • [10] A Query-Focused Summarization Method that Guarantees the Inclusion of Query Words
    Yasuda, Norihito
    Nishino, Masaaki
    Hirao, Tsutomu
    Suzuki, Jun
    Kataoka, Ryoji
    [J]. 2012 23RD INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA), 2012, : 126 - 130