Video Paragraph Captioning as a Text Summarization Task

被引:0
|
作者
Liu, Hui [1 ]
Wan, Xiaojun
机构
[1] Peking Univ, Wangxuan Inst Comp Technol, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video paragraph captioning aims to generate a set of coherent sentences to describe a video that contains several events. Most previous methods simplify this task by using ground-truth event segments. In this work, we propose a novel framework by taking this task as a text summarization task. We first generate lots of sentence-level captions focusing on different video clips and then summarize these captions to obtain the final paragraph caption. Our method does not depend on ground-truth event segments. Experiments on two popular datasets ActivityNet Captions and YouCookII demonstrate the advantages of our new framework. On the ActivityNet dataset, our method even outperforms some previous methods using ground-truth event segment labels.
引用
收藏
页码:55 / 60
页数:6
相关论文
共 50 条
  • [21] Dense Video Captioning Using Graph-Based Sentence Summarization
    Zhang, Zhiwang
    Xu, Dong
    Ouyang, Wanli
    Zhou, Luping
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 1799 - 1810
  • [22] Text with Knowledge Graph Augmented Transformer for Video Captioning
    Gu, Xin
    Chen, Guang
    Wang, Yufei
    Zhang, Libo
    Luo, Tiejian
    Wen, Longyin
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 18941 - 18951
  • [23] FRAMERANK: A TEXT PROCESSING APPROACH TO VIDEO SUMMARIZATION
    Lei, Zhuo
    Zhang, Chao
    Zhang, Qian
    Qiu, Guoping
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 368 - 373
  • [24] Video Summarization using Text Subjectivity Classification
    Moraes, Leonardo
    Marcacini, Ricardo Marcondes
    Goularte, Rudinei
    PROCEEDINGS OF THE 28TH BRAZILIAN SYMPOSIUM ON MULTIMEDIA AND THE WEB, WEBMEDIA 2022, 2022, : 133 - 141
  • [25] Summarization of Text and Image Captioning in Information Retrieval Using Deep Learning Techniques
    Mahalakshmi, P.
    Fatima, N. Sabiyath
    IEEE ACCESS, 2022, 10 : 18289 - 18297
  • [26] Training for Diversity in Image Paragraph Captioning
    Melas-Kyriazi, Luke
    Han, George
    Rush, Alexander M.
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 757 - 761
  • [27] Bridging Video and Text: A Two-Step Polishing Transformer for Video Captioning
    Xu, Wanru
    Miao, Zhenjiang
    Yu, Jian
    Tian, Yi
    Wan, Lili
    Ji, Qiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 6293 - 6307
  • [28] From task to evaluation: an automatic text summarization review
    Lingfeng Lu
    Yang Liu
    Weiqiang Xu
    Huakang Li
    Guozi Sun
    Artificial Intelligence Review, 2023, 56 : 2477 - 2507
  • [29] From task to evaluation: an automatic text summarization review
    Lu, Lingfeng
    Liu, Yang
    Xu, Weiqiang
    Li, Huakang
    Sun, Guozi
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (SUPPL 2) : 2477 - 2507
  • [30] Automatic Text Summarization of Video Lectures Using Subtitles
    Garg, Shruti
    RECENT DEVELOPMENTS IN INTELLIGENT COMPUTING, COMMUNICATION AND DEVICES, ICCD 2016, 2017, 555 : 45 - 52