Video Paragraph Captioning as a Text Summarization Task

被引：0

作者：

Liu, Hui ^{[1
]}

Wan, Xiaojun

机构：

[1] Peking Univ, Wangxuan Inst Comp Technol, Beijing, Peoples R China

来源：

ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2 | 2021年

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Video paragraph captioning aims to generate a set of coherent sentences to describe a video that contains several events. Most previous methods simplify this task by using ground-truth event segments. In this work, we propose a novel framework by taking this task as a text summarization task. We first generate lots of sentence-level captions focusing on different video clips and then summarize these captions to obtain the final paragraph caption. Our method does not depend on ground-truth event segments. Experiments on two popular datasets ActivityNet Captions and YouCookII demonstrate the advantages of our new framework. On the ActivityNet dataset, our method even outperforms some previous methods using ground-truth event segment labels.

引用

页码：55 / 60

页数：6

共 50 条

[21] Dense Video Captioning Using Graph-Based Sentence Summarization
Zhang, Zhiwang
Xu, Dong
Ouyang, Wanli
Zhou, Luping
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 1799 - 1810
[22] Text with Knowledge Graph Augmented Transformer for Video Captioning
Gu, Xin
Chen, Guang
Wang, Yufei
Zhang, Libo
Luo, Tiejian
Wen, Longyin
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 18941 - 18951
[23] FRAMERANK: A TEXT PROCESSING APPROACH TO VIDEO SUMMARIZATION
Lei, Zhuo
Zhang, Chao
Zhang, Qian
Qiu, Guoping
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 368 - 373
[24] Video Summarization using Text Subjectivity Classification
Moraes, Leonardo
Marcacini, Ricardo Marcondes
Goularte, Rudinei
PROCEEDINGS OF THE 28TH BRAZILIAN SYMPOSIUM ON MULTIMEDIA AND THE WEB, WEBMEDIA 2022, 2022, : 133 - 141
[25] Summarization of Text and Image Captioning in Information Retrieval Using Deep Learning Techniques
Mahalakshmi, P.
Fatima, N. Sabiyath
IEEE ACCESS, 2022, 10 : 18289 - 18297
[26] Training for Diversity in Image Paragraph Captioning
Melas-Kyriazi, Luke
Han, George
Rush, Alexander M.
2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 757 - 761
[27] Bridging Video and Text: A Two-Step Polishing Transformer for Video Captioning
Xu, Wanru
Miao, Zhenjiang
Yu, Jian
Tian, Yi
Wan, Lili
Ji, Qiang
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 6293 - 6307
[28] From task to evaluation: an automatic text summarization review
Lingfeng Lu
Yang Liu
Weiqiang Xu
Huakang Li
Guozi Sun
Artificial Intelligence Review, 2023, 56 : 2477 - 2507
[29] From task to evaluation: an automatic text summarization review
Lu, Lingfeng
Liu, Yang
Xu, Weiqiang
Li, Huakang
Sun, Guozi
ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (SUPPL 2) : 2477 - 2507
[30] Automatic Text Summarization of Video Lectures Using Subtitles
Garg, Shruti
RECENT DEVELOPMENTS IN INTELLIGENT COMPUTING, COMMUNICATION AND DEVICES, ICCD 2016, 2017, 555 : 45 - 52

← 1 2 3 4 5 →