TIB: A Dataset for Abstractive Summarization of Long Multimodal Videoconference Records

被引:0
|
作者
Gigant, Theo [1 ,2 ]
Dufaux, Frederic [1 ]
Guinaudeau, Camille [3 ,4 ]
Decombas, Marc [2 ]
机构
[1] Univ Paris Saclay, CNRS, CentraleSupelec, Lab Signaux & Syst, Gif Sur Yvette, France
[2] JustAI, Paris, France
[3] Japanese French Lab Informat, CNRS, Tokyo, Japan
[4] Univ Paris Saclay, Gif Sur Yvette, France
关键词
multimedia dataset; multimodal documents; automatic summarization;
D O I
10.1145/3617233.3617238
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large language models and multimodal language-vision models give impressive results on current available summarization benchmarks, but are not designed to handle long multimodal documents. Most summarization datasets are composed of either mono-modal documents or short multimodal documents. In order to develop models designed for understanding and summarizing real-world videoconference records that are typically around 1 hour long, we propose a dataset of 9,103 videoconference records extracted from the German National Library of Science and Technology (TIB) archive, along with their abstract. Additionally, we process the content using automatic tools in order to provide the transcripts and key frames. Finally, we present experiments for abstractive summarization, to serve as baseline for future research work in multimodal approaches.
引用
收藏
页码:61 / 70
页数:10
相关论文
共 43 条
  • [1] SummScreen: A Dataset for Abstractive Screenplay Summarization
    Chen, Mingda
    Chu, Zewei
    Wiseman, Sam
    Gimpel, Kevin
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 8602 - 8615
  • [2] CATAMARAN: A Cross-lingual Long Text Abstractive Summarization Dataset
    Chen, Zheng
    Lin, Hongyu
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6932 - 6937
  • [3] Abstractive Text Summarization Using Multimodal Information
    Rafi, Shaik
    Das, Ranjita
    [J]. 2023 10TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE, ISCMI, 2023, : 141 - 145
  • [4] CLTS plus : A New Chinese Long Text Summarization Dataset with Abstractive Summaries
    Liu, Xiaojun
    Zang, Shunan
    Zhang, Chuang
    Chen, Xiaojun
    Ding, Yangyang
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 : 73 - 84
  • [5] Topic-guided abstractive multimodal summarization with multimodal output
    Rafi, Shaik
    Das, Ranjita
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023,
  • [6] CivilSum: A Dataset for Abstractive Summarization of Indian Court Decisions
    Malik, Manuj
    Zhao, Zheng
    Fonseca, Marcio
    Rao, Shrisha
    Cohen, Shay B.
    [J]. PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2241 - 2250
  • [7] Multimodal Abstractive Summarization for How2 Videos
    Palaskar, Shruti
    Libovicky, Jindrich
    Gella, Spandana
    Metze, Florian
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 6587 - 6596
  • [8] BIGPATENT: A Large-Scale Dataset for Abstractive and Coherent Summarization
    Sharma, Eva
    Li, Chen
    Wang, Lu
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 2204 - 2213
  • [9] Abstractive text summarization using deep learning with a new Turkish summarization benchmark dataset
    Ertam, Fatih
    Aydin, Galip
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (09):
  • [10] Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization
    Liang, Yunlong
    Meng, Fandong
    Xu, Jinan
    Wang, Jiaan
    Chen, Yufeng
    Zhou, Jie
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 2934 - 2951