Cross-lingual timeline summarization

被引:2
|
作者
Cagliero, Luca [1 ]
La Quatra, Moreno [1 ]
Garza, Paolo [1 ]
Baralis, Elena [1 ]
机构
[1] Politecn Torino, Dipartimento Automat & Informat, Turin, Italy
关键词
Cross-lingual summarization; Timeline Summarization; Natural Language Processing;
D O I
10.1109/AIKE52691.2021.00014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Timeline summarization methods analyze timestamped, topic-specific news article collections to select the key dates representing the event flow and to extract the most relevant per-date content. Existing approaches are all tailored to a single language. Hence, they are unable to combine topic-related content available in different languages. Enriching news timelines with multilingual content is particularly useful for (i) summarizing complex events, whose main facets are covered differently by media sources from different countries, and (ii) generating news timelines in low-resource languages, for which there is a lack of news content in the target language. This paper presents three alternative approaches to address cross-lingual timeline summarization. They combine state-of-the-art extractive summarization methods with machine translation steps at different stages of the timeline generation process. The paper also proposes novel Rouge-based evaluation metrics customized for cross-lingual timeline summarization with a twofold aim: (i) quantifying the ability of the cross-lingual process to enhance available content extraction in the target language and (ii) estimating summarizer effectiveness in conveying additional content from other languages. A new multilingual timeline benchmark dataset has been generated to allow a thorough analysis of the factors that mainly influence summarization performance.
引用
收藏
页码:45 / 53
页数:9
相关论文
共 50 条
  • [21] Attend, Translate and Summarize: An Efficient Method for Neural Cross-Lingual Summarization
    Zhu, Junnan
    Zhou, Yu
    Zhang, Jiajun
    Zong, Chengqing
    [J]. 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 1309 - 1321
  • [22] X-SCITLDR: Cross-Lingual Extreme Summarization of Scholarly Documents
    Takeshita, Sotaro
    Green, Tommaso
    Friedrich, Niklas
    Eckert, Kai
    Ponzetto, Simone Paolo
    [J]. 2022 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL), 2022,
  • [23] MCLS: A Large-Scale Multimodal Cross-Lingual Summarization Dataset
    Shi, Xiaorui
    [J]. CHINESE COMPUTATIONAL LINGUISTICS, CCL 2023, 2023, 14232 : 273 - 288
  • [24] CAR-Transformer: Cross-Attention Reinforcement Transformer for Cross-Lingual Summarization
    Cai, Yuang
    Yuan, Yuyu
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17718 - 17726
  • [25] Dataset construction method of cross-lingual summarization based on filtering and text augmentation
    Pan, Hangyu
    Xi, Yaoyi
    Wang, Ling
    Nan, Yu
    Su, Zhizhong
    Cao, Rong
    [J]. PeerJ Computer Science, 2023, 9
  • [26] Dataset construction method of cross-lingual summarization based on filtering and text augmentation
    Pan, Hangyu
    Xi, Yaoyi
    Wang, Ling
    Nan, Yu
    Su, Zhizhong
    Cao, Rong
    [J]. PEERJ COMPUTER SCIENCE, 2023, 9
  • [27] Cross-Lingual Sentiment Relation Capturing for Cross-Lingual Sentiment Analysis
    Chen, Qiang
    Li, Wenjie
    Lei, Yu
    Liu, Xule
    Luo, Chuwei
    He, Yanxiang
    [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2017, 2017, 10193 : 54 - 67
  • [28] Cross-lingual training of summarization systems using annotated corpora in a foreign language
    Marina Litvak
    Mark Last
    [J]. Information Retrieval, 2013, 16 : 629 - 656
  • [29] Cross-lingual training of summarization systems using annotated corpora in a foreign language
    Litvak, Marina
    Last, Mark
    [J]. INFORMATION RETRIEVAL, 2013, 16 (05): : 629 - 656
  • [30] Multi-path Based Self-adaptive Cross-lingual Summarization
    Bao, Zhongtian
    Wang, Jun
    Yang, Zhenglu
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT III, KSEM 2023, 2023, 14119 : 282 - 294