CAKES: Cross-lingual Wikipedia Knowledge Enrichment and Summarization

被引:0
|
作者
Fionda, Valeria [1 ]
Pirro, Giuseppe [1 ]
机构
[1] Free Univ Bolzano Bozen, Bolzano, Italy
关键词
D O I
10.3233/978-1-61499-098-7-901
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Wikipedia is a huge source of multilingual knowledge curated by human contributors. Wiki articles are independently written in the various languages and may cover different perspectives about a given subject. The aim of this paper is to exploit Wikipedia multilingual information for knowledge enrichment and summarization. Investigating the link structure of a Wiki article in a source language and comparing it with the structure of articles about the same subject written in other languages gives insights about the body of knowledge shared among languages. This investigation is also useful to identify knowledge perspectives not covered in the source language but covered in other languages. We implemented these ideas in CAKES, which: i) exploits Wikipedia information on the fly without requiring any data preprocessing; ii) enables to specify the set of languages to be considered and; iii) ranks subjects interesting for a given article on the basis of their popularity among languages.
引用
收藏
页码:901 / 902
页数:2
相关论文
共 50 条
  • [41] Oversea Cross-Lingual Summarization Service in Multilanguage Pre-Trained Model through Knowledge Distillation
    Yang, Xiwei
    Yun, Jing
    Zheng, Bofei
    Liu, Limin
    Ban, Qi
    ELECTRONICS, 2023, 12 (24)
  • [42] Attend, Translate and Summarize: An Efficient Method for Neural Cross-Lingual Summarization
    Zhu, Junnan
    Zhou, Yu
    Zhang, Jiajun
    Zong, Chengqing
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 1309 - 1321
  • [43] MCLS: A Large-Scale Multimodal Cross-Lingual Summarization Dataset
    Shi, Xiaorui
    CHINESE COMPUTATIONAL LINGUISTICS, CCL 2023, 2023, 14232 : 273 - 288
  • [44] X-SCITLDR: Cross-Lingual Extreme Summarization of Scholarly Documents
    Takeshita, Sotaro
    Green, Tommaso
    Friedrich, Niklas
    Eckert, Kai
    Ponzetto, Simone Paolo
    2022 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL), 2022,
  • [45] X-SCITLDR: Cross-lingual extreme summarization of scholarly documents
    Takeshita, Sotaro
    Green, Tommaso
    Friedrich, Niklas
    Eckert, Kai
    Der Medien, Hochschule
    Ponzetto, Simone Paolo
    Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2022,
  • [46] Multilingual Knowledge Graph Embeddings for Cross-lingual Knowledge Alignment
    Chen, Muhao
    Tian, Yingtao
    Yang, Mohan
    Zaniolo, Carlo
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1511 - 1517
  • [47] Japanese/english blog distillation and cross-lingual blog analysis with multilingual wikipedia entries as fundamental knowledge source
    Nakasaki H.
    Kawaba M.
    Yokomoto D.
    Utsuro T.
    Fukuhara T.
    Transactions of the Japanese Society for Artificial Intelligence, 2010, 25 (05) : 613 - 622
  • [48] WikiTranslate: Query Translation for Cross-Lingual Information Retrieval Using Only Wikipedia
    Nguyen, Dong
    Overwijk, Arnold
    Hauff, Claudia
    Trieschnigg, Dolf R. B.
    Hiemstra, Djoerd
    de Jong, Franciska
    EVALUATING SYSTEMS FOR MULTILINGUAL AND MULTIMODAL INFORMATION ACCESS, 2009, 5706 : 58 - 65
  • [49] CAR-Transformer: Cross-Attention Reinforcement Transformer for Cross-Lingual Summarization
    Cai, Yuang
    Yuan, Yuyu
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17718 - 17726
  • [50] CL2CM: Improving Cross-Lingual Cross-Modal Retrieval via Cross-Lingual Knowledge Transfer
    Wang, Yabing
    Wang, Fan
    Dong, Jianfeng
    Luo, Hao
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 5651 - 5659