CAKES: Cross-lingual Wikipedia Knowledge Enrichment and Summarization

被引:0
|
作者
Fionda, Valeria [1 ]
Pirro, Giuseppe [1 ]
机构
[1] Free Univ Bolzano Bozen, Bolzano, Italy
关键词
D O I
10.3233/978-1-61499-098-7-901
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Wikipedia is a huge source of multilingual knowledge curated by human contributors. Wiki articles are independently written in the various languages and may cover different perspectives about a given subject. The aim of this paper is to exploit Wikipedia multilingual information for knowledge enrichment and summarization. Investigating the link structure of a Wiki article in a source language and comparing it with the structure of articles about the same subject written in other languages gives insights about the body of knowledge shared among languages. This investigation is also useful to identify knowledge perspectives not covered in the source language but covered in other languages. We implemented these ideas in CAKES, which: i) exploits Wikipedia information on the fly without requiring any data preprocessing; ii) enables to specify the set of languages to be considered and; iii) ranks subjects interesting for a given article on the basis of their popularity among languages.
引用
收藏
页码:901 / 902
页数:2
相关论文
共 50 条
  • [21] Mixed-Lingual Pre-training for Cross-lingual Summarization
    Xu, Ruochen
    Zhu, Chenguang
    Shi, Yu
    Zeng, Michael
    Huang, Xuedong
    1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (AACL-IJCNLP 2020), 2020, : 536 - 541
  • [22] A Variational Hierarchical Model for Neural Cross-Lingual Summarization
    Liang, Yunlong
    Meng, Fandong
    Zhou, Chulun
    Xu, Jinan
    Chen, Yufeng
    Su, Jinsong
    Zhou, Jie
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 2088 - 2099
  • [23] Cross-Lingual Korean Speech-to-Text Summarization
    Yoon, HyoJeon
    Dinh Tuyen Hoang
    Ngoc Thanh Nguyen
    Hwang, Dosam
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2019, PT I, 2019, 11431 : 198 - 206
  • [24] clstk: The Cross-Lingual Summarization Tool-Kit
    Jhaveri, Nisarg
    Gupta, Manish
    Varma, Vasudeva
    PROCEEDINGS OF THE TWELFTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'19), 2019, : 766 - 769
  • [25] OECM: A Cross-Lingual Approach for Ontology Enrichment
    Ibrahim, Shimaa
    Fathalla, Said
    Yazdi, Hamed Shariat
    Lehmann, Jens
    Jabeen, Hajira
    SEMANTIC WEB: ESWC 2019 SATELLITE EVENTS, 2019, 11762 : 100 - 104
  • [26] Conversations Powered by Cross-Lingual Knowledge
    Sun, Weiwei
    Meng, Chuan
    Meng, Qi
    Ren, Zhaochun
    Ren, Pengjie
    Chen, Zhumin
    de Rijke, Maarten
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 1442 - 1451
  • [27] Cross-Lingual Question Answering Architecture based on ILI and Wikipedia
    Ferrandez Escamez, Sergio
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2009, (42): : 127 - 128
  • [28] Exploiting Wikipedia and EuroWordNet to solve Cross-Lingual Question Answering
    Ferrandez, Sergio
    Toral, Antonio
    Ferrandez, Oscar
    Ferrandez, Antonio
    Munoz, Rafael
    INFORMATION SCIENCES, 2009, 179 (20) : 3473 - 3488
  • [29] English-to-Korean Cross-Lingual Link Detection for Wikipedia
    Marigomen, Ralph
    Kang, In-Su
    U- AND E-SERVICE, SCIENCE AND TECHNOLOGY, 2011, 264 : 274 - 280
  • [30] Cross-lingual Cross-temporal Summarization: Dataset, Models, Evaluation
    Zhang, Ran
    Ouni, Jihed
    Eger, Steffen
    COMPUTATIONAL LINGUISTICS, 2024, 50 (03) : 1001 - 1047