Discovering Cross-language Links in Wikipedia through Semantic Relatedness

被引:2
|
作者
Penta, Antonio [1 ]
Quercini, Gianluca [2 ]
Reynaud, Chantal [2 ]
Shadbolt, Nigel [1 ]
机构
[1] Univ Southampton, Southampton SO9 5NH, Hants, England
[2] Univ Paris Sud XI, Paris, France
关键词
D O I
10.3233/978-1-61499-098-7-642
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Wikipedia is a large multilingual collection of interlinked articles, used and contributed by millions of users over the Internet, that provides editions in up to 283 languages. Two articles in different language versions of Wikipedia may have information on the exactly the same concept, in which case they are often connected through a cross-language link. However, many cross-language links are either missing or incorrect and this negatively affects both the readers of Wikipedia and multilingual information retrieval applications. In this paper, we propose WIKICL, an algorithm for discoverinrg cross-language links using the semantic relatedness of two articles derived from the Wikipedia graph structure. Our evaluation shows that we achieve comparable, and in some cases, better results than previous methods with much less computational time.
引用
收藏
页码:642 / +
页数:2
相关论文
共 50 条
  • [21] Computing Terms Semantic Relatedness by Knowledge in Wikipedia
    Zhao, Dexin
    Qin, Liangliang
    Liu, Pengjie
    Ma, Zhen
    Li, Yukun
    2015 12TH WEB INFORMATION SYSTEM AND APPLICATION CONFERENCE (WISA), 2015, : 107 - 111
  • [22] Cross-language influences: translation status affects intraword sense relatedness
    Tamar Degani
    Natasha Tokowicz
    Memory & Cognition, 2013, 41 : 1046 - 1064
  • [23] Semantic Similarity/Relatedness for Cross language plagiarism detection
    Ezzikouri, Hanane
    Oukessou, Mohamed
    Erritali, Mohammed
    2016 13TH INTERNATIONAL CONFERENCE ON COMPUTER GRAPHICS, IMAGING AND VISUALIZATION (CGIV), 2016, : 372 - 374
  • [24] Exploiting Wikipedia for Evaluating Semantic Relatedness Mechanisms
    Ferrara, Felice
    Tasso, Carlo
    BRIDGING BETWEEN CULTURAL HERITAGE INSTITUTIONS, 2014, 385 : 105 - 117
  • [25] A New Approach for Computing Semantic Relatedness with Wikipedia
    Zhang, Xinye
    Li, Xiu
    Ruan, Zhijian
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER, NETWORKS AND COMMUNICATION ENGINEERING (ICCNCE 2013), 2013, 30 : 654 - 657
  • [26] Parallel sentence extraction to improve cross-language information retrieval from Wikipedia
    Cheon, Juryong
    Ko, Youngjoong
    JOURNAL OF INFORMATION SCIENCE, 2021, 47 (02) : 281 - 293
  • [27] Semantic feature norms: a cross-method and cross-language comparison
    Kivisaari, Sasa L.
    Hulten, Annika
    van Vliet, Marijn
    Lindh-Knuutila, Tiina
    Salmelin, Riitta
    BEHAVIOR RESEARCH METHODS, 2024, 56 (06) : 5788 - 5797
  • [28] Mining a multilingual association dictionary from Wikipedia for cross-language information retrieval
    Ye, Zheng
    Huang, Jimmy Xiangji
    He, Ben
    Lin, Hongfei
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2012, 63 (12): : 2474 - 2487
  • [29] Exploiting Wikipedia API for Hindi-English Cross-Language Information Retrieval
    Sharma, Vijay Kumar
    Mittal, Namita
    TWELFTH INTERNATIONAL CONFERENCE ON COMMUNICATION NETWORKS, ICCN 2016 / TWELFTH INTERNATIONAL CONFERENCE ON DATA MINING AND WAREHOUSING, ICDMW 2016 / TWELFTH INTERNATIONAL CONFERENCE ON IMAGE AND SIGNAL PROCESSING, ICISP 2016, 2016, 89 : 434 - 440
  • [30] Multilingual Ontologies for Cross-Language Information Extraction and Semantic Search
    Ernbley, David W.
    Liddle, Stephen W.
    Lonsdale, Deryle W.
    Tijerino, Yuri
    CONCEPTUAL MODELING - ER 2011, 2011, 6998 : 147 - +