An Experimental Comparison of Explicit Semantic Analysis Implementations for Cross-Language Retrieval

被引:0
|
作者
Sorg, Philipp [1 ]
Cimiano, Philipp [2 ]
机构
[1] Univ Karlsruhe, Inst AIFB, Karlsruhe, Germany
[2] Delft Univ Technol, Web Informat Syst Grp, Delft, Netherlands
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Explicit Semantic Analysis (ESA) has been recently proposed as an approach to computing semantic relatedness between words (and indirectly also between texts) and has thus a natural application in information retrieval, showing the potential to alleviate the vocabulary mismatch problem inherent in standard Bag-of-Word models. The ESA model has been also recently extended to cross-lingual retrieval settings, which can be considered as an extreme case of the vocabulary mismatch problem. The ESA approach actually represents a class of approaches and allows for various instantiations. As our first contribution, we generalize ESA in order to clearly show the degrees of freedom it provides. Second, we propose some variants of ESA along different dimensions, testing their impact on performance on a cross-lingual mate retrieval task on two datasets (JRC-ACQUIS and Multext). Our results are interesting as a systematic investigation has been missing so far and the variations between different basic design choices are significant. We also show that the settings adopted in the original ESA implementation are reasonably good, which to our knowledge has not been demonstrated so far, but can still be significantly improved by tuning the right parameters (yielding a relative improvement on a cross-lingual mate retrieval task of between 62% (Multext) and 237% (JRC-ACQUIS) with respect to the original ESA model).
引用
收藏
页码:36 / +
页数:3
相关论文
共 50 条
  • [11] Cross-Language Information Retrieval
    Federico, Marcello
    [J]. COMPUTATIONAL LINGUISTICS, 2011, 37 (02) : 411 - 412
  • [12] Cross-language information retrieval
    Oard, DW
    Diekema, AR
    [J]. ANNUAL REVIEW OF INFORMATION SCIENCE AND TECHNOLOGY, 1998, 33 : 223 - 256
  • [13] Semantic feature norms: a cross-method and cross-language comparison
    Kivisaari, Sasa L.
    Hulten, Annika
    van Vliet, Marijn
    Lindh-Knuutila, Tiina
    Salmelin, Riitta
    [J]. BEHAVIOR RESEARCH METHODS, 2024, 56 (06) : 5788 - 5797
  • [14] Cross-Language Document Retrieval by using Non-linear Semantic Mapping
    Banchs, Rafael E.
    Jussa, Marta R. Costa
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2009, (43): : 169 - 176
  • [15] Semantic annotation for concept-based cross-language medical information retrieval
    Volk, M
    Ripplinger, B
    Vintar, S
    Buitelaar, P
    Raileanu, D
    Sacaleanu, B
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2002, 67 (1-3) : 97 - 112
  • [16] Resolving ambiguity for cross-language retrieval
    Univ of Massachusetts, Amherst, MA, United States
    [J]. SIGIR Forum, (64-71):
  • [17] Study on cross-language information retrieval
    Si, Shen
    [J]. PROCEEDINGS OF 2008 INTERNATIONAL PRE-OLYMPIC CONGRESS ON COMPUTER SCIENCE, VOL I: COMPUTER SCIENCE AND ENGINEERING, 2008, : 6 - 10
  • [18] Cross-language multimedia information retrieval
    Flank, S
    [J]. 6TH APPLIED NATURAL LANGUAGE PROCESSING CONFERENCE/1ST MEETING OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE AND PROCEEDINGS OF THE ANLP-NAACL 2000 STUDENT RESEARCH WORKSHOP, 2000, : 13 - 20
  • [19] Billingual Formal Concept Analysis for Cross-Language Information Retrieval
    Ali, Chedi Bechikh
    Haddad, Hatem
    Slimani, Yahia
    [J]. 2017 IEEE/ACS 14TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2017, : 922 - 928
  • [20] A cross-language approach to historic document retrieval
    Koolen, Marijn
    Adriaans, Frans
    Kamps, Jaap
    de Rijke, Maarten
    [J]. ADVANCES IN INFORMATION RETRIEVAL, 2006, 3936 : 407 - 419