An Experimental Comparison of Explicit Semantic Analysis Implementations for Cross-Language Retrieval

被引:0
|
作者
Sorg, Philipp [1 ]
Cimiano, Philipp [2 ]
机构
[1] Univ Karlsruhe, Inst AIFB, Karlsruhe, Germany
[2] Delft Univ Technol, Web Informat Syst Grp, Delft, Netherlands
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Explicit Semantic Analysis (ESA) has been recently proposed as an approach to computing semantic relatedness between words (and indirectly also between texts) and has thus a natural application in information retrieval, showing the potential to alleviate the vocabulary mismatch problem inherent in standard Bag-of-Word models. The ESA model has been also recently extended to cross-lingual retrieval settings, which can be considered as an extreme case of the vocabulary mismatch problem. The ESA approach actually represents a class of approaches and allows for various instantiations. As our first contribution, we generalize ESA in order to clearly show the degrees of freedom it provides. Second, we propose some variants of ESA along different dimensions, testing their impact on performance on a cross-lingual mate retrieval task on two datasets (JRC-ACQUIS and Multext). Our results are interesting as a systematic investigation has been missing so far and the variations between different basic design choices are significant. We also show that the settings adopted in the original ESA implementation are reasonably good, which to our knowledge has not been demonstrated so far, but can still be significantly improved by tuning the right parameters (yielding a relative improvement on a cross-lingual mate retrieval task of between 62% (Multext) and 237% (JRC-ACQUIS) with respect to the original ESA model).
引用
收藏
页码:36 / +
页数:3
相关论文
共 50 条
  • [1] Evaluating Cross-Language Explicit Semantic Analysis and Cross Querying
    Anderka, Maik
    Lipka, Nedim
    Stein, Benno
    [J]. MULTILINGUAL INFORMATION ACCESS EVALUATION I: TEXT RETRIEVAL EXPERIMENTS, 2010, 6241 : 50 - 57
  • [2] Query Expansion in Cross-Language Information Retrieval Using Latent Semantic Analysis
    Bi Jianting
    Su Yidan
    [J]. ICCSE 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION: ADVANCED COMPUTER TECHNOLOGY, NEW EDUCATION, 2008, : 220 - 224
  • [3] CROSS-LANGUAGE DOCUMENT RETRIEVAL BY USING NONLINEAR SEMANTIC MAPPING
    Banchs, Rafael E.
    Costa-Jussa, Marta R.
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2013, 27 (09) : 781 - 802
  • [4] Cross-Language Information Retrieval: An analysis of errors
    Ruiz, ME
    Srinivasan, P
    [J]. PROCEEDINGS OF THE ASIS ANNUAL MEETING, 1998, 35 : 153 - 165
  • [5] Cross-Language Information Retrieval: An analysis of errors
    Ruiz, ME
    Srinivasan, P
    [J]. ASIS '98 - PROCEEDINGS OF THE 61ST ASIS ANNUAL MEETING, VOL 35, 1998: INFORMATION ACCESS IN THE GLOBAL INFORMATION ECONOMY, 1998, 35 : 153 - 165
  • [6] Explicit Versus Latent Concept Models for Cross-Language Information Retrieval
    Cimiano, Philipp
    Schultz, Antje
    Sizov, Sergej
    Sorg, Philipp
    Staab, Steffen
    [J]. 21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS, 2009, : 1513 - 1518
  • [7] Cross-language information retrieval
    Nie J.-Y.
    [J]. Synthesis Lectures on Human Language Technologies, 2010, 3 (01): : 1 - 142
  • [8] Semantic and Cross-Language Information Retrieval for Thai Herbs and Modern Medicine
    Akewaranukulsiri, Pitchakorn
    Prompoon, Nakornthip
    [J]. 2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND APPLICATIONS (ICISA 2013), 2013,
  • [9] Cross-Language Semantic Retrieval and Linking of E-Gov Services
    Narducci, Fedelucio
    Palmonari, Matteo
    Semeraro, Giovanni
    [J]. SEMANTIC WEB - ISWC 2013, PART II, 2013, 8219 : 130 - 145
  • [10] Cross-Language Retrieval with Wikipedia
    Schoenhofen, Peter
    Benczur, Andras
    Biro, Istvan
    Csalogany, Karoly
    [J]. ADVANCES IN MULTILINGUAL AND MULTIMODAL INFORMATION RETRIEVAL, 2008, 5152 : 72 - 79