Using thesauri in cross-language retrieval of German and french indexed collections

被引:0
|
作者
Petras, V [1 ]
Perelman, N
Gey, R
机构
[1] Univ Calif Berkeley, Sch Informat Management & Syst, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, UC Data Arch & Tech Assistance, Berkeley, CA 94720 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For CLEF 2002, Berkeley's Group One experimented with Russian, French and English as query languages, and investigated thesaurus-aided retrieval for the special CLEF collections GIRT and Amaryllis. Two techniques were used to locate source language topic terms within the controlled vocabulary and replace them with the document language thesaurus terms to form the query sent against the collection index. This form of controlled vocabulary-aided translation is called thesaurus matching. Results show that thesaurus-aided cross-language retrieval performs slightly worse than machine translation retrieval on average, but can yield decidedly better results for particular queries. In addition, Berkeley submitted runs to the monolingual and bilingual (French and German) CLEF main tasks. We found that bilingual retrieval sometimes outperforms monolingual retrieval and postulate reasons to explain this phenomenon.
引用
收藏
页码:349 / 362
页数:14
相关论文
共 50 条
  • [41] Matching meaning for cross-language information retrieval
    Wang, Jianqiang
    Oard, Douglas W.
    INFORMATION PROCESSING & MANAGEMENT, 2012, 48 (04) : 631 - 653
  • [42] Influence of WSD on cross-language information retrieval
    Kang, IS
    Na, SH
    Lee, JH
    NATURAL LANGUAGE PROCESSING - IJCNLP 2004, 2005, 3248 : 358 - 366
  • [43] Different approaches to cross-language information retrieval
    Kraaij, W
    Pohlmann, R
    COMPUTATIONAL LINGUISTICS IN THE NETHERLANDS 2000, 2001, (37): : 97 - 110
  • [44] Cross-language information retrieval using EuroWordNet and word sense disambiguation
    Clough, P
    Stevenson, M
    ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2004, 2997 : 327 - 337
  • [45] Using Comparable Corpora to Improve the Effectiveness of Cross-Language Information Retrieval
    Sadat, Fatiha
    ADVANCES IN NATURAL LANGUAGE PROCESSING, 2010, 6233 : 320 - 331
  • [46] Query disambiguation for Cross-Language Information Retrieval using Web directories
    Kimura, F
    Maeda, A
    Miyazaki, J
    Uemura, S
    INTERNATIONAL WORKSHOP ON CHALLENGES IN WEB INFORMATION RETRIEVAL AND INTEGRATION, PROCEEDINGS, 2005, : 151 - 156
  • [47] Cross-language text retrieval by query translation using term reweighting
    Kang, I
    Kwon, OW
    Lee, JH
    Lee, G
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2000, 14 (05) : 617 - 629
  • [48] Multilingual information access system using cross-language information retrieval
    Hayashi, Yoshihiko
    Matsuo, Yoshihiro
    Nagata, Masaaki
    Furuse, Osamu
    2003, Nippon Telegraph and Telephone Corp. (52):
  • [49] Using cross-language information retrieval methods for bilingual search of the web
    Shim, Sung J.
    International Conference on Computational Intelligence for Modelling, Control & Automation Jointly with International Conference on Intelligent Agents, Web Technologies & Internet Commerce, Vol 2, Proceedings, 2006, : 19 - 23
  • [50] Language translation and media transformation in cross-language image retrieval
    Chen, Hsin-Hsi
    Chang, Yih-Chen
    DIGITAL LIBRARIES: ACHIEVEMENTS, CHALLENGES AND OPPORTUNITIES, PROCEEDINGS, 2006, 4312 : 350 - +