Using thesauri in cross-language retrieval of German and french indexed collections

被引:0
|
作者
Petras, V [1 ]
Perelman, N
Gey, R
机构
[1] Univ Calif Berkeley, Sch Informat Management & Syst, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, UC Data Arch & Tech Assistance, Berkeley, CA 94720 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For CLEF 2002, Berkeley's Group One experimented with Russian, French and English as query languages, and investigated thesaurus-aided retrieval for the special CLEF collections GIRT and Amaryllis. Two techniques were used to locate source language topic terms within the controlled vocabulary and replace them with the document language thesaurus terms to form the query sent against the collection index. This form of controlled vocabulary-aided translation is called thesaurus matching. Results show that thesaurus-aided cross-language retrieval performs slightly worse than machine translation retrieval on average, but can yield decidedly better results for particular queries. In addition, Berkeley submitted runs to the monolingual and bilingual (French and German) CLEF main tasks. We found that bilingual retrieval sometimes outperforms monolingual retrieval and postulate reasons to explain this phenomenon.
引用
收藏
页码:349 / 362
页数:14
相关论文
共 50 条
  • [1] Cross-language information retrieval
    Nie J.-Y.
    Synthesis Lectures on Human Language Technologies, 2010, 3 (01): : 1 - 142
  • [2] Cross-Language Retrieval with Wikipedia
    Schoenhofen, Peter
    Benczur, Andras
    Biro, Istvan
    Csalogany, Karoly
    ADVANCES IN MULTILINGUAL AND MULTIMODAL INFORMATION RETRIEVAL, 2008, 5152 : 72 - 79
  • [3] Cross-Language Information Retrieval
    Federico, Marcello
    COMPUTATIONAL LINGUISTICS, 2011, 37 (02) : 411 - 412
  • [4] Cross-language information retrieval
    Oard, DW
    Diekema, AR
    ANNUAL REVIEW OF INFORMATION SCIENCE AND TECHNOLOGY, 1998, 33 : 223 - 256
  • [5] Cross-language retrieval using HAIRCUT at CLEF 2004
    McNamee, P
    Mayfield, J
    MULTILINGUAL INFORMATION ACCESS FOR TEXT, SPEECH AND IMAGES, 2005, 3491 : 50 - 59
  • [6] Cross-language information retrieval using web directories
    Kimura, F
    Maeda, A
    Yoshikawa, M
    Uemura, S
    2003 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS, AND SIGNAL PROCESSING, VOLS 1 AND 2, CONFERENCE PROCEEDINGS, 2003, : 911 - 914
  • [7] Using Lasso RCCA for cross-language information retrieval
    Polajnar, Emil
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2018, 47 (09) : 2739 - 2748
  • [8] Using restricted CCA for cross-language information retrieval
    Polajnar, Emil
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2017, 46 (06) : 4618 - 4626
  • [9] Cross-Language Retrieval Using Link-Based Language Models
    Roth, Benjamin
    Klakow, Dietrich
    SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 773 - 774
  • [10] Resolving ambiguity for cross-language retrieval
    Univ of Massachusetts, Amherst, MA, United States
    SIGIR Forum, (64-71):