Multilingual information retrieval using machine translation, relevance feedback and decompounding

被引:31
|
作者
Chen, A [1 ]
Gey, FC
机构
[1] Univ Calif Berkeley, Sch Informat Management & Syst, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, UC Data Arch & Tech Assistance UC DATA, Berkeley, CA 94720 USA
来源
INFORMATION RETRIEVAL | 2004年 / 7卷 / 1-2期
关键词
multilingual information retrieval; cross-language information retrieval; relevance feedback; decompounding; results merging;
D O I
10.1023/B:INRT.0000009444.89549.90
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multilingual retrieval ( querying of multiple document collections each in a different language) can be achieved by combining several individual techniques which enhance retrieval: machine translation to cross the language barrier, relevance feedback to add words to the initial query, decompounding for languages with complex term structure, and data fusion to combine monolingual retrieval results from different languages. Using the CLEF 2001 and CLEF 2002 topics and document collections, this paper evaluates these techniques within the context of a monolingual document ranking formula based upon logistic regression. Each individual technique yields improved performance over runs which do not utilize that technique. Moreover the techniques are complementary, in that combining the best techniques outperforms individual technique performance. An approximate but fast document translation using bilingual wordlists created from machine translation systems is presented and evaluated. The fast document translation is as effective as query translation in multilingual retrieval. Furthermore, when fast document translation is combined with query translation in multilingual retrieval, the performance is significantly better than that of query translation or fast document translation.
引用
收藏
页码:149 / 182
页数:34
相关论文
共 50 条
  • [1] Multilingual Information Retrieval Using Machine Translation, Relevance Feedback and Decompounding
    Aitao Chen
    Fredric C. Gey
    Information Retrieval, 2004, 7 : 149 - 182
  • [2] Adaptation of machine translation for multilingual information retrieval in the medical domain
    Pecina, Pavel
    Dusek, Ondrej
    Goeuriot, Lorraine
    Hajic, Jan
    Hlavacova, Jaroslava
    Jones, Gareth J. F.
    Kelly, Liadh
    Leveling, Johannes
    Marecek, David
    Novak, Michal
    Popel, Martin
    Rosa, Rudolf
    Tamchyna, Ales
    Uresova, Zdenka
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2014, 61 (03) : 165 - 185
  • [3] Enhancing query translation with relevance feedback in translingual information retrieval
    He, Daqing
    Wu, Dan
    INFORMATION PROCESSING & MANAGEMENT, 2011, 47 (01) : 1 - 17
  • [4] Using translation heuristics to improve a multimodal and multilingual information retrieval system
    Garcia-Cumbreras, Miguel Angel
    Martin-Valdivia, Maria Teresa
    Urena-Lopez, Luis Alfonso
    Diaz-Galiano, Manuel Carlos
    Montejo-Raez, Arturo
    APPLICATIONS OF FUZZY SETS THEORY, 2007, 4578 : 438 - +
  • [5] INFORMATION RETRIEVAL AND MACHINE TRANSLATION
    ALLEN, K
    CURRENT SCIENCE, 1961, 30 (11): : 442 - &
  • [6] A Case Study in Decompounding for Bengali Information Retrieval
    Ganguly, Debasis
    Leveling, Johannes
    Jones, Gareth J. F.
    INFORMATION ACCESS EVALUATION: MULTILINGUALITY, MULTIMODALITY, AND VISUALIZATION, 2013, 8138 : 108 - 119
  • [7] Multilingual Information Retrieval using GHSOM
    Yang, Hsin-Chang
    Lee, Chung-Hong
    ISDA 2008: EIGHTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 1, PROCEEDINGS, 2008, : 225 - +
  • [8] EUROGENE: Multilingual Retrieval and Machine Translation Applied to Human Genetics
    Knoth, Petr
    Collins, Trevor
    Sklavounou, Elsa
    Zdrahal, Zdenek
    ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2010, 5993 : 670 - +
  • [9] Machine translation and monolingual information retrieval
    Franz, M
    McCarley, JS
    SIGIR'99: PROCEEDINGS OF 22ND INTERNATIONAL CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 1999, : 295 - 296
  • [10] User Relevance Feedback in Semantic Information Retrieval
    Picariello, Antonio
    Rinaldi, Antonio M.
    INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2007, 3 (02) : 36 - 50