Multilingual information retrieval using machine translation, relevance feedback and decompounding

被引:31
|
作者
Chen, A [1 ]
Gey, FC
机构
[1] Univ Calif Berkeley, Sch Informat Management & Syst, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, UC Data Arch & Tech Assistance UC DATA, Berkeley, CA 94720 USA
来源
INFORMATION RETRIEVAL | 2004年 / 7卷 / 1-2期
关键词
multilingual information retrieval; cross-language information retrieval; relevance feedback; decompounding; results merging;
D O I
10.1023/B:INRT.0000009444.89549.90
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multilingual retrieval ( querying of multiple document collections each in a different language) can be achieved by combining several individual techniques which enhance retrieval: machine translation to cross the language barrier, relevance feedback to add words to the initial query, decompounding for languages with complex term structure, and data fusion to combine monolingual retrieval results from different languages. Using the CLEF 2001 and CLEF 2002 topics and document collections, this paper evaluates these techniques within the context of a monolingual document ranking formula based upon logistic regression. Each individual technique yields improved performance over runs which do not utilize that technique. Moreover the techniques are complementary, in that combining the best techniques outperforms individual technique performance. An approximate but fast document translation using bilingual wordlists created from machine translation systems is presented and evaluated. The fast document translation is as effective as query translation in multilingual retrieval. Furthermore, when fast document translation is combined with query translation in multilingual retrieval, the performance is significantly better than that of query translation or fast document translation.
引用
收藏
页码:149 / 182
页数:34
相关论文
共 50 条
  • [41] Multilingual information retrieval using English and Chinese queries
    Chen, AT
    EVLAUATION OF CROSS-LANGUAGE INFORMATION RETRIEVAL SYSTEMS, 2002, 2406 : 44 - 58
  • [42] Combining the evidence of different relevance feedback methods for information retrieval
    Soongsil Univ, Seoul, Korea, Republic of
    Inf Process Manage, 6 (681-691):
  • [43] A user interface of relevance feedback for interactive information retrieval systems
    Vitsentiy, Vitaliy
    IDAACS 2007: PROCEEDINGS OF THE 4TH IEEE WORKSHOP ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS: TECHNOLOGY AND APPLICATIONS, 2007, : 449 - 453
  • [44] Verbosity normalized pseudo-relevance feedback in information retrieval
    Na, Seung-Hoon
    Kim, Kangil
    INFORMATION PROCESSING & MANAGEMENT, 2018, 54 (02) : 219 - 239
  • [45] Combining the evidence of different relevance feedback methods for information retrieval
    Lee, JH
    INFORMATION PROCESSING & MANAGEMENT, 1998, 34 (06) : 681 - 691
  • [46] Interactive pattern analysis for relevance feedback in multimedia information retrieval
    Wu, YM
    Zhang, AD
    MULTIMEDIA SYSTEMS, 2004, 10 (01) : 41 - 55
  • [47] A weight-based approach to information retrieval and relevance feedback
    Liao, Yi-Chun
    EXPERT SYSTEMS WITH APPLICATIONS, 2008, 35 (1-2) : 254 - 261
  • [48] Real relevance feedback information retrieval based on bound model
    Wang, Biao
    Gao, Guanglai
    Dongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Southeast University (Natural Science Edition), 2010, 40 (SUPPL. 2): : 301 - 306
  • [49] Amharic-English Information Retrieval with Pseudo Relevance Feedback
    Argaw, Atelach Alemu
    ADVANCES IN MULTILINGUAL AND MULTIMODAL INFORMATION RETRIEVAL, 2008, 5152 : 119 - 126
  • [50] Integrating neurophysiologic relevance feedback in intent modeling for information retrieval
    Jacucci, Giulio
    Barral, Oswald
    Daee, Pedram
    Wenzel, Markus
    Serim, Baris
    Ruotsalo, Tuukka
    Pluchino, Patrik
    Freeman, Jonathan
    Gamberini, Luciano
    Kaski, Samuel
    Blankertz, Benjamin
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2019, 70 (09) : 917 - 930