Transfer Learning Approaches for Building Cross-Language Dense Retrieval Models

被引:11
|
作者
Nair, Suraj [1 ,2 ]
Yang, Eugene [2 ]
Lawrie, Dawn [2 ]
Duh, Kevin [2 ]
McNamee, Paul [2 ]
Murray, Kenton [2 ]
Mayfield, James [2 ]
Oard, Douglas W. [1 ,2 ]
机构
[1] Univ Maryland, College Pk, MD 20742 USA
[2] Johns Hopkins Univ, HLTCOE, Baltimore, MD 21211 USA
来源
关键词
CLIR; ColBERT; ColBERT-X; Dense Retrieval;
D O I
10.1007/978-3-030-99736-6_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The advent of transformer-based models such as BERT has led to the rise of neural ranking models. These models have improved the effectiveness of retrieval systems well beyond that of lexical term matching models such as BM25. While monolingual retrieval tasks have benefited from large-scale training collections such as MS MARCO and advances in neural architectures, cross-language retrieval tasks have fallen behind these advancements. This paper introduces ColBERT-X, a generalization of the ColBERT multi-representation dense retrieval model that uses the XLM-RoBERTa (XLM-R) encoder to support cross-language information retrieval (CLIR). ColBERT-X can be trained in two ways. In zero-shot training, the system is trained on the English MS MARCO collection, relying on the XLM-R encoder for cross-language mappings. In translate-train, the system is trained on the MS MARCO English queries coupled with machine translations of the associated MS MARCO passages. Results on ad hoc document ranking tasks in several languages demonstrate substantial and statistically significant improvements of these trained dense retrieval models over traditional lexical CLIR baselines.
引用
收藏
页码:382 / 396
页数:15
相关论文
共 50 条
  • [1] Different approaches to cross-language information retrieval
    Kraaij, W
    Pohlmann, R
    [J]. COMPUTATIONAL LINGUISTICS IN THE NETHERLANDS 2000, 2001, (37): : 97 - 110
  • [2] Translate-Distill: Learning Cross-Language Dense Retrieval by Translation and Distillation
    Yang, Eugene
    Lawrie, Dawn
    Mayfield, James
    Oard, Douglas W.
    Miller, Scott
    [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT II, 2024, 14609 : 50 - 65
  • [3] Approaches to Cross-Language Retrieval of Similar Legal Documents Based on Machine Learning
    Zhebel, V. V.
    Devyatkin, D. A.
    Zubarev, D. V.
    Sochenkov, I. V.
    [J]. SCIENTIFIC AND TECHNICAL INFORMATION PROCESSING, 2023, 50 (05) : 494 - 499
  • [4] Approaches to Cross-Language Retrieval of Similar Legal Documents Based on Machine Learning
    V. V. Zhebel
    D. A. Devyatkin
    D. V. Zubarev
    I. V. Sochenkov
    [J]. Scientific and Technical Information Processing, 2023, 50 : 494 - 499
  • [5] Online Learning to Rank for Cross-Language Information Retrieval
    Rahimi, Razieh
    Shakery, Azadeh
    [J]. SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 1033 - 1036
  • [6] Cross-Language Retrieval Using Link-Based Language Models
    Roth, Benjamin
    Klakow, Dietrich
    [J]. SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 773 - 774
  • [7] Cross-language information retrieval
    Nie J.-Y.
    [J]. Synthesis Lectures on Human Language Technologies, 2010, 3 (01): : 1 - 142
  • [8] Cross-Language Retrieval with Wikipedia
    Schoenhofen, Peter
    Benczur, Andras
    Biro, Istvan
    Csalogany, Karoly
    [J]. ADVANCES IN MULTILINGUAL AND MULTIMODAL INFORMATION RETRIEVAL, 2008, 5152 : 72 - 79
  • [9] Cross-Language Information Retrieval
    Federico, Marcello
    [J]. COMPUTATIONAL LINGUISTICS, 2011, 37 (02) : 411 - 412
  • [10] Cross-language information retrieval
    Oard, DW
    Diekema, AR
    [J]. ANNUAL REVIEW OF INFORMATION SCIENCE AND TECHNOLOGY, 1998, 33 : 223 - 256