Query-dependent learning to rank for cross-lingual information retrieval

被引:6
|
作者
Ghanbari, Elham [1 ]
Shakery, Azadeh [1 ,2 ]
机构
[1] Univ Tehran, Sch Elect & Comp Engn, Coll Engn, Tehran, Iran
[2] Inst Res Fundamental Sci IPM, Sch Comp Sci, Tehran, Iran
关键词
Learning to rank (LTR); Cross-lingual information retrieval (CLIR); Query-dependent LTR; Cross-lingual features; Query features;
D O I
10.1007/s10115-018-1232-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning to rank (LTR), as a machine learning technique for ranking tasks, has become one of the most popular research topics in the area of information retrieval (IR). Cross-lingual information retrieval (CLIR), in which the language of the query is different from the language of the documents, is one of the important IR tasks that can potentially benefit from LTR. Our focus in this paper is the use of LTR for CLIR. To rank the documents in the target language in response to the query in the source language, we propose a local query-dependent approach based on LTR for CLIR, which is called LQ-DLTR for CLIR. The core idea of LQ-DLTR for CLIR is the use of the local characteristics of similar queries to construct the LTR model, instead of using a single global ranking model for all queries. Since the query and the documents are in different languages, the traditional features that are used in LTR cannot be used directly for CLIR. Thus, defining appropriate features is a major step in the use of LTR for CLIR. In this paper, three categories of cross-lingual features are defined: query-document features, document features, and query features. To define the cross-lingual features, translation resources are used to fill the gap between the documents and the queries. Then, in LQ-DLTR for CLIR, a neighborhood of similar queries based on cross-lingual query features is used to create a local ranking function by the LTR algorithm for a given query. The LTR algorithm uses two cross-lingual feature sets, namely document features and query-document features, to learn the model. The query features that are used to identify the neighbors are not involved in the learning phase. Experimental results indicate that the CLIR performance improves with the use of cross-lingual features that use several translations and their probabilities to compute the features, compared to the use of monolingual features in traditional LTR, which translate a query according to the best translation and ignore the probabilities. Moreover, experimental results show that LQ-DLTR for CLIR outperforms the baseline information retrieval methods and other LTR ranking models in terms of the MAP and NDCG measures.
引用
收藏
页码:711 / 743
页数:33
相关论文
共 50 条
  • [1] Query-dependent learning to rank for cross-lingual information retrieval
    Elham Ghanbari
    Azadeh Shakery
    [J]. Knowledge and Information Systems, 2019, 59 : 711 - 743
  • [2] A Learning to rank framework based on cross-lingual loss function for cross-lingual information retrieval
    Ghanbari, Elham
    Shakery, Azadeh
    [J]. APPLIED INTELLIGENCE, 2022, 52 (03) : 3156 - 3174
  • [3] A Learning to rank framework based on cross-lingual loss function for cross-lingual information retrieval
    Elham Ghanbari
    Azadeh Shakery
    [J]. Applied Intelligence, 2022, 52 : 3156 - 3174
  • [4] Query by Example for Cross-Lingual Event Retrieval
    Sarwar, Sheikh Muhammad
    Allan, James
    [J]. PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1601 - 1604
  • [5] Semantic Cross-Lingual Information Retrieval
    Pourmahmoud, Solmaz
    Shamsfard, Mehrnoush
    [J]. 23RD INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2008, : 80 - +
  • [6] WikiTranslate: Query Translation for Cross-Lingual Information Retrieval Using Only Wikipedia
    Nguyen, Dong
    Overwijk, Arnold
    Hauff, Claudia
    Trieschnigg, Dolf R. B.
    Hiemstra, Djoerd
    de Jong, Franciska
    [J]. EVALUATING SYSTEMS FOR MULTILINGUAL AND MULTIMODAL INFORMATION ACCESS, 2009, 5706 : 58 - 65
  • [7] Using query-relevant documents pairs for cross-lingual information retrieval
    Pinto, David
    Juan, Alfons
    Rosso, Paolo
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2007, 4629 : 630 - 637
  • [8] Task-Dependent and Query-Dependent Subspace Learning for Cross-Modal Retrieval
    Wang, Li
    Zhu, Lei
    Yu, En
    Sun, Jiande
    Zhang, Huaxiang
    [J]. IEEE ACCESS, 2018, 6 : 27091 - 27102
  • [9] Learning Query-dependent Prefilters for Scalable Image Retrieval
    Torresani, Lorenzo
    Szummer, Martin
    Fitzgibbon, Andrew
    [J]. CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 2607 - +
  • [10] RANKOM: A QUERY-DEPENDENT RANKING SYSTEM FOR INFORMATION RETRIEVAL
    Jiang, Jung-Yi
    Lee, Lian-Wang
    Lee, Shie-Jue
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2011, 7 (12): : 6739 - 6756