Exploiting Comparable Corpora for Cross-Language Information Retrieval

被引:0
|
作者
Sadat, Fatiha [1 ]
机构
[1] Univ Quebec, Dept Comp Sci, Montreal, PQ H3C 3P8, Canada
关键词
Cross-language information retrieval; comparable corpora; similarity; co-occurrence tendency;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large-scale comparable corpora became more abundant and accessible than parallel corpora, with the explosive growth of the World Wide Web. Therefore, strategies on bilingual terminology extraction from comparable texts must be given more attention in order to enrich existing bilingual lexicons and thesauri and to enhance Cross-Language Information Retrieval. In the present paper, we focus on the enhancement of Cross-Language Information Retrieval using a two-stage corpus-based translation model that includes bi-directional extraction of bilingual terminology from comparable corpora and selection of best translation alternatives on the basis of their morphological knowledge. The impact of comparable corpora on the performance of the Cross-Language Information Retrieval process is evaluated in this study and the results indicate that the effect is clearly positive, especially when using the linear combination with bilingual dictionaries and Japanese-English pair of languages.
引用
收藏
页码:662 / 667
页数:6
相关论文
共 50 条
  • [21] Cross-language Information Retrieval Based on Multiple Information
    Liu, Pengyuan
    Zheng, Zhijun
    Su, Qi
    2018 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2018), 2018, : 623 - 626
  • [22] Neural Methods for Cross-Language Information Retrieval
    Yang, Eugene
    Lawrie, Dawn
    Mayfield, James
    Nair, Suraj
    Oard, Douglas W.
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 3430 - 3431
  • [23] Translation Techniques in Cross-Language Information Retrieval
    Zhou, Dong
    Truran, Mark
    Brailsford, Tim
    Wade, Vincent
    Ashman, Helen
    ACM COMPUTING SURVEYS, 2012, 45 (01)
  • [24] Translation Ambiguity in Cross-Language Information Retrieval
    Sadat, Fatiha
    BUSINESS TRANSFORMATION THROUGH INNOVATION AND KNOWLEDGE MANAGEMENT: AN ACADEMIC PERSPECTIVE, VOLS 1-2, 2010, : 301 - 303
  • [25] Relevance feedback and cross-language information retrieval
    Orengo, Viviane Moreira
    Huyck, Christian
    INFORMATION PROCESSING & MANAGEMENT, 2006, 42 (05) : 1203 - 1217
  • [26] Cross-Language Information Retrieval: An analysis of errors
    Ruiz, ME
    Srinivasan, P
    PROCEEDINGS OF THE ASIS ANNUAL MEETING, 1998, 35 : 153 - 165
  • [27] The BETTER Cross-Language Information Retrieval Datasets
    Soboroff, Ian
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 3047 - 3053
  • [28] Arabic Cross-Language Information Retrieval: A Review
    Elayeb, Bilel
    Bounhas, Ibrahim
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2016, 15 (03)
  • [29] Combining evidence for cross-language information retrieval
    Kamps, J
    Monz, C
    de Rijke, M
    ADVANCES IN CROSS-LANGUAGE INFORMATION RETRIEVAL, 2003, 2785 : 111 - 126
  • [30] Disambiguation strategies for Cross-Language Information Retrieval
    Hiemstra, D
    de Jong, F
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, PROCEEDINGS, 1999, 1696 : 274 - 293