Exploiting Comparable Corpora for Cross-Language Information Retrieval

被引:0
|
作者
Sadat, Fatiha [1 ]
机构
[1] Univ Quebec, Dept Comp Sci, Montreal, PQ H3C 3P8, Canada
关键词
Cross-language information retrieval; comparable corpora; similarity; co-occurrence tendency;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large-scale comparable corpora became more abundant and accessible than parallel corpora, with the explosive growth of the World Wide Web. Therefore, strategies on bilingual terminology extraction from comparable texts must be given more attention in order to enrich existing bilingual lexicons and thesauri and to enhance Cross-Language Information Retrieval. In the present paper, we focus on the enhancement of Cross-Language Information Retrieval using a two-stage corpus-based translation model that includes bi-directional extraction of bilingual terminology from comparable corpora and selection of best translation alternatives on the basis of their morphological knowledge. The impact of comparable corpora on the performance of the Cross-Language Information Retrieval process is evaluated in this study and the results indicate that the effect is clearly positive, especially when using the linear combination with bilingual dictionaries and Japanese-English pair of languages.
引用
收藏
页码:662 / 667
页数:6
相关论文
共 50 条
  • [31] Cross-Language Information Retrieval: An analysis of errors
    Ruiz, ME
    Srinivasan, P
    ASIS '98 - PROCEEDINGS OF THE 61ST ASIS ANNUAL MEETING, VOL 35, 1998: INFORMATION ACCESS IN THE GLOBAL INFORMATION ECONOMY, 1998, 35 : 153 - 165
  • [32] Cross-Language Information Retrieval in Web application
    Yu, SF
    Li, ZZ
    Thomassen, W
    ICCC2004: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION VOL 1AND 2, 2004, : 1198 - 1202
  • [33] Cross-language information retrieval: the way ahead
    Gey, FC
    Kando, N
    Peters, C
    INFORMATION PROCESSING & MANAGEMENT, 2005, 41 (03) : 415 - 431
  • [34] Matching meaning for cross-language information retrieval
    Wang, Jianqiang
    Oard, Douglas W.
    INFORMATION PROCESSING & MANAGEMENT, 2012, 48 (04) : 631 - 653
  • [35] Influence of WSD on cross-language information retrieval
    Kang, IS
    Na, SH
    Lee, JH
    NATURAL LANGUAGE PROCESSING - IJCNLP 2004, 2005, 3248 : 358 - 366
  • [36] Different approaches to cross-language information retrieval
    Kraaij, W
    Pohlmann, R
    COMPUTATIONAL LINGUISTICS IN THE NETHERLANDS 2000, 2001, (37): : 97 - 110
  • [37] Effect of cross-language IR in bilingual lexicon acquisition from comparable corpora
    Utsuro, T
    Horiuchi, T
    Hino, K
    Hamamoto, T
    Nakayama, T
    EACL 2003: 10TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2003, : 355 - 362
  • [38] Using Mutual Information Technique in Cross-Language Information Retrieval
    Sari, Syandra
    Adriani, Mirna
    DIGITAL LIBRARIES: UNIVERSAL AND UBIQUITOUS ACCESS TO INFORMATION, PROCEEDINGS, 2008, 5362 : 276 - +
  • [39] Leveraging comparable corpora for cross-lingual information retrieval in resource-lean language pairs
    Shakery, Azadeh
    Zhai, ChengXiang
    INFORMATION RETRIEVAL, 2013, 16 (01): : 1 - 29
  • [40] Leveraging comparable corpora for cross-lingual information retrieval in resource-lean language pairs
    Azadeh Shakery
    ChengXiang Zhai
    Information Retrieval, 2013, 16 : 1 - 29