Transliteration Retrieval Model for Cross Lingual Information Retrieval

被引:0
|
作者
Jan, Ea-Ee [1 ]
Lin, Shih-Hsiang [1 ,2 ]
Chen, Berlin [2 ]
机构
[1] IBM TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA
[2] Natl Taiwan Normal Univ, Comp Sci & Informat Engn, Taipei, Taiwan
来源
关键词
cross lingual information retrieval (CLIR); transliteration; retrieval model; statistical machine translation (SMT); NTCIR; ALIGNMENT;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The performance of transliteration from a source language to a target language builds the ground work in support of proper name Cross Lingual Information Retrieval (CLIR). Traditionally, this task is accomplished by two separate modules: transliteration and retrieval. Queries are first transliterated to target language using one or multiple hypotheses. The retrieval is then carried out based on translated queries. The transliteration often results in 30-50% errors with top I hypothesis, thus leading to significant performance degradation in CUR. Therefore, we proposed a unified transliteration retrieval model that incorporates the transliteration similarity measurement into the relevance scoring function. In addition, we presented an efficient and robust method in similarity measurement for a given proper name pair using the Hidden Markov Model (HMM) based alignment and a Statistical Machine Translation (SMT) framework. Experimental data showed significant results with the proposed integrated method on the NTCIR7 IR4QA task, which demonstrated a greater flexibility and acceptance in transliteration.
引用
下载
收藏
页码:183 / +
页数:3
相关论文
共 50 条
  • [21] Japanese/English Cross-Language Information Retrieval: Exploration of Query Translation and Transliteration
    Atsushi Fujii
    Tetsuya Ishikawa
    Computers and the Humanities, 2001, 35 : 389 - 420
  • [22] A Learning to rank framework based on cross-lingual loss function for cross-lingual information retrieval
    Ghanbari, Elham
    Shakery, Azadeh
    APPLIED INTELLIGENCE, 2022, 52 (03) : 3156 - 3174
  • [23] A Learning to rank framework based on cross-lingual loss function for cross-lingual information retrieval
    Elham Ghanbari
    Azadeh Shakery
    Applied Intelligence, 2022, 52 : 3156 - 3174
  • [24] CLIReval: Evaluating Machine Translation as a Cross-Lingual Information Retrieval Task
    Sun, Shuo
    Sia, Suzanna
    Duh, Kevin
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020): SYSTEM DEMONSTRATIONS, 2020, : 134 - 141
  • [25] Mind the Gap: Cross-Lingual Information Retrieval with Hierarchical Knowledge Enhancement
    Zhang, Fuwei
    Zhang, Zhao
    Ao, Xiang
    Gao, Dehong
    Zhuang, Fuzhen
    Wei, Yi
    He, Qing
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 4345 - 4353
  • [26] Query-dependent learning to rank for cross-lingual information retrieval
    Elham Ghanbari
    Azadeh Shakery
    Knowledge and Information Systems, 2019, 59 : 711 - 743
  • [27] A Low Cost Machine Translation Method for Cross-Lingual Information Retrieval
    Bracewell, David B.
    Ren, Fuji
    Kuroiwa, Shingo
    ENGINEERING LETTERS, 2008, 16 (01)
  • [28] Using the Web corpus to translate the queries in cross-lingual information retrieval
    Zhang, JL
    Sun, L
    Min, JM
    PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 493 - 498
  • [29] Cross-lingual information retrieval and delivery using community mobile networks
    Shriram, R.
    Sugumaran, Vijayan
    Kapetanios, Epaminondas
    2006 1ST INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT, 2006, : 320 - +
  • [30] Unsupervised Cross-Lingual Information Retrieval Using Monolingual Data Only
    Litschko, Robert
    Glavas, Goran
    Ponzetto, Simone Paolo
    Vulic, Ivan
    ACM/SIGIR PROCEEDINGS 2018, 2018, : 1253 - 1256