Anchor text mining for translation of Web queries: A transitive translation approach

被引:29
|
作者
Lu, WH
Chien, LF
Lee, HJ
机构
[1] Acad Sinica, Inst Informat Sci, Nangang 115, Taiwan
[2] Natl Chiao Tung Univ, Dept Comp Sci & Informat Engn, Hsinchu 300, Taiwan
关键词
algorithms; experimentation; performance; multilingual translation; anchor text mining; cross-language information retrieval; cross-language Web search; competitive linking algorithm;
D O I
10.1145/984321.984324
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To discover translation knowledge in diverse data resources on the Web, this article proposes an effective approach to finding translation equivalents of query terms and constructing multilingual lexicons through the mining of Web anchor texts and link structures. Although Web anchor texts are wide-scoped hypertext resources, not every particular pair of languages contains sufficient anchor texts for effective extraction of translations for Web queries. For more generalized applications, the approach is designed based on a transitive translation model. The translation equivalents of a query term can be extracted via its translation in an intermediate language. To reduce interference from translation errors, the approach further integrates a competitive linking algorithm into the process of determining the most probable translation. A series of experiments has been conducted, including performance tests on term translation extraction, cross-language information retrieval, and translation suggestions for practical Web search services, respectively. The obtained experimental results have shown that the proposed approach is effective in extracting translations of unknown queries, is easy to combine with the probabilistic retrieval model to improve the cross-language retrieval performance, and is very useful when the considered language pairs lack a sufficient number of anchor texts. Based on the approach, an experimental system called LiveTrans has been developed for English-Chinese cross-language Web search.
引用
收藏
页码:242 / 269
页数:28
相关论文
共 50 条
  • [1] Anchor text mining for translation of Web queries
    Lu, WH
    Chien, LF
    Lee, HJ
    2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 401 - 408
  • [2] Text Mining in Radiology Reports by Statistical Machine Translation Approach
    Bodile, Anuradha
    Kshirsagar, Manali
    2015 GLOBAL CONFERENCE ON COMMUNICATION TECHNOLOGIES (GCCT), 2015, : 906 - 909
  • [3] WeMiT: Web-Mining for Translation
    Roche, Mathieu
    Garbasevschi, Oana Mihaela
    20TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2012), 2012, 242 : 993 - +
  • [4] Semantic Completely Preprocessing for Deep Web Queries Translation
    Liang, Hao
    Ren, Fei
    2009 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION SYSTEMS AND APPLICATIONS, PROCEEDINGS, 2009, : 43 - 46
  • [5] Improving Web-Based OOV Translation Mining for Query Translation
    Ge, Yun Dong
    Hong, Yu
    Yao, Jian Min
    Zhu, Qiao Ming
    INFORMATION RETRIEVAL TECHNOLOGY, 2010, 6458 : 576 - 587
  • [6] Machine Translation: Mining Text for Social Theory
    Evans, James A.
    Aceves, Pedro
    ANNUAL REVIEW OF SOCIOLOGY, VOL 42, 2016, 42 : 21 - 50
  • [7] Implementation of Neural Machine Translation for Nahuatl as a Web Platform: A Focus on Text Translation
    Bello Garcia, S. Khalil
    Sanchez Lucero, E.
    Bonilla Huerta, E.
    Hernandez Hernandez, J. Crispin
    Ramirez Cruz, J. Federico
    Pedroza Mendez, B. Estela
    PROGRAMMING AND COMPUTER SOFTWARE, 2021, 47 (08) : 778 - 792
  • [8] Implementation of Neural Machine Translation for Nahuatl as a Web Platform: A Focus on Text Translation
    S. Khalil Bello García
    E. Sánchez Lucero
    E. Bonilla Huerta
    J. Crispín Hernández Hernández
    J. Federico Ramírez Cruz
    B. Estela Pedroza Méndez
    Programming and Computer Software, 2021, 47 : 778 - 792
  • [9] Web-based terminology translation mining
    Fang, GL
    Yu, H
    Nishino, F
    NATURAL LANGUAGE PROCESSING - IJCNLP 2005, PROCEEDINGS, 2005, 3651 : 1004 - 1016
  • [10] Named Entity Translation with Web Mining and Transliteration
    Jiang, Long
    Zhou, Ming
    Chien, Lee-Feng
    Niu, Cheng
    20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 1629 - 1634