Query expansion with statistical machine translation

被引:0
|
作者
Li Weijiang [1 ]
Zhao Tiejun [1 ]
Wang Xiangang [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, MOE MS Key Lab Nat Language Proc & Speech, Harbin 150001, Peoples R China
来源
CHINESE JOURNAL OF ELECTRONICS | 2008年 / 17卷 / 01期
关键词
information retrieval; query expansion; language model; statistical machine translation (SMT);
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In practical applications of information retrieval, such as the search engine, the query user submitted contains only several keywords usually. This will cause unmatched issues of words between relevant files and the user's query, and result in more seriously negative effects on the performance of information retrieval. On the basis of analyzing the process of producing query, this paper puts forward a new method of query expansion based on the model of statistical machine translation. The approach extract related terms between documents and query through statistical machine translation model, then expand the query with them. The experiment on TREC data collection shows that our method achieved 4 - 17% of the improvement all the time more than the language model method without expanding. Compared to pseudo feedback, our method has the competitive average precision.
引用
收藏
页码:48 / 52
页数:5
相关论文
共 50 条
  • [41] Online Learning for Statistical Machine Translation
    Ortiz-Martinez, Daniel
    COMPUTATIONAL LINGUISTICS, 2016, 42 (01) : 121 - 161
  • [42] Domain Adaptation for Statistical Machine Translation
    Wang, Xiaoxue
    Zhu, Conghui
    Li, Sheng
    Zhao, Tiejun
    Zheng, Dequan
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 1652 - 1658
  • [43] Distortion Models For Statistical Machine Translation
    Al-Onaizan, Yaser
    Papineni, Kishore
    COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 529 - 536
  • [44] Topic Adaptation for Statistical Machine Translation
    Taraghi, Mina
    Khadivi, Shahram
    2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, : 2147 - 2152
  • [45] Social Annotation in Query Expansion: a Machine Learning Approach
    Lin, Yuan
    Lin, Hongfei
    Jin, Song
    Ye, Zheng
    PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 405 - 414
  • [46] Query expansion based on statistical learning from code changes
    Huang, Qing
    Yang, Yangrui
    Zhan, Xue
    Wan, Hongyan
    Wu, Guoqing
    SOFTWARE-PRACTICE & EXPERIENCE, 2018, 48 (07): : 1333 - 1351
  • [47] A Sense-Based Translation Model for Statistical Machine Translation
    Xiong, Deyi
    Zhang, Min
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 1459 - 1469
  • [48] Analysing terminology translation errors in statistical and neural machine translation
    Haque, Rejwanul
    Hasanuzzaman, Mohammed
    Way, Andy
    MACHINE TRANSLATION, 2020, 34 (2-3) : 149 - 195
  • [49] A Topic-Triggered Translation Model for Statistical Machine Translation
    SU Jinsong
    WANG Zhihao
    WU Qingqiang
    YAO Junfeng
    LONG Fei
    ZHANG Haiying
    ChineseJournalofElectronics, 2017, 26 (01) : 65 - 72
  • [50] DOCUMENT TRANSLATION RETRIEVAL BASED ON STATISTICAL MACHINE TRANSLATION TECHNIQUES
    Sanchez-Martinez, Felipe
    Carrasco, Rafael C.
    APPLIED ARTIFICIAL INTELLIGENCE, 2011, 25 (05) : 329 - 340