Query expansion with statistical machine translation

被引:0
|
作者
Li Weijiang [1 ]
Zhao Tiejun [1 ]
Wang Xiangang [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, MOE MS Key Lab Nat Language Proc & Speech, Harbin 150001, Peoples R China
来源
CHINESE JOURNAL OF ELECTRONICS | 2008年 / 17卷 / 01期
关键词
information retrieval; query expansion; language model; statistical machine translation (SMT);
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In practical applications of information retrieval, such as the search engine, the query user submitted contains only several keywords usually. This will cause unmatched issues of words between relevant files and the user's query, and result in more seriously negative effects on the performance of information retrieval. On the basis of analyzing the process of producing query, this paper puts forward a new method of query expansion based on the model of statistical machine translation. The approach extract related terms between documents and query through statistical machine translation model, then expand the query with them. The experiment on TREC data collection shows that our method achieved 4 - 17% of the improvement all the time more than the language model method without expanding. Compared to pseudo feedback, our method has the competitive average precision.
引用
收藏
页码:48 / 52
页数:5
相关论文
共 50 条
  • [21] Reduction of Neural Machine Translation Failures by Incorporating Statistical Machine Translation
    Dugonik, Jani
    Maucec, Mirjam Sepesy
    Verber, Domen
    Brest, Janez
    MATHEMATICS, 2023, 11 (11)
  • [22] Translation Model of Myanmar Phrases for Statistical Machine Translation
    Zin, Thet Thet
    Soe, Khin Mar
    Thein, Ni Lar
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2012, 6839 : 235 - +
  • [23] Query expansion for mining translation knowledge from comparable data
    Xiang, Lu
    Zhou, Yu
    Hao, Jie
    Zhang, Dakun
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8801 : 200 - 211
  • [24] Statistical machine translation for Indic languages
    Das, Sudhansu Bala
    Panda, Divyajyoti
    Mishra, Tapas Kumar
    Patra, Bidyut Kr.
    NATURAL LANGUAGE PROCESSING, 2025, 31 (02): : 328 - 345
  • [25] Paraphrase Lattice for Statistical Machine Translation
    Onishi, Takashi
    Utiyama, Masao
    Sumita, Eiichiro
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (06) : 1299 - 1305
  • [26] Pushdown Automata in Statistical Machine Translation
    Allauzen, Cyril
    Byrne, Bill
    de Gispert, Adria
    Iglesias, Gonzalo
    Riley, Michael
    COMPUTATIONAL LINGUISTICS, 2014, 40 (03) : 687 - 723
  • [27] Phrasal cohesion and statistical machine translation
    Fox, HJ
    PROCEEDINGS OF THE 2002 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, 2002, : 304 - 311
  • [28] Optimization for Statistical Machine Translation: A Survey
    Neubig, Graham
    Watanabe, Taro
    COMPUTATIONAL LINGUISTICS, 2016, 42 (01) : 1 - 54
  • [29] A Coupled Linguistics/Statistical Technique for Query Structure Classification and its Application to Query Expansion
    Selvaretnam, Bhawani
    Belkhatir, Mohammed
    Messom, Christopher
    2013 10TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2013, : 1105 - 1109
  • [30] Lattice Desegmentation for Statistical Machine Translation
    Salameh, Mohammad
    Cherry, Colin
    Kondrak, Grzegorz
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 100 - 110