A new approach to query segmentation for relevance ranking in web search

被引:4
|
作者
Wu, Haocheng [1 ]
Hu, Yunhua [2 ]
Li, Hang [3 ]
Chen, Enhong [1 ]
机构
[1] Univ Sci & Technol China, Hefei 230026, Peoples R China
[2] Alibaba Com, Beijing, Peoples R China
[3] Noahs Ark Lab Huawei Technol, Hong Kong, Hong Kong, Peoples R China
来源
INFORMATION RETRIEVAL JOURNAL | 2015年 / 18卷 / 01期
关键词
Web search; Query segmentation; Relevance ranking; Query processing; Re-ranking; BM25; Term dependency model; Key n-gram extraction;
D O I
10.1007/s10791-014-9246-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we try to determine how best to improve state-of-the-art methods for relevance ranking in web searching by query segmentation. Query segmentation is meant to separate the input query into segments, typically natural language phrases. We propose employing the re-ranking approach in query segmentation, which first employs a generative model to create the top k candidates and then employs a discriminative model to re-rank the candidates to obtain the final segmentation result. The method has been widely utilized for structure prediction in natural language processing, but has not been applied to query segmentation, as far as we know. Furthermore, we propose a new method for using the results of query segmentation in relevance ranking, which takes both the original query words and the segmented query phrases as units of query representation. We investigate whether our method can improve three relevance models, namely n-gram BM25, key n-gram model and term dependency model, within the framework of learning to rank. Our experimental results on large scale web search datasets show that our method can indeed significantly improve relevance ranking in all three cases.
引用
收藏
页码:26 / 50
页数:25
相关论文
共 50 条
  • [41] A novel approach for ranking web documents based on query-optimized personalized pagerank
    Rajendra Kumar Roul
    Jajati Keshari Sahoo
    International Journal of Data Science and Analytics, 2021, 11 : 37 - 55
  • [42] Time heuristics ranking approach for recommended queries using search engine query logs
    Umagandhi, R.
    Kumar, A. V. Senthil
    KUWAIT JOURNAL OF SCIENCE, 2014, 41 (02) : 127 - 149
  • [43] Predicting the relevance of Web search results: A collaborative filtering approach
    Briggs, P
    Smyth, B
    STAIRS 2004, 2004, 109 : 137 - 146
  • [44] FILTERING SEARCH: A NEW APPROACH TO QUERY-ANSWERING.
    Chazelle, Bernard
    1600, (15):
  • [45] FILTERING SEARCH - A NEW APPROACH TO QUERY-ANSWERING
    CHAZELLE, B
    SIAM JOURNAL ON COMPUTING, 1986, 15 (03) : 703 - 724
  • [46] Transactional query identification in Web search
    Kang, IH
    INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2005, 3689 : 221 - 232
  • [47] Query association surrogates for Web search
    Scholer, F
    Williams, HE
    Turpin, A
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2004, 55 (07): : 637 - 650
  • [48] Automated ranking of approximate query results of web database
    Meng, Xiang-Fu
    Ma, Zong-Min
    Zhang, Xiao-Yan
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2010, 31 (01): : 23 - 27
  • [49] Results ranking in web search engines
    Courtois, MP
    Berry, MW
    ONLINE, 1999, 23 (03): : 39 - +
  • [50] Joint Ranking for Multilingual Web Search
    Gao, Wei
    Niu, Cheng
    Zhou, Ming
    Wong, Kam-Fai
    ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5478 : 114 - +