A new approach to query segmentation for relevance ranking in web search

被引:4
|
作者
Wu, Haocheng [1 ]
Hu, Yunhua [2 ]
Li, Hang [3 ]
Chen, Enhong [1 ]
机构
[1] Univ Sci & Technol China, Hefei 230026, Peoples R China
[2] Alibaba Com, Beijing, Peoples R China
[3] Noahs Ark Lab Huawei Technol, Hong Kong, Hong Kong, Peoples R China
来源
INFORMATION RETRIEVAL JOURNAL | 2015年 / 18卷 / 01期
关键词
Web search; Query segmentation; Relevance ranking; Query processing; Re-ranking; BM25; Term dependency model; Key n-gram extraction;
D O I
10.1007/s10791-014-9246-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we try to determine how best to improve state-of-the-art methods for relevance ranking in web searching by query segmentation. Query segmentation is meant to separate the input query into segments, typically natural language phrases. We propose employing the re-ranking approach in query segmentation, which first employs a generative model to create the top k candidates and then employs a discriminative model to re-rank the candidates to obtain the final segmentation result. The method has been widely utilized for structure prediction in natural language processing, but has not been applied to query segmentation, as far as we know. Furthermore, we propose a new method for using the results of query segmentation in relevance ranking, which takes both the original query words and the segmented query phrases as units of query representation. We investigate whether our method can improve three relevance models, namely n-gram BM25, key n-gram model and term dependency model, within the framework of learning to rank. Our experimental results on large scale web search datasets show that our method can indeed significantly improve relevance ranking in all three cases.
引用
收藏
页码:26 / 50
页数:25
相关论文
共 50 条
  • [31] Complex-query web image search with concept-based relevance estimation
    Dan Guo
    Pengfei Gao
    World Wide Web, 2016, 19 : 247 - 264
  • [32] Improving Web Search User Query Relevance using Content Based Page Rank
    Chouhan, Jayendra Singh
    Gadwal, Anand
    2015 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION AND CONTROL (IC4), 2015,
  • [33] Complex-query web image search with concept-based relevance estimation
    Guo, Dan
    Gao, Pengfei
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2016, 19 (02): : 247 - 264
  • [34] From Web Search Relevance to Vertical Search Relevance
    Chang, Yi
    SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 1073 - 1073
  • [35] Fuzzy Clustering and Relevance Ranking of Web Search Results with Differentiating Cluster Label Generation
    Matsumoto, Takazumi
    Hung, Edward
    2010 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2010), 2010,
  • [36] Word distribution analysis for relevance ranking and query expansion
    Galeas, Patricio
    Freisleben, Bernd
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2008, 4919 : 500 - 511
  • [37] Deep Search Relevance Ranking in Practice
    Pang, Linsey
    Liu, Wei
    Chang, Keng-Hao
    Li, Xue
    Bhattacharya, Moumita
    Liu, Xianjing
    Guo, Stephen
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 4810 - 4811
  • [38] Relevance ranking in georeferenced video search
    Sakire Arslan Ay
    Roger Zimmermann
    Seon Ho Kim
    Multimedia Systems, 2010, 16 : 105 - 125
  • [39] Relevance ranking in georeferenced video search
    Ay, Sakire Arslan
    Zimmermann, Roger
    Kim, Seon Ho
    MULTIMEDIA SYSTEMS, 2010, 16 (02) : 105 - 125
  • [40] A novel approach for ranking web documents based on query-optimized personalized pagerank
    Roul, Rajendra Kumar
    Sahoo, Jajati Keshari
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2021, 11 (01) : 37 - 55