A new approach to query segmentation for relevance ranking in web search

被引:4
|
作者
Wu, Haocheng [1 ]
Hu, Yunhua [2 ]
Li, Hang [3 ]
Chen, Enhong [1 ]
机构
[1] Univ Sci & Technol China, Hefei 230026, Peoples R China
[2] Alibaba Com, Beijing, Peoples R China
[3] Noahs Ark Lab Huawei Technol, Hong Kong, Hong Kong, Peoples R China
来源
INFORMATION RETRIEVAL JOURNAL | 2015年 / 18卷 / 01期
关键词
Web search; Query segmentation; Relevance ranking; Query processing; Re-ranking; BM25; Term dependency model; Key n-gram extraction;
D O I
10.1007/s10791-014-9246-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we try to determine how best to improve state-of-the-art methods for relevance ranking in web searching by query segmentation. Query segmentation is meant to separate the input query into segments, typically natural language phrases. We propose employing the re-ranking approach in query segmentation, which first employs a generative model to create the top k candidates and then employs a discriminative model to re-rank the candidates to obtain the final segmentation result. The method has been widely utilized for structure prediction in natural language processing, but has not been applied to query segmentation, as far as we know. Furthermore, we propose a new method for using the results of query segmentation in relevance ranking, which takes both the original query words and the segmented query phrases as units of query representation. We investigate whether our method can improve three relevance models, namely n-gram BM25, key n-gram model and term dependency model, within the framework of learning to rank. Our experimental results on large scale web search datasets show that our method can indeed significantly improve relevance ranking in all three cases.
引用
收藏
页码:26 / 50
页数:25
相关论文
共 50 条
  • [1] A new approach to query segmentation for relevance ranking in web search
    Haocheng Wu
    Yunhua Hu
    Hang Li
    Enhong Chen
    Information Retrieval Journal, 2015, 18 : 26 - 50
  • [2] Relevance Ranking for Web Search
    Lages, Joao
    Carvalho, Joao Paulo
    2020 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2020,
  • [3] Coverage, relevance, and ranking: The impact of query operators on web search engine results
    Eastman, CM
    Jansen, BJ
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2003, 21 (04) : 383 - 411
  • [4] Query suggestion by query search: a new approach to user support in web search
    Jiang, Shen
    Zilles, Sandra
    Holte, Robert
    2009 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 1, 2009, : 679 - +
  • [5] Query Sampling for Ranking Learning in Web Search
    Yang, Linjun
    Wang, Li
    Geng, Bo
    Hua, Xian-Sheng
    PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 754 - 755
  • [6] Cocluster hypothesis and ranking consistency for relevance ranking in web search
    Jiang, Jian-De
    Jiang, Jyun-Yu
    Cheng, Pu-Jen
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2019, 70 (06) : 535 - 546
  • [7] Ranking Entities Using Web Search Query Logs
    Billerbeck, Bodo
    Demartini, Gianluca
    Firan, Claudiu S.
    Iofciu, Teresa
    Krestel, Ralf
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, 2010, 6273 : 273 - +
  • [8] An approach for the ranking of query results in the semantic web
    Stojanovic, N
    Studer, R
    Stojanovic, L
    SEMANTIC WEB - ISWC 2003, 2003, 2870 : 500 - 516
  • [9] Query aspects approach to web search
    Crabtree, Daniel
    Gao, Xiaoying
    Andreae, Peter
    WEB INTELLIGENCE, 2016, 14 (03) : 173 - 197
  • [10] LambdaRank Acceleration for Relevance Ranking in Web Search Engines
    Yan, Jing
    Xu, Ning-Yi
    Cai, Xiong-Fei
    Gao, Rui
    Wang, Yu
    Luo, Rong
    Hsu, Feng-Hsiung
    FPGA 10, 2010, : 285 - 285