Query reformulation mining: models, patterns, and applications

被引:22
|
作者
Boldi, Paolo [1 ]
Bonchi, Francesco [2 ]
Castillo, Carlos [2 ]
Vigna, Sebastiano [1 ]
机构
[1] Univ Milan, DSI, I-20135 Milan, Italy
[2] Yahoo Res, Barcelona 080018, Spain
来源
INFORMATION RETRIEVAL | 2011年 / 14卷 / 03期
关键词
Query log mining; Query flow graph; Session segmentation; Query recommendation; WEB; SEARCH;
D O I
10.1007/s10791-010-9155-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Understanding query reformulation patterns is a key task towards next generation web search engines. If we can do that, then we can build systems able to understand and possibly predict user intent, providing the needed assistance at the right time, and thus helping users locate information more effectively and improving their web-search experience. As a step in this direction, we build a very accurate model for classifying user query reformulations into broad classes (generalization, specialization, error correction or parallel move), achieving 92% accuracy. We then apply the model to automatically label two very large query logs sampled from different geographic areas, and containing a total of approximately 17 million query reformulations. We study the resulting reformulation patterns, matching some results from previous studies performed on smaller manually annotated datasets, and discovering new interesting reformulation patterns, including connections between reformulation types and topical categories. We annotate two large query-flow graphs with reformulation type information, and run several graph-characterization experiments on these graphs, extracting new insights about the relationships between the different query reformulation types. Finally we study query recommendations based on short random walks on the query-flow graphs. Our experiments show that these methods can match in precision, and often improve, recommendations based on query-click graphs, without the need of users' clicks. Our experiments also show that it is important to consider transition-type labels on edges for having recommendations of good quality.
引用
收藏
页码:257 / 289
页数:33
相关论文
共 50 条
  • [1] Query reformulation mining: models, patterns, and applications
    Paolo Boldi
    Francesco Bonchi
    Carlos Castillo
    Sebastiano Vigna
    [J]. Information Retrieval, 2011, 14 : 257 - 289
  • [2] From "Dango" to "Japanese Cakes": Query Reformulation Models and Patterns
    Boldi, Paolo
    Bonchi, Francesco
    Castillo, Carlos
    Vigna, Sebastiano
    [J]. 2009 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 1, 2009, : 183 - +
  • [3] Patterns of Query Reformulation During Web Searching
    Jansen, Bernard J.
    Booth, Danielle L.
    Spink, Amanda
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2009, 60 (07): : 1358 - 1371
  • [4] Patterns of Gender-Specializing Query Reformulation
    Raj, Amifa
    Mitra, Bhaskar
    Craswell, Nick
    Ekstrand, Michael
    [J]. PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 2241 - 2245
  • [5] Generalized Syntactic and Semantic Models of Query Reformulation
    Herdagdelen, Amac
    Ciaramita, Massimiliano
    Mahler, Daniel
    Holmqvist, Maria
    Hall, Keith
    Riezler, Stefan
    Alfonseca, Enrique
    [J]. SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 283 - 290
  • [6] Patterns and transitions of query reformulation during web searching
    Jansen, Bernard J.
    Zhang, Mimi
    Spink, Amanda
    [J]. INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2007, 3 (04) : 328 - +
  • [7] Applications of Web query mining
    Baeza-Yates, R
    [J]. ADVANCES IN INFORMATION RETRIEVAL, 2005, 3408 : 7 - 22
  • [8] Query Reformulation Patterns in Cross-device OPAC Search
    Wu, Dan
    Bi, Renmin
    [J]. 2017 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL 2017), 2017, : 328 - 329
  • [9] Mining XML frequent query patterns
    Hua, Cheng
    Zhao, Hai-jun
    Chen, Yi
    [J]. INTEGRATION AND INNOVATION ORIENT TO E-SOCIETY, VOL 1, 2007, 251 : 26 - +
  • [10] SEMANTIC FRAMEWORK FOR SPATIAL QUERY REFORMULATION FOR DISASTER MONITORING APPLICATIONS
    Kurte, Kuldeep R.
    Potnis, Abhishek V.
    Durbha, Surya S.
    Shinde, Rajat C.
    [J]. 2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 9946 - 9949