Mining Broad Latent Query Aspects from Search Sessions

被引:0
|
作者
Wang, Xuanhui [1 ]
Chakrabarti, Deepayan [1 ]
Punera, Kunal [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
关键词
Latent user intents; query aspects; search sessions;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Search queries are typically very short, which means they are often underspecified or have senses that the user did not think of. A broad latent query aspect is a set of keywords that succinctly represents one particular sense, or one particular information need, that can aid users in reformulating such queries. We extract such broad latent aspects from query reformulations found in historical search session logs. We propose a framework under which the problem of extracting such broad latent aspects reduces to that of optimizing a formal objective function under constraints on the total number of aspects the system can store, and the number of aspects that can be shown in response to any given query. We present algorithms to find a good set of aspects, and also to pick the best k aspects matching any query. Empirical results on real-world search engine logs show significant gains over a strong baseline that uses single-keyword reformulations: a gain of 14% and 23% in terms of human-judged accuracy and click-through data respectively, and around 20% in terms of consistency among aspects predicted for "similar" queries. This demonstrates both the importance of broad query aspects, and the efficacy of our algorithms for extracting them.
引用
收藏
页码:867 / 875
页数:9
相关论文
共 50 条
  • [41] The Archive Query Log: Mining Millions of Search Result Pages of Hundreds of Search Engines from 25 Years of Web Archives
    Reimer, Jan Heinrich
    Schmidt, Sebastian
    Froebe, Maik
    Gienapp, Lukas
    Scells, Harrisen
    Stein, Benno
    Hagen, Matthias
    Potthast, Martin
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 2848 - 2860
  • [42] The use of query auto-completion over the course of search sessions with multifaceted information needs
    Smith, Catherine L.
    Gwizdka, Jacek
    Feild, Henry
    INFORMATION PROCESSING & MANAGEMENT, 2017, 53 (05) : 1139 - 1155
  • [43] Mining Frequent User Query Patterns from XML Query Streams
    Chang, Tsui-Ping
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2014, 11 (05) : 452 - 458
  • [44] Mining query subtopics from social tags
    Zhitomirsky-Geffet, Maayan
    Daya, Yossi
    INFORMATION RESEARCH-AN INTERNATIONAL ELECTRONIC JOURNAL, 2015, 20 (02):
  • [45] Mining User Profiles from Query Log
    Peng, Minlong
    Zhao, Jun
    Zhang, Qi
    Gui, Tao
    Huang, Xuanjing
    Fu, Jinlan
    INFORMATION RETRIEVAL (CCIR 2019), 2019, 11772 : 3 - 15
  • [46] Estimation of Search Intents from Query to Context Search Engine
    Takama, Yasufumi
    Tezuka, Takuya
    Shibata, Hiroki
    Chen, Lieu-Hen
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2020, 24 (03) : 316 - 325
  • [47] Mining Precision Interfaces From Query Logs
    Zhang, Qianrui
    Zhang, Haoci
    Sellam, Thibault
    Wu, Eugene
    SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2019, : 988 - 1005
  • [48] Log-mining-based query spelling correction for Chinese search engines
    Zhou, Bo
    Zhang, Min
    Ma, Shaoping
    Liu, Yiqun
    Ru, Liyun
    Journal of Computational Information Systems, 2009, 5 (03): : 1225 - 1233
  • [49] Mining the Query Logs of a Chinese Web Search Engine for Character Usage Analysis
    Lu, Yan
    Chau, Michael
    Fang, Xiao
    PACIFIC ASIA CONFERENCE ON INFORMATION SYSTEMS 2006, SECTIONS 1-8, 2006, : 346 - +
  • [50] Mining Historic Query Trails to Label Long and Rare Search Engine Queries
    Bailey, Peter
    White, Ryen W.
    Liu, Han
    Kumaran, Giridhar
    ACM TRANSACTIONS ON THE WEB, 2010, 4 (04)