Discovering and understanding word level user intent in Web search queries

被引:9
|
作者
Roy, Rishiraj Saha [1 ]
Katare, Rahul [1 ]
Ganguly, Niloy [1 ]
Laxman, Srivatsan [2 ]
Choudhury, Monojit [3 ]
机构
[1] Indian Inst Technol Kharagpur, Comp Sci & Engn, Kharagpur, W Bengal, India
[2] Scibler Technol Private Ltd, Bengaluru, Karnataka, India
[3] Microsoft Res India, Bengaluru, Karnataka, India
来源
JOURNAL OF WEB SEMANTICS | 2015年 / 30卷
关键词
Query understanding; Query intent; Intent words; Co-occurrence entropy; TERM PROXIMITY;
D O I
10.1016/j.websem.2014.07.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Identifying and interpreting user intent are fundamental to semantic search. In this paper, we investigate the association of intent with individual words of a search query. We propose that words in queries can be classified as either content or intent, where content words represent the central topic of the query, while users add intent words to make their requirements more explicit. We argue that intelligent processing of intent words can be vital to improving the result quality, and in this work we focus on intent word discovery and understanding. Our approach towards intent word detection is motivated by the hypotheses that query intent words satisfy certain distributional properties in large query logs similar to function words in natural language corpora. Following this idea, we first prove the effectiveness of our corpus distributional features, namely, word co-occurrence counts and entropies, towards function word detection for five natural languages. Next, we show that reliable detection of intent words in queries is possible using these same features computed from query logs. To make the distinction between content and intent words more tangible, we additionally provide operational definitions of content and intent words as those words that should match, and those that need not match, respectively, in the text of relevant documents. In addition to a standard evaluation against human annotations, we also provide an alternative validation of our ideas using clickthrough data. Concordance of the two orthogonal evaluation approaches provide further support to our original hypothesis of the existence of two distinct word classes in search queries. Finally, we provide a taxonomy of intent words derived through rigorous manual analysis of large query logs. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:22 / 38
页数:17
相关论文
共 50 条
  • [1] Word Order Communicates User Intent in Search Queries
    Smirnova, Anastasia
    CHI'20: EXTENDED ABSTRACTS OF THE 2020 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2020,
  • [2] Classifying the user intent of User intent of web queries web queries using k-means clustering
    Kathuria, Ashish
    Jansen, Bernard J.
    Hafernik, Carolyn
    Spink, Amanda
    INTERNET RESEARCH, 2010, 20 (05) : 563 - 581
  • [3] Comparing Classifiers for Web User Intent Understanding
    Deufemia, Vincenzo
    Granatello, Miriam
    Merola, Alessandro
    Pesce, Emanuele
    Polese, Giuseppe
    EMPOWERING ORGANIZATIONS: ENABLING PLATFORMS AND ARTEFACTS, 2016, 11 : 147 - 159
  • [4] Named Entity Recognition in Local Intent Web Search Queries
    Mittal, Saloni
    Agarwal, Manoj K.
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PT I, 2019, 11706 : 407 - 417
  • [5] Exploring effective features for recognizing the user intent behind web queries
    Figueroa, Alejandro
    COMPUTERS IN INDUSTRY, 2015, 68 : 162 - 169
  • [6] User Intent Inference for Web Search and Conversational Agents
    Ahmadvand, Ali
    PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM '20), 2020, : 911 - 912
  • [7] User Intent and Assessor Disagreement in Web Search Evaluation
    Kazai, Gabriella
    Yilmaz, Emine
    Craswell, Nick
    Tahaghoghi, S. M. M.
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 699 - 708
  • [8] Temporal Dynamics of User Interests in Web Search Queries
    Cayci, Aysegul
    Sumengen, Selcuk
    Turkay, Cagatay
    Balcisoy, Selim
    Saygin, Yucel
    2009 INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS: WAINA, VOLS 1 AND 2, 2009, : 762 - 767
  • [9] Understanding user intent on the web through interaction mining
    Caruccio, Loredana
    Deufemia, Vincenzo
    Polese, Giuseppe
    JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 2015, 31 : 230 - 236
  • [10] The 32 Days Of Christmas: Understanding Temporal Intent in Image Search Queries
    Bentley, Frank R.
    Kaye, Joseph 'Jofish'
    Shamma, David A.
    Guerra-Gomez, John Alexis
    34TH ANNUAL CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2016, 2016, : 5710 - 5714