Discovering and understanding word level user intent in Web search queries

被引:9
|
作者
Roy, Rishiraj Saha [1 ]
Katare, Rahul [1 ]
Ganguly, Niloy [1 ]
Laxman, Srivatsan [2 ]
Choudhury, Monojit [3 ]
机构
[1] Indian Inst Technol Kharagpur, Comp Sci & Engn, Kharagpur, W Bengal, India
[2] Scibler Technol Private Ltd, Bengaluru, Karnataka, India
[3] Microsoft Res India, Bengaluru, Karnataka, India
来源
JOURNAL OF WEB SEMANTICS | 2015年 / 30卷
关键词
Query understanding; Query intent; Intent words; Co-occurrence entropy; TERM PROXIMITY;
D O I
10.1016/j.websem.2014.07.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Identifying and interpreting user intent are fundamental to semantic search. In this paper, we investigate the association of intent with individual words of a search query. We propose that words in queries can be classified as either content or intent, where content words represent the central topic of the query, while users add intent words to make their requirements more explicit. We argue that intelligent processing of intent words can be vital to improving the result quality, and in this work we focus on intent word discovery and understanding. Our approach towards intent word detection is motivated by the hypotheses that query intent words satisfy certain distributional properties in large query logs similar to function words in natural language corpora. Following this idea, we first prove the effectiveness of our corpus distributional features, namely, word co-occurrence counts and entropies, towards function word detection for five natural languages. Next, we show that reliable detection of intent words in queries is possible using these same features computed from query logs. To make the distinction between content and intent words more tangible, we additionally provide operational definitions of content and intent words as those words that should match, and those that need not match, respectively, in the text of relevant documents. In addition to a standard evaluation against human annotations, we also provide an alternative validation of our ideas using clickthrough data. Concordance of the two orthogonal evaluation approaches provide further support to our original hypothesis of the existence of two distinct word classes in search queries. Finally, we provide a taxonomy of intent words derived through rigorous manual analysis of large query logs. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:22 / 38
页数:17
相关论文
共 50 条
  • [21] Learning Multiple Intent Representations for Search Queries
    Hashemi, Helia
    Zamani, Hamed
    Croft, W. Bruce
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 669 - 679
  • [22] Upgrading web search queries
    Owais, Suhail S. J.
    Kroemer, Pavel
    Snasel, Vaclav
    Maleki-Dizaji, S.
    Nyongesa, Henry O.
    DEXA 2007: 18TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2007, : 128 - +
  • [23] Understanding the brevity of web queries
    Freund, L
    Toms, EG
    ASIST 2003: PROCEEDINGS OF THE 66TH ASIST ANNUAL MEETING, VOL 40, 2003: HUMANIZING INFORMATION TECHNOLOGY: FROM IDEAS TO BITS AND BACK, 2003, 40 : 517 - 518
  • [24] Task, Information Seeking Intentions, and User Behavior: Toward A Multi-level Understanding of Web Search
    Liu, Jiqun
    Mitsui, Matthew
    Belkin, Nicholas J.
    Shah, Chirag
    PROCEEDINGS OF THE 2019 CONFERENCE ON HUMAN INFORMATION INTERACTION AND RETRIEVAL (CHIIR'19), 2019, : 123 - 132
  • [25] Understanding User Situational Relevance in Ranking Web Search Results
    Opoku-Mensah, Eugene
    Zhang, Fengli
    Zhou, Fan
    Kittur, Philemon Kibiwott
    2017 8TH IEEE ANNUAL INFORMATION TECHNOLOGY, ELECTRONICS AND MOBILE COMMUNICATION CONFERENCE (IEMCON), 2017, : 405 - 410
  • [26] Determining the informational, navigational, and transactional intent of Web queries
    Jansen, Bernard J.
    Booth, Danielle L.
    Spink, Amanda
    INFORMATION PROCESSING & MANAGEMENT, 2008, 44 (03) : 1251 - 1266
  • [27] USE OF A MULTI LEVEL SUB STRUCTURE SEARCH SYSTEM - SURVEY OF USER QUERIES
    HYDE, E
    MCARDLE, LA
    LAMBOURN.DR
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1972, : 1 - &
  • [28] Utilizing search intent in topic ontology-based user profie for web mining
    Zhou, Xujuan
    Wu, Sheng-Tang
    Li, Yuefeng
    Xu, Yue
    Lau, Raymond Y. K.
    Bruza, Peter D.
    2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, (WI 2006 MAIN CONFERENCE PROCEEDINGS), 2006, : 558 - +
  • [29] Stepping Towards a Semantic Web Search Engine for Accurate Outcomes in Favor of User Queries
    Suryanarayana, D.
    Hussain, S. Mahaboob
    Kanakam, Prathyusha
    Gupta, Sumit
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2015, : 8 - 13
  • [30] Generic Intent Representation in Web Search
    Zhang, Hongfei
    Song, Xia
    Xiong, Chenyan
    Rosset, Corby
    Bennett, Paul N.
    Craswell, Nick
    Tiwary, Saurabh
    PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 65 - 74