Exploring effective features for recognizing the user intent behind web queries

被引:20
|
作者
Figueroa, Alejandro [1 ,2 ]
机构
[1] Yahoo Res Latin Amer, Santiago 400, Chile
[2] Univ Diego Port, Escuela Ingn Informat, Santiago, Chile
关键词
Search query understanding; Query classification; Query analysis; User intent; User experience; Feature analysis;
D O I
10.1016/j.compind.2015.01.005
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Automatically identifying the user intent behind web queries has started to catch the attention of the research community, since it allows search engines to enhance user experience by adapting results to that goal. It is broadly agreed that there are three archetypal intentions behind search queries: navigational, resource/transactional and informational. Thus, as a natural consequence, this task has been interpreted as a multi-class classification problem. At large, recent works have focused on comparing several machine learning Methods built with words as features. Conversely, this paper examines the influence of assorted properties on three classification approaches. In particular, it focuses its attention on the contribution of linguistic-based attributes. However, most of natural language processing tools are designed for documents, not web queries. Therefore, as a means of bridging this linguistic gap, we benefited from caseless models, which are trained with traditionally labeled data, but all terms are converted to lowercase before their generation. Overall, tested attributes proved to be effective by improving on word-based classifiers by up to 8.347% (accuracy), and outperforming a baseline by up to 6.17%. Most notably, linguistic-oriented features, from caseless models, are shown to be instrumental in narrowing the linguistic gap between queries and documents. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:162 / 169
页数:8
相关论文
共 50 条
  • [1] Classifying the user intent of User intent of web queries web queries using k-means clustering
    Kathuria, Ashish
    Jansen, Bernard J.
    Hafernik, Carolyn
    Spink, Amanda
    INTERNET RESEARCH, 2010, 20 (05) : 563 - 581
  • [2] Discovering and understanding word level user intent in Web search queries
    Roy, Rishiraj Saha
    Katare, Rahul
    Ganguly, Niloy
    Laxman, Srivatsan
    Choudhury, Monojit
    JOURNAL OF WEB SEMANTICS, 2015, 30 : 22 - 38
  • [3] Ensembling Classifiers for Detecting User Intentions behind Web Queries
    Figueroa, Alejandro
    Atkinson, John
    IEEE INTERNET COMPUTING, 2016, 20 (02) : 8 - 16
  • [4] Improving User Intent Detection in Urdu Web Queries with Capsule Net Architectures
    Shams, Sana
    Aslam, Muhammad
    APPLIED SCIENCES-BASEL, 2022, 12 (22):
  • [5] Identifying Web Queries with Question Intent
    Tsur, Gilad
    Pinter, Yuval
    Szpektor, Idan
    Carmel, David
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16), 2016, : 783 - 793
  • [6] The intention behind Web queries
    Baeza-Yates, Ricardo
    Calderon-Benavides, Liliana
    Gonzalez-Caro, Cristina
    STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2006, 4209 : 98 - 109
  • [7] Word Order Communicates User Intent in Search Queries
    Smirnova, Anastasia
    CHI'20: EXTENDED ABSTRACTS OF THE 2020 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2020,
  • [8] Interaction Behind the Scenes: Exploring Knowledge and User Intent in Interactive Decision-Making Processes
    Brandao, Rafael R. M.
    Moreno, Marcio F.
    Cerqueira, Renato F. G.
    UNIVERSAL ACCESS IN HUMAN-COMPUTER INTERACTION: DESIGN AND DEVELOPMENT APPROACHES AND METHODS, PT I, 2017, 10277 : 291 - 300
  • [9] Determining the informational, navigational, and transactional intent of Web queries
    Jansen, Bernard J.
    Booth, Danielle L.
    Spink, Amanda
    INFORMATION PROCESSING & MANAGEMENT, 2008, 44 (03) : 1251 - 1266
  • [10] Exploring features for the automatic identification of user goals in web search
    Herrera, Mauro Rojas
    de Moura, Edleno Silva
    Cristo, Marco
    Silva, Thomaz Philippe
    da Silva, Altigran Soares
    INFORMATION PROCESSING & MANAGEMENT, 2010, 46 (02) : 131 - 142