QUERY INTENT DETECTION BASED ON QUERY LOG MINING

被引:0
|
作者
Zamora, Juan [1 ]
Mendoza, Marcelo [1 ]
Allende, Hector [1 ]
机构
[1] Univ Tecn Federico Santa Maria, Dept Comp Sci, Valparaiso, Chile
来源
JOURNAL OF WEB ENGINEERING | 2014年 / 13卷 / 1-2期
关键词
Query categorization; user intents; query logs; IDENTIFICATION;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper we deal with the problem of automatic detection of query intent in search engines. We studied features that have shown good performance in the state-of-the-art, combined with novel features extracted from click-through data. We show that the combination of these features gives good precision results. In a second stage, four text-based classifiers were studied to test the usefulness of text-based features. With a low rate of false positives (less than 10 %) the proposed classifiers can detect query intent in over 90% of the evaluation instances. However due to a notorious unbalance in the classes, the proposed classifiers show poor results to detect transactional intents. We address this problem by including a cost sensitive learning strategy, allowing to solve the skewed data distribution. Finally, we explore the use of classifier ensembles which allow to us to achieve the best performance for the task.
引用
下载
收藏
页码:24 / 52
页数:29
相关论文
共 50 条
  • [21] CLHQS: Hierarchical Query Suggestion by Mining Clickthrough Log
    Chen, Depin
    Liu, Ning
    Yin, Zhijun
    Tong, Yang
    Yan, Jun
    Chen, Zheng
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, 5476 : 764 - 771
  • [22] Mining Query Subtopics from Search Log Data
    Hu, Yunhua
    Qian, Yanan
    Li, Hang
    Jiang, Daxin
    Pei, Jian
    Zheng, Qinghua
    SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 305 - 314
  • [23] Query Log Mining for Inferring User Tasks and Needs
    Mehrotra, Rishabh
    Yilmaz, Emine
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2016, PT III, 2016, 9853 : 284 - 288
  • [24] Combining Query Ambiguity and Query-URL Strength for Log-Based Query Suggestion
    Ye, Feiyue
    Sun, Jing
    ADVANCES IN SWARM INTELLIGENCE, ICSI 2016, PT II, 2016, 9713 : 590 - 597
  • [25] Understanding Temporal Intent of User Query Based on Time-Based Query Classification
    Ren, Pengjie
    Chen, Zhumin
    Song, Xiaomeng
    Li, Bin
    Yang, Haopeng
    Ma, Jun
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2013, 2013, 400 : 334 - 345
  • [26] Query intent mining with multiple dimensions of web search data
    Di Jiang
    Kenneth Wai-Ting Leung
    Wilfred Ng
    World Wide Web, 2016, 19 : 475 - 497
  • [27] Query intent mining with multiple dimensions of web search data
    Jiang, Di
    Leung, Kenneth Wai-Ting
    Ng, Wilfred
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2016, 19 (03): : 475 - 497
  • [28] Log-mining-based query spelling correction for Chinese search engines
    Zhou, Bo
    Zhang, Min
    Ma, Shaoping
    Liu, Yiqun
    Ru, Liyun
    Journal of Computational Information Systems, 2009, 5 (03): : 1225 - 1233
  • [29] A dataspace prefetching method based on query intent
    Zhu G.
    Zhou L.
    Wang N.
    Liu D.
    Harbin Gongcheng Daxue Xuebao/Journal of Harbin Engineering University, 2016, 37 (02): : 236 - 241
  • [30] Learning to Mine Query Subtopics from Query Log
    Zhang, Zhenzhong
    Sun, Le
    Han, Xianpei
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 341 - 345