QUERY INTENT DETECTION BASED ON QUERY LOG MINING

被引:0
|
作者
Zamora, Juan [1 ]
Mendoza, Marcelo [1 ]
Allende, Hector [1 ]
机构
[1] Univ Tecn Federico Santa Maria, Dept Comp Sci, Valparaiso, Chile
来源
JOURNAL OF WEB ENGINEERING | 2014年 / 13卷 / 1-2期
关键词
Query categorization; user intents; query logs; IDENTIFICATION;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper we deal with the problem of automatic detection of query intent in search engines. We studied features that have shown good performance in the state-of-the-art, combined with novel features extracted from click-through data. We show that the combination of these features gives good precision results. In a second stage, four text-based classifiers were studied to test the usefulness of text-based features. With a low rate of false positives (less than 10 %) the proposed classifiers can detect query intent in over 90% of the evaluation instances. However due to a notorious unbalance in the classes, the proposed classifiers show poor results to detect transactional intents. We address this problem by including a cost sensitive learning strategy, allowing to solve the skewed data distribution. Finally, we explore the use of classifier ensembles which allow to us to achieve the best performance for the task.
引用
收藏
页码:24 / 52
页数:29
相关论文
共 50 条
  • [31] On query completion in web search engines based on query stream mining
    Barouni-Ebrahimi, M.
    Ghorbani, Ali A.
    PROCEEDINGS OF THE IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE: WI 2007, 2007, : 317 - 320
  • [32] Intent mining in search query logs for automatic search script generation
    Chieh-Jen Wang
    Hsin-Hsi Chen
    Knowledge and Information Systems, 2014, 39 : 513 - 542
  • [33] A Search Log Mining based Query Expansion Technique to Improve Effectiveness in Code Search
    Satter, Abdus
    Sakib, Kazi
    PROCEEDINGS OF THE 2016 19TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2016, : 586 - 591
  • [34] Intent mining in search query logs for automatic search script generation
    Wang, Chieh-Jen
    Chen, Hsin-Hsi
    KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 39 (03) : 513 - 542
  • [35] Understanding Temporal Query Intent
    Hasanuzzaman, Mohammed
    Saha, Sriparna
    Dias, Gael
    Ferrari, Stephane
    SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 823 - 826
  • [36] Classifying and Characterizing Query Intent
    Ashkan, Azin
    Clarke, Charles L. A.
    Agichtein, Engene
    Guo, Qi
    ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5478 : 578 - +
  • [37] Visual Detection of Anomalies in DNS Query Log Data
    Shan, Guihua
    Wang, Yang
    Xie, Maojin
    Lv, Haopu
    Chi, Xuebin
    2014 IEEE PACIFIC VISUALIZATION SYMPOSIUM (PACIFICVIS), 2014, : 258 - 261
  • [38] Intent-Based User Segmentation with Query Enhancement
    Xiong, Wei
    Recce, Michael
    Wu, Brook
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2013, 3 (04) : 1 - 17
  • [39] The detection model of malignant query and personal information leakage based on log analysis
    Kim, Gei-Young
    Jung, Kyung-Jin
    Shin, Yongtae
    Kim, Sangphil
    Kim, Jong-Bae
    International Journal of Multimedia and Ubiquitous Engineering, 2015, 10 (11): : 105 - 114
  • [40] Privacy-Preserving Query Log Mining for Business Confidentiality Protection
    Poblete, Barbara
    Spiliopoulou, Myra
    Baeza-Yates, Ricardo
    ACM TRANSACTIONS ON THE WEB, 2010, 4 (03) : 1 - 26