Intent mining in search query logs for automatic search script generation

被引:0
|
作者
Chieh-Jen Wang
Hsin-Hsi Chen
机构
[1] National Taiwan University,Department of Computer Science and Information Engineering
来源
关键词
Intent mining; Query log analysis; Search script generation; Web search enhancement;
D O I
暂无
中图分类号
学科分类号
摘要
Capturing users’ information needs is essential in decreasing the barriers in information access. This paper mines sequences of actions called search scripts from search query logs which keep large-scale users’ search experiences. Search scripts can be applied to guide users to satisfy their information needs, improve the search effectiveness of retrieval systems, recommend advertisements at suitable places, and so on. Information quality, query ambiguity, topic diversity, and document relevancy are four major challenging issues in search script mining. In this paper, we determine the relevance of URLs for a query, adopt the Open Directory Project (ODP) categories to disambiguate queries and URLs, explore various features and clustering algorithms for intent clustering, identify critical actions from each intent cluster to form a search script, generate a nature language description for each action, and summarize a topic for each search script. Experiments show that the complete link hierarchical clustering algorithm with the features of query terms, relevant URLs, and disambiguated ODP categories performs the best. Applying the intent clusters created by the best model to intent boundary identification achieves an \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$F$$\end{document} score of  0.6666. The intent clusters then are applied to generate search scripts.
引用
收藏
页码:513 / 542
页数:29
相关论文
共 50 条
  • [1] Intent mining in search query logs for automatic search script generation
    Wang, Chieh-Jen
    Chen, Hsin-Hsi
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 39 (03) : 513 - 542
  • [2] Intent Boundary Detection in Search Query Logs
    Wang, Chieh-Jen
    Lin, Kevin Hsin-Yih
    Chen, Hsin-Hsi
    [J]. SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 749 - 750
  • [3] Query Sub-intent Mining by Incorporating Search Results with Query Logs for Information Retrieval
    Liu, Xinyu
    [J]. 2023 IEEE 8TH INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS, ICBDA, 2023, : 180 - 186
  • [4] Mining Search Engine Query Logs via Suggestion Sampling
    Bar-Yossef, Ziv
    Gurevich, Maxim
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (01): : 54 - 65
  • [5] Mining Named Entities from Search Engine Query Logs
    Alasiry, Areej
    Levene, Mark
    Poulovassilis, Alexandra
    [J]. PROCEEDINGS OF THE 18TH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM (IDEAS14), 2014, : 46 - 56
  • [6] A Search Engine Based on Query Logs, and Search Log Analysis by Automatic Language Identification
    Oakes, Michael
    Xu, Yan
    [J]. MULTILINGUAL INFORMATION ACCESS EVALUATION I: TEXT RETRIEVAL EXPERIMENTS, 2010, 6241 : 526 - 533
  • [7] Query intent mining with multiple dimensions of web search data
    Di Jiang
    Kenneth Wai-Ting Leung
    Wilfred Ng
    [J]. World Wide Web, 2016, 19 : 475 - 497
  • [8] Dynamic Query Intent Mining from a Search Log Stream
    Qian, Yanan
    Sakai, Tetsuya
    Ye, Junting
    Zheng, Qinghua
    Li, Cong
    [J]. PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1205 - 1208
  • [9] Query intent mining with multiple dimensions of web search data
    Jiang, Di
    Leung, Kenneth Wai-Ting
    Ng, Wilfred
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2016, 19 (03): : 475 - 497
  • [10] Mining search engine query logs for social filtering-based query recommendation
    Zhang, Zhiyong
    Nasraoui, Olfa
    [J]. APPLIED SOFT COMPUTING, 2008, 8 (04) : 1326 - 1334