Intent mining in search query logs for automatic search script generation

被引:0
|
作者
Chieh-Jen Wang
Hsin-Hsi Chen
机构
[1] National Taiwan University,Department of Computer Science and Information Engineering
来源
关键词
Intent mining; Query log analysis; Search script generation; Web search enhancement;
D O I
暂无
中图分类号
学科分类号
摘要
Capturing users’ information needs is essential in decreasing the barriers in information access. This paper mines sequences of actions called search scripts from search query logs which keep large-scale users’ search experiences. Search scripts can be applied to guide users to satisfy their information needs, improve the search effectiveness of retrieval systems, recommend advertisements at suitable places, and so on. Information quality, query ambiguity, topic diversity, and document relevancy are four major challenging issues in search script mining. In this paper, we determine the relevance of URLs for a query, adopt the Open Directory Project (ODP) categories to disambiguate queries and URLs, explore various features and clustering algorithms for intent clustering, identify critical actions from each intent cluster to form a search script, generate a nature language description for each action, and summarize a topic for each search script. Experiments show that the complete link hierarchical clustering algorithm with the features of query terms, relevant URLs, and disambiguated ODP categories performs the best. Applying the intent clusters created by the best model to intent boundary identification achieves an \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$F$$\end{document} score of  0.6666. The intent clusters then are applied to generate search scripts.
引用
收藏
页码:513 / 542
页数:29
相关论文
共 50 条
  • [41] Mining related queries from web search engine query logs using an improved association rule mining model
    Shi, Xiaodong
    Yang, Christopher C.
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2007, 58 (12): : 1871 - 1883
  • [42] Improving the effectiveness of keyword search in databases using query logs
    Yu, Ziqiang
    Abraham, Ajith
    Yu, Xiaohui
    Liu, Yang
    Zhou, Jing
    Ma, Kun
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2019, 81 : 169 - 179
  • [43] Using web search logs to identify query classification terms
    Taksa, Isak
    Zelikovitz, Sarah
    Spink, Amanda
    [J]. INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, PROCEEDINGS, 2007, : 469 - +
  • [44] Use of Query Logs for Providing Cache Support to the Search Engine
    Kaushik, Pragya
    Gaur, Sreesh
    Singh, Mayank
    [J]. 2014 INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2014, : 819 - 824
  • [45] Automatic extraction of user’s search intention from web search logs
    Kinam Park
    Hyesung Jee
    Taemin Lee
    Soonyoung Jung
    Heuiseok Lim
    [J]. Multimedia Tools and Applications, 2012, 61 : 145 - 162
  • [46] Automatic extraction of user's search intention from web search logs
    Park, Kinam
    Jee, Hyesung
    Lee, Taemin
    Jung, Soonyoung
    Lim, Heuiseok
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 61 (01) : 145 - 162
  • [47] Using web search logs to identify query classification terms
    Taksa, Isak
    Zelikovitz, Sarah
    Spink, Amanda
    [J]. INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2007, 3 (04) : 315 - +
  • [48] Improving the Effectiveness of Keyword Search in Databases Using Query Logs
    Zhou, Jing
    Liu, Yang
    Yu, Ziqiang
    [J]. WEB-AGE INFORMATION MANAGEMENT (WAIM 2015), 2015, 9098 : 193 - 206
  • [49] Learning with both unlabeled data and query logs for image search
    Wu, Jun
    Xiao, Zhi-Bo
    Wang, Hai-Shuai
    Shen, Hong
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2014, 40 (03) : 964 - 973
  • [50] How users search and what they search for in the medical domainUnderstanding laypeople and experts through query logs
    João Palotti
    Allan Hanbury
    Henning Müller
    Charles E. Kahn
    [J]. Information Retrieval Journal, 2016, 19 : 189 - 224