Search and Browse Log Mining for Web Information Retrieval: Challenges, Methods, and Applications

被引:0
|
作者
Jiang, Daxin [1 ]
Pei, Jian
Li, Hang [1 ]
机构
[1] Microsoft Res Asia, Beijing, Peoples R China
关键词
Search and browse logs; log data mining;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Huge amounts of search log data have been accumulated in various search engines. Currently, a commercial search engine receives billions of queries and collects tera-bytes of log data on any single day. Other than search log data, browse logs can be collected by client-side browser plug-ins, which record the browse information if users' permissions are granted. Such massive amounts of search/browse log data, on the one hand, provide great opportunities to mine the wisdom of crowds and improve search results as well as online advertisement. On the other hand, designing effective and efficient methods to clean, model, and process large scale log data also presents great challenges. In this tutorial, we focus on mining search and browse log data for Web information retrieval. We consider a Web information retrieval system consisting of four components, namely, query understanding, document understanding, query-document matching, and user understanding. Accordingly, we organize the tutorial materials along these four aspects. For each aspect, we will survey the major tasks, challenges, fundamental principles, and state-of-the-art methods. The goal of this tutorial is to provide a systematic survey on large-scale search/browse log mining to the IR community. It will help IR researchers to get familiar with the core challenges and promising directions in log mining. At the same time, this tutorial may also serve the developers of Web information retrieval systems as a comprehensive and in-depth reference to the advanced log mining techniques.
引用
收藏
页码:912 / 912
页数:1
相关论文
共 50 条
  • [1] Web Search and Browse Log Mining: Challenges, Methods, and Applications
    Jiang, Daxin
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT II, 2011, 6588 : 465 - 466
  • [2] Enhancing Web Search by Mining Search and Browse Logs
    Jiang, Daxin
    Pei, Jian
    Li, Hang
    PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 1295 - 1295
  • [3] Mining Search and Browse Logs for Web Search: A Survey
    Jiang, Daxin
    Pei, Jian
    Li, Hang
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2013, 4 (04)
  • [4] Web user log mining for web retrieval
    Yu, YJ
    Chen, C
    2002 IEEE REGION 10 CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND POWER ENGINEERING, VOLS I-III, PROCEEDINGS, 2002, : 97 - 100
  • [5] Applications of an web information mining model to data mining and information retrieval tasks
    Pereira, AR
    Baeza-Yates, R
    SIXTEENTH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2005, : 1031 - 1035
  • [6] Special issue of The Journal of Information Retrieval on web mining for search
    Baeza-Yates, Ricardo
    Pasi, Gabriella
    INFORMATION RETRIEVAL, 2011, 14 (03): : 213 - 214
  • [7] Special issue of The Journal of Information Retrieval on web mining for search
    Ricardo Baeza-Yates
    Gabriella Pasi
    Information Retrieval, 2011, 14 : 213 - 214
  • [8] Privacy in Web Search Query Log Mining
    Jones, Rosie
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT I, 2009, 5781 : 4 - 4
  • [9] Challenges in Web Information Retrieval
    Arora, Monika
    Kanjilal, Uma
    Varshney, Dinesh
    INNOVATIONS IN COMPUTING SCIENCES AND SOFTWARE ENGINEERING, 2010, : 141 - 146
  • [10] SEARCH ENGINES: INFORMATION RETRIEVAL ON THE WEB
    Chawla, Suruchi
    EVERYMANS SCIENCE, 2016, 51 (04): : 237 - 240