Web Search and Browse Log Mining: Challenges, Methods, and Applications

被引:0
|
作者
Jiang, Daxin [1 ]
机构
[1] Microsoft Res Asia, Beijing, Peoples R China
关键词
Search and browse logs; log data summarization; log mining applications;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Huge amounts of search log data have been accumulated in various search engines. Currently, a commercial search engine receives billions of queries and collects tera-bytes of log data on any single day. Other than search log data, browse logs can be collected by client-side browser plug-ins, which record the browse information if users' permissions are granted. Such massive amounts of search/browse log data, on the one hand, provide great opportunities to mine the wisdom of crowds and improve search results as well as online advertisement. On the other hand, designing effective and efficient methods to clean, model, and process large scale log data also presents great challenges. In this tutorial, I will focus on mining search and browse log data for search engines. I will start with an introduction of search and browse log data and an overview of frequently-used data summarization in log mining. I will then elaborate how log mining applications enhance the five major components of a search engine, namely, query understanding, document understanding, query-document matching, user understanding, and monitoring and feedbacks. For each aspect, I will survey the major tasks, fundamental principles, and state-of-the-art methods. Finally, I will discuss the challenges and future trends of log data mining.
引用
收藏
页码:465 / 466
页数:2
相关论文
共 50 条
  • [1] Search and Browse Log Mining for Web Information Retrieval: Challenges, Methods, and Applications
    Jiang, Daxin
    Pei, Jian
    Li, Hang
    SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 912 - 912
  • [2] Enhancing Web Search by Mining Search and Browse Logs
    Jiang, Daxin
    Pei, Jian
    Li, Hang
    PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 1295 - 1295
  • [3] Mining Search and Browse Logs for Web Search: A Survey
    Jiang, Daxin
    Pei, Jian
    Li, Hang
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2013, 4 (04)
  • [4] Privacy in Web Search Query Log Mining
    Jones, Rosie
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT I, 2009, 5781 : 4 - 4
  • [5] Applications of Data Mining in CRM Based on Web Log
    Dang, Jianning
    Zhang, Aiqin
    Jing, Wei
    TRENDS IN CIVIL ENGINEERING, PTS 1-4, 2012, 446-449 : 3762 - 3765
  • [6] Analysis of Text Mining methods in Web search
    Dzhurenko, Tetyana
    Myakshylo, Olena
    Cherednichenko, Galina
    UKRAINIAN FOOD JOURNAL, 2015, 4 (03) : 508 - 519
  • [7] Overview: Web log Mining, Privacy Issues and Application of Web Log Mining
    Singh, Amarjeet
    Sreeram, Y. Chaitanya
    2014 INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2014, : 638 - 641
  • [8] Interaction differences in web search and browse logs
    Thomas, Paul
    O'Neill, Alex
    Paris, Cecile
    ADCS 2010 - Proceedings of the Fifteenth Australasian Document Computing Symposium, 2010, : 52 - 59
  • [9] Application of Convolution Neural Networks in Web Search Log Mining for Effective Web Document Clustering
    Chawla, Suruchi
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2022, 12 (01)
  • [10] WEB LOG MINING - A STUDY
    Krishnagandhi, Geetha
    Dhas, Suresh Gnana
    IIOAB JOURNAL, 2016, 7 (09) : 6 - 15