Web Search and Browse Log Mining: Challenges, Methods, and Applications

被引:0
|
作者
Jiang, Daxin [1 ]
机构
[1] Microsoft Res Asia, Beijing, Peoples R China
关键词
Search and browse logs; log data summarization; log mining applications;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Huge amounts of search log data have been accumulated in various search engines. Currently, a commercial search engine receives billions of queries and collects tera-bytes of log data on any single day. Other than search log data, browse logs can be collected by client-side browser plug-ins, which record the browse information if users' permissions are granted. Such massive amounts of search/browse log data, on the one hand, provide great opportunities to mine the wisdom of crowds and improve search results as well as online advertisement. On the other hand, designing effective and efficient methods to clean, model, and process large scale log data also presents great challenges. In this tutorial, I will focus on mining search and browse log data for search engines. I will start with an introduction of search and browse log data and an overview of frequently-used data summarization in log mining. I will then elaborate how log mining applications enhance the five major components of a search engine, namely, query understanding, document understanding, query-document matching, user understanding, and monitoring and feedbacks. For each aspect, I will survey the major tasks, fundamental principles, and state-of-the-art methods. Finally, I will discuss the challenges and future trends of log data mining.
引用
收藏
页码:465 / 466
页数:2
相关论文
共 50 条
  • [31] Mining Web Access Log for the Personalization Recommendation
    Peng, Xueping
    Cao, Yujuan
    Niu, Zhendong
    2008 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND INFORMATION TECHNOLOGY, PROCEEDINGS, 2008, : 172 - 175
  • [32] Web Log Mining based on Website Topic
    Yu, Xiaobing
    Guo, Shunsheng
    Peng, Zhao
    SEVENTH WUHAN INTERNATIONAL CONFERENCE ON E-BUSINESS, VOLS I-III: UNLOCKING THE FULL POTENTIAL OF GLOBAL TECHNOLOGY, 2008, : 874 - 878
  • [33] Design and Implementation of WEB Log Mining System
    Ni, Xianjun
    2009 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND TECHNOLOGY, VOL II, PROCEEDINGS, 2009, : 425 - 427
  • [34] A HowNet based web log mining algorithm
    Li, Chen
    Qi, Jiayin
    Shu, Huaying
    RESEARCH AND PRACTICAL ISSUES OF ENTERPRISE INFORMATION SYSTEMS II, VOL 2, 2008, 255 : 923 - +
  • [35] Comprehensive analysis of web log files for mining
    Verma, Vikas
    Verma, A.K.
    Bhatia, S.S.
    International Journal of Computer Science Issues, 2011, 8 (6 6-3): : 199 - 202
  • [36] A New Clustering and Preprocessing for Web Log Mining
    Maheswari, B. Uma
    Sumathi, P.
    2014 WORLD CONGRESS ON COMPUTING AND COMMUNICATION TECHNOLOGIES (WCCCT 2014), 2014, : 25 - +
  • [37] Efficient web log mining for product development
    Woon, YK
    Ng, WK
    Li, X
    Lu, WF
    2003 INTERNATIONAL CONFERENCE ON CYBERWORLDS, PROCEEDINGS, 2003, : 294 - 301
  • [38] Web Log Mining for Improvement of Caching Performance
    Soonthomsutee, Rudeekom
    Luenam, Pramote
    INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, IMECS 2012, VOL I, 2012, : 524 - +
  • [39] Sequential patterns recognition in Web Log Mining
    Lu, Lina
    Wei, Hengyi
    Yang, Yiling
    Guan, Xudong
    Xiaoxing Weixing Jisuanji Xitong/Mini-Micro Systems, 2000, 21 (05): : 481 - 483
  • [40] Log Mining to Support Web Query Expansions
    Ngok, Patrick
    Gong, Zhiguo
    ICIA: 2009 INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, VOLS 1-3, 2009, : 364 - 368