Usage Data in Web Search: Benefits and Limitations

被引:0
|
作者
Baeza-Yates, Ricardo [1 ]
Maarek, Yoelle [2 ]
机构
[1] Yahoo Res, Barcelona, Spain
[2] Yahoo Res, Haifa, Israel
关键词
Web search; usage data; wisdom of crowds; large scale data mining; privacy; personalization; long tail;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Web Search, which takes its root in the mature field of information retrieval, evolved tremendously over the last 20 years. The field encountered its first revolution when it started to deal with huge amounts of Web pages. Then, a major step was accomplished when engines started to consider the structure of the Web graph and link analysis became a differentiator in both crawling and ranking. Finally, a more discrete, but not less critical step, was made when search engines started to monitor and mine the numerous (mostly implicit) signals provided by users while interacting with the search engine. We focus here on this third "revolution" of large scale usage data. We detail the different shapes it takes, illustrating its benefits through a review of some winning search features that could not have been possible without it. We also discuss its limitations and how in some cases it even conflicts with some natural users' aspirations such as personalization and privacy. We conclude by discussing how some of these conflicts can be circumvented by using adequate aggregation principles to create "ad hoc" crowds.
引用
收藏
页码:495 / 506
页数:12
相关论文
共 50 条
  • [1] Usage Data in Web Search: Benefits and Limitations
    Baeza-Yates, Ricardo
    Maarek, Yoelle
    [J]. STRING PROCESSING AND INFORMATION RETRIEVAL: 19TH INTERNATIONAL SYMPOSIUM, SPIRE 2012, 2012, 7608 : 16 - 16
  • [2] Biclustering of Web Usage Data Using Gravitational Search Algorithm
    Prabha, V. Diviya
    Rathipriya, R.
    [J]. 2013 INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, INFORMATICS AND MEDICAL ENGINEERING (PRIME), 2013,
  • [3] SIGIR 2012 Tutorial (Big) Usage Data in Web Search
    Baeza-Yates, Ricardo
    Maarek, Yoelle
    [J]. SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 1181 - 1182
  • [4] Usage of Domain Ontologies for Web Search
    Aguilar-Lopez, Dulce
    Lopez-Arevalo, Ivan
    Sosa, Victor
    [J]. INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE 2008, 2009, 50 : 319 - 328
  • [5] Prioritizing agile benefits and limitations in relation to practice usage
    Adam Solinski
    Kai Petersen
    [J]. Software Quality Journal, 2016, 24 : 447 - 482
  • [6] Prioritizing agile benefits and limitations in relation to practice usage
    Solinski, Adam
    Petersen, Kai
    [J]. SOFTWARE QUALITY JOURNAL, 2016, 24 (02) : 447 - 482
  • [7] Web usage data mining
    Ortega, Jose-Luis
    Aguillo, Isidro F.
    [J]. PROFESIONAL DE LA INFORMACION, 2009, 18 (01): : 20 - 26
  • [8] Web Usage Classification and Clustering Approach for Web Search Personalization
    Vijayalakshmi, K.
    Jena, Sudarson
    [J]. 6TH INTERNATIONAL CONFERENCE ON COMPUTER & COMMUNICATION TECHNOLOGY (ICCCT-2015), 2015, : 376 - 383
  • [9] Benefits and limitations of computerised laboratory data
    Block, C
    [J]. JOURNAL OF CLINICAL PATHOLOGY, 1997, 50 (06) : 448 - 449
  • [10] Personalizing Web Directories with the Aid of Web Usage Data
    Pierrakos, Dimitrios
    Paliouras, Georgios
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (09) : 1331 - 1344