Vertical Search Blending - A Real-world Counterfactual Dataset

被引:0
|
作者
Prochazka, Pavel [1 ]
Kocian, Matej [1 ]
Drdak, Jakub [1 ]
Vrsovsky, Jan [1 ]
Kadlec, Vladimir [1 ]
Kuchar, Jaroslav [1 ]
机构
[1] Seznam Cz, Prague, Czech Republic
关键词
Multi-armed Contextual bandit; Counterfactual dataset; Search engine Off-policy learning;
D O I
10.1145/3331184.3331345
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Blending of search results from several vertical sources became standard among web search engines. Similar scenarios appear in computational advertising, news recommendation, and other interactive systems. As such environments give only partial feedback, the evaluation of new policies conventionally requires expensive online A/B tests. Counterfactual approach is a promising alternative, nevertheless, it requires specific conditions for a valid off-policy evaluation. We release a large-scale, real-world vertical-blending dataset gathered by Seznam.cz web search engine. The dataset contains logged partial feedback with the corresponding propensity and is thus suited for counterfactual evaluation. We provide basic checks for validity and evaluate several learning methods.
引用
收藏
页码:1237 / 1240
页数:4
相关论文
共 50 条
  • [1] Satisfying Real-world Goals with Dataset Constraints
    Goh, Gabriel
    Cotter, Andrew
    Gupta, Maya
    Friedlander, Michael
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [2] VERTICAL RESTRAINTS, EFFICIENCY, AND THE REAL-WORLD
    BURNS, JW
    [J]. FORDHAM LAW REVIEW, 1993, 62 (03) : 597 - 651
  • [3] A Real-World Dataset Generator for Specific Emitter Identification
    Muller, Braeden P.
    Wong, Lauren J.
    Clark, William H.
    Michaels, Alan J.
    [J]. IEEE ACCESS, 2023, 11 : 110023 - 110038
  • [4] WildDeepfake: A Challenging Real-World Dataset for Deepfake Detection
    Zi, Bojia
    Chang, Minghao
    Chen, Jingjing
    Ma, Xingjun
    Jiang, Yu-Gang
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2382 - 2390
  • [5] TANDEM: A Taxonomy and a Dataset of Real-World Performance Bugs
    Sanchez, Ana B.
    Delgado-Perez, Pedro
    Medina-Bulo, Inmaculada
    Segura, Sergio
    [J]. IEEE ACCESS, 2020, 8 (08): : 107214 - 107228
  • [6] Offline Learning of Counterfactual Predictions for Real-World Robotic Reinforcement Learning
    Jin, Jun
    Graves, Daniel
    Haigh, Cameron
    Luo, Jun
    Jagersand, Martin
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 3616 - 3623
  • [7] Semantic Meaningfulness: Evaluating Counterfactual Approaches for Real-World Plausibility and Feasibility
    Hoellig, Jacqueline
    Markus, Aniek F.
    de Slegte, Jef
    Bagave, Prachi
    [J]. EXPLAINABLE ARTIFICIAL INTELLIGENCE, XAI 2023, PT II, 2023, 1902 : 636 - 659
  • [8] Tabu search for the real-world carpooling problem
    Huang, Can
    Zhang, Defu
    Si, Yain-Whar
    Leung, Stephen C. H.
    [J]. JOURNAL OF COMBINATORIAL OPTIMIZATION, 2016, 32 (02) : 492 - 512
  • [9] Local search for real-world scheduling and planning
    Chiong, Raymond
    Siarry, Patrick
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2012, 25 (02) : 207 - 208
  • [10] Tabu search for the real-world carpooling problem
    Can Huang
    Defu Zhang
    Yain-Whar Si
    Stephen C. H. Leung
    [J]. Journal of Combinatorial Optimization, 2016, 32 : 492 - 512