共 50 条
- [21] Near-Optimal Pure Exploration in Matrix Games: A Generalization of Stochastic Bandits & Dueling Bandits INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
- [23] Non-stationary Dueling Bandits for Online Learning to Rank WEB AND BIG DATA, PT II, APWEB-WAIM 2022, 2023, 13422 : 166 - 174
- [24] Multi-Dueling Bandits and Their Application to Online Ranker Evaluation CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 2161 - 2166
- [26] Efficient and Optimal Algorithms for Contextual Dueling Bandits under Realizability INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 167, 2022, 167
- [27] Bias-Robust Bayesian Optimization via Dueling Bandits INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [28] Identification of the Generalized Condorcet Winner in Multi-dueling Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [29] Dueling Bandits: Beyond Condorcet Winners to General Tournament Solutions ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
- [30] Learning to Identify Top Elo Ratings: A Dueling Bandits Approach THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8797 - 8805