共 50 条
- [1] Minimax Regret Bounds for Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
- [4] Contrastive Counterfactual Learning for Causality-aware Interpretable Recommender Systems PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 3564 - 3573
- [5] Minimax Search and Reinforcement Learning for Adversarial Tetris ARTIFICIAL INTELLIGENCE: THEORIES, MODELS AND APPLICATIONS, PROCEEDINGS, 2010, 6040 : 417 - 422
- [7] ROI-Constrained Bidding via Curriculum-Guided Bayesian Reinforcement Learning PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 4021 - 4031
- [8] No-Regret Online Reinforcement Learning with Adversarial Losses and Transitions ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [10] Minimax Regret of Switching-Constrained Online Convex Optimization: No Phase Transition ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33