共 50 条
- [1] Bandits with Knapsacks (Extended Abstract) [J]. 2013 IEEE 54TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS), 2013, : 207 - 216
- [2] Models for Autonomously Motivated Exploration in Reinforcement Learning (Extended Abstract) [J]. ALGORITHMIC LEARNING THEORY, 2011, 6925 : 14 - +
- [3] Guiding Reinforcement Learning Exploration Using Natural Language Extended Abstract [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 1956 - 1958
- [4] Improved Learning Complexity in Combinatorial Pure Exploration Bandits [J]. ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 51, 2016, 51 : 1004 - 1012
- [5] Leveraging Currency for Repairing Inconsistent and Incomplete Data (Extended Abstract) [J]. 2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 2315 - 2316
- [7] Learning from Failure [Extended Abstract] [J]. PROCEEDINGS OF THE 6TH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTIONS (HRI 2011), 2011, : 145 - 146
- [8] Meta-Learning Effective Exploration Strategies for Contextual Bandits [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 9541 - 9548
- [9] Contextual Bandits with Delayed Feedback and Semi-supervised Learning (Student Abstract) [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 15943 - 15944
- [10] Deep Residual Reinforcement Learning (Extended Abstract) [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4869 - 4873