共 50 条
- [21] A Priority Experience Replay Sampling Method Based on Upper Confidence Bound ICDLT 2019: 2019 3RD INTERNATIONAL CONFERENCE ON DEEP LEARNING TECHNOLOGIES, 2019, : 38 - 41
- [23] Estimating the maximum expected value through upper confidence bound of likelihood 2017 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2017, : 202 - 207
- [26] Pairwise Regression with Upper Confidence Bound for Contextual Bandit with Multiple Actions 2013 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2013, : 19 - 24
- [28] Maximal Expectation as Upper Confidence Bound for Multi-armed Bandit Problems 2014 IEEE 7TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC), 2014, : 325 - 329
- [29] Linear Upper Confidence Bound Algorithm for Contextual Bandit Problem with Piled Rewards ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2016, PT II, 2016, 9652 : 143 - 155
- [30] Relative Upper Confidence Bound for the K-Armed Dueling Bandit Problem INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2), 2014, 32 : 10 - 18