共 50 条
- [12] A Smoothed Analysis of the Greedy Algorithm for the Linear Contextual Bandit Problem ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
- [15] Optimal regret algorithm for Pseudo-1d Bandit Convex Optimization INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [17] Discussion of "High-dimensional autocovariance matrices and optimal linear prediction" ELECTRONIC JOURNAL OF STATISTICS, 2015, 9 (01): : 789 - 791
- [19] Rejoinder of "High-dimensional autocovariance matrices and optimal linear prediction" ELECTRONIC JOURNAL OF STATISTICS, 2015, 9 (01): : 811 - 822
- [20] An optimal ADP algorithm for a high-dimensional stochastic control problem 2007 IEEE INTERNATIONAL SYMPOSIUM ON APPROXIMATE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2007, : 52 - +