共 50 条
- [23] Reinforcement Learning Augmented Asymptotically Optimal Index Policy for Finite-Horizon Restless Bandits THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8726 - 8734
- [26] The characteristics of umbilical cord blood (UCB) and UCB transplantation SEMINARS IN THROMBOSIS AND HEMOSTASIS, 1998, 24 (05): : 491 - 495
- [27] Reducing Dueling Bandits to Cardinal Bandits INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2), 2014, 32 : 856 - 864