共 50 条
- [41] Emergency-Response Locomotion of Hexapod Robot with Heuristic Reinforcement Learning Using Q-Learning INTERACTIVE COLLABORATIVE ROBOTICS (ICR 2019), 2019, 11659 : 320 - 329
- [42] Generating Learning Sequences Using Contextual Bandit Algorithms GENERATIVE INTELLIGENCE AND INTELLIGENT TUTORING SYSTEMS, PT I, ITS 2024, 2024, 14798 : 320 - 329
- [43] An Online Home Energy Management System using Q-Learning and Deep Q-Learning SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2024, 43
- [44] Reinforcement distribution in a team of cooperative Q-learning agents PROCEEDINGS OF NINTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING, 2008, : 154 - +
- [45] The Sample Complexity of Teaching-by-Reinforcement on Q-Learning THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 10939 - 10947
- [46] HEDGING BARRIER OPTIONS USING REINFORCEMENT LEARNING JOURNAL OF INVESTMENT MANAGEMENT, 2024, 22 (04): : 16 - 25
- [47] Reinforcement Learning for Automatic Parameter Tuning in Apache Spark: A Q-Learning Approach 2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 13 - 18
- [49] Symmetric Q-learning: Reducing Skewness of Bellman Error in Online Reinforcement Learning THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 13, 2024, : 14474 - 14481
- [50] Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,