共 32 条
- [1] Xu Jin, Liu Quan, Zhang Zong-Zhang, Et al., Asynchronous deep reinforcement learning with multiple gating mechanisms, Chinese Journal of Computers, 42, 3, pp. 636-653, (2019)
- [2] Rocha F M, Costa V S, Reis L P., From reinforcement learning towards artificial general intelligence, Proceedings of the 2020 World Conference on Information Systems and Technologies, pp. 401-413, (2020)
- [3] Chai Lai, Zhang Ting-Ting, Dong Hui, Et al., Multi-agent deep reinforcement learning algorithm based on partitioned buffer replay and multiple process interaction, Chinese Journal of Computers, 44, 6, pp. 1140-1152, (2021)
- [4] Cui J, Liu Y, Nallanathan A., Multi-agent reinforcement learning-based resource allocation for UAV networks, IEEE Transactions on Wireless Communications, 19, 2, pp. 729-743, (2019)
- [5] Catacora Ocana J M, Riccio F, Capobianco R, Et al., Cooperative multi-agent deep reinforcement learning in soccer domains, Proceedings of the 18th International Conference on Autonomous Agents and Multi Agent Systems, pp. 1865-1867, (2019)
- [6] Liu X, Yu J, Feng Z, Et al., Multi-agent reinforcement learning for resource allocation in IoT networks with edge computing, China Communications, 17, 9, pp. 220-236, (2020)
- [7] Posor J E, Belzner L, Knapp A., Joint action learning for multi-agent cooperation using recurrent reinforcement learning, Digitale Welt, 4, 1, pp. 79-84, (2020)
- [8] Schollig A, Alonso-Mora J, D'Andrea R., Independent vs. joint estimation in multi-agent iterative learning control, Proceedings of the 49th IEEE Conference on Decision and Control(CDC), pp. 6949-6954, (2010)
- [9] Lowe R, Wu Y, Tamar A, Et al., Multi-agent actor-critic for mixed cooperative-competitive environments, Proceedings of the 31st International Conference on Neural Information Processing Systems, pp. 6382-6393, (2017)
- [10] Rashid T, Samvelyan M, Schroeder C, Et al., QMIX: Monotonic value function factorisation for deep multi-agent reinforcement learning, Proceedings of the 2018 International Conference on Machine Learning, pp. 4295-4304, (2018)