Reinforcement Learning Aided Sequential Optimization for Unsignalized Intersection Management of Robot Traffic

被引：0

作者：

Hoysal, G. Nishchal ^{[1
]}

Tallapragada, Pavankumar ^{[1
,2
]}

机构：

[1] Indian Inst Sci Bengaluru, Robert Bosch Ctr Cyber Phys Syst, Bengaluru 560012, India

[2] Indian Inst Sci Bengaluru, Dept Elect Engn, Bengaluru 560012, India

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Robot kinematics; Trajectory; Collision avoidance; Safety; Reinforcement learning; Real-time systems; Optimization methods; Robot coordination; deep reinforcement learning; autonomous intersection management; warehouse automation; MULTI-AGV SYSTEMS; AUTOMATED VEHICLES; OPTIMAL COORDINATION; TIME;

D O I：

10.1109/ACCESS.2024.3434552

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We consider the problem of optimal unsignalized intersection management, wherein we seek to obtain safe and optimal trajectories, for a set of robots that arrive randomly and continually. This problem involves repeatedly solving a mixed integer program (with robot acceleration trajectories as decision variables) with different parameters, for which the computation time using a naive optimization algorithm scales exponentially with the number of robots and lanes. Hence, such an approach is not suitable for real-time implementation. In this paper, we propose a solution framework that combines learning and sequential optimization. In particular, we propose an algorithm for learning a shared policy that given the traffic state information, determines the crossing order of the robots. Then, we optimize the trajectories of the robots sequentially according to that crossing order. This approach inherently guarantees safety at all times. We validate the performance of this approach using extensive simulations and compare our approach against 5 different heuristics from the literature in 9 different simulation settings. Our approach, on average, significantly outperforms the heuristics from the literature in various metrics like objective function, weighted average of crossing times and computation time. For example, in some scenarios, we have observed that our approach offers up to 150% improvement in objective value over the first come first serve heuristic. Even on untrained scenarios, our approach shows a consistent improvement (in objective value) of more than 30% over all heuristics under consideration. We also show through simulations that the computation time for our approach scales linearly with the number of robots (assuming all other factors are constant). We further implement the learnt policies on physical robots with a few modifications to the solution framework to address real-world challenges and establish its real-time implementability.

引用

页码：104052 / 104070

页数：19

共 50 条

[41] Determination and Optimization of Reinforcement Learning Parameters for Driver Actions in Traffic
Chong, Linsen
Abbas, Montasir
Higgs, Bryan
Medina, Alejandra
Yang, C. Y. David
2011 14TH INTERNATIONAL IEEE CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2011, : 1785 - 1790
[42] Reinforcement Learning based Interconnection Routing for Adaptive Traffic Optimization
Kao, Sheng-Chun
Yang, Chao-Han Huck
Chen, Pin-Yu
Ma, Xiaoli
Krishna, Tushar
PROCEEDINGS OF THE 13TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON NETWORKS-ON-CHIP (NOCS'19), 2019,
[43] A Reinforcement Learning-Based Distributed Control Scheme for Cooperative Intersection Traffic Control
Guzman, Jose A.
Pizarro, German
Nunez, Felipe
IEEE ACCESS, 2023, 11 : 57037 - 57045
[44] Deep Reinforcement Learning for Vehicle Platooning at a Signalized Intersection in Mixed Traffic with Partial Detection
Hung Tuan Trinh
Bae, Sang-Hoon
Duy Quang Tran
APPLIED SCIENCES-BASEL, 2022, 12 (19):
[45] ROAD ARTERY TRAFFIC LIGHT OPTIMIZATION WITH USE OF REINFORCEMENT LEARNING
Marsetic, Rok
Semrov, Darja
Zura, Marijan
PROMET-TRAFFIC & TRANSPORTATION, 2014, 26 (02): : 101 - 108
[46] A Multi-phase Intersection Traffic Signal Control Strategy with Deep Reinforcement Learning
Li, Congcong
Li, Yuan
Liu, Guihua
2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 959 - 964
[47] Multi-Intersection Management for Connected Autonomous Vehicles by Reinforcement Learning
Jin, Haiming
Wei, Yifei
Yang, Zhaoxing
Liu, Zirui
Fan, Guiyun
2023 IEEE 43RD INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, ICDCS, 2023, : 649 - 659
[48] FedLight: Federated Reinforcement Learning for Autonomous Multi-Intersection Traffic Signal Control
Ye, Yutong
Zhao, Wupan
Wei, Tongquan
Hu, Shiyan
Chen, Mingsong
2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 847 - 852
[49] Optimization Control of Adaptive Traffic Signal with Deep Reinforcement Learning
Cao, Kerang
Wang, Liwei
Zhang, Shuo
Duan, Lini
Jiang, Guiminx
Sfarra, Stefano
Zhang, Hai
Jung, Hoekyung
ELECTRONICS, 2024, 13 (01)
[50] Continuous residual reinforcement learning for traffic signal control optimization
Aslani, Mohainmad
Seipel, Stefan
Wiering, Marco
CANADIAN JOURNAL OF CIVIL ENGINEERING, 2018, 45 (08) : 690 - 702

← 1 2 3 4 5 →