Reinforcement Learning Aided Sequential Optimization for Unsignalized Intersection Management of Robot Traffic

被引:0
|
作者
Hoysal, G. Nishchal [1 ]
Tallapragada, Pavankumar [1 ,2 ]
机构
[1] Indian Inst Sci Bengaluru, Robert Bosch Ctr Cyber Phys Syst, Bengaluru 560012, India
[2] Indian Inst Sci Bengaluru, Dept Elect Engn, Bengaluru 560012, India
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Robot kinematics; Trajectory; Collision avoidance; Safety; Reinforcement learning; Real-time systems; Optimization methods; Robot coordination; deep reinforcement learning; autonomous intersection management; warehouse automation; MULTI-AGV SYSTEMS; AUTOMATED VEHICLES; OPTIMAL COORDINATION; TIME;
D O I
10.1109/ACCESS.2024.3434552
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the problem of optimal unsignalized intersection management, wherein we seek to obtain safe and optimal trajectories, for a set of robots that arrive randomly and continually. This problem involves repeatedly solving a mixed integer program (with robot acceleration trajectories as decision variables) with different parameters, for which the computation time using a naive optimization algorithm scales exponentially with the number of robots and lanes. Hence, such an approach is not suitable for real-time implementation. In this paper, we propose a solution framework that combines learning and sequential optimization. In particular, we propose an algorithm for learning a shared policy that given the traffic state information, determines the crossing order of the robots. Then, we optimize the trajectories of the robots sequentially according to that crossing order. This approach inherently guarantees safety at all times. We validate the performance of this approach using extensive simulations and compare our approach against 5 different heuristics from the literature in 9 different simulation settings. Our approach, on average, significantly outperforms the heuristics from the literature in various metrics like objective function, weighted average of crossing times and computation time. For example, in some scenarios, we have observed that our approach offers up to 150% improvement in objective value over the first come first serve heuristic. Even on untrained scenarios, our approach shows a consistent improvement (in objective value) of more than 30% over all heuristics under consideration. We also show through simulations that the computation time for our approach scales linearly with the number of robots (assuming all other factors are constant). We further implement the learnt policies on physical robots with a few modifications to the solution framework to address real-world challenges and establish its real-time implementability.
引用
收藏
页码:104052 / 104070
页数:19
相关论文
共 50 条
  • [41] Determination and Optimization of Reinforcement Learning Parameters for Driver Actions in Traffic
    Chong, Linsen
    Abbas, Montasir
    Higgs, Bryan
    Medina, Alejandra
    Yang, C. Y. David
    2011 14TH INTERNATIONAL IEEE CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2011, : 1785 - 1790
  • [42] Reinforcement Learning based Interconnection Routing for Adaptive Traffic Optimization
    Kao, Sheng-Chun
    Yang, Chao-Han Huck
    Chen, Pin-Yu
    Ma, Xiaoli
    Krishna, Tushar
    PROCEEDINGS OF THE 13TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON NETWORKS-ON-CHIP (NOCS'19), 2019,
  • [43] A Reinforcement Learning-Based Distributed Control Scheme for Cooperative Intersection Traffic Control
    Guzman, Jose A.
    Pizarro, German
    Nunez, Felipe
    IEEE ACCESS, 2023, 11 : 57037 - 57045
  • [44] Deep Reinforcement Learning for Vehicle Platooning at a Signalized Intersection in Mixed Traffic with Partial Detection
    Hung Tuan Trinh
    Bae, Sang-Hoon
    Duy Quang Tran
    APPLIED SCIENCES-BASEL, 2022, 12 (19):
  • [45] ROAD ARTERY TRAFFIC LIGHT OPTIMIZATION WITH USE OF REINFORCEMENT LEARNING
    Marsetic, Rok
    Semrov, Darja
    Zura, Marijan
    PROMET-TRAFFIC & TRANSPORTATION, 2014, 26 (02): : 101 - 108
  • [46] A Multi-phase Intersection Traffic Signal Control Strategy with Deep Reinforcement Learning
    Li, Congcong
    Li, Yuan
    Liu, Guihua
    2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 959 - 964
  • [47] Multi-Intersection Management for Connected Autonomous Vehicles by Reinforcement Learning
    Jin, Haiming
    Wei, Yifei
    Yang, Zhaoxing
    Liu, Zirui
    Fan, Guiyun
    2023 IEEE 43RD INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, ICDCS, 2023, : 649 - 659
  • [48] FedLight: Federated Reinforcement Learning for Autonomous Multi-Intersection Traffic Signal Control
    Ye, Yutong
    Zhao, Wupan
    Wei, Tongquan
    Hu, Shiyan
    Chen, Mingsong
    2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 847 - 852
  • [49] Optimization Control of Adaptive Traffic Signal with Deep Reinforcement Learning
    Cao, Kerang
    Wang, Liwei
    Zhang, Shuo
    Duan, Lini
    Jiang, Guiminx
    Sfarra, Stefano
    Zhang, Hai
    Jung, Hoekyung
    ELECTRONICS, 2024, 13 (01)
  • [50] Continuous residual reinforcement learning for traffic signal control optimization
    Aslani, Mohainmad
    Seipel, Stefan
    Wiering, Marco
    CANADIAN JOURNAL OF CIVIL ENGINEERING, 2018, 45 (08) : 690 - 702