Reinforcement Learning Augmented Optimization for Smart Mobility

被引:0
|
作者
Overko, Roman [1 ]
Ordonez-Hurtado, Rodrigo [2 ]
Zhuk, Sergiy [2 ]
Shorten, Robert [1 ]
机构
[1] Univ Coll Dublin, Dublin, Ireland
[2] IBM Res, Bldg 3,Damastown Ind Pk, Dublin 15, Ireland
来源
2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC) | 2019年
关键词
ALGORITHMS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many mobility applications in smart cities are addressed as optimization problems. However, often, these problems are fragile due to their large-scale and non-convex nature, and also due to uncertainties arising because of human activity. In this paper, we apply a model-based Markov-decision-process (MDP) closed-loop identification algorithm to augment classical optimizers, with a view to alleviating this fragility. Specifically, we use deterministic optimal solutions provided by classical optimizers as initial guesses for MDP's policies, which are then "amended" as a result of online interaction with the environment to cope with uncertainty. Applications are described from niche of smart mobility problems, and numerical results are provided.
引用
收藏
页码:1286 / 1292
页数:7
相关论文
共 50 条
  • [1] Augmented Proximal Policy Optimization for Safe Reinforcement Learning
    Dai, Juntao
    Ji, Jiaming
    Yang, Long
    Zheng, Qian
    Pan, Gang
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 6, 2023, : 7288 - 7295
  • [2] Policy Optimization with Augmented Value Targets for Generalization in Reinforcement Learning
    Nafi, Nasik Muhammad
    Poggi-Corradini, Giovanni
    Hsu, William
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [3] Retrieval Augmented Reinforcement Learning
    Goyal, Anirudh
    Friesen, Abram L.
    Weber, Theophane
    Banino, Andrea
    Ke, Nan Rosemary
    Badia, Adria Puigdomenech
    Guez, Arthur
    Mirza, Mehdi
    Humphreys, Peter C.
    Konyushkova, Ksenia
    Sifre, Laurent
    Valko, Michal
    Osindero, Simon
    Lillicrap, Timothy
    Heess, Nicolas
    Blundell, Charles
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [4] Reinforcement Learning with Augmented Data
    Laskin, Michael
    Lee, Kimin
    Stooke, Adam
    Pinto, Lerrel
    Abbeel, Pieter
    Srinivas, Aravind
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [5] Adaptive Stabilizing Control of Smart Transformer Based on Reinforcement Learning Optimization
    Tang, Jian
    Zou, Zhixiang
    Yang, Jiajun
    Buticchi, Giampaolo
    Hua, Wei
    IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2024, 60 (03) : 4324 - 4337
  • [6] A transfer learning approach to minimize reinforcement learning risks in energy optimization for automated and smart buildings
    Genkin, Mikhail
    McArthur, J. J.
    ENERGY AND BUILDINGS, 2024, 303
  • [7] A reinforcement learning optimization for future smart cities using software defined networking
    Rajkumar, Kulandaivel
    Ramachandran, Manikandan
    Al-Turjman, Fadi
    Patan, Rizwan
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (11) : 3221 - 3233
  • [8] Self-Optimization in Smart Production Systems using Distributed Reinforcement Learning
    Schwung, Dorothea
    Modali, Madhav
    Schwung, Andreas
    2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 4063 - 4068
  • [9] Reinforcement Learning-based Routing Optimization Model for Smart Grid Scenarios
    Fu, Jiajia
    Zhang, Peiming
    Liu, Yuanjie
    PROCEEDINGS OF THE 2024 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATION AND SENSOR NETWORKS, ICWCSN 2024, 2024, : 39 - 43
  • [10] A reinforcement learning optimization for future smart cities using software defined networking
    Kulandaivel Rajkumar
    Manikandan Ramachandran
    Fadi Al-Turjman
    Rizwan Patan
    International Journal of Machine Learning and Cybernetics, 2021, 12 : 3221 - 3233