Bayesian Optimization for Efficient Tuning of Visual Servo and Computed Torque Controllers in a Reinforcement Learning Scenario

被引:0
|
作者
Ribeiro, Eduardo G. [1 ]
Mendes, Raul Q. [1 ]
Terra, Marco H. [1 ]
Grassi Jr, Valdir [1 ]
机构
[1] Univ Sao Paulo, Sao Carlos Sch Engn, Dept Elect & Comp Engn, Sao Carlos, Brazil
基金
巴西圣保罗研究基金会;
关键词
GLOBAL OPTIMIZATION;
D O I
10.1109/ICAR53236.2021.9659363
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Although the search for optimal parameters is a central concern for the design stage of control systems, this adjustment is generally not optimized in the design of visual servo controllers. However, for a classic position-based visual servo controller, the choice of the proportional gain that multiplies the computed error may directly affect the system's performance, and may even lead to instability. On the other hand, adjusting such a parameter can be a time-consuming and hard-working task. Thus, in this work, we propose to automate the search for the linear and angular gains of a visual servo controller through Bayesian optimization. We simulate the environment in Matlab with a Kinova GEN3 7DOF robot in a reinforcement learning scenario, in which the projected cost function is evaluated directly on the robot. We demonstrate that Bayesian optimization is capable of finding the visual servo controller gains, as well as the robot internal controller gains, with up to 13 and 14 times fewer iterations when compared to an on-police actor-critic model-free algorithm and the genetic algorithm respectively. Furthermore, we show that the obtained controller performs better considering different control performance parameters and in qualitative evaluations regarding the Cartesian and image spaces.
引用
收藏
页码:282 / 289
页数:8
相关论文
共 50 条
  • [41] Dynamic Tuning of PI-Controllers based on Model-free Reinforcement Learning Methods
    Brujeni, Lena Abbasi
    Lee, Jong Min
    Shah, Sirish L.
    INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2010), 2010, : 453 - 458
  • [42] Parameters tuning and optimization for Reinforcement Learning algorithms using Evolutionary Computing
    Fernandez, Franklin Cardenoso
    Caarls, Wouter
    PROCEEDINGS 3RD INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS AND COMPUTER SCIENCE (INCISCOS 2018), 2018, : 301 - 305
  • [43] Dynamic Visual Prompt Tuning for Parameter Efficient Transfer Learning
    Ruan, Chunqing
    Wang, Hongjian
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VIII, 2024, 14432 : 293 - 303
  • [44] Reinforcement learning-trained optimisers and Bayesian optimisation for online particle accelerator tuning
    Kaiser, Jan
    Xu, Chenran
    Eichler, Annika
    Garcia, Andrea Santamaria
    Stein, Oliver
    Bruendermann, Erik
    Kuropka, Willi
    Dinter, Hannes
    Mayet, Frank
    Vinatier, Thomas
    Burkart, Florian
    Schlarb, Holger
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [45] Bayesian Optimization for Efficient Heterogeneous MPSoC based DNN Accelerator Runtime Tuning
    Zhu, Xuqi
    Gao, Cong
    Saha, Sangeet
    Zhai, Xiaojun
    McDonald-Maier, Klaus D.
    2023 33RD INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, FPL, 2023, : 355 - 356
  • [46] Guided Bayesian Optimization: Data-Efficient Controller Tuning With Digital Twin
    Nobar, Mahdi
    Keller, Jurg
    Rupenyan, Alisa
    Khosravi, Mohammad
    Lygeros, John
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024,
  • [47] A visual servo reinforcement learning control of uncalibrated manipulators with multi-channel gain decision
    Wang, Bingsen
    Dong, Jiuxiang
    TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2025, 47 (02) : 265 - 277
  • [48] Efficient LQR Parameter Tuning for a Flying Inverted Pendulum via Bayesian Optimization
    Park, Jinwoo
    Lee, Changhyeon
    Kim, Donghyeong
    Han, Soohee
    2024 24TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS, ICCAS 2024, 2024, : 691 - 696
  • [49] Adaptive PID computed-torque control of robot manipulators based on DDPG reinforcement learning
    Ghediri, Akram
    Lamamra, Kheireddine
    Kaki, Abdelaziz Ait
    Vaidyanathan, Sundarapandian
    INTERNATIONAL JOURNAL OF MODELLING IDENTIFICATION AND CONTROL, 2022, 41 (03) : 173 - 182
  • [50] Falsification of Learning-Based Controllers through Multi-Fidelity Bayesian Optimization
    Shahrooei, Zahra
    Kochenderfer, Mykel J.
    Baheri, Ali
    2023 EUROPEAN CONTROL CONFERENCE, ECC, 2023,