Application of Neural ODE with embedded hybrid method for robotic manipulator control

被引:6
|
作者
Meleshkova, Zoya [1 ]
Ivanov, Sergei Evgenievich [1 ]
Ivanova, Lubov [1 ]
机构
[1] ITMO Natl Res Univ ITMO Univ St Petersburg, 49 Kronverksky Pr, St Petersburg, Russia
关键词
machine learning; model-based reinforcement learning; neural ODE; hybrid method;
D O I
10.1016/j.procs.2021.10.032
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The paper presents an approach to machine learning involving the use of a hybrid method in Neural ODE. The authors consider a way of the Neural ODE application with an embedded hybrid method for intelligent manipulator control in a non-deterministic environment. To reduce computational resources, the number of layers of the neural network using ODE is reduced, the solution of which is carried out by a hybrid transformation method. The hybrid method includes normalizing the system, linear transformation, and polynomial transformation. Using the hybrid method in Neural ODE can increase the learning and computation speed. The transformation method has high accuracy with low computational complexity of the algorithm. As a result of applying the hybrid method, the original nonlinear system is transformed to a simpler form. To solve the simplified transformed system, the scheme of the numerical Runge-Kutta method of the third order with an adaptive step is used. The results of the computational experiment show an increase in the speed and accuracy of the calculation. The presented approach to implementing the hybrid method in Neural ODE allows faster training of the neural network of the world model on limited data obtained from the real environment. After training on the data generated by the model of the world, a robotic arm always successfully reaches the object during subsequent interaction with the real environment. This solves the problem of the limited ability of the agent to interact when learning with the real environment in robotic systems. (C) 2021 The Authors. Published by Elsevier B.V.
引用
收藏
页码:314 / 324
页数:11
相关论文
共 50 条
  • [1] Application of "Consecutive Compensator" Method for Robotic Manipulator Control
    Margun, Alexey
    Zimenko, Konstantin
    Bazylev, Dmitry
    Bobtsov, Alexey
    Kremlev, Artem
    2014 6TH INTERNATIONAL CONGRESS ON ULTRA MODERN TELECOMMUNICATIONS AND CONTROL SYSTEMS AND WORKSHOPS (ICUMT), 2014, : 341 - 345
  • [2] Hybrid Control of Soft Robotic Manipulator
    Garriga-Casanovas, Arnau
    Shakib, Fahim
    Ferrandy, Varell
    Franco, Enrico
    ACTUATORS, 2024, 13 (07)
  • [3] Neural ODE Machine Learning Method with Embedded Numerical Method
    Televnoy, Andrey
    Ivanov, Sergei
    Zudilova, Tatiana
    Voitiuk, Tatiana
    PROCEEDINGS OF THE 28TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION FRUCT, 2021, : 451 - 457
  • [4] HYBRID TWIST AND WRENCH CONTROL FOR A ROBOTIC MANIPULATOR
    LIPKIN, H
    DUFFY, J
    JOURNAL OF MECHANISMS TRANSMISSIONS AND AUTOMATION IN DESIGN-TRANSACTIONS OF THE ASME, 1988, 110 (02): : 138 - 144
  • [5] Application of discrete learning control to a robotic manipulator
    Poo, AN
    Lim, KB
    Ma, YX
    ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 1996, 12 (01) : 55 - 64
  • [6] A neural network application to fault diagnosis for robotic manipulator
    Naughton, JM
    Chen, YC
    Jiang, J
    PROCEEDINGS OF THE 1996 IEEE INTERNATIONAL CONFERENCE ON CONTROL APPLICATIONS, 1996, : 988 - 993
  • [7] Intelligent control of robotic manipulator using neural networks
    Gupta, P
    Sinha, NK
    (SYSID'97): SYSTEM IDENTIFICATION, VOLS 1-3, 1998, : 849 - 854
  • [8] A Reinforcement Learning Neural Network for Robotic Manipulator Control
    Hu, Yazhou
    Si, Bailu
    NEURAL COMPUTATION, 2018, 30 (07) : 1983 - 2004
  • [9] VISUAL CONTROL OF ROBOTIC MANIPULATOR BASED ON NEURAL NETWORKS
    HASHIMOTO, H
    KUBOTA, T
    SATO, M
    HARASHIMA, F
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 1992, 39 (06) : 490 - 496
  • [10] Estimating movements of a robotic manipulator by hybrid constructive neural networks
    Lahnajärvi, JJT
    Lehtokangas, MI
    Saarinen, JPP
    NEUROCOMPUTING, 2004, 56 : 345 - 363