Offline Policy Iteration Based Reinforcement Learning Controller for Online Robotic Knee Prosthesis Parameter Tuning

被引:0
|
作者
Li, Minhan [1 ,2 ]
Gao, Xiang [3 ]
Wen, Yue [1 ,2 ]
Si, Jennie [3 ]
Huang, He [1 ,2 ]
机构
[1] NC State Univ, NCSU UNC Dept Biomed Engn, Raleigh, NC 27695 USA
[2] Univ North Carolina Chapel Hill, Chapel Hill, NC 27599 USA
[3] Arizona State Univ, Dept Elect Comp & Energy Engn, Tempe, AZ 85281 USA
基金
美国国家科学基金会;
关键词
ANKLE-FOOT PROSTHESIS; IMPEDANCE CONTROL; DESIGN;
D O I
10.1109/icra.2019.8794212
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper aims to develop an optimal controller that can automatically provide personalized control of robotic knee prosthesis in order to best support gait of individual prosthesis wearers. We introduced a new reinforcement learning (RL) controller for this purpose based on the promising ability of RL controllers to solve optimal control problems through interactions with the environment without requiring an explicit system model. However, collecting data from a human-prosthesis system is expensive and thus the design of a RL controller has to take into account data and time efficiency. We therefore propose an offline policy iteration based reinforcement learning approach. Our solution is built on the finite state machine (FSM) impedance control framework, which is the most used prosthesis control method in commercial and prototypic robotic prosthesis. Under such a framework, we designed an approximate policy iteration algorithm to devise impedance parameter update rules for 12 prosthesis control parameters in order to meet individual users' needs. The goal of the reinforcement learning-based control was to reproduce near-normal knee kinematics during gait. We tested the RL controller obtained from offline learning in real time experiment involving the same able-bodied human subject wearing a robotic lower limb prosthesis. Our results showed that the RL control resulted in good convergent behavior in kinematic states, and the offline learning control policy successfully adjusted the prosthesis control parameters to produce near-normal knee kinematics in 10 updates of the impedance control parameters.
引用
收藏
页码:2831 / 2837
页数:7
相关论文
共 50 条
  • [1] Online Reinforcement Learning Control for the Personalization of a Robotic Knee Prosthesis
    Wen, Yue
    Si, Jennie
    Brandt, Andrea
    Gao, Xiang
    Huang, He
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (06) : 2346 - 2356
  • [2] Reinforcement Learning Control of Robotic Knee With Human-in-the-Loop by Flexible Policy Iteration
    Gao, Xiang
    Si, Jennie
    Wen, Yue
    Li, Minhan
    Huang, He
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (10) : 5873 - 5887
  • [3] Adaptive Policy Learning for Offline-to-Online Reinforcement Learning
    Zheng, Han
    Luo, Xufang
    Wei, Pengfei
    Song, Xuan
    Li, Dongsheng
    Jiang, Jing
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 11372 - 11380
  • [4] Online model-free controller for flexible wing aircraft: a policy iteration-based reinforcement learning approach
    Mohammed Abouheaf
    Wail Gueaieb
    International Journal of Intelligent Robotics and Applications, 2020, 4 : 21 - 43
  • [6] Online least-squares policy iteration for reinforcement learning control
    Busoniu, Lucian
    Ernst, Damien
    De Schutter, Bart
    Babuska, Robert
    2010 AMERICAN CONTROL CONFERENCE, 2010, : 486 - 491
  • [7] Reinforcement Learning-based Sim-to-Real Impedance Parameter Tuning for Robotic Assembly
    Kim, Yong-Geon
    Na, Minwoo
    Song, Jae-Bok
    2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 833 - 836
  • [8] SUF: Stabilized Unconstrained Fine-Tuning for Offline-to-Online Reinforcement Learning
    Feng, Jiaheng
    Feng, Mingxiao
    Song, Haolin
    Zhou, Wengang
    Li, Houqiang
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 11, 2024, : 11961 - 11969
  • [9] Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning
    Xie, Tengyang
    Jiang, Nan
    Wang, Huan
    Xiong, Caiming
    Bai, Yu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [10] PID Parameter Tuning of Flight Atmospheric Parameter Test System Based on Policy Iteration
    Xu, Tao
    Zhang, Hexuan
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 555 - 560