Offline Policy Iteration Based Reinforcement Learning Controller for Online Robotic Knee Prosthesis Parameter Tuning

被引：0

作者：

Li, Minhan ^{[1
,2
]}

Gao, Xiang ^{[3
]}

Wen, Yue ^{[1
,2
]}

Si, Jennie ^{[3
]}

Huang, He ^{[1
,2
]}

机构：

[1] NC State Univ, NCSU UNC Dept Biomed Engn, Raleigh, NC 27695 USA

[2] Univ North Carolina Chapel Hill, Chapel Hill, NC 27599 USA

[3] Arizona State Univ, Dept Elect Comp & Energy Engn, Tempe, AZ 85281 USA

来源：

2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) | 2019年

基金：

美国国家科学基金会;

关键词：

ANKLE-FOOT PROSTHESIS; IMPEDANCE CONTROL; DESIGN;

D O I：

10.1109/icra.2019.8794212

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper aims to develop an optimal controller that can automatically provide personalized control of robotic knee prosthesis in order to best support gait of individual prosthesis wearers. We introduced a new reinforcement learning (RL) controller for this purpose based on the promising ability of RL controllers to solve optimal control problems through interactions with the environment without requiring an explicit system model. However, collecting data from a human-prosthesis system is expensive and thus the design of a RL controller has to take into account data and time efficiency. We therefore propose an offline policy iteration based reinforcement learning approach. Our solution is built on the finite state machine (FSM) impedance control framework, which is the most used prosthesis control method in commercial and prototypic robotic prosthesis. Under such a framework, we designed an approximate policy iteration algorithm to devise impedance parameter update rules for 12 prosthesis control parameters in order to meet individual users' needs. The goal of the reinforcement learning-based control was to reproduce near-normal knee kinematics during gait. We tested the RL controller obtained from offline learning in real time experiment involving the same able-bodied human subject wearing a robotic lower limb prosthesis. Our results showed that the RL control resulted in good convergent behavior in kinematic states, and the offline learning control policy successfully adjusted the prosthesis control parameters to produce near-normal knee kinematics in 10 updates of the impedance control parameters.

引用

页码：2831 / 2837

页数：7

共 50 条

[1] Online Reinforcement Learning Control for the Personalization of a Robotic Knee Prosthesis
Wen, Yue
Si, Jennie
Brandt, Andrea
Gao, Xiang
Huang, He
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (06) : 2346 - 2356
[2] Reinforcement Learning Control of Robotic Knee With Human-in-the-Loop by Flexible Policy Iteration
Gao, Xiang
Si, Jennie
Wen, Yue
Li, Minhan
Huang, He
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (10) : 5873 - 5887
[3] Adaptive Policy Learning for Offline-to-Online Reinforcement Learning
Zheng, Han
Luo, Xufang
Wei, Pengfei
Song, Xuan
Li, Dongsheng
Jiang, Jing
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 11372 - 11380
[4] Online model-free controller for flexible wing aircraft: a policy iteration-based reinforcement learning approach
Mohammed Abouheaf
Wail Gueaieb
International Journal of Intelligent Robotics and Applications, 2020, 4 : 21 - 43
[5] Online model-free controller for flexible wing aircraft: a policy iteration-based reinforcement learning approach
Abouheaf, Mohammed
Gueaieb, Wail
INTERNATIONAL JOURNAL OF INTELLIGENT ROBOTICS AND APPLICATIONS, 2020, 4 (01) : 21 - 43
[6] Online least-squares policy iteration for reinforcement learning control
Busoniu, Lucian
Ernst, Damien
De Schutter, Bart
Babuska, Robert
2010 AMERICAN CONTROL CONFERENCE, 2010, : 486 - 491
[7] Reinforcement Learning-based Sim-to-Real Impedance Parameter Tuning for Robotic Assembly
Kim, Yong-Geon
Na, Minwoo
Song, Jae-Bok
2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 833 - 836
[8] SUF: Stabilized Unconstrained Fine-Tuning for Offline-to-Online Reinforcement Learning
Feng, Jiaheng
Feng, Mingxiao
Song, Haolin
Zhou, Wengang
Li, Houqiang
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 11, 2024, : 11961 - 11969
[9] Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning
Xie, Tengyang
Jiang, Nan
Wang, Huan
Xiong, Caiming
Bai, Yu
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[10] PID Parameter Tuning of Flight Atmospheric Parameter Test System Based on Policy Iteration
Xu, Tao
Zhang, Hexuan
2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 555 - 560

← 1 2 3 4 5 →