Robotic Knee Tracking Control to Mimic the Intact Human Knee Profile Based on Actor-Critic Reinforcement Learning

被引：2

作者：

Ruofan Wu ^{[1
]}

Zhikai Yao ^{[1
]}

Jennie Si ^{[2
,1
]}

He(Helen) Huang ^{[2
,3
,4
]}

机构：

[1] the School of Electrical, Computer and Energy Engineering, Arizona State University

[2] IEEE

[3] the Department of Biomedical Engineering, North Carolina State University

[4] the University of North Carolina at Chapel Hill

来源：

IEEE/CAA Journal of Automatica Sinica | 2022年 / 9卷 / 01期

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP242 [机器人];

学科分类号：

1111 ;

摘要：

We address a state-of-the-art reinforcement learning(RL) control approach to automatically configure robotic prosthesis impedance parameters to enable end-to-end, continuous locomotion intended for transfemoral amputee subjects.Specifically, our actor-critic based RL provides tracking control of a robotic knee prosthesis to mimic the intact knee profile. This is a significant advance from our previous RL based automatic tuning of prosthesis control parameters which have centered on regulation control with a designer prescribed robotic knee profile as the target. In addition to presenting the tracking control algorithm based on direct heuristic dynamic programming(d HDP), we provide a control performance guarantee including the case of constrained inputs. We show that our proposed tracking control possesses several important properties, such as weight convergence of the learning networks, Bellman(sub)optimality of the cost-to-go value function and control input, and practical stability of the human-robot system. We further provide a systematic simulation of the proposed tracking control using a realistic human-robot system simulator, the Open Sim, to emulate how the d HDP enables level ground walking, walking on different terrains and at different paces. These results show that our proposed d HDP based tracking control is not only theoretically suitable, but also practically useful.

引用

页码：19 / 30

页数：12

共 50 条

[41] Soft Actor-Critic Deep Reinforcement Learning for Fault-Tolerant Flight Control
Dally, Killian
van Kampen, Erik-Jan
arXiv, 2022,
[42] Soft Actor-Critic Reinforcement Learning-Based Optimization for Analog Circuit Sizing
Park, Sejin
Choi, Youngchang
Kang, Seokhyeong
2023 20TH INTERNATIONAL SOC DESIGN CONFERENCE, ISOCC, 2023, : 47 - 48
[43] Dynamic Pricing Based on Demand Response Using Actor-Critic Agent Reinforcement Learning
Ismail, Ahmed
Baysal, Mustafa
ENERGIES, 2023, 16 (14)
[44] Equivariant Graph-Representation-Based Actor-Critic Reinforcement Learning for Nanoparticle Design
Elsborg, Jonas
Bhowmik, Arghya
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2023, 63 (12) : 3731 - 3741
[45] Taming chimeras in coupled oscillators using soft actor-critic based reinforcement learning
Ding, Jianpeng
Lei, Youming
Small, Michael
CHAOS, 2025, 35 (01)
[46] A deep residual reinforcement learning algorithm based on Soft Actor-Critic for autonomous navigation
Wen, Shuhuan
Shu, Yili
Rad, Ahmad
Wen, Zeteng
Guo, Zhengzheng
Gong, Simeng
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 259
[47] DAG-based workflows scheduling using Actor-Critic Deep Reinforcement Learning
Koslovski, Guilherme Piegas
Pereira, Kleiton
Albuquerque, Paulo Roberto
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 150 : 354 - 363
[48] Robust Active Simultaneous Localization and Mapping Based on Bayesian Actor-Critic Reinforcement Learning
Pedraza, Bryan
Dera, Dimah
2023 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI, 2023, : 63 - 66
[49] Proactive Content Caching Based on Actor-Critic Reinforcement Learning for Mobile Edge Networks
Jiang, Wei
Feng, Daquan
Sun, Yao
Feng, Gang
Wang, Zhenzhong
Xia, Xiang-Gen
IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2022, 8 (02) : 1239 - 1252
[50] Power Allocation in Dual Connectivity Networks Based on Actor-Critic Deep Reinforcement Learning
Moein, Elham
Hasibi, Ramin
Shokri, Matin
Rasti, Mehdi
17TH INTERNATIONAL SYMPOSIUM ON MODELING AND OPTIMIZATION IN MOBILE, AD HOC, AND WIRELESS NETWORKS (WIOPT 2019), 2019, : 170 - 177

← 1 2 3 4 5 →