Robotic Knee Tracking Control to Mimic the Intact Human Knee Profile Based on Actor-Critic Reinforcement Learning

被引:2
|
作者
Ruofan Wu [1 ]
Zhikai Yao [1 ]
Jennie Si [2 ,1 ]
He(Helen) Huang [2 ,3 ,4 ]
机构
[1] the School of Electrical, Computer and Energy Engineering, Arizona State University
[2] IEEE
[3] the Department of Biomedical Engineering, North Carolina State University
[4] the University of North Carolina at Chapel Hill
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP242 [机器人];
学科分类号
1111 ;
摘要
We address a state-of-the-art reinforcement learning(RL) control approach to automatically configure robotic prosthesis impedance parameters to enable end-to-end, continuous locomotion intended for transfemoral amputee subjects.Specifically, our actor-critic based RL provides tracking control of a robotic knee prosthesis to mimic the intact knee profile. This is a significant advance from our previous RL based automatic tuning of prosthesis control parameters which have centered on regulation control with a designer prescribed robotic knee profile as the target. In addition to presenting the tracking control algorithm based on direct heuristic dynamic programming(d HDP), we provide a control performance guarantee including the case of constrained inputs. We show that our proposed tracking control possesses several important properties, such as weight convergence of the learning networks, Bellman(sub)optimality of the cost-to-go value function and control input, and practical stability of the human-robot system. We further provide a systematic simulation of the proposed tracking control using a realistic human-robot system simulator, the Open Sim, to emulate how the d HDP enables level ground walking, walking on different terrains and at different paces. These results show that our proposed d HDP based tracking control is not only theoretically suitable, but also practically useful.
引用
收藏
页码:19 / 30
页数:12
相关论文
共 50 条
  • [41] Soft Actor-Critic Deep Reinforcement Learning for Fault-Tolerant Flight Control
    Dally, Killian
    van Kampen, Erik-Jan
    arXiv, 2022,
  • [42] Soft Actor-Critic Reinforcement Learning-Based Optimization for Analog Circuit Sizing
    Park, Sejin
    Choi, Youngchang
    Kang, Seokhyeong
    2023 20TH INTERNATIONAL SOC DESIGN CONFERENCE, ISOCC, 2023, : 47 - 48
  • [43] Dynamic Pricing Based on Demand Response Using Actor-Critic Agent Reinforcement Learning
    Ismail, Ahmed
    Baysal, Mustafa
    ENERGIES, 2023, 16 (14)
  • [44] Equivariant Graph-Representation-Based Actor-Critic Reinforcement Learning for Nanoparticle Design
    Elsborg, Jonas
    Bhowmik, Arghya
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2023, 63 (12) : 3731 - 3741
  • [45] Taming chimeras in coupled oscillators using soft actor-critic based reinforcement learning
    Ding, Jianpeng
    Lei, Youming
    Small, Michael
    CHAOS, 2025, 35 (01)
  • [46] A deep residual reinforcement learning algorithm based on Soft Actor-Critic for autonomous navigation
    Wen, Shuhuan
    Shu, Yili
    Rad, Ahmad
    Wen, Zeteng
    Guo, Zhengzheng
    Gong, Simeng
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 259
  • [47] DAG-based workflows scheduling using Actor-Critic Deep Reinforcement Learning
    Koslovski, Guilherme Piegas
    Pereira, Kleiton
    Albuquerque, Paulo Roberto
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 150 : 354 - 363
  • [48] Robust Active Simultaneous Localization and Mapping Based on Bayesian Actor-Critic Reinforcement Learning
    Pedraza, Bryan
    Dera, Dimah
    2023 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI, 2023, : 63 - 66
  • [49] Proactive Content Caching Based on Actor-Critic Reinforcement Learning for Mobile Edge Networks
    Jiang, Wei
    Feng, Daquan
    Sun, Yao
    Feng, Gang
    Wang, Zhenzhong
    Xia, Xiang-Gen
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2022, 8 (02) : 1239 - 1252
  • [50] Power Allocation in Dual Connectivity Networks Based on Actor-Critic Deep Reinforcement Learning
    Moein, Elham
    Hasibi, Ramin
    Shokri, Matin
    Rasti, Mehdi
    17TH INTERNATIONAL SYMPOSIUM ON MODELING AND OPTIMIZATION IN MOBILE, AD HOC, AND WIRELESS NETWORKS (WIOPT 2019), 2019, : 170 - 177