Cooperative Markov Decision Process model for human-machine co-adaptation in robot-assisted rehabilitation

被引:0
|
作者
Guo, Kairui [2 ]
Cheng, Adrian [2 ]
Li, Yaqi [1 ]
Li, Jun [2 ]
Duffield, Rob [2 ]
Su, Steven Weidong [1 ]
机构
[1] Shandong First Med Univ, Shandong Acad Med Sci, Coll Artificial Intelligence & Big Data Med Sci, 6699 Qingdao Rd, Jinan 250024, Shandong, Peoples R China
[2] Univ Technol Sydney, 61 Broadway, Ultimo, NSW 2007, Australia
关键词
Reinforcement learning; Cooperative Markov Decision Processes; Multi-agent reinforcement learning; Model-based reinforcement learning; Rehabilitation engineering; MECHANISM; SYSTEMS;
D O I
10.1016/j.knosys.2024.111572
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human-machine interaction is a critical component in robotic rehabilitation systems. A mutual learning strategy involving both machine- and human -oriented learning has shown improvements in learning efficiency and receptiveness. Despite these advancements, a theoretical framework that encompasses high-level human responses during robot -assisted rehabilitation is still needed. This paper introduces a novel human-machine interface that uses a Co -adaptive Markov Decision Process (CaMDP) model based on cooperative multi -agent reinforcement learning. The CaMDP model effectively measures user adaptation to machines, treating the entire rehabilitation process as a collaborative learning experience. It quantifies learning rates at a higher system abstraction level. Policy Iteration in Reinforcement Learning is employed for the cooperative adjustment of Policy Improvement between the human and machine. Simulation studies demonstrate that the proposed new Policy Improvement approach has great potential to address non-stationarity issues and significantly reduce the switching frequency of patients - from 16.6% to 2% in a sample of 120,000 cases. The CaMDP model provides valuable insights into rehabilitation effect prediction and risk avoidance through dual -agent simulation, thereby enhancing the overall performance of the rehabilitation process.
引用
收藏
页数:9
相关论文
共 16 条
  • [1] Progressive Co-adaptation in Human-Machine Interaction
    Gallina, Paolo
    Bellotto, Nicola
    Di Luca, Massimiliano
    ICIMCO 2015 PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL. 2, 2015, : 362 - 368
  • [2] AUTOMATED CONTROL MEETS BEHAVIOR: HUMAN-MACHINE CO-ADAPTATION OF THE ARTIFICIAL PANCREAS
    Colmegna, P.
    DIABETES TECHNOLOGY & THERAPEUTICS, 2023, 25 : A15 - A15
  • [3] Analysis of Human-Machine Cooperative Robot and haptic interaction for stroke rehabilitation
    Qian, Zhiqin
    Jiang, Tingting
    Liu, Yifan
    Bi, Zhuming
    PROCEEDINGS OF THE 2017 2ND INTERNATIONAL CONFERENCE ON MATERIALS SCIENCE, MACHINERY AND ENERGY ENGINEERING (MSMEE 2017), 2017, 123 : 643 - 648
  • [4] First Contact: Unsupervised Human-Machine Co-Adaptation via Mutual Information Maximization
    Reddy, Siddharth
    Levine, Sergey
    Dragan, Anca D.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [5] Editorial: Embodiment and Co-adaptation Through Human-Machine Interfaces: At the Border of Robotics, Neuroscience and Psychology
    Beckerle, Philipp
    Castellini, Claudio
    Lenggenhager, Bigna
    Dosen, Strahinja
    FRONTIERS IN NEUROROBOTICS, 2022, 16
  • [6] A computational model of human decision making and learning for assessment of co-adaptation in neuro-adaptive human-robot interaction
    Ehrlich, Stefan K.
    Cheng, Gordon
    2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 264 - 271
  • [7] An Approach Analyzing Cognitive Process of Human-Machine Interaction Based on Extended Markov Decision Process
    Ren, Bingxuan
    Yin, Tangwen
    Fu, Shan
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 1306 - 1311
  • [8] Design of Human-Machine Compatible Ankle Rehabilitation Robot Based on Equivalent Human Ankle Model
    Liu, Ya
    Lu, Wenjuan
    Ma, Huiting
    Deng, Yunjiao
    Hu, Bo
    Zeng, Daxing
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (02) : 1230 - 1237
  • [9] Use of Unobtrusive Human-Machine Interface for Rehabilitation of Stroke victims through Robot Assisted Mirror therapy
    Narang, Gautam
    Narang, Arjun
    Singh, Soumya
    Lempiainen, Juhani
    2013 IEEE INTERNATIONAL CONFERENCE ON TECHNOLOGIES FOR PRACTICAL ROBOT APPLICATIONS (TEPRA), 2013,
  • [10] Regionalized Decision Algorithm for Human-Machine Shared Control Based on Gaussian Hidden Markov Model
    Liu, Fang
    Zhu, Tian-He
    Su, Wei-Xing
    Liu, Yang
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2022, 50 (11): : 2659 - 2667