Characterizing Motor Control of Mastication With Soft Actor-Critic

被引:4
|
作者
Abdi, Amir H. [1 ]
Sagl, Benedikt [2 ]
Srungarapu, Venkata P. [1 ]
Stavness, Ian [3 ]
Prisman, Eitan [4 ]
Abolmaesumi, Purang [1 ]
Fels, Sidney [1 ]
机构
[1] Univ British Columbia, Elect & Comp Engn Dept, Vancouver, BC, Canada
[2] Med Univ Vienna, Univ Clin Dent, Dept Prosthodont, Vienna, Austria
[3] Univ Saskatchewan, Dept Comp Sci, Saskatoon, SK, Canada
[4] Univ British Columbia, Dept Surg, Vancouver, BC, Canada
来源
基金
加拿大健康研究院; 加拿大自然科学与工程研究理事会;
关键词
mastication modeling; reinforcement learning; soft actor-critic; inverse dynamics; jaw; motor control; musculoskeletal modeling; computational biomechanics; INVERSE DYNAMICS; MUSCLE; JAW; MODEL; FORCES; MOVEMENT; PREDICTION; PARAMETERS; HARDNESS; SYSTEM;
D O I
10.3389/fnhum.2020.00188
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
The human masticatory system is a complex functional unit characterized by a multitude of skeletal components, muscles, soft tissues, and teeth. Muscle activation dynamics cannot be directly measured on live human subjects due to ethical, safety, and accessibility limitations. Therefore, estimation of muscle activations and their resultant forces is a longstanding and active area of research. Reinforcement learning (RL) is an adaptive learning strategy which is inspired by the behavioral psychology and enables an agent to learn the dynamics of an unknown system via policy-driven explorations. The RL framework is a well-formulated closed-loop system where high capacity neural networks are trained with the feedback mechanism of rewards to learn relatively complex actuation patterns. In this work, we are building on a deep RL algorithm, known as the Soft Actor-Critic, to learn the inverse dynamics of a simulated masticatory system, i.e., learn the activation patterns that drive the jaw to its desired location. The outcome of the proposed training procedure is a parametric neural model which acts as the brain of the biomechanical system. We demonstrate the model's ability to navigate the feasible three-dimensional (3D) envelope of motion with sub-millimeter accuracies. We also introduce a performance analysis platform consisting of a set of quantitative metrics to assess the functionalities of a given simulated masticatory system. This platform assesses the range of motion, metabolic efficiency, the agility of motion, the symmetry of activations, and the accuracy of reaching the desired target positions. We demonstrate how the model learns more metabolically efficient policies by integrating a force regularization term in the RL reward. We also demonstrate the inverse correlation between the metabolic efficiency of the models and their agility and range of motion. The presented masticatory model and the proposed RL training mechanism are valuable tools for the analysis of mastication and other biomechanical systems. We see this framework's potential in facilitating the functional analyses aspects of surgical treatment planning and predicting the rehabilitation performance in post-operative subjects.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Soft Actor-Critic With Integer Actions
    Fan, Ting-Han
    Wang, Yubo
    [J]. 2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 2611 - 2616
  • [2] A Novel Actor-Critic Motor Reinforcement Learning for Continuum Soft Robots
    Pantoja-Garcia, Luis
    Parra-Vega, Vicente
    Garcia-Rodriguez, Rodolfo
    Vazquez-Garcia, Carlos Ernesto
    [J]. ROBOTICS, 2023, 12 (05)
  • [3] Simultaneous Control and Guidance of an AUV Based on Soft Actor-Critic
    Sola, Yoann
    Le Chenadec, Gilles
    Clement, Benoit
    [J]. SENSORS, 2022, 22 (16)
  • [4] Stepwise Soft Actor-Critic for UAV Autonomous Flight Control
    Hwang, Ha Jun
    Jang, Jaeyeon
    Choi, Jongkwan
    Bae, Jung Ho
    Kim, Sung Ho
    Kim, Chang Ouk
    [J]. DRONES, 2023, 7 (09)
  • [5] Soft Actor-Critic for Navigation of Mobile Robots
    de Jesus, Junior Costa
    Kich, Victor Augusto
    Kolling, Alisson Henrique
    Grando, Ricardo Bedin
    Cuadros, Marco Antonio de Souza Leite
    Gamarra, Daniel Fernando Tello
    [J]. Journal of Intelligent and Robotic Systems: Theory and Applications, 2021, 102 (02):
  • [6] Latent Context Based Soft Actor-Critic
    Pu, Yuan
    Wang, Shaochen
    Yao, Xin
    Li, Bin
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [7] Soft Actor-Critic for Navigation of Mobile Robots
    Junior Costa de Jesus
    Victor Augusto Kich
    Alisson Henrique Kolling
    Ricardo Bedin Grando
    Marco Antonio de Souza Leite Cuadros
    Daniel Fernando Tello Gamarra
    [J]. Journal of Intelligent & Robotic Systems, 2021, 102
  • [8] Soft Actor-Critic for Navigation of Mobile Robots
    de Jesus, Junior Costa
    Kich, Victor Augusto
    Kolling, Alisson Henrique
    Grando, Ricardo Bedin
    Cuadros, Marco Antonio de Souza Leite
    Gamarra, Daniel Fernando Tello
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2021, 102 (02)
  • [9] Model-Based Soft Actor-Critic
    Chien, Jen-Tzung
    Yang, Shu-Hsiang
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 2028 - 2035
  • [10] Characterizing the Gap Between Actor-Critic and Policy Gradient
    Wen, Junfeng
    Kumar, Saurabh
    Gummadi, Ramki
    Schuurmans, Dale
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139