Human motor learning is robust to control-dependent noise

被引:7
|
作者
Pang, Bo [1 ]
Cui, Leilei [1 ]
Jiang, Zhong-Ping [1 ]
机构
[1] NYU, Dept Elect & Comp Engn, 370 Jay St, Brooklyn, NY 11201 USA
基金
美国国家科学基金会;
关键词
Robustness; Reinforcement learning; Policy iteration; Sensorimotor control; Arm reaching; OPTIMAL FEEDBACK-CONTROL; STOCHASTIC-SYSTEMS; IMPEDANCE CONTROL; ARM MOVEMENTS; MODEL; ADAPTATION; REINFORCEMENT; VARIABILITY; MECHANISMS; UNCERTAINTY;
D O I
10.1007/s00422-022-00922-z
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Noises are ubiquitous in sensorimotor interactions and contaminate the information provided to the central nervous system (CNS) for motor learning. An interesting question is how the CNS manages motor learning with imprecise information. Integrating ideas from reinforcement learning and adaptive optimal control, this paper develops a novel computational mechanism to explain the robustness of human motor learning to the imprecise information, caused by control-dependent noise that exists inherently in the sensorimotor systems. Starting from an initial admissible control policy, in each learning trial the mechanism collects and uses the noisy sensory data (caused by the control-dependent noise) to form an imprecise evaluation of the performance of the current policy and then constructs an updated policy based on the imprecise evaluation. As the number of learning trials increases, the generated policies mathematically provably converge to a (potentially small) neighborhood of the optimal policy under mild conditions, despite the imprecise information in the learning process. The mechanism directly synthesizes the policies from the sensory data, without identifying an internal forward model. Our preliminary computational results on two classic arm reaching tasks are in line with experimental observations reported in the literature. The model-free control principle proposed in the paper sheds more lights into the inherent robustness of human sensorimotor systems to the imprecise information, especially control-dependent noise, in the CNS.
引用
收藏
页码:307 / 325
页数:19
相关论文
共 50 条
  • [41] Robust iterative learning control for permanent magnet linear motor
    Zhang, Hong-Wei
    Yu, Fa-Shan
    Bu, Xu-Hui
    Wang, Fu-Zhong
    Dianji yu Kongzhi Xuebao/Electric Machines and Control, 2012, 16 (06): : 81 - 86
  • [42] Robust active noise control: An information theoretic learning approach
    Kurian, Nikhil Cherian
    Patel, Kashyap
    George, Nithin V.
    APPLIED ACOUSTICS, 2017, 117 : 180 - 184
  • [43] Stochastic Maximum Principle for Optimal Liquidation with Control-Dependent Terminal Time
    Riccardo Cesari
    Harry Zheng
    Applied Mathematics & Optimization, 2022, 85
  • [44] Robust H2/H∞ control for a class of time-varying nonlinear stochastic systems with state- and control-dependent noises
    Gao, Ming
    Zhu, Zhengmao
    Niu, Yichun
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2020, 51 (07) : 1218 - 1228
  • [45] Linearization of affine systems based on control-dependent changes of independent variable
    Fetisov, D. A.
    DIFFERENTIAL EQUATIONS, 2017, 53 (11) : 1483 - 1494
  • [46] Stochastic Nash Games for Weakly Coupled Large Scale Discrete-Time Systems with State- and Control-Dependent Noise
    Mukaidani, Hiroaki
    Xu, Hua
    Dragan, Vasile
    49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 1429 - 1435
  • [47] Stochastic Maximum Principle for Optimal Liquidation with Control-Dependent Terminal Time
    Cesari, Riccardo
    Zheng, Harry
    APPLIED MATHEMATICS AND OPTIMIZATION, 2022, 85 (03):
  • [48] State-dependent noise and human balance control
    Cabrera, JL
    Bormann, R
    Eurich, C
    Ohira, T
    Milton, J
    FLUCTUATION AND NOISE LETTERS, 2004, 4 (01): : L107 - L117
  • [49] Robust Learning-Based Control via Bootstrapped Multiplicative Noise
    Gravell, Benjamin
    Summers, Tyler
    LEARNING FOR DYNAMICS AND CONTROL, VOL 120, 2020, 120 : 599 - 607
  • [50] Robust output feedback learning control for induction motor servo drives
    Tomei, P.
    Verrelli, C. M.
    Montanari, M.
    Tilli, A.
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2009, 19 (15) : 1745 - 1759