Neural network-based estimation of biomechanical vocal fold parameters

被引:0
|
作者
Donhauser, Jonas [1 ]
Tur, Bogac [1 ]
Doellinger, Michael [1 ]
机构
[1] Friedrich Alexander Univ Erlangen Nurnberg, Univ Hosp Erlangen, Div Phoniatr & Pediat Audiol, Dept Otorhinolaryngol Head & Neck Surg, Erlangen, Germany
关键词
convolutional recurrent neural network; high-speed video; mass-spring-damper system; vocal fold dynamics; voice physiology; SUBGLOTTAL PRESSURE; MODEL; CLASSIFICATION; OSCILLATION; VARIABILITY; RECORDINGS; SIMULATION; VIBRATIONS; PHONATION; DISORDERS;
D O I
10.3389/fphys.2024.1282574
中图分类号
Q4 [生理学];
学科分类号
071003 ;
摘要
Vocal fold (VF) vibrations are the primary source of human phonation. High-speed video (HSV) endoscopy enables the computation of descriptive VF parameters for assessment of physiological properties of laryngeal dynamics, i.e., the vibration of the VFs. However, underlying biomechanical factors responsible for physiological and disordered VF vibrations cannot be accessed. In contrast, physically based numerical VF models reveal insights into the organ's oscillations, which remain inaccessible through endoscopy. To estimate biomechanical properties, previous research has fitted subglottal pressure-driven mass-spring-damper systems, as inverse problem to the HSV-recorded VF trajectories, by global optimization of the numerical model. A neural network trained on the numerical model may be used as a substitute for computationally expensive optimization, yielding a fast evaluating surrogate of the biomechanical inverse problem. This paper proposes a convolutional recurrent neural network (CRNN)-based architecture trained on regression of a physiological-based biomechanical six-mass model (6 MM). To compare with previous research, the underlying biomechanical factor "subglottal pressure" prediction was tested against 288 HSV ex vivo porcine recordings. The contributions of this work are two-fold: first, the presented CRNN with the 6 MM handles multiple trajectories along the VFs, which allows for investigations on local changes in VF characteristics. Second, the network was trained to reproduce further important biomechanical model parameters like VF mass and stiffness on synthetic data. Unlike in a previous work, the network in this study is therefore an entire surrogate of the inverse problem, which allowed for explicit computation of the fitted model using our approach. The presented approach achieves a best-case mean absolute error (MAE) of 133 Pa (13.9%) in subglottal pressure prediction with 76.6% correlation on experimental data and a re-estimated fundamental frequency MAE of 15.9 Hz (9.9%). In-detail training analysis revealed subglottal pressure as the most learnable parameter. With the physiological-based model design and advances in fast parameter prediction, this work is a next step in biomechanical VF model fitting and the estimation of laryngeal kinematics.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Evaluation of voice pathology based on the estimation of vocal fold biomechanical parameters
    Gomez-Vilda, P.
    Fernandez-Baillo, R.
    Nieto, A.
    Diaz, F.
    Fernandez-Camacho, F. J.
    Rodellar, V.
    Alvarez, A.
    Martinez, R.
    [J]. JOURNAL OF VOICE, 2007, 21 (04) : 450 - 476
  • [2] Neural Network-based Estimation of the MMSE
    Diaz, Mario
    Kairouz, Peter
    Liao, Jiachun
    Sankar, Lalitha
    [J]. 2021 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2021, : 1023 - 1028
  • [3] Neural Network-Based Estimation for OFDM Channels
    Cheng, Chia-Hsin
    Huang, Yung-Fa
    Huang, Yao-Hung
    Chen, Hsing-Chung
    Yao, Tsung-Yu
    [J]. 2015 IEEE 29th International Conference on Advanced Information Networking and Applications (IEEE AINA 2015), 2015, : 600 - 604
  • [4] Neural network-based ATM QoS estimation
    Sheng, WB
    Rueda, J
    Blight, D
    [J]. IEEE WESCANEX 97 COMMUNICATIONS, POWER AND COMPUTING CONFERENCE PROCEEDINGS, 1997, : 1 - 6
  • [5] Performance estimation of a neural network-based controller
    Schumann, Johann
    Liu, Yan
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 2, PROCEEDINGS, 2006, 3972 : 981 - 990
  • [6] Neural network-based identification of missile aerodynamical parameters
    Zha, X
    Hu, YN
    Cui, PY
    [J]. ISTM/2003: 5TH INTERNATIONAL SYMPOSIUM ON TEST AND MEASUREMENT, VOLS 1-6, CONFERENCE PROCEEDINGS, 2003, : 1487 - 1489
  • [7] Iterative Convolutional Neural Network-Based Illumination Estimation
    Koscevic, Karlo
    Subasic, Marko
    Loncaric, Sven
    [J]. IEEE ACCESS, 2021, 9 : 26755 - 26765
  • [8] Neural network-based estimation of power electronic waveforms
    Kim, MH
    Simoes, MG
    Bose, BK
    [J]. IEEE TRANSACTIONS ON POWER ELECTRONICS, 1996, 11 (02) : 383 - 389
  • [9] Neural network-based pose estimation for fixtureless assembly
    Langley, CS
    D'Eleuterio, GMT
    [J]. 2001 IEEE INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN ROBOTICS AND AUTOMATION: INTEGRATING INTELLIGENT MACHINES WITH HUMANS FOR A BETTER TOMORROW, 2001, : 248 - 253
  • [10] Neural network-based estimation of light attenuation coefficient
    Srirangam, S
    Ressom, H
    Natarajan, P
    Musavi, MT
    Virnstein, RW
    Morris, LJ
    Tweedale, W
    [J]. PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 590 - 595