Voice Feature Selection to Improve Performance of Machine Learning Models for Voice Production Inversion

被引:5
|
作者
Zhang, Zhaoyan [1 ]
机构
[1] Univ Calif Los Angeles, Dept Head & Neck Surg, 31-24 Rehabil Ctr,1000 Veteran Ave, Los Angeles, CA 90095 USA
关键词
Voice inversion; Vocal fold geometry; Vocal fold stiffness; Machine learning; BODY-COVER MODEL; PARAMETERS; VIBRATION; ACOUSTICS; VARIABLES; STIFFNESS;
D O I
10.1016/j.jvoice.2021.03.004
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Objective. Estimation of physiological control parameters of the vocal system from the produced voice outcome has important applications in clinical management of voice disorders . Previously we developed a pressure from voice outcome features that characterize the acoustics of the produced voice. The goals of this study are to (1) explore the possibility of improving the estimation accuracy of physiological control parameters by including voice outcome features characterizing vocal fold vibration; and (2) identify voice feature sets that optimize both estimation accuracy and robustness to measurement noise.Methods. Feedforward neural networks are trained to solve the inversion problem of estimating the physiological control parameters of a three-dimensional body-cover vocal fold model from different sets of voice outcome features that characterize the simulated voice acoustics, glottal flow, and vocal fold vibration. A sensitivity analysis is then performed to evaluate the contribution of individual voice features to the overall performance of the neural networks in estimating the physiologic control parameters.Results and conclusions. While including voice outcome features characterizing vocal fold vibration increases estimation accuracy, it also reduces the network's robustness to measurement noise, due to high sensitivity of network performance to voice outcome features measuring the absolute amplitudes of the glottal flow and area waveforms, which are also difficult to measure accurately in practical applications. By excluding such glottal flow-based features and replacing glottal area-based features by their normalized counterparts, we are able to significantly improve both estimation accuracy and robustness to noise. We further show that similar estimation accuracy and robustness can be achieved with an even smaller set of voice outcome features by excluding features of small sensitivity.
引用
收藏
页码:479 / 485
页数:7
相关论文
共 50 条
  • [21] Functional Feature Selection by Weighted Projections in Pathological Voice Detection
    Giraldo, Luis Sanchez
    Martinez Tabares, Fernando
    Castellanos Dominguez, German
    [J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, PROCEEDINGS, 2009, 5856 : 329 - +
  • [22] Machine learning predictive models with voice parameters for prediction of difficult laryngoscopies: a prospective cohort
    Carvalho, Clistenes
    Souza, Ana Beatriz
    Regueira, Stephanie
    [J]. ANESTHESIA AND ANALGESIA, 2021, 133 (3S_SUPPL): : 456 - 456
  • [23] Impact of Feature Selection Techniques on the Performance of Machine Learning Models for Depression Detection Using EEG Data
    Hassan, Marwa
    Kaabouch, Naima
    [J]. Applied Sciences (Switzerland), 2024, 14 (22):
  • [24] An application of machine learning with feature selection to improve diagnosis and classification of neurodegenerative disorders
    Diaz Alvarez, Josefa
    Matias-Guiu, Jordi A.
    Nieves Cabrera-Martin, Maria
    Risco-Martin, Jose L.
    Ayala, Jose L.
    [J]. BMC BIOINFORMATICS, 2019, 20 (01)
  • [25] An application of machine learning with feature selection to improve diagnosis and classification of neurodegenerative disorders
    Josefa Díaz Álvarez
    Jordi A. Matias-Guiu
    María Nieves Cabrera-Martín
    José L. Risco-Martín
    José L. Ayala
    [J]. BMC Bioinformatics, 20
  • [26] Heuristic Model to Improve Feature Selection Based on Machine Learning in Data Mining
    Majumdar, Jahin
    Mal, Anwesha
    Gupta, Shruti
    [J]. 2016 6TH INTERNATIONAL CONFERENCE - CLOUD SYSTEM AND BIG DATA ENGINEERING (CONFLUENCE), 2016, : 73 - 77
  • [27] DETECTING SLEEP DEFICIENCY WITH VOICE BIOMARKERS AND MACHINE LEARNING
    Zhang, Boyu
    Ronda, Joseph
    Yuan, Robin
    Duffy, Jeanne
    Czeisler, Charles
    [J]. SLEEP, 2024, 47
  • [28] Voice in Parkinson's Disease: A Machine Learning Study
    Suppa, Antonio
    Costantini, Giovanni
    Asci, Francesco
    Di Leo, Pietro
    Al-Wardat, Mohammad Sami
    Di Lazzaro, Giulia
    Scalise, Simona
    Pisani, Antonio
    Saggio, Giovanni
    [J]. FRONTIERS IN NEUROLOGY, 2022, 13
  • [29] Voice Pathology Detection Using Machine Learning Technique
    AL-Dhief, Fahad Taha
    Mu, Nurul
    Abd Malik, Nik Noordini Nik
    Sabri, Naseer
    Baki, Marina Mat
    Albadr, Musatafa Abbas Abbood
    Abbas, Aymen Fadhil
    Hussein, Yaqdhan Mahmood
    Mohammed, Mazin Abed
    [J]. 2020 IEEE 5TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATION TECHNOLOGIES (ISTT), 2020, : 99 - 104
  • [30] A Novel feature reduction method to improve the performance of Machine Learning model
    Mirniaharikandehei, Seyedehnafiseh
    Heidari, Morteza
    Danala, Gopichandh
    Lakshmivarahan, Sivaramakrishnan
    Zheng, Bin
    [J]. MEDICAL IMAGING 2021: COMPUTER-AIDED DIAGNOSIS, 2021, 11597