A methodology for voice classification based on the personalized fundamental frequency estimation

被引:22
|
作者
Verde, Laura [1 ]
De Pietro, Giuseppe [2 ]
Sannino, Giovanna [2 ]
机构
[1] Univ Naples Parthenope, Dept Engn, Ctr Direz, Isola C4, Naples, Italy
[2] Natl Res Council Italy, Inst High Performance Comp & Networking CNR ICAR, Via Pietro Castellino 111, Naples, Italy
关键词
Voice disorders; Signal processing; DFundamental frequency analysis; m-Health system; RISK-FACTORS; DISORDERS; SPEECH; GENDER; POPULATION; PREVALENCE; DYSPHONIA; TEACHERS; AGE;
D O I
10.1016/j.bspc.2018.01.007
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Nowadays, the incidence of voice disorders is increasing rapidly, with about a third of the population suffering from dysphonia at some point in their lives. Dysphonia is a disorder that alters vocal quality and can impair and reduce the quality of life. The structural or functional alteration of the phonatory apparatus, unhealthy lifestyles or an excessive use of the vocal cords for work activities (e.g. teaching) can cause voice disorders. Unfortunately, people who suffer from dysphonia often underestimate its symptoms and therefore delay consulting a speech therapist for accurate voice assessment and treatment. Voice disorder evaluation involves a series of tests, including an acoustic analysis. This quantifies the measurements of voice quality through the evaluation of certain characteristic parameters, for example the fundamental frequency (F-0). In this paper, a personalized methodology for the estimation of the F-0 is presented. The personalization is accomplished by taking into account two of the main factors that influence the F-0, the gender and age of the subject. The estimation of the F-0 is crucial for the classification of the voice signal, because the discrimination of a healthy voice from a pathological one is achieved by evaluating the inclusion of the F-0 value within the healthy range. To evaluate the presented methodology, we have carried out a set of tests by using some voice signals selected from an available database in order to compare the classification ability of the proposed methodology with other algorithms existing in the literature. The numerical results obtained show that the proposed methodology provides a good accuracy, sensitivity, and specificity, respectively of over 77%, 72% and 81%, values better than those achieved by the most frequently other used and cited fundamental frequency estimation algorithms. Additionally, a statistical analysis to evaluate whether or not a statistically significant difference exists between the accuracy, sensitivity and specificity has been carried out. The outcome of the ANOVA tests and of the t-tests confirms that there is a significant difference between the proposed methodology and the other algorithms. Finally, the presented methodology could be embedded in a portable and simple m-health application that could be useful for the monitoring of the state of vocal health and the prevention of voice disorders. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:134 / 144
页数:11
相关论文
共 50 条
  • [1] Fundamental frequency estimation of voice of patients with laryngeal disorders
    Mitev, P
    Hadjitodorov, S
    [J]. INFORMATION SCIENCES, 2003, 156 (1-2) : 3 - 19
  • [2] Estimation of air traffic controllers' fatigue based on the analysis of the human voice's fundamental frequency
    Canas, Jose J.
    Munoz-de-Escalona, Enrique
    de Frutos, Patricia Lopez
    Rodriguez, Ruben
    Celorrio, Fernando
    [J]. INTERNATIONAL JOURNAL OF HUMAN FACTORS AND ERGONOMICS, 2022, 9 (04)
  • [3] Speech Emotion Recognition Based on Voice Fundamental Frequency
    Dimitrova-Grekow, Teodora
    Klis, Aneta
    Igras-Cybulska, Magdalena
    [J]. ARCHIVES OF ACOUSTICS, 2019, 44 (02) : 277 - 286
  • [4] ON MODELING FUNDAMENTAL VOICE FREQUENCY
    COOPER, WE
    [J]. CONTEMPORARY PSYCHOLOGY, 1983, 28 (07): : 570 - 570
  • [5] Voice/Non-Voice classification using reliable fundamental frequency estimator for voice activated powered wheelchair control
    Suk, Soo-Young
    Chung, Hyun-Yeol
    Kojima, Hiroaki
    [J]. EMBEDDED SOFTWARE AND SYSTEMS, PROCEEDINGS, 2007, 4523 : 347 - +
  • [6] Validation of an Algorithm for Semi-automated Estimation of Voice Relative Fundamental Frequency
    Lien, Yu-An S.
    Murray, Elizabeth S. Heller
    Calabrese, Carolyn R.
    Michener, Carolyn M.
    Van Stan, Jarrad H.
    Mehta, Daryush D.
    Hillman, Robert E.
    Noordzij, J. Pieter
    Stepp, Cara E.
    [J]. ANNALS OF OTOLOGY RHINOLOGY AND LARYNGOLOGY, 2017, 126 (10): : 712 - 716
  • [7] Fundamental Frequency Estimation Based on Mean Values
    Ardeleanu, Andrei Sebastian
    Temneanu, Marinel
    [J]. 2013 8TH INTERNATIONAL SYMPOSIUM ON ADVANCED TOPICS IN ELECTRICAL ENGINEERING (ATEE), 2013,
  • [8] FUNDAMENTAL-FREQUENCY DETERMINATION BASED ON INSTANTANEOUS FREQUENCY ESTIMATION
    QIU, LJ
    YANG, HY
    KOH, SN
    [J]. SIGNAL PROCESSING, 1995, 44 (02) : 233 - 241
  • [9] Fundamental frequency estimation based on instantaneous frequency amplitude spectrum
    Tanaka, T
    Kobayashi, T
    Arifianto, D
    Masuko, T
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 329 - 332
  • [10] VOICE FUNDAMENTAL FREQUENCY IN SATURATION DIVING
    HOLLIEN, H
    HICKS, JW
    SHEARER, W
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1976, 60 : S46 - S46