Refining algorithmic estimation of relative fundamental frequency: Accounting for sample characteristics and fundamental frequency estimation method

被引:14
|
作者
Vojtech, Jennifer M. [1 ,2 ]
Segina, Roxanne K. [2 ]
Buckley, Daniel P. [2 ,3 ]
Kolin, Katharine R. [2 ]
Tardif, Monique C. [2 ]
Noordzij, J. Pieter [3 ]
Stepp, Cara E. [1 ,2 ,4 ]
机构
[1] Boston Univ, Dept Biomed Engn, 44 Cummington Mall, Boston, MA 02215 USA
[2] Boston Univ, Dept Speech Language & Hearing Sci, 635 Commonwealth Ave, Boston, MA 02215 USA
[3] Boston Univ, Sch Med, Dept Otolaryngol Head & Neck Surg, 72 East Concord St, Boston, MA 02118 USA
[4] Boston Univ, Sch Med, Dept Otolaryngol Head & Neck Surg, Boston, MA 02215 USA
来源
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
AUDITORY-PERCEPTUAL EVALUATION; VOICING OFFSET; VOCAL EFFORT; SPASMODIC DYSPHONIA; PITCH STRENGTH; ONSET; SPEECH; MUSCLE; HYPERFUNCTION; INDIVIDUALS;
D O I
10.1121/1.5131025
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Relative fundamental frequency (RFF) is a promising acoustic measure for evaluating voice disorders. Yet, the accuracy of the current RFF algorithm varies across a broad range of vocal signals. The authors investigated how fundamental frequency (f(o)) estimation and sample characteristics impact the relationship between manual and semi-automated RFF estimates. Acoustic recordings were collected from 227 individuals with and 256 individuals without voice disorders. Common f(o) estimation techniques were compared to the autocorrelation method currently implemented in the RFF algorithm. Pitch strength-based categories were constructed using a training set (1158 samples), and algorithm thresholds were tuned to each category. RFF was then computed on an independent test set (291 samples) using category-specific thresholds and compared against manual RFF via mean bias error (MBE) and root-mean-square error (RMSE). Auditory-SWIPE' for f(o) estimation led to the greatest correspondence with manual RFF and was implemented in concert with category-specific thresholds. Refining f(o) estimation and accounting for sample characteristics led to increased correspondence with manual RFF [MBE - 0.01 semitones (ST), RMSE - 0.28 ST] compared to the unmodified algorithm (MBE - 0.90 ST, RMSE - 0.34 ST), reducing the MBE and RMSE of semi-automated RFF estimates by 88.4% and 17.3%, respectively. (C) 2019 Acoustical Society of America.
引用
收藏
页码:3184 / 3202
页数:19
相关论文
共 50 条
  • [1] Automated Estimation of Relative Fundamental Frequency
    Lien, Yu-An S.
    Stepp, Cara E.
    [J]. 2013 35TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2013, : 2136 - 2139
  • [2] AN EXACT SUBSPACE METHOD FOR FUNDAMENTAL FREQUENCY ESTIMATION
    Christensen, Mads Graesboll
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6802 - 6806
  • [3] Toolbox for fundamental frequency estimation
    Kasi, K
    Gu, L
    Zahorian, SA
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 4020 - 4020
  • [4] Default Bayesian Estimation of the Fundamental Frequency
    Nielsen, Jesper Kjaer
    Christensen, Mads Graesboll
    Jensen, Soren Holdt
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (03): : 598 - 610
  • [5] SIFT ALGORITHM FOR FUNDAMENTAL FREQUENCY ESTIMATION
    MARKEL, JD
    [J]. IEEE TRANSACTIONS ON AUDIO AND ELECTROACOUSTICS, 1972, AU20 (05): : 367 - &
  • [6] FUNDAMENTAL FREQUENCY ESTIMATION BY HARMONIC IDENTIFICATION
    SNOW, TB
    HUGHES, GW
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1969, 45 (01): : 316 - &
  • [7] Fundamental Frequency and its Harmonics Model: A Robust Method of Estimation
    Kundu, Debasis
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2024, 43 (02) : 1007 - 1029
  • [8] A SINGLE SNAPSHOT OPTIMAL FILTERING METHOD FOR FUNDAMENTAL FREQUENCY ESTIMATION
    Jensen, Jesper Rindom
    Christensen, Mads Graesboll
    Jensen, Soren Holdt
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4272 - 4275
  • [9] Estimation of the fundamental frequency of the speech signal modeled by the SYMPES method
    Milivojevic, Zoran N.
    Mirkovic, Milorad Dj.
    [J]. AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2009, 63 (03) : 200 - 208
  • [10] Fundamental Frequency and its Harmonics Model: A Robust Method of Estimation
    Debasis Kundu
    [J]. Circuits, Systems, and Signal Processing, 2024, 43 (2) : 1007 - 1029