Discrimination of pathological voices using a time-frequency approach

被引:96
|
作者
Umapathy, K
Krishnan, S
Parsa, V
Jamieson, DG
机构
[1] Ryerson Univ, Dept Elect & Comp Engn, Toronto, ON M5B 2K3, Canada
[2] Univ Western Ontario, Natl Ctr Audiol, London, ON N6G 1H1, Canada
关键词
matching pursuit; pathological voice; pattern classification; speech disorders; time-frequency distributions;
D O I
10.1109/TBME.2004.842962
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Acoustical measures of vocal function are routinely used in the assessments of disordered voice, and for monitoring the patient's progress over the course of voice therapy. Typically, acoustic measures are extracted from sustained vowel stimuli where short-term and long-term perturbations in fundamental frequency and intensity, and the level of "glottal noise" are used to characterize the vocal function. However, acoustic measures extracted from continuous speech samples may well be required for accurate prediction of abnormal voice quality that is relevant to the client's "real world" experience. In contrast with sustained vowel research, there is relatively sparse literature on the effectiveness of acoustic measures extracted from continuous speech samples. This is partially due to the challenge of segmenting the speech signal into voiced, unvoiced, and silence periods before features can be extracted for vocal function characterization. In this paper we propose a joint time-frequency approach for classifying pathological voices using continuous speech signals that obviates the need for such segmentation. The speech signals were decomposed using an adaptive time-frequency transform algorithm, and several features such as the octave max, octave mean, energy ratio, length ratio, and frequency ratio were extracted from the decomposition parameters and analyzed using statistical pattern classification techniques. Experiments with a database consisting of continuous speech samples from 51 normal and 161 pathological talkers yielded a classification accuracy of 93.4%.
引用
收藏
页码:421 / 430
页数:10
相关论文
共 50 条
  • [1] Discrimination of pathological voices using an adaptive time-frequency approach
    Umapathy, K
    Krishnan, S
    Parsa, V
    Jamieson, D
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 3852 - 3855
  • [2] Time-frequency modeling and classification of pathological voices
    Umapathy, K
    Krishnan, S
    Parsa, V
    Jamieson, D
    [J]. SECOND JOINT EMBS-BMES CONFERENCE 2002, VOLS 1-3, CONFERENCE PROCEEDINGS: BIOENGINEERING - INTEGRATIVE METHODOLOGIES, NEW TECHNOLOGIES, 2002, : 116 - 117
  • [3] Discrimination Between Pathological and Normal Voices Using GMM-SVM Approach
    Wang, Xiang
    Zhang, Jianping
    Yan, Yonghong
    [J]. JOURNAL OF VOICE, 2011, 25 (01) : 38 - 43
  • [4] Discrimination of Buried Objects using Time-Frequency Analysis and Waveform Norms
    Morrow, Ivor L.
    Wirth, Sebastian
    Finnis, Mark
    [J]. 2016 LOUGHBOROUGH ANTENNAS & PROPAGATION CONFERENCE (LAPC), 2016,
  • [5] Time-frequency analysis for arrhythmia discrimination using human atrium electrogram
    Shin, Hangsik
    Lee, Chungkeun
    Choo, Benjamin Youngmin
    Lee, Myoungho
    [J]. 2006 28TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-15, 2006, : 4478 - +
  • [6] Time-frequency modelling and discrimination of noise in the electrocardiogram
    Augustyniak, P
    [J]. PHYSIOLOGICAL MEASUREMENT, 2003, 24 (03) : 753 - 767
  • [7] Speech Enhancement for Pathological Voice Using Time-Frequency Trajectory Excitation Modeling
    Song, Eunwoo
    Ryu, Jongyoub
    Kang, Hong-Goo
    [J]. 2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [8] Segmentation and identification of some pathological phonocardiogram signals using time-frequency analysis
    Boutana, D.
    Benidir, M.
    Barkat, B.
    [J]. IET SIGNAL PROCESSING, 2011, 5 (06) : 527 - 537
  • [9] Enhanced target versus clutter discrimination using time-frequency (LTV) filters
    Gomatam, Vikram Thiruneermalai
    Loughlin, Patrick
    [J]. AUTOMATIC TARGET RECOGNITION XXV, 2015, 9476
  • [10] Ship Discrimination Using Polarimetric SAR Data and Coherent Time-Frequency Analysis
    Hu, Canbin
    Ferro-Famil, Laurent
    Kuang, Gangyao
    [J]. REMOTE SENSING, 2013, 5 (12) : 6899 - 6920