Discrimination of pathological voices using a time-frequency approach

被引:96
|
作者
Umapathy, K
Krishnan, S
Parsa, V
Jamieson, DG
机构
[1] Ryerson Univ, Dept Elect & Comp Engn, Toronto, ON M5B 2K3, Canada
[2] Univ Western Ontario, Natl Ctr Audiol, London, ON N6G 1H1, Canada
关键词
matching pursuit; pathological voice; pattern classification; speech disorders; time-frequency distributions;
D O I
10.1109/TBME.2004.842962
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Acoustical measures of vocal function are routinely used in the assessments of disordered voice, and for monitoring the patient's progress over the course of voice therapy. Typically, acoustic measures are extracted from sustained vowel stimuli where short-term and long-term perturbations in fundamental frequency and intensity, and the level of "glottal noise" are used to characterize the vocal function. However, acoustic measures extracted from continuous speech samples may well be required for accurate prediction of abnormal voice quality that is relevant to the client's "real world" experience. In contrast with sustained vowel research, there is relatively sparse literature on the effectiveness of acoustic measures extracted from continuous speech samples. This is partially due to the challenge of segmenting the speech signal into voiced, unvoiced, and silence periods before features can be extracted for vocal function characterization. In this paper we propose a joint time-frequency approach for classifying pathological voices using continuous speech signals that obviates the need for such segmentation. The speech signals were decomposed using an adaptive time-frequency transform algorithm, and several features such as the octave max, octave mean, energy ratio, length ratio, and frequency ratio were extracted from the decomposition parameters and analyzed using statistical pattern classification techniques. Experiments with a database consisting of continuous speech samples from 51 normal and 161 pathological talkers yielded a classification accuracy of 93.4%.
引用
下载
收藏
页码:421 / 430
页数:10
相关论文
共 50 条
  • [31] Diagnostics and prognostics based on adaptive time-frequency feature discrimination
    Jae Hyuk Oh
    Chang Gu Kim
    Young Man Cho
    KSME International Journal, 2004, 18 : 1537 - 1548
  • [32] Enhancing Time-Frequency Concentration and Accuracy Using an Improved Time-Frequency Synchrosqueezing Transform
    Yang, Yaocheng
    Zhang, Jialiang
    Li, Yifan
    Ni, Qing
    Wang, Biao
    IEEE Sensors Journal, 2024, 24 (23) : 39334 - 39343
  • [33] Diagnostics and prognostics based on adaptive time-frequency feature discrimination
    Oh, JH
    Kim, CG
    Cho, YM
    KSME INTERNATIONAL JOURNAL, 2004, 18 (09): : 1537 - 1548
  • [34] Time-Frequency Features Extraction for Infant Directed Speech Discrimination
    Mahdhaoui, Ammar
    Chetouani, Mohamed
    Kessous, Loic
    ADVANCES IN NONLINEAR SPEECH PROCESSING, 2010, 5933 : 120 - +
  • [35] A Novel Time-Frequency Analysis Approach for Nonstaionary Time Series Using Multiresolution Wavelet
    Tan, Si-Rui
    Li, Yang
    Li, Ke
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2014, : 990 - 995
  • [36] Audio time-scale modification using a hybrid time-frequency domain approach
    Dorran, D
    Coyle, E
    Lawlor, R
    2005 WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2005, : 279 - 282
  • [37] Neural approach to time-frequency signal decomposition
    Grabowski, D
    Walczak, J
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING - ICAISC 2004, 2004, 3070 : 1118 - 1123
  • [38] A time-frequency approach to the adjustable bandwidth concept
    Galleani, Lorenzo
    Cohen, Leon
    Noga, Andrew
    DIGITAL SIGNAL PROCESSING, 2006, 16 (05) : 454 - 467
  • [39] Testing Stationarity With Surrogates: A Time-Frequency Approach
    Borgnat, Pierre
    Flandrin, Patrick
    Honeine, Paul
    Richard, Cedric
    Xiao, Jun
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2010, 58 (07) : 3459 - 3470
  • [40] An Implementation Approach For Ideal Time-Frequency Distribution
    Zhang, Liming
    Qian, Tao
    2014 19TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2014, : 114 - 118