Discrimination of pathological voices using a time-frequency approach

被引:96
|
作者
Umapathy, K
Krishnan, S
Parsa, V
Jamieson, DG
机构
[1] Ryerson Univ, Dept Elect & Comp Engn, Toronto, ON M5B 2K3, Canada
[2] Univ Western Ontario, Natl Ctr Audiol, London, ON N6G 1H1, Canada
关键词
matching pursuit; pathological voice; pattern classification; speech disorders; time-frequency distributions;
D O I
10.1109/TBME.2004.842962
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Acoustical measures of vocal function are routinely used in the assessments of disordered voice, and for monitoring the patient's progress over the course of voice therapy. Typically, acoustic measures are extracted from sustained vowel stimuli where short-term and long-term perturbations in fundamental frequency and intensity, and the level of "glottal noise" are used to characterize the vocal function. However, acoustic measures extracted from continuous speech samples may well be required for accurate prediction of abnormal voice quality that is relevant to the client's "real world" experience. In contrast with sustained vowel research, there is relatively sparse literature on the effectiveness of acoustic measures extracted from continuous speech samples. This is partially due to the challenge of segmenting the speech signal into voiced, unvoiced, and silence periods before features can be extracted for vocal function characterization. In this paper we propose a joint time-frequency approach for classifying pathological voices using continuous speech signals that obviates the need for such segmentation. The speech signals were decomposed using an adaptive time-frequency transform algorithm, and several features such as the octave max, octave mean, energy ratio, length ratio, and frequency ratio were extracted from the decomposition parameters and analyzed using statistical pattern classification techniques. Experiments with a database consisting of continuous speech samples from 51 normal and 161 pathological talkers yielded a classification accuracy of 93.4%.
引用
下载
收藏
页码:421 / 430
页数:10
相关论文
共 50 条
  • [41] An Engineering Approach to Time-Frequency Uncertainty Criteria
    Udal, A.
    Kukk, V.
    ELEKTRONIKA IR ELEKTROTECHNIKA, 2012, 117 (01) : 3 - 8
  • [42] On a time-frequency approach to translation on finite graphs
    Begue, Matthew
    2015 INTERNATIONAL CONFERENCE ON SAMPLING THEORY AND APPLICATIONS (SAMPTA), 2015, : 6 - 10
  • [43] A NEW APPROACH TO TIME-FREQUENCY SIGNAL DECOMPOSITION
    HLAWATSCH, F
    KRATTENTHALER, W
    1989 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-3, 1989, : 1248 - 1251
  • [44] Time-Frequency Approach for Stochastic Signal Detection
    Ghosh, Ripul
    Akula, Aparna
    Kumar, Satish
    Sardana, H. K.
    OPTICS: PHENOMENA, MATERIALS, DEVICES, AND CHARACTERIZATION: OPTICS 2011: INTERNATIONAL CONFERENCE ON LIGHT, 2011, 1391
  • [45] A time-frequency approach for newborn seizure detection
    Boashash, B
    Mesbah, M
    IEEE ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE, 2001, 20 (05): : 54 - 64
  • [46] A NEW APPROACH FOR THE REASSIGNMENT OF TIME-FREQUENCY REPRESENTATIONS
    Sejdic, Ervin
    Ozertem, Umut
    Djurovic, Igor
    Erdogmus, Deniz
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 2997 - +
  • [47] Discrimination of delay-fired mine blasts in Wyoming using an automatic time-frequency discriminant
    Arrowsmith, Stephen J.
    Arrowsmith, Marie D.
    Hedlin, Michael A. H.
    Stump, Brian
    BULLETIN OF THE SEISMOLOGICAL SOCIETY OF AMERICA, 2006, 96 (06) : 2368 - 2382
  • [48] Instantaneous frequency estimation by using time-frequency distributions
    Ivanovic, V
    Dakovic, M
    Djurovic, I
    Stankovic, L
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 3521 - 3524
  • [49] An Approach to Intelligent Fault Diagnosis of Cryocooler Using Time-Frequency Image and CNN
    Gao, Sheng
    Jiang, Zhenhua
    Liu, Shaoshuai
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [50] A fractal approach of pathological voices
    Péan, V
    Ouayoun, M
    Fugain, C
    Meyer, B
    Chouard, CH
    FRACTALS IN BIOLOGY AND MEDICINE, VOL III, 2002, : 147 - 151