Study of Time and Frequency Variability in Pathological Speech and Error Reduction Methods for Automatic Speech Recognition

被引:0
|
作者
Saz, Oscar [1 ]
Miguel, Antonio [1 ]
Lleida, Eduardo [1 ]
Ortega, Alfonso [1 ]
Buera, Luis [1 ]
机构
[1] Univ Zaragoza, Aragon Inst Technol I3A, GTC, E-50009 Zaragoza, Spain
关键词
speech analysis; pathological speech; automatic speech recognition; local warping;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we study the variations in the time and frequency domains inside a Spanish language corpus of speakers with non-pathological and pathological speech. We show how pathological speech has a greater variability in the duration of the words than non-pathological speech, while in the frequency domain we show that the vowels confusability increases by a 18%. The baseline experiments in Automatic Speech Recognition (ASR) with this corpus demonstrate that this variability causes a loss in the performance of ASR systems. To reduce the impact of time and frequency variability we use a recent Vocal Tract Length Normalization (VTLN) system: MATE (augMented stAte space acousTic modEl), as a way of improving the performance of ASR systems when dealing with speakers who suffer any kind of speech pathology. Experiments with MATE show a 17.04% and 11.19% WER reduction by using frequency and time MATE respectively.
引用
收藏
页码:993 / 996
页数:4
相关论文
共 50 条
  • [1] Automatic speech recognition and speech variability: A review
    Benzeghiba, M.
    De Mori, R.
    Deroo, O.
    Dupont, S.
    Erbes, T.
    Jouvet, D.
    Fissore, L.
    Laface, P.
    Mertins, A.
    Ris, C.
    Rose, R.
    Tyagi, V.
    Wellekens, C.
    [J]. SPEECH COMMUNICATION, 2007, 49 (10-11) : 763 - 786
  • [2] Time-frequency distributions for automatic speech recognition
    Potamianos, A
    Maragos, P
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (03): : 196 - 200
  • [3] LSTM TIME AND FREQUENCY RECURRENCE FOR AUTOMATIC SPEECH RECOGNITION
    Li, Jinyu
    Mohamed, Abdelrahman
    Zweig, Geoffrey
    Gong, Yifan
    [J]. 2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 187 - 191
  • [4] Automatic gender recognition in normal and pathological speech
    Gomez-Garcia, J. A.
    Godino-Llorente, J., I
    Castellanos-Dominguez, G.
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1706 - 1710
  • [5] STRUCTURAL METHODS IN AUTOMATIC SPEECH RECOGNITION
    LEVINSON, SE
    [J]. PROCEEDINGS OF THE IEEE, 1985, 73 (11) : 1625 - 1650
  • [6] Weighting Time-Frequency Representation of Speech using Auditory Saliency for Automatic Speech Recognition
    Cong-Thanh Do
    Stylianou, Yannis
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1591 - 1595
  • [7] AUTOMATIC SPEECH RECOGNITION FOR ACOUSTICAL ANALYSIS AND ASSESSMENT OF CANTONESE PATHOLOGICAL VOICE AND SPEECH
    Lee, Tan
    Liu, Yuanyuan
    Huang, Pei-Wen
    Chien, Jen-Tzung
    Lam, Wang Kong
    Yeung, Yu Ting
    Law, Thomas K. T.
    Lee, Kathy Y. S.
    Kong, Anthony Pak-Hin
    Law, Sam-Po
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6475 - 6479
  • [8] Time-frequency analysis and auditory modeling for automatic recognition of speech
    Pitton, JW
    Wang, KS
    Juang, BH
    [J]. PROCEEDINGS OF THE IEEE, 1996, 84 (09) : 1199 - 1215
  • [9] A Dimensionality Reduction Framework for Automatic Speech Recognition
    ElMoudden, Ismail
    ElBernoussi, Souad
    Benyacoub, Badreddine
    [J]. INNOVATION MANAGEMENT AND SUSTAINABLE ECONOMIC COMPETITIVE ADVANTAGE: FROM REGIONAL DEVELOPMENT TO GLOBAL GROWTH, VOLS I - VI, 2015, 2015, : 2602 - 2608
  • [10] Comparative Evaluation of Speech Enhancement Methods for Robust Automatic Speech Recognition
    Paliwal, Kuldip K.
    Lyons, James G.
    So, Stephen
    Stark, Anthony P.
    Wojcicki, Kamil K.
    [J]. 2010 4TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2010,