Study of Time and Frequency Variability in Pathological Speech and Error Reduction Methods for Automatic Speech Recognition

被引：0

作者：

Saz, Oscar ^{[1
]}

Miguel, Antonio ^{[1
]}

Lleida, Eduardo ^{[1
]}

Ortega, Alfonso ^{[1
]}

Buera, Luis ^{[1
]}

机构：

[1] Univ Zaragoza, Aragon Inst Technol I3A, GTC, E-50009 Zaragoza, Spain

来源：

INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年

关键词：

speech analysis; pathological speech; automatic speech recognition; local warping;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this work, we study the variations in the time and frequency domains inside a Spanish language corpus of speakers with non-pathological and pathological speech. We show how pathological speech has a greater variability in the duration of the words than non-pathological speech, while in the frequency domain we show that the vowels confusability increases by a 18%. The baseline experiments in Automatic Speech Recognition (ASR) with this corpus demonstrate that this variability causes a loss in the performance of ASR systems. To reduce the impact of time and frequency variability we use a recent Vocal Tract Length Normalization (VTLN) system: MATE (augMented stAte space acousTic modEl), as a way of improving the performance of ASR systems when dealing with speakers who suffer any kind of speech pathology. Experiments with MATE show a 17.04% and 11.19% WER reduction by using frequency and time MATE respectively.

引用

页码：993 / 996

页数：4

共 50 条

[1] Automatic speech recognition and speech variability: A review
Benzeghiba, M.
De Mori, R.
Deroo, O.
Dupont, S.
Erbes, T.
Jouvet, D.
Fissore, L.
Laface, P.
Mertins, A.
Ris, C.
Rose, R.
Tyagi, V.
Wellekens, C.
[J]. SPEECH COMMUNICATION, 2007, 49 (10-11) : 763 - 786
[2] Time-frequency distributions for automatic speech recognition
Potamianos, A
Maragos, P
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (03): : 196 - 200
[3] LSTM TIME AND FREQUENCY RECURRENCE FOR AUTOMATIC SPEECH RECOGNITION
Li, Jinyu
Mohamed, Abdelrahman
Zweig, Geoffrey
Gong, Yifan
[J]. 2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 187 - 191
[4] Automatic gender recognition in normal and pathological speech
Gomez-Garcia, J. A.
Godino-Llorente, J., I
Castellanos-Dominguez, G.
[J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1706 - 1710
[5] STRUCTURAL METHODS IN AUTOMATIC SPEECH RECOGNITION
LEVINSON, SE
[J]. PROCEEDINGS OF THE IEEE, 1985, 73 (11) : 1625 - 1650
[6] Weighting Time-Frequency Representation of Speech using Auditory Saliency for Automatic Speech Recognition
Cong-Thanh Do
Stylianou, Yannis
[J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1591 - 1595
[7] AUTOMATIC SPEECH RECOGNITION FOR ACOUSTICAL ANALYSIS AND ASSESSMENT OF CANTONESE PATHOLOGICAL VOICE AND SPEECH
Lee, Tan
Liu, Yuanyuan
Huang, Pei-Wen
Chien, Jen-Tzung
Lam, Wang Kong
Yeung, Yu Ting
Law, Thomas K. T.
Lee, Kathy Y. S.
Kong, Anthony Pak-Hin
Law, Sam-Po
[J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6475 - 6479
[8] Time-frequency analysis and auditory modeling for automatic recognition of speech
Pitton, JW
Wang, KS
Juang, BH
[J]. PROCEEDINGS OF THE IEEE, 1996, 84 (09) : 1199 - 1215
[9] A Dimensionality Reduction Framework for Automatic Speech Recognition
ElMoudden, Ismail
ElBernoussi, Souad
Benyacoub, Badreddine
[J]. INNOVATION MANAGEMENT AND SUSTAINABLE ECONOMIC COMPETITIVE ADVANTAGE: FROM REGIONAL DEVELOPMENT TO GLOBAL GROWTH, VOLS I - VI, 2015, 2015, : 2602 - 2608
[10] Comparative Evaluation of Speech Enhancement Methods for Robust Automatic Speech Recognition
Paliwal, Kuldip K.
Lyons, James G.
So, Stephen
Stark, Anthony P.
Wojcicki, Kamil K.
[J]. 2010 4TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2010,

← 1 2 3 4 5 →