Robustness Analysis of Automatic Speech Signal Recognition System Against Factors Degrading Speech Signal

被引:0
|
作者
Oska, Jaroslaw [1 ]
Wojtun, Jaroslaw [1 ]
Wodecki, Krzysztof [1 ]
Piotrowski, Zbigniew [1 ]
机构
[1] Mil Univ Technol, Fac Elect, Warsaw, Poland
关键词
speech recognition; degrading factors; LPCC; MFCC; GMM-UBM; G.711; G.723.1; iLBC;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In the article there are presented the results of research on the influence of the lossy compression, used in codecs G.711, G.723.1 and iLBC, on the efficiency of isolated speech phrase recognition. In the research the degree of robustness against degrading factors in the parameterisation method of audio signal LPCC and MFCC (Linear Prediction Cepstral Coefficients, Mel Frequency Cepstral Coefficients) is compared. The research is based on the classifier of improved Gaussian mixtures making allowance for Universal Background Model GMM-UBM (Gaussian Mixtures Model - Universal Background Model). The research was conducted on the database composed of 3000 isolated speech phrases.
引用
收藏
页码:71 / 75
页数:5
相关论文
共 50 条
  • [21] Emotional feature analysis and recognition in multilingual speech signal
    School of Information Science and Engineering, University of Jinan, Jinan 250022, China
    [J]. ICEMI - Proc. Int. Conf. Electron. Meas. Instrum., 1600, (41046-41050):
  • [22] Recognition and Analysis of Emotion Transition in Mandarin Speech Signal
    Pao, Tsang-Long
    Yeh, Jun-Heng
    Tsai, Yao-Wei
    [J]. IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010, : 3326 - 3332
  • [23] A Study on Emotional Feature Analysis and Recognition in Speech Signal
    Cheng, Xin Min
    Cheng, Pei Ying
    Zhao, Li
    [J]. 2009 INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION, VOL I, 2009, : 418 - 420
  • [24] Speech Signal Analysis and Pattern Recognition in Diagnosis of Dysarthria
    Thoppilu, Minu George
    Kumar, C. Santhosh
    Kumar, Anand
    Amose, John
    [J]. ANNALS OF INDIAN ACADEMY OF NEUROLOGY, 2017, 20 (04) : 352 - 357
  • [25] Multichannel Signal Processing With Deep Neural Networks for Automatic Speech Recognition
    Sainath, Tara N.
    Weiss, Ron J.
    Wilson, Kevin W.
    Li, Bo
    Narayanan, Arun
    Variani, Ehsan
    Bacchiani, Michiel
    Shafran, Izhak
    Senior, Andrew
    Chin, Kean
    Misra, Ananya
    Kim, Chanwoo
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (05) : 965 - 979
  • [26] Channel identification and signal spectrum estimation for robust automatic speech recognition
    Zhao, YX
    [J]. IEEE SIGNAL PROCESSING LETTERS, 1998, 5 (12) : 305 - 308
  • [27] Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition
    Garner, Philip N.
    [J]. SPEECH COMMUNICATION, 2011, 53 (08) : 991 - 1001
  • [28] Automatic depression recognition by intelligent speech signal processing: A systematic survey
    Wu, Pingping
    Wang, Ruihao
    Lin, Han
    Zhang, Fanlong
    Tu, Juan
    Sun, Miao
    [J]. CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2023, 8 (03) : 701 - 711
  • [29] Automatic Speech Recognition System for Malay Speaking Children Automatic Speech Recognition system
    Rahman, Feisal Dani
    Mohamed, Noraini
    Mustafa, Mumtaz Begum
    Salim, Siti Salwah
    [J]. 2014 THIRD ICT INTERNATIONAL STUDENT PROJECT CONFERENCE (ICT-ISPC), 2014, : 79 - 82
  • [30] Prototype measurement system for spatial analysis of speech signal for speech therapy
    Kostera, Kinga
    Wieclawek, Wojciech
    Krecichwost, Michal
    [J]. INNOVATIONS IN BIOMEDICAL ENGINEERING, 2018, 623 : 79 - 86