Robustness Analysis of Automatic Speech Signal Recognition System Against Factors Degrading Speech Signal

被引:0
|
作者
Oska, Jaroslaw [1 ]
Wojtun, Jaroslaw [1 ]
Wodecki, Krzysztof [1 ]
Piotrowski, Zbigniew [1 ]
机构
[1] Mil Univ Technol, Fac Elect, Warsaw, Poland
关键词
speech recognition; degrading factors; LPCC; MFCC; GMM-UBM; G.711; G.723.1; iLBC;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In the article there are presented the results of research on the influence of the lossy compression, used in codecs G.711, G.723.1 and iLBC, on the efficiency of isolated speech phrase recognition. In the research the degree of robustness against degrading factors in the parameterisation method of audio signal LPCC and MFCC (Linear Prediction Cepstral Coefficients, Mel Frequency Cepstral Coefficients) is compared. The research is based on the classifier of improved Gaussian mixtures making allowance for Universal Background Model GMM-UBM (Gaussian Mixtures Model - Universal Background Model). The research was conducted on the database composed of 3000 isolated speech phrases.
引用
收藏
页码:71 / 75
页数:5
相关论文
共 50 条
  • [1] Automatic emotion recognition by the speech signal
    Schuller, B
    Lang, M
    Rigoll, G
    [J]. 6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL IX, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING II, 2002, : 367 - 372
  • [2] A STUDY ON BIAS-BASED SPEECH SIGNAL CONDITIONING TECHNIQUES FOR IMPROVING THE ROBUSTNESS OF AUTOMATIC SPEECH RECOGNITION
    Chowdhury, Md Foezur Rahman
    Selouani, Sid-Ahmed
    O'Shaughnessy, Douglas
    [J]. 2009 IEEE 22ND CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1 AND 2, 2009, : 366 - +
  • [3] Automatic Emotion Recognition of Speech Signal in Mandarin
    Zhang, Sheng
    Ching, P. C.
    Kong, Fanrang
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1810 - +
  • [4] COMPARISON OF SEVERAL SPEECH SIGNAL FEATURE PARAMETERS FOR AUTOMATIC SPEECH RECOGNITION
    PARTALO, M
    SIJERCIC, Z
    [J]. SPEECH COMMUNICATION, 1989, 8 (04) : 347 - 353
  • [5] IMPROVING ROBUSTNESS AGAINST REVERBERATION FOR AUTOMATIC SPEECH RECOGNITION
    Mitra, Vikramjit
    Van Hout, Julien
    Wang, Wen
    Graciarena, Martin
    McLaren, Mitchell
    Franco, Horacio
    Vergyri, Dimitra
    [J]. 2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 525 - 532
  • [6] Automatic speech recognition using acoustic doppler signal
    Lee, Ki-Seung
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2016, 35 (01): : 74 - 82
  • [7] Robustness of linear discriminant analysis in automatic speech recognition
    Katz, M
    Meier, HG
    Dolfing, H
    Klakow, D
    [J]. 16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL III, PROCEEDINGS, 2002, : 371 - 374
  • [8] Automatic Lip Synchronization by Speech Signal Analysis
    Zoric, Goranka
    Cerekovic, Aleksandra
    Pandzic, Igor S.
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2323 - 2323
  • [9] Geometry Analysis and Recognition Research of Speech Signal
    Wan Xianbao
    Xu Chunyan
    Chen Yong
    Pan Xiaoxia
    Wang Shoujue
    [J]. PACIIA: 2008 PACIFIC-ASIA WORKSHOP ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION, VOLS 1-3, PROCEEDINGS, 2008, : 1219 - 1222
  • [10] Analysis of speech signal recognition with correlated interferences
    Zhuk, SY
    Machnev, AM
    [J]. IZVESTIYA VYSSHIKH UCHEBNYKH ZAVEDENII RADIOELEKTRONIKA, 2000, 43 (1-2): : A48 - A54