ROBUST SPEECH RECOGNITION USING MULTIVARIATE COPULA MODELS

被引:0
|
作者
Bayestehtashk, Alireza [1 ]
Shafran, Izhak [2 ]
Babaeian, Amir [3 ]
机构
[1] Oregon Hlth & Sci Univ, Portland, OR 97201 USA
[2] Google Inc, Mountain View, CA USA
[3] Univ Calif San Diego, La Jolla, CA 92093 USA
关键词
Copula model; Robust speech recognition; Deep neural network; Aurora; 4;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we continue our investigation into copula models for real-valued multivariate features with the goal of compensating for the mismatch in the training and the testing conditions. Previously, we reported results on UCI classification tasks where our method consistently outperformed other competing classifiers [1]. Here, we extend this work from classification to recognition and elaborate further on the mathematical properties of our models in the form of lemmas. We report results on the Aurora 4 automatic speech recognition (ASR) task which contains utterances with wide range of background noise that are not well represented in the training data. Our results show that the proposed copula-based models improve the accuracy by about 7% (11.6 vs 12.4) over a comparable baseline.
引用
收藏
页码:5890 / 5894
页数:5
相关论文
共 50 条
  • [21] Bridging the Gap: Integrating Pre-trained Speech Enhancement and Recognition Models for Robust Speech Recognition
    Wang, Kuan-Chen
    Li, You-Jin
    Chen, Wei-Lun
    Chen, Yu-Wen
    Wang, Yi-Ching
    Yeh, Ping-Cheng
    Zhang, Chao
    Tsao, Yu
    32ND EUROPEAN SIGNAL PROCESSING CONFERENCE, EUSIPCO 2024, 2024, : 426 - 430
  • [22] Structured Log Linear Models for Noise Robust Speech Recognition
    Zhang, Shi-Xiong
    Ragni, Anton
    Gales, Mark John Francis
    IEEE SIGNAL PROCESSING LETTERS, 2010, 17 (11) : 945 - 948
  • [23] AN INVESTIGATION OF END-TO-END MODELS FOR ROBUST SPEECH RECOGNITION
    Prasad, Archiki
    Jyothi, Preethi
    Velmurugan, Rajbabu
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6893 - 6897
  • [24] STRUCTURED DISCRIMINATIVE MODELS FOR NOISE ROBUST CONTINUOUS SPEECH RECOGNITION
    Ragni, A.
    Gales, M. J. F.
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4788 - 4791
  • [25] A Study on the Generalization Capability of Acoustic Models for Robust Speech Recognition
    Xiao, Xiong
    Li, Jinyu
    Chng, Eng Siong
    Li, Haizhou
    Lee, Chin-Hui
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1158 - 1169
  • [26] Comments on: Inference in multivariate Archimedean copula models
    Johan Segers
    TEST, 2011, 20
  • [27] A copula formulation for multivariate latent Markov models
    Russo, Alfonso
    Farcomeni, Alessio
    TEST, 2024, 33 (03) : 731 - 751
  • [28] Comments on: Inference in multivariate Archimedean copula models
    Valdez, Emiliano A.
    TEST, 2011, 20 (02) : 257 - 262
  • [29] A robust speech analysis in speech recognition
    Miyanaga, Y
    Gozen, S
    Ohtsuki, N
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 706 - 709
  • [30] Efficient estimation of semiparametric multivariate copula models
    Chen, Xiaohong
    Fan, Yanqin
    Tsyrennikov, Viktor
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (475) : 1228 - 1240