Multi-candidate missing data imputation for robust speech recognition

被引:0
|
作者
Yujun Wang
Hugo Van hamme
机构
[1] Katholieke Universiteit Leuven,Department of ESAT
关键词
speech recognition; constrained optimization; missing data; noise robustness;
D O I
暂无
中图分类号
学科分类号
摘要
The application of Missing Data Techniques (MDT) to increase the noise robustness of HMM/GMM-based large vocabulary speech recognizers is hampered by a large computational burden. The likelihood evaluations imply solving many constrained least squares (CLSQ) optimization problems. As an alternative, researchers have proposed frontend MDT or have made oversimplifying independence assumptions for the backend acoustic model. In this article, we propose a fast Multi-Candidate (MC) approach that solves the per-Gaussian CLSQ problems approximately by selecting the best from a small set of candidate solutions, which are generated as the MDT solutions on a reduced set of cluster Gaussians. Experiments show that the MC MDT runs equally fast as the uncompensated recognizer while achieving the accuracy of the full backend optimization approach. The experiments also show that exploiting the more accurate acoustic model of the backend does pay off in terms of accuracy when compared to frontend MDT.
引用
收藏
相关论文
共 50 条
  • [1] Multi-candidate missing data imputation for robust speech recognition
    Wang, Yujun
    Van Hamme, Hugo
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2012,
  • [2] Robust Recognition of Noisy Speech Through Partial Imputation of Missing Data
    Kafoori, Kian Ebrahim
    Ahadi, Seyed Mohammad
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2018, 37 (04) : 1625 - 1648
  • [3] Robust Recognition of Noisy Speech Through Partial Imputation of Missing Data
    Kian Ebrahim Kafoori
    Seyed Mohammad Ahadi
    [J]. Circuits, Systems, and Signal Processing, 2018, 37 : 1625 - 1648
  • [4] Compressive Sensing for Missing Data Imputation in Noise Robust Speech Recognition
    Gemmeke, Jort Florent
    Van Hamme, Hugo
    Cranen, Bert
    Boves, Lou
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2010, 4 (02) : 272 - 287
  • [5] Missing data techniques for robust speech recognition
    Cooke, M
    Morris, A
    Green, P
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 863 - 866
  • [6] MULTI-CANDIDATE EQUILIBRIA
    WITTMAN, D
    [J]. PUBLIC CHOICE, 1984, 43 (03) : 287 - 291
  • [7] Robust automatic speech recognition with missing and unreliable acoustic data
    Cooke, M
    Green, P
    Josifovski, L
    Vizinho, A
    [J]. SPEECH COMMUNICATION, 2001, 34 (03) : 267 - 285
  • [8] Bounded cepstral marginalization of missing data for robust speech recognition
    Kafoori, Kian Ebrahim
    Ahadi, Seyed Mohammad
    [J]. COMPUTER SPEECH AND LANGUAGE, 2016, 36 : 1 - 23
  • [9] Mask estimation and imputation methods for missing data speech recognition in a multisource reverberant environment
    Keronen, Sami
    Kallasjoki, Heikki
    Remes, Ulpu
    Brown, Guy J.
    Gemmeke, Jort F.
    Palomaki, Kalle J.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2013, 27 (03): : 798 - 819
  • [10] A WAVELET-BASED DATA IMPUTATION APPROACH TO SPECTROGRAM RECONSTRUCTION FOR ROBUST SPEECH RECOGNITION
    Badiezadegan, Shirin
    Rose, Richard C.
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4780 - 4783