Multi-candidate missing data imputation for robust speech recognition

被引:0
|
作者
Yujun Wang
Hugo Van hamme
机构
[1] Katholieke Universiteit Leuven,Department of ESAT
关键词
speech recognition; constrained optimization; missing data; noise robustness;
D O I
暂无
中图分类号
学科分类号
摘要
The application of Missing Data Techniques (MDT) to increase the noise robustness of HMM/GMM-based large vocabulary speech recognizers is hampered by a large computational burden. The likelihood evaluations imply solving many constrained least squares (CLSQ) optimization problems. As an alternative, researchers have proposed frontend MDT or have made oversimplifying independence assumptions for the backend acoustic model. In this article, we propose a fast Multi-Candidate (MC) approach that solves the per-Gaussian CLSQ problems approximately by selecting the best from a small set of candidate solutions, which are generated as the MDT solutions on a reduced set of cluster Gaussians. Experiments show that the MC MDT runs equally fast as the uncompensated recognizer while achieving the accuracy of the full backend optimization approach. The experiments also show that exploiting the more accurate acoustic model of the backend does pay off in terms of accuracy when compared to frontend MDT.
引用
收藏
相关论文
共 50 条
  • [21] Flexible and Robust Method for Missing Loop Detector Data Imputation
    Henrickson, Kristian
    Zou, Yajie
    Wang, Yinhai
    [J]. TRANSPORTATION RESEARCH RECORD, 2015, (2527) : 29 - 36
  • [22] Iterative Robust Semi-Supervised Missing Data Imputation
    Fazakis, Nikos
    Kostopoulos, Georgios
    Kotsiantis, Sotiris
    Mporas, Iosif
    [J]. IEEE ACCESS, 2020, 8 : 90555 - 90569
  • [23] DOUBLY ROBUST NONPARAMETRIC MULTIPLE IMPUTATION FOR IGNORABLE MISSING DATA
    Long, Qi
    Hsu, Chiu-Hsieh
    Li, Yisheng
    [J]. STATISTICA SINICA, 2012, 22 (01) : 149 - 172
  • [24] Imputation of missing values in multi-view data
    van Loon, Wouter
    Fokkema, Marjolein
    de Vos, Frank
    Koini, Marisa
    Schmidt, Reinhold
    de Rooij, Mark
    [J]. Information Fusion, 2024, 111
  • [25] Missing data techniques using voicing probability dor robust automatic speech recognition
    Kim, LY
    Cho, HY
    Oh, YH
    [J]. ELECTRONICS LETTERS, 2001, 37 (11) : 723 - 724
  • [26] Robust speech recognition using cepstral domain Missing Data Techniques and noisy masks
    Van Hamme, H
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 213 - 216
  • [27] Adaptive beamforming and soft missing data decoding for robust speech recognition in reverberant environments
    Kuehne, Marco
    Togneri, Roberto
    Nordholm, Sven
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 976 - +
  • [28] Robust speech recognition using missing data techniques in the prospect domain and fuzzy masks
    Van Segbroeck, Maarten
    Van Hamme, Hugo
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4393 - 4396
  • [29] Multi-Candidate Voting Model Based on Blockchain
    Xu, Dongliang
    Shi, Wei
    Zhai, Wensheng
    Tian, Zhihong
    [J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2021, 8 (12) : 1891 - 1900
  • [30] A multi-stage multi-candidate algorithm for motion estimation
    Liao, TC
    Phoong, SM
    Lin, YP
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 1613 - 1616