Missing data techniques for robust speech recognition

被引:0
|
作者
Cooke, M
Morris, A
Green, P
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In noisy listening conditions, the information available on which to base speech recognition decisions is necessarily incomplete: some spectro-temporal regions are dominated by other sources. We report on the application of a variety of techniques for missing data in speech recognition. These techniques may be based on marginal distributions or on reconstruction of missing parts of the spectrum. Application of these ideas in the Resource Management task shows performance which is robust to random removal of up to 80% of the frequency channels, but falls off rapidly with deletions which more realistically simulate masked speech. We report on a vowel classification experiment designed to isolate some of the RM problems for more detailed exploration. The results of this experiment confirm the general superiority of marginals-based schemes, demonstrate the viability of shared covariance statistics, and suggest several ways in which performance improvements on the larger task may be obtained.
引用
收藏
页码:863 / 866
页数:4
相关论文
共 50 条
  • [1] Missing data techniques using voicing probability dor robust automatic speech recognition
    Kim, LY
    Cho, HY
    Oh, YH
    [J]. ELECTRONICS LETTERS, 2001, 37 (11) : 723 - 724
  • [2] Robust speech recognition using cepstral domain Missing Data Techniques and noisy masks
    Van Hamme, H
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 213 - 216
  • [3] Robust speech recognition using missing data techniques in the prospect domain and fuzzy masks
    Van Segbroeck, Maarten
    Van Hamme, Hugo
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4393 - 4396
  • [4] Robust automatic speech recognition with missing and unreliable acoustic data
    Cooke, M
    Green, P
    Josifovski, L
    Vizinho, A
    [J]. SPEECH COMMUNICATION, 2001, 34 (03) : 267 - 285
  • [5] Bounded cepstral marginalization of missing data for robust speech recognition
    Kafoori, Kian Ebrahim
    Ahadi, Seyed Mohammad
    [J]. COMPUTER SPEECH AND LANGUAGE, 2016, 36 : 1 - 23
  • [6] Robust Recognition of Noisy Speech Through Partial Imputation of Missing Data
    Kian Ebrahim Kafoori
    Seyed Mohammad Ahadi
    [J]. Circuits, Systems, and Signal Processing, 2018, 37 : 1625 - 1648
  • [7] Multi-candidate missing data imputation for robust speech recognition
    Wang, Yujun
    Van Hamme, Hugo
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2012,
  • [8] Multi-candidate missing data imputation for robust speech recognition
    Yujun Wang
    Hugo Van hamme
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2012
  • [9] Robust Recognition of Noisy Speech Through Partial Imputation of Missing Data
    Kafoori, Kian Ebrahim
    Ahadi, Seyed Mohammad
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2018, 37 (04) : 1625 - 1648
  • [10] Compressive Sensing for Missing Data Imputation in Noise Robust Speech Recognition
    Gemmeke, Jort Florent
    Van Hamme, Hugo
    Cranen, Bert
    Boves, Lou
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2010, 4 (02) : 272 - 287