Missing data techniques for robust speech recognition

被引:0
|
作者
Cooke, M
Morris, A
Green, P
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In noisy listening conditions, the information available on which to base speech recognition decisions is necessarily incomplete: some spectro-temporal regions are dominated by other sources. We report on the application of a variety of techniques for missing data in speech recognition. These techniques may be based on marginal distributions or on reconstruction of missing parts of the spectrum. Application of these ideas in the Resource Management task shows performance which is robust to random removal of up to 80% of the frequency channels, but falls off rapidly with deletions which more realistically simulate masked speech. We report on a vowel classification experiment designed to isolate some of the RM problems for more detailed exploration. The results of this experiment confirm the general superiority of marginals-based schemes, demonstrate the viability of shared covariance statistics, and suggest several ways in which performance improvements on the larger task may be obtained.
引用
收藏
页码:863 / 866
页数:4
相关论文
共 50 条
  • [41] Speech enhancement with missing data techniques using recurrent neural networks
    Parveen, S
    Green, P
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 733 - 736
  • [42] A combination of discriminative and Maximum Likelihood techniques for noise robust speech recognition
    Laurila, K
    Vasilache, M
    Viikki, O
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 85 - 88
  • [43] A robust speech analysis in speech recognition
    Miyanaga, Y
    Gozen, S
    Ohtsuki, N
    [J]. 2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 706 - 709
  • [44] Correction of likelihoods for degrees of freedom in robust speech recognition using missing feature theory
    Van hamme, H
    [J]. SEVENTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOL 1, PROCEEDINGS, 2003, : 401 - 404
  • [45] Using phoneme segmentation in, conjunction with missing feature approaches for noise robust speech recognition
    Mohammadi, Arash
    Almasganj, Farshad
    Taherkhani, Aboozar
    Naderkhani, Farnoosh
    [J]. 2007 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, VOLS 1-3, 2007, : 447 - 451
  • [46] Coupling identification and reconstruction of missing features for noise-robust automatic speech recognition
    Ma, Ning
    Barker, Jon
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2637 - 2640
  • [47] A Simulated-Data Adaptation Technique for Robust Speech Recognition
    Thatphithakkul, Nattanun
    Kruatrachue, Boontee
    Wutiwiwatchai, Chai
    Marukatat, Sanparith
    Boonpiam, Vataya
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 777 - +
  • [48] Robust speech recognition with selective input data to a NN classifier
    Cong, L
    Asghar, S
    [J]. INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-IV, PROCEEDINGS, 1998, : 1817 - 1824
  • [49] Robust Speech Recognition in the presence of noise using medical data
    Athanaselis, Theologos
    Bakamidis, Stelios
    Giannopoulos, George
    Dologlou, Ioannis
    Fotinea, Evita
    [J]. 2008 IEEE INTERNATIONAL WORKSHOP ON IMAGING SYSTEMS AND TECHNIQUES, 2008, : 347 - 350
  • [50] Matching training and test data distributions for robust speech recognition
    Molau, S
    Keysers, D
    Ney, H
    [J]. SPEECH COMMUNICATION, 2003, 41 (04) : 579 - 601