Alignment-based codeword-dependent cepstral normalization

被引:1
|
作者
Huerta, JM [1 ]
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA
来源
关键词
acoustic environment; acoustic modeling; CDCN; feature compensation; hidden Markov models; linear channel model; robust speech recognition;
D O I
10.1109/TSA.2002.804305
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes the alignment-based codeword dependent cepstral normalization algorithm (ACDC(N)) which aims to alleviate the acoustical mismatch that occurs when the speech recognizer faces environmental conditions not observed in the training data. ACDC(N) is based on the linear channel model of the environment originally proposed by Acero and on the CDCN solution to this model [1]. ACDC(N) substitutes the codebook (Gaussian mixture model) employed by CDCN for the state distributions employed by the recognizer's HMMs under the assumption that these HMM distributions will model the associated speech segments better than the general GMM distribution. The feature-frame to HMM-state association is obtained through an alignment of a first decoding-pass hypothesis. From this alignment, ACDCN obtains an estimate of the environmental parameters (noise and channel vectors) which are then employed to obtain an MMSE estimate of the clean speech vectors, in a way similar to [1]. ACDC(N) produces an overall reduction of the error rate of over 30% in the noise range of 0 to 20 dB on experiments conducted on the Aurora-2 noisy digits database.
引用
收藏
页码:451 / 459
页数:9
相关论文
共 50 条
  • [1] Distant speech recognition based on position dependent cepstral mean normalization
    Wang, LB
    Kitaoka, N
    Nakagawa, S
    [J]. PROCEEDINGS OF THE SIXTH IASTED INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, 2004, : 249 - 254
  • [2] Alignment-Based Trace Clustering
    Chatain, Thomas
    Carmona, Josep
    van Dongen, Boudewijn
    [J]. CONCEPTUAL MODELING, ER 2017, 2017, 10650 : 295 - 308
  • [3] Alignment-based reordering for SMT
    Holmqvist, Maria
    Stymne, Sara
    Ahrenberg, Lars
    Merkel, Magnus
    [J]. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 3436 - 3440
  • [4] Speech recognition using an enhanced FVQ based on a codeword dependent distribution normalization and codeword weighting by fuzzy objective function
    Choi, HJ
    Oh, YH
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 354 - 357
  • [5] Alignment-based nonmonotonicities in similarity
    Goldstone, RL
    [J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-LEARNING MEMORY AND COGNITION, 1996, 22 (04) : 988 - 1001
  • [6] Implementing alignment-based learning
    van Zaanen, M
    [J]. GRAMMATICAL INFERENCE: ALGORITHMS AND APPLICATIONS, 2002, 2484 : 312 - 314
  • [7] An alignment-based account of serial recall
    Dennis, S
    [J]. PROCEEDINGS OF THE TWENTY-FIFTH ANNUAL CONFERENCE OF THE COGNITIVE SCIENCE SOCIETY, PTS 1 AND 2, 2003, : 336 - 341
  • [8] Alignment-based extraction of multiword expressions
    Helena Medeiros de Caseli
    Carlos Ramisch
    Maria das Graças Volpe Nunes
    Aline Villavicencio
    [J]. Language Resources and Evaluation, 2010, 44 : 59 - 77
  • [9] Alignment-Based Prediction of Sites of Metabolism
    Kops, Christina de Bruyn
    Friedrich, Nils-Ole
    Kirchmair, Johannes
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2017, 57 (06) : 1258 - 1264
  • [10] Alignment-Based Metrics for Trace Comparison
    Weber, Matthias
    Mohror, Kathryn
    Schulz, Martin
    de Supinski, Bronis R.
    Brunst, Holger
    Nagel, Wolfgang E.
    [J]. EURO-PAR 2013 PARALLEL PROCESSING, 2013, 8097 : 29 - 40