Comparing Jacorian adaptation with cepstral mean normalization and parallel model combination for noise robust speech recognition

被引:0
|
作者
Pärssinen, K [1 ]
Salmela, P [1 ]
Harju, M [1 ]
Kiss, I [1 ]
机构
[1] Tampere Univ Technol, Inst Digital & Comp Syst, FIN-33101 Tampere, Finland
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, two techniques are researched for Jacobian adaptation (JA) in the presence of additive noise. Since the original concept of JA was presented only for static cepstral coefficients, the performance of JA is researched when it is extended to cover also the delta cepstrum. However, this extension or the original concept can not provide accurate recognition performance when the mismatch between the training and recognition environments is out of the linear range of JA. Hence, this problem can be alleviated to some extent by dividing JA into two steps. At first, the adaptation is done e.g. from clean to the target environment having "high" SNR level. After that, the new JA matrixes are calculated and they are used in the second step to adapt the system to the lower target SNR level. Both of the above adaptation methods have been compared to cepstral mean normalization (CMN) and parallel model combination (PMC) in isolated word recognition task having a vocabulary of 200 English words. The best performace was achieved with PMC but JA showed comparable performace to CMN and outperformed it when JA was done in two steps from SNR of 25 dB to 5 dB. The system was tested with SpeechDat(II) database by adding noise recorded inside a car to the test set utterances at various SNR levels.
引用
收藏
页码:193 / 196
页数:4
相关论文
共 50 条
  • [1] PARAMETRIC CEPSTRAL MEAN NORMALIZATION FOR ROBUST SPEECH RECOGNITION
    Kalinli, Ozlem
    Bhattacharya, Gautam
    Weng, Chao
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6735 - 6739
  • [2] Cepstral gain normalization for noise robust speech recognition
    Yoshizawa, Shingo
    Hayasaka, Noboru
    Wada, Naoya
    Miyanaga, Yoshikazu
    [J]. ICASSP IEEE Int Conf Acoust Speech Signal Process Proc, 1600, (I209-I212):
  • [3] Cepstral gain normalization for noise robust speech recognition
    Yoshizawa, S
    Hayasaka, N
    Wada, N
    Miyanaga, Y
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 209 - 212
  • [4] Cepstral amplitude range normalization for noise robust speech recognition
    Yoshizawa, S
    Hayasaka, N
    Wada, N
    Miyanaga, Y
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (08): : 2130 - 2137
  • [5] Noise-robust speech recognition by discriminative adaptation in parallel model combination
    Chung, YJ
    [J]. ELECTRONICS LETTERS, 2000, 36 (04) : 370 - 371
  • [6] A Cepstral PDF Normalization Method for Noise Robust Speech Recognition
    Suk, Yong Ho
    Choi, Seung Ho
    [J]. ADVANCES IN COMPUTER SCIENCE, ENVIRONMENT, ECOINFORMATICS, AND EDUCATION, PT II, 2011, 215 : 34 - +
  • [7] The integration of principal component analysis and cepstral mean subtraction in parallel model combination for robust speech recognition
    Veisi, Hadi
    Sameti, Hossein
    [J]. DIGITAL SIGNAL PROCESSING, 2011, 21 (01) : 36 - 53
  • [8] Cepstral domain segmental feature vector normalization for noise robust speech recognition
    Viikki, O
    Laurila, K
    [J]. SPEECH COMMUNICATION, 1998, 25 (1-3) : 133 - 147
  • [9] Cepstral shape normalization (CSN) for robust speech recognition
    Du, Jun
    Wang, Ren-Hua
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4389 - 4392
  • [10] Noise Robust Speech Recognition Based on Parallel Model Combination Adaptation Using Frequency-Variant
    Choi, Sook-Nam
    Chung, Hyun-Yeol
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2013, 32 (03): : 252 - 261