A Log-Index Weighted Cepstral Distance Measure for Speech Recognition

被引:1
|
作者
郑方
吴文虎
方棣棠
机构
关键词
Log-index weighted cepstral distance measure; speech recognition;
D O I
暂无
中图分类号
TN912 [电声技术和语音信号处理];
学科分类号
081002 ;
摘要
A log-index weighted cepstral distance measure is proposed and tested in speaker-independent and speaker-dependent isolated word recognition systems using statistic techniques. The weights for the cepstral coefficients of this measure equal the logarithm of the corresponding indices. The experimental results show that this kind of measure works better than any other weighted Euclidean cepstral distance measures on three speech databases. The error rate obtained using this measure is about 1.8 percent for three databases on average, which is a 25% reduction from that obtained using other measures, and a 40% reduction from that obtained using Log Likelihood Ratio (LLR) measure. The experimental results also show that this kind of distance measure works well in both speaker-dependent and speaker-independent speech recognition systems.
引用
收藏
页码:177 / 184
页数:8
相关论文
共 50 条
  • [31] Nonlinear cepstral equalisation method for noisy speech recognition
    Lee, L.M., 1600, IEE, Stevenage, United Kingdom (141):
  • [32] Robust Speech Recognition Combining Cepstral and Articulatory Features
    Zha, Zhuan-ling
    Hu, Jin
    Zhan, Qing-ran
    Shan, Ya-hui
    Xie, Xiang
    Wang, Jing
    Cheng, Hao-bo
    PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 1401 - 1405
  • [33] A novel weighted likelihood measure for speech recognition under G-Force
    Lei, Z
    Han, JQ
    Wang, CF
    PROCEEDINGS OF THE 7TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2003, : 692 - 695
  • [34] A new perceptually weighted distance measure for vector quantization of the STFT amplitudes in the speech application
    Doost, Roghayeh
    Sayadian, Abolghasem
    Shamsi, Hossein
    IEICE ELECTRONICS EXPRESS, 2009, 6 (12): : 824 - 830
  • [35] A Minimum Classification Error Based Distance Measure for Template Based Speech Recognition
    Matton, Mike
    Van Compernolle, Dirk
    Cools, Ronald
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2386 - 2389
  • [36] Linear Frequency Residual Cepstral Coefficients for Speech Emotion Recognition
    Hora, Baveet Singh
    Uthiraa, S.
    Patil, Hemant A.
    SPEECH AND COMPUTER, SPECOM 2023, PT I, 2023, 14338 : 116 - 129
  • [37] NMF-based Cepstral Features for Speech Emotion Recognition
    Lashkari, Milad
    Seyedin, Sanaz
    2018 4TH IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2018, : 189 - 193
  • [38] Whispered Speech Recognition Based on Gammatone Filterbank Cepstral Coefficients
    Markovic, B.
    Galic, J.
    Grozdic, D.
    Jovicic, S. T.
    Mijic, M.
    JOURNAL OF COMMUNICATIONS TECHNOLOGY AND ELECTRONICS, 2017, 62 (11) : 1255 - 1261
  • [39] Bounded cepstral marginalization of missing data for robust speech recognition
    Kafoori, Kian Ebrahim
    Ahadi, Seyed Mohammad
    COMPUTER SPEECH AND LANGUAGE, 2016, 36 : 1 - 23
  • [40] Cepstral amplitude range normalization for noise robust speech recognition
    Yoshizawa, S
    Hayasaka, N
    Wada, N
    Miyanaga, Y
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (08): : 2130 - 2137