Log-index weighted cepstral distance measure for speech recognition

被引:0
|
作者
Zheng Fang [1 ]
Wu Wenhu [1 ]
Fang Ditang [1 ]
机构
[1] Tsinghua Univ, Beijing
关键词
Log-index weighted cepstral distance measure; speech recognition;
D O I
10.1007/BF02951337
中图分类号
学科分类号
摘要
A log-index weighted cepstral distance measure is proposed and tested in speaker-independent and speaker-dependent isolated word recognition systems using statistic techniques. The weights for the cepstral coefficients of this measure equal the logarithm of the corresponding indices. The experimental results show that this kind of measure works better than any other weighted Euclidean cepstral distance measures on three speech databases. The error rate obtained using this measure is about 1.8 percent for three databases on average, which is a 25% reduction from that obtained using other measures, and a 40% reduction from that obtained using Log Likelihood Ratio (LLR) measure. The experimental results also show that this kind of distance measure works well in both speaker-dependent and speaker-independent speech recognition systems.
引用
收藏
页码:177 / 184
页数:7
相关论文
共 50 条
  • [31] Cepstral gain normalization for noise robust speech recognition
    Yoshizawa, S
    Hayasaka, N
    Wada, N
    Miyanaga, Y
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 209 - 212
  • [32] A novel weighted likelihood measure for speech recognition under G-Force
    Lei, Z
    Han, JQ
    Wang, CF
    [J]. PROCEEDINGS OF THE 7TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2003, : 692 - 695
  • [33] A new perceptually weighted distance measure for vector quantization of the STFT amplitudes in the speech application
    Doost, Roghayeh
    Sayadian, Abolghasem
    Shamsi, Hossein
    [J]. IEICE ELECTRONICS EXPRESS, 2009, 6 (12): : 824 - 830
  • [34] A Minimum Classification Error Based Distance Measure for Template Based Speech Recognition
    Matton, Mike
    Van Compernolle, Dirk
    Cools, Ronald
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2386 - 2389
  • [35] NMF-based Cepstral Features for Speech Emotion Recognition
    Lashkari, Milad
    Seyedin, Sanaz
    [J]. 2018 4TH IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2018, : 189 - 193
  • [36] Linear Frequency Residual Cepstral Coefficients for Speech Emotion Recognition
    Hora, Baveet Singh
    Uthiraa, S.
    Patil, Hemant A.
    [J]. SPEECH AND COMPUTER, SPECOM 2023, PT I, 2023, 14338 : 116 - 129
  • [37] Whispered Speech Recognition Based on Gammatone Filterbank Cepstral Coefficients
    Markovic, B.
    Galic, J.
    Grozdic, D.
    Jovicic, S. T.
    Mijic, M.
    [J]. JOURNAL OF COMMUNICATIONS TECHNOLOGY AND ELECTRONICS, 2017, 62 (11) : 1255 - 1261
  • [38] DELTA-SPECTRAL CEPSTRAL COEFFICIENTS FOR ROBUST SPEECH RECOGNITION
    Kumar, Kshitiz
    Kim, Chanwoo
    Stern, Richard M.
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4784 - 4787
  • [39] Recognition of emotion from speech using evolutionary cepstral coefficients
    Bakhshi, Ali
    Chalup, Stephan
    Harimi, Ali
    Mirhassani, Seyed Mostafa
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (47-48) : 35739 - 35759
  • [40] Bounded cepstral marginalization of missing data for robust speech recognition
    Kafoori, Kian Ebrahim
    Ahadi, Seyed Mohammad
    [J]. COMPUTER SPEECH AND LANGUAGE, 2016, 36 : 1 - 23