New Methods for Template Selection and Compression in Continuous Speech Recognition

被引:0
|
作者
Sun, Xie [1 ]
Zhao, Yunxin [1 ]
机构
[1] Univ Missouri, Dept Comp Sci, Columbia, MO 65211 USA
关键词
template matching; template selection; KL divergence; DTW;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a maximum likelihood method for selecting template representatives, and in order to include more information in the selected template representatives, we further propose to create compressed template representatives by Gaussian mixture model (GMM) merging algorithm. A Kullback-Leibler (KL) divergence based local distance is proposed for Dynamic Time Warping (DTW) in template matching. Experimental results on the tasks of TIMIT phone recognition and large vocabulary continuous speech recognition demonstrated that the proposed template selection method significantly improved the recognition accuracy over the HMM baseline while only 5% or 10% templates were selected from the total templates, and the template compression method has provided further recognition accuracy gains over the template selection method.
引用
收藏
页码:992 / 995
页数:4
相关论文
共 50 条
  • [21] Integrated exemplar-based template matching and statistical modeling for continuous speech recognition
    Sun, Xie
    Zhao, Yunxin
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014,
  • [22] Integrated exemplar-based template matching and statistical modeling for continuous speech recognition
    Xie Sun
    Yunxin Zhao
    EURASIP Journal on Audio, Speech, and Music Processing, 2014
  • [23] SPEECH RECOGNITION MODEL COMPRESSION
    Sakthi, Madhumitha
    Tewfik, Ahmed
    Pawate, Raj
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7869 - 7873
  • [24] Influence of features extraction methods in performance of continuous speech recognition for Romanian
    Dumitru, C. O.
    Gavat, Inge
    2007 14TH INTERNATIONAL WORKSHOP ON SYSTEMS, SIGNALS, & IMAGE PROCESSING & EURASIP CONFERENCE FOCUSED ON SPEECH & IMAGE PROCESSING, MULTIMEDIA COMMUNICATIONS & SERVICES, 2007, : 40 - 43
  • [25] Comparing Speaker Adaptation Methods for Visual Speech Recognition for Continuous Spanish
    Gimeno-Gomez, David
    Martinez-Hinarejos, Carlos-D.
    APPLIED SCIENCES-BASEL, 2023, 13 (11):
  • [26] Feature Selection Filtering Methods for Emotion Recognition in Chinese Speech Signal
    Zhang, Shiqing
    Zhao, Zhijin
    ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 1700 - +
  • [27] Low bit rate compression methods of feature vectors for distributed speech recognition
    Enrique Garcia, Jose
    Ortega, Alfonso
    Miguel, Antonio
    Lleida, Eduardo
    SPEECH COMMUNICATION, 2014, 58 : 111 - 123
  • [28] Use of Gaussian Selection in large vocabulary continuous speech recognition using HMMS
    Knill, KM
    Gales, MJF
    Young, SJ
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 470 - 473
  • [29] Speech recognition methods for speech theraphy
    Türk, O
    Arslan, LM
    PROCEEDINGS OF THE IEEE 12TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, 2004, : 410 - 413
  • [30] Influence of Emotional Speech on Continuous Speech Recognition
    Zgank, Andrej
    Maucec, Mirjam Sepesy
    13TH INTERNATIONAL CONFERENCE ON ELEKTRO (ELEKTRO 2020), 2020,