New Methods for Template Selection and Compression in Continuous Speech Recognition

被引：0

作者：

Sun, Xie ^{[1
]}

Zhao, Yunxin ^{[1
]}

机构：

[1] Univ Missouri, Dept Comp Sci, Columbia, MO 65211 USA

来源：

12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 | 2011年

关键词：

template matching; template selection; KL divergence; DTW;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a maximum likelihood method for selecting template representatives, and in order to include more information in the selected template representatives, we further propose to create compressed template representatives by Gaussian mixture model (GMM) merging algorithm. A Kullback-Leibler (KL) divergence based local distance is proposed for Dynamic Time Warping (DTW) in template matching. Experimental results on the tasks of TIMIT phone recognition and large vocabulary continuous speech recognition demonstrated that the proposed template selection method significantly improved the recognition accuracy over the HMM baseline while only 5% or 10% templates were selected from the total templates, and the template compression method has provided further recognition accuracy gains over the template selection method.

引用

页码：992 / 995

页数：4

共 50 条

[21] Integrated exemplar-based template matching and statistical modeling for continuous speech recognition
Sun, Xie
Zhao, Yunxin
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014,
[22] Integrated exemplar-based template matching and statistical modeling for continuous speech recognition
Xie Sun
Yunxin Zhao
EURASIP Journal on Audio, Speech, and Music Processing, 2014
[23] SPEECH RECOGNITION MODEL COMPRESSION
Sakthi, Madhumitha
Tewfik, Ahmed
Pawate, Raj
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7869 - 7873
[24] Influence of features extraction methods in performance of continuous speech recognition for Romanian
Dumitru, C. O.
Gavat, Inge
2007 14TH INTERNATIONAL WORKSHOP ON SYSTEMS, SIGNALS, & IMAGE PROCESSING & EURASIP CONFERENCE FOCUSED ON SPEECH & IMAGE PROCESSING, MULTIMEDIA COMMUNICATIONS & SERVICES, 2007, : 40 - 43
[25] Comparing Speaker Adaptation Methods for Visual Speech Recognition for Continuous Spanish
Gimeno-Gomez, David
Martinez-Hinarejos, Carlos-D.
APPLIED SCIENCES-BASEL, 2023, 13 (11):
[26] Feature Selection Filtering Methods for Emotion Recognition in Chinese Speech Signal
Zhang, Shiqing
Zhao, Zhijin
ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 1700 - +
[27] Low bit rate compression methods of feature vectors for distributed speech recognition
Enrique Garcia, Jose
Ortega, Alfonso
Miguel, Antonio
Lleida, Eduardo
SPEECH COMMUNICATION, 2014, 58 : 111 - 123
[28] Use of Gaussian Selection in large vocabulary continuous speech recognition using HMMS
Knill, KM
Gales, MJF
Young, SJ
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 470 - 473
[29] Speech recognition methods for speech theraphy
Türk, O
Arslan, LM
PROCEEDINGS OF THE IEEE 12TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, 2004, : 410 - 413
[30] Influence of Emotional Speech on Continuous Speech Recognition
Zgank, Andrej
Maucec, Mirjam Sepesy
13TH INTERNATIONAL CONFERENCE ON ELEKTRO (ELEKTRO 2020), 2020,

← 1 2 3 4 5 →