Model-based Articulatory Phonetic Features for Improved Speech Recognition

被引：0

作者：

Huang, Guangpu ^{[1
]}

Er, Meng Joo ^{[1
]}

机构：

[1] Nanyang Technol Univ, Comp Vis Lab, Singapore 639798, Singapore

来源：

2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2012年

关键词：

NEURAL-NETWORKS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We describe a neural based articulatory phonetic inversion model to improve the recognition of the acoustically varying vowels and the syllable initial plosives. The model uses a set of continuous valued articulatory phonetic features (APFs) to explore the interactions between the motor control of articulators and the acoustic phonetic events. We demonstrate that the neural model gives more accurate and robust recognition performance on the TIMIT sentences. The model offers two salient properties: it allows asynchronous feature changes at phoneme boundaries, and it accounts for the dual aspects of human speech production and perception through a heuristic learning algorithm during APFs mapping.

引用

页数：8

共 50 条

[21] Syllable-level desynchronisation of phonetic features for speech recognition
Kirchhoff, K
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2274 - 2276
[22] Spectral difference for statistical model-based speech enhancement in speech recognition
Lee, Soojeong
Chang, Joon-Hyuk
MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (23) : 24917 - 24929
[23] Spectral difference for statistical model-based speech enhancement in speech recognition
Soojeong Lee
Joon-Hyuk Chang
Multimedia Tools and Applications, 2017, 76 : 24917 - 24929
[24] One-Model Speech Recognition and Synthesis Based on Articulatory Movement HMMs
Nitta, Tsuneo
Onoda, Takayuki
Kimura, Masashi
Iribe, Yurie
Katsurada, Kouichi
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2970 - +
[25] Hidden Markov model-based speech emotion recognition
Schuller, B
Rigoll, G
Lang, M
2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 401 - 404
[26] Model-Based Feature Enhancement for Reverberant Speech Recognition
Krueger, Alexander
Haeb-Umbach, Reinhold
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07): : 1692 - 1707
[27] Model-based feature compensation for robust speech recognition
Shen, Haifeng
Li, Qunxia
Guo, Jun
Liu, Gang
FUNDAMENTA INFORMATICAE, 2006, 72 (04) : 529 - 539
[28] Model-based feature enhancement for noisy speech recognition
Couvreur, C
Van hamme, H
2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1719 - 1722
[29] Adaptive model-based technique for robust speech recognition
Graciarena, M
CONFERENCE RECORD OF THE THIRTY-FOURTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2000, : 1512 - 1516
[30] Model-based speaker normalization methods for speech recognition
Naito, M
Deng, L
Sagisaka, Y
ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2003, 86 (02): : 45 - 56

← 1 2 3 4 5 →