Model-based Articulatory Phonetic Features for Improved Speech Recognition

被引：0

作者：

Huang, Guangpu ^{[1
]}

Er, Meng Joo ^{[1
]}

机构：

[1] Nanyang Technol Univ, Comp Vis Lab, Singapore 639798, Singapore

来源：

2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2012年

关键词：

NEURAL-NETWORKS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We describe a neural based articulatory phonetic inversion model to improve the recognition of the acoustically varying vowels and the syllable initial plosives. The model uses a set of continuous valued articulatory phonetic features (APFs) to explore the interactions between the motor control of articulators and the acoustic phonetic events. We demonstrate that the neural model gives more accurate and robust recognition performance on the TIMIT sentences. The model offers two salient properties: it allows asynchronous feature changes at phoneme boundaries, and it accounts for the dual aspects of human speech production and perception through a heuristic learning algorithm during APFs mapping.

引用

页数：8

共 50 条

[41] ARTICULATORY TIMING IN SPEECH PRODUCTION . ARTICULATORY DISTINCTIVE FEATURES
COKER, CH
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1970, 47 (1P1): : 94 - +
[42] Unsupervised noise model estimation for model-based robust speech recognition
Graciarena, M
Franco, H
ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 351 - 356
[43] Within and cross-corpus speech emotion recognition using latent topic model-based features
Mohit Shah
Chaitali Chakrabarti
Andreas Spanias
EURASIP Journal on Audio, Speech, and Music Processing, 2015
[44] Within and cross-corpus speech emotion recognition using latent topic model-based features
Shah, Mohit
Chakrabarti, Chaitali
Spanias, Andreas
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2015,
[45] Improved Fault Recognition for Model-Based Diagnostic Systems
Koetter, Matthias
Pungs, Andreas
Wolkenar, Bernd
INTERNATIONALER MOTORENKONGRESS 2015: MIT NUTZFAHRZEUGMOTOREN - SPEZIAL, 2015, : 499 - 514
[46] SUBMODULAR DATA SELECTION WITH ACOUSTIC AND PHONETIC FEATURES FOR AUTOMATIC SPEECH RECOGNITION
Ni, Chongjia
Wang, Lei
Liu, Haibo
Leung, Cheung-Chi
Lu, Li
Ma, Bin
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4629 - 4633
[47] Multi-resolution phonetic/segmental features and models for HMM-based speech recognition
Vaseghi, S
Harte, N
Milner, B
1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1263 - 1266
[48] Predictive model-based compensation schemes for robust speech recognition
Gales, MJF
SPEECH COMMUNICATION, 1998, 25 (1-3) : 49 - 74
[49] Model-Based Wiener filter for noise robust speech recognition
Arakawa, Takayuki
Tsujikawa, Masanori
Isotani, Ryosuke
2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 537 - 540
[50] Speech synthesis based on a physiological articulatory model
Fang, Qiang
Dang, Jianwu
Chinese Spoken Language Processing, Proceedings, 2006, 4274 : 211 - 222

← 1 2 3 4 5 →