Model-based Articulatory Phonetic Features for Improved Speech Recognition

被引：0

作者：

Huang, Guangpu ^{[1
]}

Er, Meng Joo ^{[1
]}

机构：

[1] Nanyang Technol Univ, Comp Vis Lab, Singapore 639798, Singapore

来源：

2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2012年

关键词：

NEURAL-NETWORKS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We describe a neural based articulatory phonetic inversion model to improve the recognition of the acoustically varying vowels and the syllable initial plosives. The model uses a set of continuous valued articulatory phonetic features (APFs) to explore the interactions between the motor control of articulators and the acoustic phonetic events. We demonstrate that the neural model gives more accurate and robust recognition performance on the TIMIT sentences. The model offers two salient properties: it allows asynchronous feature changes at phoneme boundaries, and it accounts for the dual aspects of human speech production and perception through a heuristic learning algorithm during APFs mapping.

引用

页数：8

共 50 条

[31] Model-based feature compensation for robust speech recognition
School of Information Engineering, Beijing University of Posts and Telecommunications, Beijing, 100876, China
不详
不详
Fundam Inf, 2006, 4 (529-539):
[32] Hidden Markov model-based speech emotion recognition
Schuller, B
Rigoll, G
Lang, M
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 1 - 4
[33] Self-organizing speech recognition that processes acoustic and articulatory features
Viana, Hesdras O.
Araujo, Aluizio F. R.
Barbosa, Danilo S.
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (13) : 39169 - 39195
[34] Cross-lingual Automatic Speech Recognition Exploiting Articulatory Features
Zhan, Qingran
Motlicek, Petr
Du, Shixuan
Shan, Yahui
Ma, Sifan
Xie, Xiang
2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1912 - 1916
[35] Speech recognition using syllable and pseudo-articulatory features modeling
Zhang, L
PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 137 - 141
[36] Combining Articulatory Features with End-to-End Learning in Speech Recognition
Qu, Leyuan
Weber, Cornelius
Lakomkin, Egor
Twiefel, Johannes
Wermter, Stefan
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III, 2018, 11141 : 500 - 510
[37] ARTICULATORY FEATURES FROM DEEP NEURAL NETWORKS AND THEIR ROLE IN SPEECH RECOGNITION
Mitra, Vikramjit
Sivaraman, Ganesh
Nam, Hosung
Espy-Wilson, Carol
Saltzman, Elliot
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[38] ARTICULATORY INFORMATION AND MULTIVIEW FEATURES FOR LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION
Mitra, Vikramjit
Wang, Wen
Bartels, Chris
Franco, Horacio
Vergyri, Dimitra
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5634 - 5638
[39] Self-organizing speech recognition that processes acoustic and articulatory features
Hesdras O. Viana
Aluízio F. R. Araújo
Danilo S. Barbosa
Multimedia Tools and Applications, 2024, 83 : 39169 - 39195
[40] A new phonetic model for continuous speech recognition systems
Fagundes, RDR
Corrêa, JS
Dumouchel, P
2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 572 - 575

← 1 2 3 4 5 →