Model-based Articulatory Phonetic Features for Improved Speech Recognition

被引：0

作者：

Huang, Guangpu ^{[1
]}

Er, Meng Joo ^{[1
]}

机构：

[1] Nanyang Technol Univ, Comp Vis Lab, Singapore 639798, Singapore

来源：

2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2012年

关键词：

NEURAL-NETWORKS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We describe a neural based articulatory phonetic inversion model to improve the recognition of the acoustically varying vowels and the syllable initial plosives. The model uses a set of continuous valued articulatory phonetic features (APFs) to explore the interactions between the motor control of articulators and the acoustic phonetic events. We demonstrate that the neural model gives more accurate and robust recognition performance on the TIMIT sentences. The model offers two salient properties: it allows asynchronous feature changes at phoneme boundaries, and it accounts for the dual aspects of human speech production and perception through a heuristic learning algorithm during APFs mapping.

引用

页数：8

共 50 条

[1] Deep Learning of Speech Features for Improved Phonetic Recognition
Lee, Jaehyung
Lee, Soo-Young
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1256 - 1259
[2] IMPROVED TONE MODELING BY EXPLOITING ARTICULATORY FEATURES FOR MANDARIN SPEECH RECOGNITION
Chao, Hao
Yang, Zhanlei
Liu, Wenju
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4741 - 4744
[3] Articulatory Features for "Meeting" Speech Recognition
Metze, Florian
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 581 - 584
[4] Speech recognition based on a combination of acoustic features with articulatory information
LU Xugang DANG Jianwu (Japan Advanced Institute of Science and Technology
ChineseJournalofAcoustics, 2005, (03) : 271 - 279
[5] Speech recognition using cepstral articulatory features
Najnin, Shamima
Banerjee, Bonny
SPEECH COMMUNICATION, 2019, 107 : 26 - 37
[6] Applying articulatory features to speech emotion recognition
Zhou, Yu
Sun, Yanqing
Yang, Lin
Yan, Yonghong
2009 INTERNATIONAL CONFERENCE ON RESEARCH CHALLENGES IN COMPUTER SCIENCE, ICRCCS 2009, 2009, : 73 - 76
[7] Towards capturing fine phonetic variation in speech using articulatory features
Scharenborg, Odette
Wan, Vincent
Moore, Roger K.
SPEECH COMMUNICATION, 2007, 49 (10-11) : 811 - 826
[8] Sparse smoothing of articulatory features from Gaussian mixture model based acoustic-to-articulatory inversion: Benefit to speech recognition
Sudhakar, Prasad
Ghosh, Prasanta Kumar
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 169 - 173
[9] Robust Speech Recognition Combining Cepstral and Articulatory Features
Zha, Zhuan-ling
Hu, Jin
Zhan, Qing-ran
Shan, Ya-hui
Xie, Xiang
Wang, Jing
Cheng, Hao-bo
PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 1401 - 1405
[10] Whispery speech recognition using adapted articulatory features
Jou, SC
Schultz, T
Waibel, A
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1009 - 1012

← 1 2 3 4 5 →