Acoustic-to-Articulatory Mapping;
critical articulators;
Deep Neural Networks;
phone recognition;
D O I:
暂无
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
We present a strategy for learning Deep-Neural-Network (DNN)-based Acoustic-to-Articulatory Mapping (AAM) functions where the contribution of an articulatory feature (AF) to the global reconstruction error is weighted by its relevance. We first empirically show that when an articulator is more crucial for the production of a given phone it is less variable, confirming previous findings. We then compute the relevance of an articulatory feature as a function of its frame-wise variance dependent on the acoustic evidence which is estimated through a Mixture Density Network (MDN). Finally we combine acoustic and recovered articulatory features in a hybrid DNN-HMM phone recognizer. Tested on the MOCHA-TIMIT corpus, articulatory features reconstructed by a standardly trained DNN lead to a 8.4% relative phone error reduction (w.r.t. a recognizer that only uses MFCCs), whereas when the articulatory features are reconstructed taking into account their relevance the relative phone error reduction increased to 10.9%.
机构:
Malatya Turgut Ozal Univ, Training & Res Hosp, Dept Urol, Malatya, TurkiyeMalatya Turgut Ozal Univ, Training & Res Hosp, Dept Urol, Malatya, Turkiye
Bugday, Muhammet Serdar
Akcicek, Mehmet
论文数: 0引用数: 0
h-index: 0
机构:
Malatya Turgut Ozal Univ, Training & Res Hosp, Dept Radiol, Malatya, TurkiyeMalatya Turgut Ozal Univ, Training & Res Hosp, Dept Urol, Malatya, Turkiye
Akcicek, Mehmet
Bingol, Harun
论文数: 0引用数: 0
h-index: 0
机构:
Malatya Turgut Ozal Univ, Dept Software Engn, Malatya, TurkiyeMalatya Turgut Ozal Univ, Training & Res Hosp, Dept Urol, Malatya, Turkiye
Bingol, Harun
Yildirim, Muhammed
论文数: 0引用数: 0
h-index: 0
机构:
Malatya Turgut Ozal Univ, Dept Comp Engn, Malatya, TurkiyeMalatya Turgut Ozal Univ, Training & Res Hosp, Dept Urol, Malatya, Turkiye