Model-based Articulatory Phonetic Features for Improved Speech Recognition

被引:0
|
作者
Huang, Guangpu [1 ]
Er, Meng Joo [1 ]
机构
[1] Nanyang Technol Univ, Comp Vis Lab, Singapore 639798, Singapore
来源
2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2012年
关键词
NEURAL-NETWORKS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a neural based articulatory phonetic inversion model to improve the recognition of the acoustically varying vowels and the syllable initial plosives. The model uses a set of continuous valued articulatory phonetic features (APFs) to explore the interactions between the motor control of articulators and the acoustic phonetic events. We demonstrate that the neural model gives more accurate and robust recognition performance on the TIMIT sentences. The model offers two salient properties: it allows asynchronous feature changes at phoneme boundaries, and it accounts for the dual aspects of human speech production and perception through a heuristic learning algorithm during APFs mapping.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Deep Learning of Speech Features for Improved Phonetic Recognition
    Lee, Jaehyung
    Lee, Soo-Young
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1256 - 1259
  • [2] IMPROVED TONE MODELING BY EXPLOITING ARTICULATORY FEATURES FOR MANDARIN SPEECH RECOGNITION
    Chao, Hao
    Yang, Zhanlei
    Liu, Wenju
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4741 - 4744
  • [3] Articulatory Features for "Meeting" Speech Recognition
    Metze, Florian
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 581 - 584
  • [4] Speech recognition based on a combination of acoustic features with articulatory information
    LU Xugang DANG Jianwu (Japan Advanced Institute of Science and Technology
    ChineseJournalofAcoustics, 2005, (03) : 271 - 279
  • [5] Speech recognition using cepstral articulatory features
    Najnin, Shamima
    Banerjee, Bonny
    SPEECH COMMUNICATION, 2019, 107 : 26 - 37
  • [6] Applying articulatory features to speech emotion recognition
    Zhou, Yu
    Sun, Yanqing
    Yang, Lin
    Yan, Yonghong
    2009 INTERNATIONAL CONFERENCE ON RESEARCH CHALLENGES IN COMPUTER SCIENCE, ICRCCS 2009, 2009, : 73 - 76
  • [7] Towards capturing fine phonetic variation in speech using articulatory features
    Scharenborg, Odette
    Wan, Vincent
    Moore, Roger K.
    SPEECH COMMUNICATION, 2007, 49 (10-11) : 811 - 826
  • [8] Sparse smoothing of articulatory features from Gaussian mixture model based acoustic-to-articulatory inversion: Benefit to speech recognition
    Sudhakar, Prasad
    Ghosh, Prasanta Kumar
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 169 - 173
  • [9] Robust Speech Recognition Combining Cepstral and Articulatory Features
    Zha, Zhuan-ling
    Hu, Jin
    Zhan, Qing-ran
    Shan, Ya-hui
    Xie, Xiang
    Wang, Jing
    Cheng, Hao-bo
    PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 1401 - 1405
  • [10] Whispery speech recognition using adapted articulatory features
    Jou, SC
    Schultz, T
    Waibel, A
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1009 - 1012