Application of EαNets to feature recognition of articulation manner in knowledge-based automatic speech recognition

被引:0
|
作者
Siniscalchi, Sabato M. [1 ]
Li, Jinyu
Pilato, Giovanni
Vassallo, Giorgio
Clements, Mark A.
Gentile, Antonio
Sorbello, Filippo
机构
[1] Georgia Inst Technol, Ctr Signal & Image Proc, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
[2] Italian Natl Res Council, Ist CAlcolo & Reti Ad Alte Prestaz, I-90128 Palermo, Italy
[3] Univ Palermo, Dipartimento Ingn Informat, I-90128 Palermo, Italy
来源
NEURAL NETS | 2006年 / 3931卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech recognition has become common in many application domains. Incorporating acoustic-phonetic knowledge into Automatic Speech Recognition (ASR) systems design has been proven a viable approach to rise ASR accuracy. Manner of articulation attributes such as vowel, stop, fricative, approximant, nasal, and silence are examples of such knowledge. Neural networks have already been used successfully as detectors for manner of articulation attributes starting from representations of speech signal frames. In this paper, a set of six detectors for the above mentioned attributes is designed based on the E-alpha Net model of neural networks. This model was chosen for its capability to learn hidden activation functions that results in better generalization properties. Experimental set-up and results are presented that show an average 3.5% improvement over a baseline neural network implementation.
引用
收藏
页码:140 / 146
页数:7
相关论文
共 50 条
  • [31] Application of knowledge-based cascade-correlation to vowel recognition
    Rivest, F
    Shultz, TR
    PROCEEDING OF THE 2002 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-3, 2002, : 53 - 58
  • [32] DUAL APPLICATION OF SPEECH ENHANCEMENT FOR AUTOMATIC SPEECH RECOGNITION
    Pandey, Ashutosh
    Liu, Chunxi
    Wang, Yun
    Saraf, Yatharth
    2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 223 - 228
  • [33] Automatic Speech Recognition and Dependency Network to Identification of Articulation Error Patterns
    Chen, Yeou-Jiunn
    Wu, Jiunn-Liang
    Yang, Hui-Mei
    2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 4009 - +
  • [34] AN INTEGRATED KNOWLEDGE BASE FOR SPEECH SYNTHESIS AND AUTOMATIC SPEECH RECOGNITION
    TATHAM, MAA
    JOURNAL OF PHONETICS, 1985, 13 (02) : 175 - 188
  • [35] A Robust Feature Normalization Algorithm for Automatic Speech Recognition
    Lei, Jianjun
    Yang, Zhen
    Wang, Jian
    FIRST IITA INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, : 473 - +
  • [36] Characterizing feature variability in automatic speech recognition systems
    Barrault, Loic
    Matrouf, Driss
    De Mori, Renato
    Gemello, Roberto
    Mana, Franco
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 5887 - 5890
  • [37] Soft Margin Feature Extraction for Automatic Speech Recognition
    Li, Jinyu
    Lee, Chin-Hui
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 293 - 296
  • [38] KNOWLEDGE-BASED CHINESE SENTENCE RECOGNITION
    ZHENG, YC
    YUAN, BZ
    14TH INTERNATIONAL CONGRESS ON ACOUSTICS, PROCEEDINGS, VOLS 1-4, 1992, : 1113 - 1114
  • [39] The application of optimization in feature extraction of speech recognition
    Gu, L
    Liu, RS
    ICSP '96 - 1996 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1996, : 745 - 748
  • [40] Knowledge-based adaptive decision tree state tying for conversational speech recognition
    Hu, Rusheng
    Zhao, Yunxin
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07): : 2160 - 2168