Application of EαNets to feature recognition of articulation manner in knowledge-based automatic speech recognition

被引:0
|
作者
Siniscalchi, Sabato M. [1 ]
Li, Jinyu
Pilato, Giovanni
Vassallo, Giorgio
Clements, Mark A.
Gentile, Antonio
Sorbello, Filippo
机构
[1] Georgia Inst Technol, Ctr Signal & Image Proc, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
[2] Italian Natl Res Council, Ist CAlcolo & Reti Ad Alte Prestaz, I-90128 Palermo, Italy
[3] Univ Palermo, Dipartimento Ingn Informat, I-90128 Palermo, Italy
来源
NEURAL NETS | 2006年 / 3931卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech recognition has become common in many application domains. Incorporating acoustic-phonetic knowledge into Automatic Speech Recognition (ASR) systems design has been proven a viable approach to rise ASR accuracy. Manner of articulation attributes such as vowel, stop, fricative, approximant, nasal, and silence are examples of such knowledge. Neural networks have already been used successfully as detectors for manner of articulation attributes starting from representations of speech signal frames. In this paper, a set of six detectors for the above mentioned attributes is designed based on the E-alpha Net model of neural networks. This model was chosen for its capability to learn hidden activation functions that results in better generalization properties. Experimental set-up and results are presented that show an average 3.5% improvement over a baseline neural network implementation.
引用
收藏
页码:140 / 146
页数:7
相关论文
共 50 条
  • [21] Feature extraction for automatic speech recognition (ASR)
    Swartz, B
    Magotra, N
    THIRTIETH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 1997, : 748 - 751
  • [22] Articulation constrained learning with application to speech emotion recognition
    Shah, Mohit
    Tu, Ming
    Berisha, Visar
    Chakrabarti, Chaitali
    Spanias, Andreas
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2019, 2019 (01)
  • [23] Articulation constrained learning with application to speech emotion recognition
    Mohit Shah
    Ming Tu
    Visar Berisha
    Chaitali Chakrabarti
    Andreas Spanias
    EURASIP Journal on Audio, Speech, and Music Processing, 2019
  • [24] AUTOMATIC SPEECH RECOGNITION AND ITS APPLICATION
    BRUNDAGE, WJ
    CONTROL ENGINEERING, 1983, 30 (04) : 117 - 117
  • [25] Development of articulation training system with speech recognition based automatic pronunciation detection mechanism
    Chen, Yeou-Jiunn
    Huang, Jing-Wei
    3RD KUALA LUMPUR INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING 2006, 2007, 15 : 637 - +
  • [26] Model based feature enhancement for automatic speech recognition in reverberant environments
    Krueger, Alexander
    Haeb-Umbach, Reinhold
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1239 - 1242
  • [27] Prosodic knowledge sources for automatic speech recognition
    Vergyri, D
    Stolcke, A
    Gadde, VRR
    Ferrer, L
    Shriberg, E
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 208 - 211
  • [28] A Feature Extraction Method for Automatic Speech Recognition Based on the Cochlear Nucleus
    Haque, Serajul
    Togneri, Roberto
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2454 - 2457
  • [29] Synchrony-Based Feature Extraction for Robust Automatic Speech Recognition
    de-La-Calle-Silos, Fernando
    Stern, Richard M.
    IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (08) : 1158 - 1162
  • [30] KNOWLEDGE-BASED AND MODEL-BASED AUTOMATIC TARGET RECOGNITION ALGORITHM ADAPTATION
    SADJADI, FA
    NASR, H
    AMEHDI, H
    BAZAKOS, M
    OPTICAL ENGINEERING, 1991, 30 (02) : 183 - 188